snakers4/open_stt

Open STT

Archived

/ 100

Emerging

Comprehensive Russian speech recognition dataset with ~16M utterances spanning 20,000 hours across diverse domains (radio, public speech, audiobooks, YouTube, phone calls) with multiple annotation methodologies including forced alignment, subtitles, and manual annotation. Data is distributed in compressed OPUS format (356GB) alongside WAV archives, with helper utilities for format conversion and on-disk database construction. Designed for training end-to-end ASR systems, with quality-stratified subsets ranging from 70-99% accuracy for different use cases.

818 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

818

Forks

Language

Python

License

—

Higher-rated alternatives

speechmatics/speechmatics-python

Python library and CLI for Speechmatics

gooofy/py-nltools

A collection of basic python modules for spoken natural language processing

IBM/MAX-Speech-to-Text-Converter

Converts spoken words into text form.

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...

Kini218/speech-to-text

Speech to text script on python

Explore Voice AI Tools

All categories Trending Voice AI directory Insights