snakers4/open_stt
Open STT
ArchivedComprehensive Russian speech recognition dataset with ~16M utterances spanning 20,000 hours across diverse domains (radio, public speech, audiobooks, YouTube, phone calls) with multiple annotation methodologies including forced alignment, subtitles, and manual annotation. Data is distributed in compressed OPUS format (356GB) alongside WAV archives, with helper utilities for format conversion and on-disk database construction. Designed for training end-to-end ASR systems, with quality-stratified subsets ranging from 70-99% accuracy for different use cases.
818 stars. No commits in the last 6 months.
Stars
818
Forks
87
Language
Python
License
—
Category
Last pushed
Mar 11, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/snakers4/open_stt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
speechmatics/speechmatics-python
Python library and CLI for Speechmatics
gooofy/py-nltools
A collection of basic python modules for spoken natural language processing
IBM/MAX-Speech-to-Text-Converter
Converts spoken words into text form.
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition,...
Kini218/speech-to-text
Speech to text script on python