robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Organizes diverse publicly and commercially available speech datasets across multiple languages and acoustic conditions in a single curated table, including conversational, read, lecture, and noisy speech corpora ranging from 5 hours (TIMIT) to 33,000+ hours (GigaSpeech). Covers both ASR training data and text-to-speech (TTS) datasets with direct download links, torrent sources, and LDC catalog codes, enabling researchers to compare dataset sizes and characteristics for benchmark evaluation.
231 stars. No commits in the last 6 months.
Stars
231
Forks
22
Language
Shell
License
Apache-2.0
Category
Last pushed
Aug 06, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/robmsmt/ASR-Audio-Data-Links"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ynop/audiomate
Python library for handling audio datasets.
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos