robmsmt/ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

/ 100

Emerging

Organizes diverse publicly and commercially available speech datasets across multiple languages and acoustic conditions in a single curated table, including conversational, read, lecture, and noisy speech corpora ranging from 5 hours (TIMIT) to 33,000+ hours (GigaSpeech). Covers both ASR training data and text-to-speech (TTS) datasets with direct download links, torrent sources, and LDC catalog codes, enabling researchers to compare dataset sizes and characteristics for benchmark evaluation.

231 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 14 / 25

How are scores calculated?

Stars

231

Forks

Language

Shell

License

Apache-2.0

Higher-rated alternatives

ynop/audiomate

Python library for handling audio datasets.

davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...

common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

reazon-research/ReazonSpeech

Massive open Japanese speech corpus

EgorLakomkin/KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Explore Voice AI Tools

All categories Trending Voice AI directory Insights