candlewill/Speech-Corpus-Collection

A Collection of Speech Corpus for ASR and TTS

/ 100

Emerging

Curates publicly available speech datasets across multiple scales and languages, including large-scale corpora (LibriSpeech's 1000 hours), specialized domains (TED talks), and single-speaker databases optimized for voice synthesis. Provides centralized access to diverse ASR training data like VCTK and TEDLIUM alongside TTS corpora from CMU ARCTIC and Blizzard Challenge resources, with preprocessing notes for datasets requiring manual alignment like the World English Bible.

113 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

113

Forks

Language

—

License

MIT

Higher-rated alternatives

ynop/audiomate

Python library for handling audio datasets.

davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...

common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

reazon-research/ReazonSpeech

Massive open Japanese speech corpus

EgorLakomkin/KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Explore Voice AI Tools

All categories Trending Voice AI directory Insights