candlewill/Speech-Corpus-Collection
A Collection of Speech Corpus for ASR and TTS
Curates publicly available speech datasets across multiple scales and languages, including large-scale corpora (LibriSpeech's 1000 hours), specialized domains (TED talks), and single-speaker databases optimized for voice synthesis. Provides centralized access to diverse ASR training data like VCTK and TEDLIUM alongside TTS corpora from CMU ARCTIC and Blizzard Challenge resources, with preprocessing notes for datasets requiring manual alignment like the World English Bible.
113 stars. No commits in the last 6 months.
Stars
113
Forks
20
Language
—
License
MIT
Category
Last pushed
Jun 19, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/candlewill/Speech-Corpus-Collection"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ynop/audiomate
Python library for handling audio datasets.
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos