Nexdata-AI/1000-Hours-American-English-Conversational-Speech-Data-by-Mobile-Phone
American English Conversational Speech Dataset
This dataset provides a large collection of natural American English conversations, recorded from 2,000 speakers using mobile phones. It includes both the audio recordings and highly accurate manual transcriptions, along with speaker identification and gender information. It's ideal for AI developers building and improving speech recognition or voiceprint recognition systems.
No commits in the last 6 months.
Use this if you need high-quality, real-world conversational speech data to train or test your American English speech AI models.
Not ideal if you require speech data in languages other than American English or need domain-specific vocabulary outside of general conversation.
Stars
2
Forks
—
Language
—
License
—
Category
Last pushed
Aug 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Nexdata-AI/1000-Hours-American-English-Conversational-Speech-Data-by-Mobile-Phone"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
double22a/speech_dataset
The dataset of Speech Recognition
Jakobovski/free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
Ijwi-ry-Ikirundi-AI/Kirundi_Dataset
🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI...
lottev1991/Project-AIdol-Public-English-Dataset
Public female English corpus used for Project AI❤dol
Jahangirbd23/WenetSpeech-Yue
📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich annotations,...