Nexdata-AI/240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading
Hindi Speech Dataset
This project provides a large collection of Hindi speech recordings, captured using mobile phones in various environments, including both quiet and noisy settings. It includes diverse content like news, entertainment, and everyday language. This resource is for AI developers, researchers, and data scientists building speech recognition, machine translation, or voice biometric systems for the Hindi language.
No commits in the last 6 months.
Use this if you need a high-quality, real-world Hindi speech dataset for training or evaluating AI models that understand spoken Hindi.
Not ideal if you require speech data for a language other than Hindi or specific domain content not covered by economy, entertainment, news, or general spoken language.
Stars
2
Forks
—
Language
—
License
—
Category
Last pushed
Aug 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Nexdata-AI/240-Hours-Hindi-Speech-Data-by-Mobile-Phone_Reading"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
double22a/speech_dataset
The dataset of Speech Recognition
Jakobovski/free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
Ijwi-ry-Ikirundi-AI/Kirundi_Dataset
🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI...
lottev1991/Project-AIdol-Public-English-Dataset
Public female English corpus used for Project AI❤dol
Jahangirbd23/WenetSpeech-Yue
📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich annotations,...