MahtaFetrat/ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
No commits in the last 6 months.
Stars
49
Forks
5
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MahtaFetrat/ManaTTS-Persian-Speech-Dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
tihu-nlp/tihu
Persian Text-To-Speech
persiandataset/PersianSpeech
Persian ASR dataset
mmahdibarghi/finglish-dataset
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
MahtaFetrat/VirgoolInformal-Speech-Dataset
A dataset of informal Persian audio and text chunks, along with a fully open processing...
IranTechNest/PersianSpeechRecognition
Persian Speech Recognition