MahtaFetrat/VirgoolInformal-Speech-Dataset
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
No commits in the last 6 months.
Stars
4
Forks
2
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Feb 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MahtaFetrat/VirgoolInformal-Speech-Dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
tihu-nlp/tihu
Persian Text-To-Speech
persiandataset/PersianSpeech
Persian ASR dataset
MahtaFetrat/ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio....
mmahdibarghi/finglish-dataset
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
IranTechNest/PersianSpeechRecognition
Persian Speech Recognition