ARBML/klaam
Arabic speech recognition, classification and text-to-speech.
Built on wav2vec2 (XLSR-53) for speech recognition and FastSpeech2 with HiFi-GAN vocoding for synthesis, supporting Modern Standard Arabic and dialectal variants (Egyptian, Levantine, Gulf, North African). Integrates with Hugging Face Transformers for pretrained models and enables both inference and fine-tuning across recognition, dialect classification, and TTS tasks with modular Python APIs.
424 stars. No commits in the last 6 months.
Stars
424
Forks
85
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Sep 30, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ARBML/klaam"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...
pykaldi/pykaldi
A Python wrapper for Kaldi
scarletcho/KoLM
Korean text normalization and language preparation package for LM in Kaldi-based ASR system