SILMA-AI/silma-tts
SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model
Built on the F5-TTS diffusion architecture, this 150M-parameter model achieves ~0.12 real-time factor (RTX 4090) with instant voice cloning and zero-shot style transfer. It includes native Arabic diacritization support via CATT and NeMo text processing for handling both Arabic (Fusha/MSA) and English with full tashkeel awareness. The model is fully compatible with F5-TTS v1.1.7 training pipelines, enabling community fine-tuning while maintaining inference optimizations over the base architecture.
6 stars and 1,118 monthly downloads. Available on PyPI.
Stars
6
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 15, 2026
Monthly downloads
1,118
Commits (30d)
0
Dependencies
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/SILMA-AI/silma-tts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
KoljaB/RealtimeTTS
Converts text to speech in realtime
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
n1teshy/yapper-tts
offline text to speech and free SOTA LLM APIs to let your programs speak to you
dputhier/pygtftk
A python package and a set of shell commands to handle GTF files