wannaphong/ttsmms
TTS with The Massively Multilingual Speech (MMS) project
Wraps Meta's MMS VITS models to enable text-to-speech synthesis across 1,107 languages through a simple Python API with automatic model downloading and audio generation. Built on the VITS vocoder architecture, it provides straightforward synthesis methods that return either NumPy arrays or write directly to WAV files at 16kHz sampling rate. Integrates with the fairseq MMS project's pre-trained multilingual models, requiring only language codes to download and instantiate language-specific TTS engines.
235 stars and 310 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
235
Forks
38
Language
Python
License
MIT
Category
Last pushed
Jul 12, 2024
Monthly downloads
310
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/wannaphong/ttsmms"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related tools
KoljaB/RealtimeTTS
Converts text to speech in realtime
nateshmbhat/pyttsx3
Offline Text To Speech synthesis for python
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
n1teshy/yapper-tts
offline text to speech and free SOTA LLM APIs to let your programs speak to you
dputhier/pygtftk
A python package and a set of shell commands to handle GTF files