Text-to-Speech Frameworks
End-to-end TTS architectures, models, and toolkits for synthesizing speech from text. Includes transformer-based, diffusion-based, and flow-matching approaches with various duration modeling techniques. Does NOT include voice cloning, speech recognition, speech evaluation metrics, or TTS paper collections.
There are 12 text-to-speech frameworks tracked. 1 score above 70 (verified tier). The highest-rated is voicepaw/so-vits-svc-fork at 89/100 with 9,281 stars and 4,743 monthly downloads. 1 of the top 10 are actively maintained.
Get all 12 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-to-speech-frameworks&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features. |
|
Verified |
| 2 |
ssmall256/mlx-audio-io
Native audio I/O for MLX on macOS and Linux |
|
Established |
| 3 |
ssmall256/mlx-spectro
High-performance STFT/iSTFT for Apple MLX with fused Metal kernels and... |
|
Established |
| 4 |
sarulab-speech/UTMOSv2
UTokyo-SaruLab MOS Prediction System |
|
Emerging |
| 5 |
daniilrobnikov/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for... |
|
Experimental |
| 6 |
MWM-io/SpecTNT-pytorch
Unofficial implementation of SpecTNT in pytorch |
|
Experimental |
| 7 |
nipponjo/arabic-vocalization
Arabic deep-learning based diacritization models (Shakkala, Shakkelha)... |
|
Experimental |
| 8 |
NTIA/alignnet
Train no-reference speech quality estimators with multiple datasets via... |
|
Experimental |
| 9 |
kuntiniong/hk-insta-identifier
Hong Kong Instagram username identification with Romanized Cantonese linguistics |
|
Experimental |
| 10 |
juanjosehr14/YingMusic-SVC
🎤 Transform singing voices effortlessly with YingMusic-SVC, a robust... |
|
Experimental |
| 11 |
mende237/Nda-Nda-Force-Aligner
Forced alignment of Nda‘ Nda’ a Cameroonian language |
|
Experimental |
| 12 |
felipeoliverai/conformer-paper
PyTorch implementation of the paper: 𝐂𝐨𝐧𝐟𝐨𝐫𝐦𝐞𝐫: 𝐂𝐨𝐧𝐯𝐨𝐥𝐮𝐭𝐢𝐨𝐧-𝐚𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝... |
|
Experimental |