Text-to-Speech Frameworks

End-to-end TTS architectures, models, and toolkits for synthesizing speech from text. Includes transformer-based, diffusion-based, and flow-matching approaches with various duration modeling techniques. Does NOT include voice cloning, speech recognition, speech evaluation metrics, or TTS paper collections.

There are 12 text-to-speech frameworks tracked. 1 score above 70 (verified tier). The highest-rated is voicepaw/so-vits-svc-fork at 89/100 with 9,281 stars and 4,743 monthly downloads. 1 of the top 10 are actively maintained.

Get all 12 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=text-to-speech-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

89
Verified
2 ssmall256/mlx-audio-io

Native audio I/O for MLX on macOS and Linux

52
Established
3 ssmall256/mlx-spectro

High-performance STFT/iSTFT for Apple MLX with fused Metal kernels and...

51
Established
4 sarulab-speech/UTMOSv2

UTokyo-SaruLab MOS Prediction System

44
Emerging
5 daniilrobnikov/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for...

28
Experimental
6 MWM-io/SpecTNT-pytorch

Unofficial implementation of SpecTNT in pytorch

26
Experimental
7 nipponjo/arabic-vocalization

Arabic deep-learning based diacritization models (Shakkala, Shakkelha)...

20
Experimental
8 NTIA/alignnet

Train no-reference speech quality estimators with multiple datasets via...

17
Experimental
9 kuntiniong/hk-insta-identifier

Hong Kong Instagram username identification with Romanized Cantonese linguistics

15
Experimental
10 juanjosehr14/YingMusic-SVC

🎤 Transform singing voices effortlessly with YingMusic-SVC, a robust...

15
Experimental
11 mende237/Nda-Nda-Force-Aligner

Forced alignment of Nda‘ Nda’ a Cameroonian language

14
Experimental
12 felipeoliverai/conformer-paper

PyTorch implementation of the paper: 𝐂𝐨𝐧𝐟𝐨𝐫𝐦𝐞𝐫: 𝐂𝐨𝐧𝐯𝐨𝐥𝐮𝐭𝐢𝐨𝐧-𝐚𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝...

10
Experimental