Text To Speech Frameworks Voice AI Tools

There are 62 text to speech frameworks tools tracked. 10 score above 50 (established tier). The highest-rated is coqui-ai/TTS at 69/100 with 44,801 stars and 214,937 monthly downloads.

Get all 62 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-to-speech-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...

69
Established
2 yeyupiaoling/MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2...

63
Established
3 netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

59
Established
4 shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

55
Established
5 keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with...

51
Established
6 DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

51
Established
7 gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

51
Established
8 jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for...

50
Established
9 svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

50
Established
10 shivammehta25/Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

50
Established
11 metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

49
Emerging
12 mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion...

48
Emerging
13 gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

45
Emerging
14 r9y9/deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech...

44
Emerging
15 spring-media/TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...

44
Emerging
16 xcmyz/FastSpeech

The Implementation of FastSpeech based on pytorch.

44
Emerging
17 soobinseo/Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

44
Emerging
18 jaywalnut310/glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

44
Emerging
19 jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity...

44
Emerging
20 descriptinc/melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

44
Emerging
21 jackaduma/CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

43
Emerging
22 yl4579/StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for...

43
Emerging
23 NevilPatel01/RVC-WebUI-MacOS

Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...

43
Emerging
24 israelg99/deepvoice

Deep Voice: Real-time Neural Text-to-Speech

43
Emerging
25 google/tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end...

42
Emerging
26 p0p4k/vits2_pytorch

unofficial vits2-TTS implementation in pytorch

42
Emerging
27 tugstugi/pytorch-dc-tts

Text to Speech with PyTorch (English and Mongolian)

42
Emerging
28 jpuigcerver/Laia

Laia: A deep learning toolkit for HTR based on Torch

41
Emerging
29 LEEYOONHYUNG/BVAE-TTS

Official implementation of BVAE-TTS

40
Emerging
30 yl4579/StyleTTS

Official Implementation of StyleTTS

39
Emerging
31 ishandutta2007/Awesome-Text-to-Speech

🎤 A curated list of the latest and most influential tools, models, and...

39
Emerging
32 pritishyuvraj/Voice-Conversion-GAN

Voice Conversion using Cycle GAN's For Non-Parallel Data

39
Emerging
33 nnsvs/nnsvs

Neural network-based singing voice synthesis library for research

38
Emerging
34 nipponjo/tts-arabic-pytorch

🎙️ Arabic TTS models (Tacotron2, FastPitch)

38
Emerging
35 maum-ai/univnet

Unofficial PyTorch Implementation of UnivNet Vocoder...

38
Emerging
36 spring-media/DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

38
Emerging
37 daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with...

38
Emerging
38 persephone-tools/persephone

A tool for automatic phoneme transcription

37
Emerging
39 keonlee9420/Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family...

37
Emerging
40 coqui-ai/TTS-papers

🐸 collection of TTS papers

37
Emerging
41 r9y9/ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

37
Emerging
42 maum-ai/assem-vc

Official Code for Assem-VC @ICASSP2022

37
Emerging
43 yl4579/StyleTTS-VC

Official Implementation of StyleTTS-VC

36
Emerging
44 karim23657/Persian-tts-coqui

Persian/Farsi text to speech(TTS) training using coqui tts

36
Emerging
45 p0p4k/pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

36
Emerging
46 keonlee9420/Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning...

35
Emerging
47 hhguo/MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

34
Emerging
48 huckiyang/Voice2Series-Reprogramming

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...

34
Emerging
49 yl4579/HiFTNet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...

34
Emerging
50 sophiefy/StellaVoiceChanger

Deep-learning-based voice changer, supporting local inference.

33
Emerging
51 double22a/asr_nlp_paper_code

Papers of ASR, Tools of ASR

33
Emerging
52 SungFeng-Huang/Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More...

32
Emerging
53 alessandroragano/scoreq

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

32
Emerging
54 binzhouchn/masr

中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。

30
Emerging
55 HuuHuy227/XphoneBert_Vits2

VITS2 extended with XPhoneBERT encoder

28
Experimental
56 jreremy/conformer

Pytorch implementation of conformer with with training script for end-to-end...

28
Experimental
57 ShawnPi233/HQ-SVC

Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice...

28
Experimental
58 keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...

27
Experimental
59 nafiuny/ICRCycleGAN-VC

Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and...

26
Experimental
60 zmeet-ai/tts-demo

支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。

24
Experimental
61 sil-ai/tts-singlish

TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.

20
Experimental
62 MahdeenSky/SoftVC-VITS-MusicSingerChanger

Google collab for testing SoftVC VITS Singing Voice Conversion for AI...

12
Experimental