Text To Speech Frameworks Voice AI Tools

There are 62 text to speech frameworks tools tracked. 10 score above 50 (established tier). The highest-rated is coqui-ai/TTS at 69/100 with 44,801 stars and 214,937 monthly downloads.

Get all 62 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-to-speech-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	coqui-ai/TTS 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...	69	Established	44,801	Python
2	yeyupiaoling/MASR Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2...	63	Established	724	Python
3	netease-youdao/EmotiVoice EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine	59	Established	8,455	Python
4	shivammehta25/Matcha-TTS [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching	55	Established	1,259	Jupyter Notebook
5	keithito/tacotron A TensorFlow implementation of Google's Tacotron speech synthesis with...	51	Established	2,988	Python
6	DigitalPhonetics/IMS-Toucan Controllable and fast Text-to-Speech for over 7000 languages!	51	Established	2,190	Python
7	gabrielmittag/NISQA NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment	51	Established	917	Python
8	jaywalnut310/vits VITS: Conditional Variational Autoencoder with Adversarial Learning for...	50	Established	7,837	Python
9	svc-develop-team/so-vits-svc SoftVC VITS Singing Voice Conversion	50	Established	28,008	Python
10	shivammehta25/Neural-HMM Neural HMMs are all you need (for high-quality attention-free TTS)	50	Established	164	Jupyter Notebook
11	metavoiceio/metavoice-src Foundational model for human-like, expressive TTS	49	Emerging	4,201	Python
12	mozilla/TTS :robot: :speech_balloon: Deep learning for Text to Speech (Discussion...	48	Emerging	10,123	Jupyter Notebook
13	gooofy/zerovox zero-shot realtime TTS system, fully offline, free and open source	45	Emerging	51	Python
14	r9y9/deepvoice3_pytorch PyTorch implementation of convolutional neural networks-based text-to-speech...	44	Emerging	1,982	Python
15	spring-media/TransformerTTS 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...	44	Emerging	1,161	Python
16	xcmyz/FastSpeech The Implementation of FastSpeech based on pytorch.	44	Emerging	880	Python
17	soobinseo/Transformer-TTS A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"	44	Emerging	690	Python
18	jaywalnut310/glow-tts A Generative Flow for Text-to-Speech via Monotonic Alignment Search	44	Emerging	704	Python
19	jik876/hifi-gan HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity...	44	Emerging	2,328	Python
20	descriptinc/melgan-neurips GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis	44	Emerging	1,037	Python
21	jackaduma/CycleGAN-VC2 Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2	43	Emerging	571	Python
22	yl4579/StarGANv2-VC StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for...	43	Emerging	518	Python
23	NevilPatel01/RVC-WebUI-MacOS Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...	43	Emerging	31	Python
24	israelg99/deepvoice Deep Voice: Real-time Neural Text-to-Speech	43	Emerging	364	Python
25	google/tacotron Audio samples accompanying publications related to Tacotron, an end-to-end...	42	Emerging	539	HTML
26	p0p4k/vits2_pytorch unofficial vits2-TTS implementation in pytorch	42	Emerging	547	Python
27	tugstugi/pytorch-dc-tts Text to Speech with PyTorch (English and Mongolian)	42	Emerging	187	Jupyter Notebook
28	jpuigcerver/Laia Laia: A deep learning toolkit for HTR based on Torch	41	Emerging	151	Shell
29	LEEYOONHYUNG/BVAE-TTS Official implementation of BVAE-TTS	40	Emerging	173	Python
30	yl4579/StyleTTS Official Implementation of StyleTTS	39	Emerging	462	Python
31	ishandutta2007/Awesome-Text-to-Speech 🎤 A curated list of the latest and most influential tools, models, and...	39	Emerging	95	—
32	pritishyuvraj/Voice-Conversion-GAN Voice Conversion using Cycle GAN's For Non-Parallel Data	39	Emerging	125	Jupyter Notebook
33	nnsvs/nnsvs Neural network-based singing voice synthesis library for research	38	Emerging	742	Python
34	nipponjo/tts-arabic-pytorch 🎙️ Arabic TTS models (Tacotron2, FastPitch)	38	Emerging	137	Jupyter Notebook
35	maum-ai/univnet Unofficial PyTorch Implementation of UnivNet Vocoder...	38	Emerging	282	Python
36	spring-media/DeepPhonemizer Grapheme to phoneme conversion with deep learning.	38	Emerging	421	Python
37	daniilrobnikov/vits2 VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with...	38	Emerging	634	Jupyter Notebook
38	persephone-tools/persephone A tool for automatic phoneme transcription	37	Emerging	159	Python
39	keonlee9420/Comprehensive-Transformer-TTS A Non-Autoregressive Transformer based Text-to-Speech, supporting a family...	37	Emerging	328	Python
40	coqui-ai/TTS-papers 🐸 collection of TTS papers	37	Emerging	723	—
41	r9y9/ttslearn ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)	37	Emerging	267	Jupyter Notebook
42	maum-ai/assem-vc Official Code for Assem-VC @ICASSP2022	37	Emerging	269	Jupyter Notebook
43	yl4579/StyleTTS-VC Official Implementation of StyleTTS-VC	36	Emerging	197	Python
44	karim23657/Persian-tts-coqui Persian/Farsi text to speech(TTS) training using coqui tts	36	Emerging	199	Jupyter Notebook
45	p0p4k/pflowtts_pytorch Unofficial implementation of NVIDIA P-Flow TTS paper	36	Emerging	230	Python
46	keonlee9420/Comprehensive-Tacotron2 PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning...	35	Emerging	48	Python
47	hhguo/MSMC-TTS Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS	34	Emerging	169	Python
48	huckiyang/Voice2Series-Reprogramming ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...	34	Emerging	73	TypeScript
49	yl4579/HiFTNet HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...	34	Emerging	247	Python
50	sophiefy/StellaVoiceChanger Deep-learning-based voice changer, supporting local inference.	33	Emerging	96	Python
51	double22a/asr_nlp_paper_code Papers of ASR, Tools of ASR	33	Emerging	41	—
52	SungFeng-Huang/Meta-TTS Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More...	32	Emerging	194	Python
53	alessandroragano/scoreq SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)	32	Emerging	108	Python
54	binzhouchn/masr 中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。	30	Emerging	285	Python
55	HuuHuy227/XphoneBert_Vits2 VITS2 extended with XPhoneBERT encoder	28	Experimental	10	Python
56	jreremy/conformer Pytorch implementation of conformer with with training script for end-to-end...	28	Experimental	28	Python
57	ShawnPi233/HQ-SVC Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice...	28	Experimental	91	Python
58	keonlee9420/Comprehensive-E2E-TTS A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...	27	Experimental	146	Python
59	nafiuny/ICRCycleGAN-VC Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and...	26	Experimental	15	Python
60	zmeet-ai/tts-demo 支持各种感情的男女声音，支持实时和离线文本合成tts语音；支持单模特声音变声，语音速率调整，语音音量大小调整；支持自定义语音模型。	24	Experimental	70	Java
61	sil-ai/tts-singlish TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.	20	Experimental	11	Python
62	MahdeenSky/SoftVC-VITS-MusicSingerChanger Google collab for testing SoftVC VITS Singing Voice Conversion for AI...	12	Experimental	13	Jupyter Notebook

Comparisons in this category

TTS and glow-tts (69 vs 44) Transformer-TTS and TransformerTTS (44 vs 44)