Text To Speech Frameworks Voice AI Tools
There are 62 text to speech frameworks tools tracked. 10 score above 50 (established tier). The highest-rated is coqui-ai/TTS at 69/100 with 44,801 stars and 214,937 monthly downloads.
Get all 62 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-to-speech-frameworks&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research... |
|
Established |
| 2 |
yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2... |
|
Established |
| 3 |
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
|
Established |
| 4 |
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching |
|
Established |
| 5 |
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with... |
|
Established |
| 6 |
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages! |
|
Established |
| 7 |
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment |
|
Established |
| 8 |
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for... |
|
Established |
| 9 |
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion |
|
Established |
| 10 |
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS) |
|
Established |
| 11 |
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS |
|
Emerging |
| 12 |
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion... |
|
Emerging |
| 13 |
gooofy/zerovox
zero-shot realtime TTS system, fully offline, free and open source |
|
Emerging |
| 14 |
r9y9/deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech... |
|
Emerging |
| 15 |
spring-media/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based... |
|
Emerging |
| 16 |
xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch. |
|
Emerging |
| 17 |
soobinseo/Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network" |
|
Emerging |
| 18 |
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search |
|
Emerging |
| 19 |
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity... |
|
Emerging |
| 20 |
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis |
|
Emerging |
| 21 |
jackaduma/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2 |
|
Emerging |
| 22 |
yl4579/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for... |
|
Emerging |
| 23 |
NevilPatel01/RVC-WebUI-MacOS
Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs... |
|
Emerging |
| 24 |
israelg99/deepvoice
Deep Voice: Real-time Neural Text-to-Speech |
|
Emerging |
| 25 |
google/tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end... |
|
Emerging |
| 26 |
p0p4k/vits2_pytorch
unofficial vits2-TTS implementation in pytorch |
|
Emerging |
| 27 |
tugstugi/pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian) |
|
Emerging |
| 28 |
jpuigcerver/Laia
Laia: A deep learning toolkit for HTR based on Torch |
|
Emerging |
| 29 |
LEEYOONHYUNG/BVAE-TTS
Official implementation of BVAE-TTS |
|
Emerging |
| 30 |
yl4579/StyleTTS
Official Implementation of StyleTTS |
|
Emerging |
| 31 |
ishandutta2007/Awesome-Text-to-Speech
🎤 A curated list of the latest and most influential tools, models, and... |
|
Emerging |
| 32 |
pritishyuvraj/Voice-Conversion-GAN
Voice Conversion using Cycle GAN's For Non-Parallel Data |
|
Emerging |
| 33 |
nnsvs/nnsvs
Neural network-based singing voice synthesis library for research |
|
Emerging |
| 34 |
nipponjo/tts-arabic-pytorch
🎙️ Arabic TTS models (Tacotron2, FastPitch) |
|
Emerging |
| 35 |
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder... |
|
Emerging |
| 36 |
spring-media/DeepPhonemizer
Grapheme to phoneme conversion with deep learning. |
|
Emerging |
| 37 |
daniilrobnikov/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with... |
|
Emerging |
| 38 |
persephone-tools/persephone
A tool for automatic phoneme transcription |
|
Emerging |
| 39 |
keonlee9420/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family... |
|
Emerging |
| 40 |
coqui-ai/TTS-papers
🐸 collection of TTS papers |
|
Emerging |
| 41 |
r9y9/ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python) |
|
Emerging |
| 42 |
maum-ai/assem-vc
Official Code for Assem-VC @ICASSP2022 |
|
Emerging |
| 43 |
yl4579/StyleTTS-VC
Official Implementation of StyleTTS-VC |
|
Emerging |
| 44 |
karim23657/Persian-tts-coqui
Persian/Farsi text to speech(TTS) training using coqui tts |
|
Emerging |
| 45 |
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper |
|
Emerging |
| 46 |
keonlee9420/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning... |
|
Emerging |
| 47 |
hhguo/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS |
|
Emerging |
| 48 |
huckiyang/Voice2Series-Reprogramming
ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time... |
|
Emerging |
| 49 |
yl4579/HiFTNet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter... |
|
Emerging |
| 50 |
sophiefy/StellaVoiceChanger
Deep-learning-based voice changer, supporting local inference. |
|
Emerging |
| 51 |
double22a/asr_nlp_paper_code
Papers of ASR, Tools of ASR |
|
Emerging |
| 52 |
SungFeng-Huang/Meta-TTS
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More... |
|
Emerging |
| 53 |
alessandroragano/scoreq
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024) |
|
Emerging |
| 54 |
binzhouchn/masr
中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。 |
|
Emerging |
| 55 |
HuuHuy227/XphoneBert_Vits2
VITS2 extended with XPhoneBERT encoder |
|
Experimental |
| 56 |
jreremy/conformer
Pytorch implementation of conformer with with training script for end-to-end... |
|
Experimental |
| 57 |
ShawnPi233/HQ-SVC
Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice... |
|
Experimental |
| 58 |
keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a... |
|
Experimental |
| 59 |
nafiuny/ICRCycleGAN-VC
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and... |
|
Experimental |
| 60 |
zmeet-ai/tts-demo
支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。 |
|
Experimental |
| 61 |
sil-ai/tts-singlish
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm. |
|
Experimental |
| 62 |
MahdeenSky/SoftVC-VITS-MusicSingerChanger
Google collab for testing SoftVC VITS Singing Voice Conversion for AI... |
|
Experimental |