FastSpeech TTS Models Voice AI Tools
PyTorch implementations and variants of FastSpeech and FastSpeech2 architectures for neural text-to-speech synthesis. Does NOT include other TTS architectures (Transformer-TTS, Glow-TTS), vocoder implementations, or non-FastSpeech based speech synthesis models.
There are 74 fastspeech tts models tools tracked. 2 score above 50 (established tier). The highest-rated is TensorSpeech/TensorFlowTTS at 66/100 with 3,995 stars and 328 monthly downloads.
Get all 74 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=fastspeech-tts-models&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art... |
|
Established |
| 2 |
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX |
|
Established |
| 3 |
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with... |
|
Emerging |
| 4 |
yl4579/PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions |
|
Emerging |
| 5 |
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End... |
|
Emerging |
| 6 |
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and... |
|
Emerging |
| 7 |
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC |
|
Emerging |
| 8 |
atomicoo/tacotron2-mandarin
Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on... |
|
Emerging |
| 9 |
rishikksh20/AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice |
|
Emerging |
| 10 |
saiteja-talluri/Speech2Face
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face... |
|
Emerging |
| 11 |
roatienza/efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023. |
|
Emerging |
| 12 |
ORI-Muchim/Efficient-Speech
Lightweight Korean TTS Model based on FastSpeech2 |
|
Emerging |
| 13 |
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional,... |
|
Emerging |
| 14 |
neosapience/mlp-singer
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing... |
|
Emerging |
| 15 |
atomicoo/FCH-TTS
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese,... |
|
Emerging |
| 16 |
KevinMIN95/StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech |
|
Emerging |
| 17 |
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official... |
|
Emerging |
| 18 |
CSTR-Edinburgh/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications. |
|
Emerging |
| 19 |
caizexin/tf_multispeakerTTS_fc
the Tensorflow version of multi-speaker TTS training with feedback constraint |
|
Emerging |
| 20 |
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model... |
|
Emerging |
| 21 |
keonlee9420/FastPitchFormant
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based... |
|
Emerging |
| 22 |
neosapience/editts
Official implementation of EdiTTS: Score-based Editing for Controllable... |
|
Emerging |
| 23 |
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 ,... |
|
Emerging |
| 24 |
keonlee9420/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive... |
|
Emerging |
| 25 |
Rongjiehuang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21) |
|
Emerging |
| 26 |
mush42/optispeech
A lightweight end-to-end text-to-speech model |
|
Emerging |
| 27 |
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across... |
|
Emerging |
| 28 |
hwRG/End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis. |
|
Emerging |
| 29 |
lucasnewman/vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and... |
|
Emerging |
| 30 |
Labmem-Zhouyx/CDFSE_FastSpeech2
The Official Implementation of “Content-Dependent Fine-Grained Speaker... |
|
Emerging |
| 31 |
yui-mhcp/text_to_speech
(Multi Speaker) Text-To-Speech (TTS) project |
|
Emerging |
| 32 |
Executedone/Chinese-FastSpeech2
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏 |
|
Emerging |
| 33 |
OpenTSLab/BELLE
Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn... |
|
Emerging |
| 34 |
gmltmd789/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis... |
|
Emerging |
| 35 |
lars76/fastspeech2-clean
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA |
|
Emerging |
| 36 |
andi611/ZeroSpeech-TTS-without-T
A Pytorch implementation for the ZeroSpeech 2019 challenge. |
|
Emerging |
| 37 |
msalhab96/MultiSpeech
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with... |
|
Emerging |
| 38 |
tuanh123789/AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for... |
|
Emerging |
| 39 |
Adibian/ResGrad
Unofficial implementation of ResGrad: Residual Denoising Diffusion... |
|
Emerging |
| 40 |
rishikksh20/LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search |
|
Experimental |
| 41 |
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to... |
|
Experimental |
| 42 |
adasegroup/OSM-one-shot-multispeaker
Framework for one-shot multispeaker system based on Deep Learning |
|
Experimental |
| 43 |
ShivamRajSharma/Transformer-Text-To-Speech
Pytorch implementation of Transformer-TTS for converting text into speech. |
|
Experimental |
| 44 |
akashmjn/cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18) |
|
Experimental |
| 45 |
deepkyu/ml-talking-face
Cloned repository from Hugging Face Spaces (CVPR 2022 Demo) |
|
Experimental |
| 46 |
lucasnewman/e2-tts-mlx
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive... |
|
Experimental |
| 47 |
hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and... |
|
Experimental |
| 48 |
eazhary/dctts2
Deep Convolution Text to Speech |
|
Experimental |
| 49 |
revsic/tf-glow-tts
Tensorflow implementation of Glow-TTS |
|
Experimental |
| 50 |
revsic/tf-mlptts
Tensorflow implementation of MLP-Mixer based TTS |
|
Experimental |
| 51 |
yanghaha0908/FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for... |
|
Experimental |
| 52 |
mush42/leanspeech
Unofficial pytorch implementation of LeanSpeech: The Microsoft Lightweight... |
|
Experimental |
| 53 |
AppleHolic/FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2 |
|
Experimental |
| 54 |
dacson/Demo-of-Text-to-Speech-based-on-Deep-Learning
text to speech for mandarin, |
|
Experimental |
| 55 |
erogol/ddc-samples
🐸💬 Coqui TTS Double Decoder Consistency samples |
|
Experimental |
| 56 |
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient... |
|
Experimental |
| 57 |
xcmyz/FastSpeech2
The Implementation of FastSpeech2 Based on Pytorch. |
|
Experimental |
| 58 |
WWWWxp/M3-TTS
Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment &... |
|
Experimental |
| 59 |
QinHsiu/BiCLTTS
Bi-level Cntrastive Learning for Text-to-Speech |
|
Experimental |
| 60 |
X-LANCE/UniCATS-CTX-txt2vec
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS |
|
Experimental |
| 61 |
aqibahmad/speech2face
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE... |
|
Experimental |
| 62 |
monatis/german-tts
German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support |
|
Experimental |
| 63 |
ssmlkl/MnTTS2
This is the experimental description of MnTTS2. |
|
Experimental |
| 64 |
carankt/FastSpeech2
Implementation of FastSpeech 2 |
|
Experimental |
| 65 |
ssumin6/Korean-TTS-Server
Korean text-to-speech |
|
Experimental |
| 66 |
erogol/TTS_tf
WIP Tensorflow implementation of https://github.com/mozilla/TTS |
|
Experimental |
| 67 |
clarenceluo78/singer-adaptive-svc
This repository is the implementation of project Converting to Realistic... |
|
Experimental |
| 68 |
Orca0917/TransformerTTS
Unofficial PyTorch implementation of Transformer-TTS, a Transformer-based... |
|
Experimental |
| 69 |
kowaalczyk/reformer-tts
An adaptation of Reformer: The Efficient Transformer for text-to-speech task. |
|
Experimental |
| 70 |
keonlee9420/Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model... |
|
Experimental |
| 71 |
gateoneh92/Flow-Matching-TTS
⚡ Non-autoregressive TTS using Conditional Flow Matching - 5-20x faster than... |
|
Experimental |
| 72 |
asiff00/TTS-Training-Blueprint
Intuitive understanding of Autoregressive TTS Models |
|
Experimental |
| 73 |
davidalvarezdlt/samplernn_pase
Implementation of the paper "Problem-agnostic speech embeddings for... |
|
Experimental |
| 74 |
zabir-nabil/fast-wavenet-mel2wav
Dummy Implementation, Will update later |
|
Experimental |