TTS Model Fine-Tuning Voice AI Tools
Repositories for fine-tuning and training text-to-speech models on custom datasets, including LoRA and full model adaptation. Does NOT include pre-built TTS services, inference-only implementations, or general voice cloning without model training.
There are 52 tts model fine-tuning tools tracked. The highest-rated is ekwek1/soprano-factory at 48/100 with 212 stars.
Get all 52 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=tts-model-finetuning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
ekwek1/soprano-factory
Soprano-Factory: Train your own 2000x realtime text-to-speech model |
|
Emerging |
| 2 |
TuananhCR/Dia-Finetuning-Vietnamese
TTS Dia finetuning for Vietnamese |
|
Emerging |
| 3 |
shhossain/BanglaTTS
BanglaTTS is a text-to-speech (TTS) system for Bangla language that works in... |
|
Emerging |
| 4 |
thinhlpg/vixtts-demo
A Vietnamese Voice Cloning Text-to-Speech Model ✨ |
|
Emerging |
| 5 |
dangvansam/viet-tts
VietTTS: An Open-Source Vietnamese Text to Speech |
|
Emerging |
| 6 |
NTT123/vietTTS
Vietnamese Text to Speech library |
|
Emerging |
| 7 |
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we... |
|
Emerging |
| 8 |
OwenTyme/voice-zero
Collection of samples suitable for use with zero-shot text to speech engines. |
|
Emerging |
| 9 |
phatjkk/SpeakIt_Vietnamese_TTS
Vietnamese Text-to-Speech on Windows Project (zalo-speech) |
|
Emerging |
| 10 |
yrom/finetune-index-tts
IndexTTS Fine-tuning notebooks |
|
Emerging |
| 11 |
mozilla-ai/speech-to-text-finetune
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language |
|
Emerging |
| 12 |
Degon3399/XTTS_V2
This repository offers a framework for fine-tuning the XTTS_V2 model,... |
|
Emerging |
| 13 |
seanghay/KLEA
An open-source Khmer Word to Speech Model. Just single word not sentence! |
|
Emerging |
| 14 |
Jobix-Ai/Iso-Vox
STT 90% Solved — Isolate specific speakers from multi-speaker "cocktail... |
|
Emerging |
| 15 |
quangvu3/coqui-xtts
Coqui XTTS model with Vietnamese added |
|
Emerging |
| 16 |
mobassir94/comprehensive-bangla-tts
Aiming to achieve ultimate Multilingual TTS pipeline with main focus on... |
|
Emerging |
| 17 |
megaease/easevoice-trainer
EaseVoice Trainer is a simple and user-friendly voice cloning and speech... |
|
Emerging |
| 18 |
Troyanovsky/awesome-TTS-Colab
Collection of awesome TTS and voice cloning models to run with Google Colab |
|
Emerging |
| 19 |
smtiitm/Fastspeech2_MFA
Indic TTS for Indian Languages: This is a project on developing... |
|
Emerging |
| 20 |
LEMAS-Project/LEMAS-TTS
LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10... |
|
Emerging |
| 21 |
asiff00/Training-TTS
Train and finutune text-to-speech models for Bengali and many other languages! |
|
Emerging |
| 22 |
gokhaneraslan/XTTS_V2-finetuning
Training XTTS V2 and PEFT LORA Text-to-Speech (TTS) |
|
Emerging |
| 23 |
zabir-nabil/bangla-tts
Bangla text to speech, Multilingual (Bangla, English) real-time speech... |
|
Emerging |
| 24 |
veralvx/xtts-finetune
XTTS fine-tuning via CLI |
|
Experimental |
| 25 |
GitHub30/asr-tts-vietnamese
Vietnamese Text-to-Speech API |
|
Experimental |
| 26 |
mrmanna/Nvidia_Nemo_FastPitch_TTS_Example
How to Build a High-Quality Text-to-Speech (TTS) System Locally with Nvidia... |
|
Experimental |
| 27 |
Kyubyong/speaker_adapted_tts
Making a TTS model with 1 minute of speech samples within 10 minutes |
|
Experimental |
| 28 |
The-Data-Dilemma/Medibeng-Orpheus-3b-0.1-ft-Fine-Tuning
Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English... |
|
Experimental |
| 29 |
6Morpheus6/IndicF5
High-Quality Text-to-Speech for Indian Languages |
|
Experimental |
| 30 |
Taijul007/VieNeu-TTS
🎤 Generate realistic Vietnamese speech with VieNeu-TTS, an advanced... |
|
Experimental |
| 31 |
pilarOG/unit_selection_tts
Toy example on how to build a unit selection TTS in Spanish |
|
Experimental |
| 32 |
LEMAS-Project/LEMAS-Project
LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with... |
|
Experimental |
| 33 |
dalmoon15/styletts2-dataset-toolkit
🎤 Streamline voice cloning with the StyleTTS2 Dataset Toolkit, a... |
|
Experimental |
| 34 |
supevil/SoulX-Singer-Eval
🎤 Evaluate zero-shot Singing Voice Synthesis systems for quality, accuracy,... |
|
Experimental |
| 35 |
Bangla-Language-Processing/Katha-Bangla-TTS
The first Bangla Text To Speech System for Bangladeshi Bangla (Katha) |
|
Experimental |
| 36 |
deuxksy/today-vn-news
베트남 뉴스 자동 생성 파이프라인 (TTS, FFmpeg, Hardware Acceleration) |
|
Experimental |
| 37 |
2tocom/F5-TTS-Vietnamese-Google-Colab
Vietnamese TTS, Chuyển văn bản thành giọng nói tiếng Việt, text to speech... |
|
Experimental |
| 38 |
iconclub/zalo-tts
Zalo Text-To-Speech for python |
|
Experimental |
| 39 |
HoseinAzad/SpeechT5-Non-English-TTS
Fine-tune SpeechT5 for non-English text-to-speech task, implemented in PyTorch. |
|
Experimental |
| 40 |
harshanavkis/Hindi-TTS
Text to Speech system for Hindi language |
|
Experimental |
| 41 |
ducnt18121997/Viet-Transformer-TTS
This is PyTorch Implementation of A Non-Autoregressive Transformer with... |
|
Experimental |
| 42 |
ilya16/isp-tts
A simple TTS model developed for the Speech Synthesis and Voice Cloning... |
|
Experimental |
| 43 |
babadue/seamless-m4t-v2-large-demo
Demonstration features of seamless-m4t-v2-large model |
|
Experimental |
| 44 |
NhanPhamThanh-IT/Vietnamese-Voice-Search-Engine
🔎 Vietnamese Voice Search Engine - Vietnamese news search app with voice... |
|
Experimental |
| 45 |
Salama1429/Text-to-speech_TTS_Model_Training
Training Text to speech model for German Language |
|
Experimental |
| 46 |
HQQHQ/FinetuneSpeechT5-Spanish
This repository hosts the code and resources for fine-tuning a SpeechT5... |
|
Experimental |
| 47 |
leanhtech/TextToSpeech_EN_VN
Đồ Án Text To Speech (Môn Hệ Điều Hành - PTITHCM) |
|
Experimental |
| 48 |
lukaszliniewicz/easy_xtts_trainer
A command line utility to easily finetune XTTS models in a fully automated... |
|
Experimental |
| 49 |
CherokeeLanguage/IMS-Toucan
Cherokee Language TTS |
|
Experimental |
| 50 |
usamireko/StableTTS-Training-Colab
A notebook created for training StableTTS models in Google Colab easily! |
|
Experimental |
| 51 |
gas/pronunza-tts-galego-onnx-colab
Caderno de Colab para síntese de voz (TTS) en galego usando o modelo ONNX de Celtia |
|
Experimental |
| 52 |
NoerNova/IMS-Toucan-Shan
fork version of IMS-Toucan to finetuning for Shan language |
|
Experimental |