gokhaneraslan/chatterbox-finetuning
Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline preprocessing, automatic VAD trimming, and voice cloning capabilities. Train custom TTS models with your own dataset in LJSpeech and file-based format.
The toolkit supports dual architectures—Llama-based Standard mode with grapheme tokenization for language fundamentals, and GPT-2-based Turbo mode that automatically merges a 50K+ token BPE vocabulary with multi-language grapheme sets for accelerated fine-tuning. Preprocessing is mandatory and offline, extracting speaker embeddings and acoustic tokens upfront to maximize training throughput; the pipeline handles automatic resampling to 16kHz training input and generates 24kHz output via vocoder, with integrated VAD support for inference trimming.
Stars
84
Forks
21
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 20, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gokhaneraslan/chatterbox-finetuning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
devnen/Chatterbox-TTS-Server
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible...
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
jamiepine/voicebox
The open-source voice synthesis studio
Aivis-Project/AivisSpeech-Engine
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to...