gokhaneraslan/chatterbox-finetuning

Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline preprocessing, automatic VAD trimming, and voice cloning capabilities. Train custom TTS models with your own dataset in LJSpeech and file-based format.

51
/ 100
Established

The toolkit supports dual architectures—Llama-based Standard mode with grapheme tokenization for language fundamentals, and GPT-2-based Turbo mode that automatically merges a 50K+ token BPE vocabulary with multi-language grapheme sets for accelerated fine-tuning. Preprocessing is mandatory and offline, extracting speaker embeddings and acoustic tokens upfront to maximize training throughput; the pipeline handles automatic resampling to 16kHz training input and generates 24kHz output via vocoder, with integrated VAD support for inference trimming.

No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 13 / 25
Community 19 / 25

How are scores calculated?

Stars

84

Forks

21

Language

Python

License

Apache-2.0

Last pushed

Feb 20, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gokhaneraslan/chatterbox-finetuning"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.