RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

56
/ 100
Established

Combines GPT language modeling with SoVITS vocoding to enable zero-shot TTS from 5-second samples and cross-lingual inference across English, Japanese, Korean, Cantonese, and Chinese. The WebUI integrates voice separation, automatic dataset segmentation, and ASR labeling to streamline training data preparation, achieving real-time inference speeds (RTF 0.028 on RTX 4060Ti) with minimal compute requirements.

55,896 stars.

No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

55,896

Forks

6,104

Language

Python

License

MIT

Last pushed

Feb 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/RVC-Boss/GPT-SoVITS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.