BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

48
/ 100
Emerging

Built on Coqui's XTTS-v2 multilingual model, this project provides both a web UI (Streamlit) and terminal interface for voice cloning across 16 languages with integrated recording and file upload capabilities. The architecture supports GPU acceleration via PyTorch CUDA and automatically downloads pretrained models on first run, with the cloning process requiring only a 10-second 24kHz WAV reference sample to generate speech in the target voice and language.

391 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

391

Forks

67

Language

Python

License

MIT

Last pushed

Dec 06, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/BoltzmannEntropy/xtts2-ui"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.