daswer123/xtts-webui
Webui for using XTTS and for finetuning it
Integrates XTTSv2 with modular voice processing pipelines supporting RVC, OpenVoice, and Resemble Enhance for post-processing synthesis results. Provides batch audio dubbing with automatic translation while preserving speaker identity, plus fine-tuning capabilities with custom model selection and optimized export. Runs locally on NVIDIA GPUs (6GB+ VRAM) via PyTorch/CUDA, with optional deepspeed acceleration and low-VRAM mode for resource-constrained setups.
877 stars. No commits in the last 6 months.
Stars
877
Forks
168
Language
Python
License
MIT
Category
Last pushed
Jan 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/daswer123/xtts-webui"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin
Atm4x/tts-with-rvc
TTS with RVC-module to generate .wav audios