thinhlpg/vixtts-demo
A Vietnamese Voice Cloning Text-to-Speech Model ✨
Fine-tuned on XTTS-v2.0.3 using the viVoice dataset, this model enables multilingual voice cloning with Vietnamese optimization. Runs locally via Gradio UI on Ubuntu/WSL2 with GPU acceleration (4GB+ VRAM recommended), or directly through a Hugging Face Space without installation. Integrates Vinorm and Underthesea for Vietnamese text normalization, DeepFilterNet for noise removal, and DeepSpeed for accelerated inference.
509 stars. No commits in the last 6 months.
Stars
509
Forks
204
Language
Jupyter Notebook
License
MPL-2.0
Category
Last pushed
Apr 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/thinhlpg/vixtts-demo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
TuananhCR/Dia-Finetuning-Vietnamese
TTS Dia finetuning for Vietnamese
dangvansam/viet-tts
VietTTS: An Open-Source Vietnamese Text to Speech
NTT123/vietTTS
Vietnamese Text to Speech library
ekwek1/soprano-factory
Soprano-Factory: Train your own 2000x realtime text-to-speech model
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at ...