modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Built on a two-stage architecture combining SAM-BERT for linguistic feature extraction and HiFi-GAN for neural vocoding, KAN-TTS enables end-to-end trainable speech synthesis across 10+ languages including Mandarin variants and European languages. The framework integrates with ModelScope for model hosting and provides complete training pipelines from data preparation through inference, supporting multilingual and multi-speaker TTS customization.
526 stars. No commits in the last 6 months.
Stars
526
Forks
88
Language
Python
License
MIT
Category
Last pushed
Dec 28, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/modelscope/KAN-TTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TuananhCR/Dia-Finetuning-Vietnamese
TTS Dia finetuning for Vietnamese
thinhlpg/vixtts-demo
A Vietnamese Voice Cloning Text-to-Speech Model ✨
dangvansam/viet-tts
VietTTS: An Open-Source Vietnamese Text to Speech
NTT123/vietTTS
Vietnamese Text to Speech library
ekwek1/soprano-factory
Soprano-Factory: Train your own 2000x realtime text-to-speech model