modelscope/KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

/ 100

Emerging

Built on a two-stage architecture combining SAM-BERT for linguistic feature extraction and HiFi-GAN for neural vocoding, KAN-TTS enables end-to-end trainable speech synthesis across 10+ languages including Mandarin variants and European languages. The framework integrates with ModelScope for model hosting and provides complete training pipelines from data preparation through inference, supporting multilingual and multi-speaker TTS customization.

526 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

526

Forks

Language

Python

License

MIT

Higher-rated alternatives

TuananhCR/Dia-Finetuning-Vietnamese

TTS Dia finetuning for Vietnamese

thinhlpg/vixtts-demo

A Vietnamese Voice Cloning Text-to-Speech Model ✨

dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

NTT123/vietTTS

Vietnamese Text to Speech library

ekwek1/soprano-factory

Soprano-Factory: Train your own 2000x realtime text-to-speech model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights