NTT123/vietTTS

Vietnamese Text to Speech library

/ 100

Established

Combines a three-stage neural architecture—duration prediction, acoustic feature generation, and HiFiGAN vocoding—to synthesize Vietnamese speech from text. Trained on the denoised InfoRe dataset with forced alignment via Montreal Forced Aligner, it supports model finetuning on ground-truth mel-spectrograms and experimental multi-speaker synthesis on a separate branch. Implemented in JAX/Haiku with PyTorch vocoder conversion, enabling both offline synthesis and integration into production pipelines via pretrained model inference.

255 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

255

Forks

104

Language

Python

License

MIT

Compare

vietTTS and viet-tts

Related tools

TuananhCR/Dia-Finetuning-Vietnamese

TTS Dia finetuning for Vietnamese

thinhlpg/vixtts-demo

A Vietnamese Voice Cloning Text-to-Speech Model ✨

dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

ekwek1/soprano-factory

Soprano-Factory: Train your own 2000x realtime text-to-speech model

modelscope/KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at ...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights