seanghay/KLEA
An open-source Khmer Word to Speech Model. Just single word not sentence!
Built on the VITS architecture, this model synthesizes audio waveforms from Khmer text using a conditional variational autoencoder trained on the kheng.info speech dataset of over 3000 recordings. It requires a pre-trained checkpoint (G_60000.pth) from Hugging Face and supports Python 3.9–3.11 via pip or UV, with straightforward API calls to generate WAV files from individual Khmer words.
Stars
19
Forks
5
Language
Python
License
MIT
Category
Last pushed
Dec 31, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/seanghay/KLEA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TuananhCR/Dia-Finetuning-Vietnamese
TTS Dia finetuning for Vietnamese
thinhlpg/vixtts-demo
A Vietnamese Voice Cloning Text-to-Speech Model ✨
dangvansam/viet-tts
VietTTS: An Open-Source Vietnamese Text to Speech
NTT123/vietTTS
Vietnamese Text to Speech library
ekwek1/soprano-factory
Soprano-Factory: Train your own 2000x realtime text-to-speech model