Sovichea/khmer_segmenter
A zero-dependency, high-performance Khmer word segmenter using the Viterbi algorithm. Optimized for dictionary accuracy, ultra-low memory footprint, and edge deployment.
Stars
34
Forks
4
Language
Python
License
MIT
Category
Last pushed
Jan 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Sovichea/khmer_segmenter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/attacut
A Fast and Accurate Neural Thai Word Segmenter
VietHoang1512/khmer-nltk
Khmer language processing toolkit
UlugbekSalaev/UzTransliterator
UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language
seanghay/KhmerOCR
A Fast Khmer Optical Character Recognition (KhmerOCR)
AI4Bharat/IndicNLP-Transliteration
Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based...