pnnbao97/sea-g2p
Fast multilingual text-to-phoneme converter for South East Asian languages.
Implements a Rust-based core with memory-mapped binary lookup tables and string pooling for O(log n) word searches, achieving ~37,000 sentences/second throughput. Handles mixed Vietnamese-English text with specialized normalization for numbers, dates, and technical terms. Integrates as the phonemization backbone for Vietnamese TTS systems via a modular Python API supporting both sequential and parallel batch processing.
Stars
64
Forks
18
Language
Rust
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/pnnbao97/sea-g2p"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thewh1teagle/phonikud
Hebrew grapheme to phoneme (G2P)
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Wikidepia/g2p-id
Indonesian Grapheme-to-Phoneme (IPA notation)
stefantaubert/pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to...
AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P