hans00/phonemize
Pure JS fast phonemizer with rule-based G2P prediction
Supports multiple output formats (IPA, ARPABET, Zhuyin) and modular multilingual G2P models for English, Chinese, Japanese, Korean, and Russian through preset imports. Uses a processing pipeline combining dictionary lookup (125,000+ entries), intelligent compound word decomposition, and LLM-generated phonetic rules for unknown words, achieving >10,000 words/second with zero startup overhead. Integrates seamlessly into Node.js projects via a simple API with options for custom pronunciations, tokenization with position tracking, and automatic text normalization (numbers, abbreviations, currency, dates).
Used by 1 other package. Available on npm.
Stars
24
Forks
1
Language
TypeScript
License
MIT
Category
Last pushed
Mar 04, 2026
Commits (30d)
0
Dependencies
2
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/hans00/phonemize"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thewh1teagle/phonikud
Hebrew grapheme to phoneme (G2P)
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Wikidepia/g2p-id
Indonesian Grapheme-to-Phoneme (IPA notation)
stefantaubert/pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to...
AdolfVonKleist/Phonetisaurus
Phonetisaurus G2P