hans00/phonemize

Pure JS fast phonemizer with rule-based G2P prediction

39
/ 100
Emerging

Supports multiple output formats (IPA, ARPABET, Zhuyin) and modular multilingual G2P models for English, Chinese, Japanese, Korean, and Russian through preset imports. Uses a processing pipeline combining dictionary lookup (125,000+ entries), intelligent compound word decomposition, and LLM-generated phonetic rules for unknown words, achieving >10,000 words/second with zero startup overhead. Integrates seamlessly into Node.js projects via a simple API with options for custom pronunciations, tokenization with position tracking, and automatic text normalization (numbers, abbreviations, currency, dates).

Used by 1 other package. Available on npm.

Maintenance 10 / 25
Adoption 7 / 25
Maturity 18 / 25
Community 4 / 25

How are scores calculated?

Stars

24

Forks

1

Language

TypeScript

License

MIT

Last pushed

Mar 04, 2026

Commits (30d)

0

Dependencies

2

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/hans00/phonemize"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.