Text Normalization Engines Voice AI Tools
Tools for normalizing written text into spoken forms across languages, handling numbers, dates, abbreviations, and special characters for TTS and speech processing. Does NOT include general text-to-speech synthesis, speech recognition, or audio processing.
There are 17 text normalization engines tools tracked. The highest-rated is speechio/chinese_text_normalization at 44/100 with 722 stars.
Get all 17 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-normalization-engines&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
speechio/chinese_text_normalization
Chinese text normalization for speech processing |
|
Emerging |
| 2 |
NickZaitsev/ru-normalizr
ru-normalizr — лучший open-source нормализатор русского текста. Приводит... |
|
Emerging |
| 3 |
gladiaio/normalization
A lightweight library for normalizing speech transcripts before computing WER |
|
Emerging |
| 4 |
34j/mecab-text-cleaner
Simple Python package (CLI/Python API) for getting japanese readings... |
|
Emerging |
| 5 |
repodiac/german_transliterate
Python module to clean and transliterate (i.e. normalize) German text... |
|
Emerging |
| 6 |
google-research-datasets/TextNormalizationCoveringGrammars
Covering grammars for English and Russian text normalization |
|
Emerging |
| 7 |
ducnt18121997/Viet-Text-Normalization
A Python library for text normalization, specifically designed for... |
|
Experimental |
| 8 |
tomaarsen/TTSTextNormalization
Convert English text from written expressions into spoken forms |
|
Experimental |
| 9 |
ScottishFold007/TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production,... |
|
Experimental |
| 10 |
stefantaubert/english-text-normalization
Command-line interface (CLI) and library to normalize English texts. |
|
Experimental |
| 11 |
Agash/TTSTextNormalization
Modern .NET10 / C#14 library to normalize text (emojis, currency, numbers,... |
|
Experimental |
| 12 |
NetherQuartz/TextForSpeechNormalizer
A Python library to accentuate Russian text |
|
Experimental |
| 13 |
cewarman/NTPU_online_text_normalization
An online text normalization tool for Chinese-English mixed text-to-speech system |
|
Experimental |
| 14 |
rafalposwiata/text-normalization
Repository for text normalization research. |
|
Experimental |
| 15 |
Amir79Naziri/TextNormalization_Project
Implementing text normalization for Farsi(Persian) language. |
|
Experimental |
| 16 |
seanghay/khmertagger
KhmerTagger: Inverse Text Normalization for Khmer Automatic Speech Recognition |
|
Experimental |
| 17 |
bmwasaru/kiswahili-speech-normalization
Kiswahili text normalization utilities for speech datasets (ASR/TTS) |
|
Experimental |