Text Normalization Engines Voice AI Tools

Tools for normalizing written text into spoken forms across languages, handling numbers, dates, abbreviations, and special characters for TTS and speech processing. Does NOT include general text-to-speech synthesis, speech recognition, or audio processing.

There are 17 text normalization engines tools tracked. The highest-rated is speechio/chinese_text_normalization at 44/100 with 722 stars.

Get all 17 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-normalization-engines&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 speechio/chinese_text_normalization

Chinese text normalization for speech processing

44
Emerging
2 NickZaitsev/ru-normalizr

ru-normalizr — лучший open-source нормализатор русского текста. Приводит...

43
Emerging
3 gladiaio/normalization

A lightweight library for normalizing speech transcripts before computing WER

41
Emerging
4 34j/mecab-text-cleaner

Simple Python package (CLI/Python API) for getting japanese readings...

38
Emerging
5 repodiac/german_transliterate

Python module to clean and transliterate (i.e. normalize) German text...

35
Emerging
6 google-research-datasets/TextNormalizationCoveringGrammars

Covering grammars for English and Russian text normalization

32
Emerging
7 ducnt18121997/Viet-Text-Normalization

A Python library for text normalization, specifically designed for...

29
Experimental
8 tomaarsen/TTSTextNormalization

Convert English text from written expressions into spoken forms

25
Experimental
9 ScottishFold007/TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production,...

25
Experimental
10 stefantaubert/english-text-normalization

Command-line interface (CLI) and library to normalize English texts.

24
Experimental
11 Agash/TTSTextNormalization

Modern .NET10 / C#14 library to normalize text (emojis, currency, numbers,...

23
Experimental
12 NetherQuartz/TextForSpeechNormalizer

A Python library to accentuate Russian text

21
Experimental
13 cewarman/NTPU_online_text_normalization

An online text normalization tool for Chinese-English mixed text-to-speech system

18
Experimental
14 rafalposwiata/text-normalization

Repository for text normalization research.

15
Experimental
15 Amir79Naziri/TextNormalization_Project

Implementing text normalization for Farsi(Persian) language.

15
Experimental
16 seanghay/khmertagger

KhmerTagger: Inverse Text Normalization for Khmer Automatic Speech Recognition

15
Experimental
17 bmwasaru/kiswahili-speech-normalization

Kiswahili text normalization utilities for speech datasets (ASR/TTS)

14
Experimental