Mobile-Artificial-Intelligence/babylon

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

45
/ 100
Emerging

Supports multi-platform deployment across Linux, macOS, Windows, and Android with unified ONNX Runtime inference for offline operation. Provides three access layers—C/C++ library APIs, a CLI tool with phonemization and synthesis subcommands, and an embeddable REST server with web UI—enabling integration from embedded systems to web services. Features the high-quality Kokoro engine (24 kHz, 54+ voices) alongside VITS for model flexibility, with pronunciation backed by a 130k-entry dictionary.

No Package No Dependents
Maintenance 13 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

30

Forks

3

Language

Python

License

MIT

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Mobile-Artificial-Intelligence/babylon"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.