slp-rl/HebTTS

The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"

27
/ 100
Experimental

Built on a language modeling architecture that operates over discrete speech tokens conditioned on word-piece tokenization, enabling pronunciation inference from undiacriticized Hebrew text through contextual understanding. The system uses an autoregressive (AR) model paired with a non-autoregressive (NAR) decoder, trained on weakly supervised data from HebDB, and supports multi-speaker synthesis with optional Multi Band Diffusion vocoding for enhanced audio quality. Inference is accessible via command-line with flexible input handling (text strings, files, or CSV batch processing) and pre-trained checkpoints are provided.

108 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 9 / 25
Maturity 1 / 25
Community 15 / 25

How are scores calculated?

Stars

108

Forks

15

Language

Python

License

Last pushed

Jun 12, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/slp-rl/HebTTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.