slp-rl/HebTTS
The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"
Built on a language modeling architecture that operates over discrete speech tokens conditioned on word-piece tokenization, enabling pronunciation inference from undiacriticized Hebrew text through contextual understanding. The system uses an autoregressive (AR) model paired with a non-autoregressive (NAR) decoder, trained on weakly supervised data from HebDB, and supports multi-speaker synthesis with optional Multi Band Diffusion vocoding for enhanced audio quality. Inference is accessible via command-line with flexible input handling (text strings, files, or CSV batch processing) and pre-trained checkpoints are provided.
108 stars. No commits in the last 6 months.
Stars
108
Forks
15
Language
Python
License
—
Category
Last pushed
Jun 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/slp-rl/HebTTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thewh1teagle/phonikud
Hebrew grapheme to phoneme (G2P)
GitYCC/g2pW
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Wikidepia/g2p-id
Indonesian Grapheme-to-Phoneme (IPA notation)
stefantaubert/pinyin-to-ipa
Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to...
pnnbao97/sea-g2p
Fast multilingual text-to-phoneme converter for South East Asian languages.