shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

/ 100

Established

Based on the README, here's the technical summary: Implements conditional flow matching with ODE-based synthesis to achieve fast non-autoregressive TTS, supporting variable inference steps via Euler solver and controllable generation through temperature and speaking rate parameters. Built on PyTorch Lightning with Hydra configuration, it includes ONNX export/inference support and a Gradio interface for browser-based synthesis. Enables phoneme-level alignment extraction and supports multi-GPU training on custom datasets with automatic mel-spectrogram normalization.

1,259 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

1,259

Forks

189

Language

Jupyter Notebook

License

MIT

Related tools

yeyupiaoling/MASR

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Explore Voice AI Tools

All categories Trending Voice AI directory Insights