rsxdalv/TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

63
/ 100
Established

Provides dual React and Gradio frontends with extensible architecture supporting voice cloning, audio separation, music generation, and speech-to-speech conversion alongside TTS synthesis. The modular extension system allows developers to add custom models independently without modifying core code, while maintaining compatibility with downstream applications like Silly Tavern through standardized API interfaces. Supports both local deployment and cloud execution via Google Colab, with pre-configured Docker containers for reproducible environment setup.

3,017 stars. Actively maintained with 6 commits in the last 30 days.

No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

3,017

Forks

305

Language

TypeScript

License

MIT

Category

text-to-speech

Last pushed

Feb 19, 2026

Commits (30d)

6

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rsxdalv/TTS-WebUI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.