rsxdalv/TTS-WebUI
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
Provides dual React and Gradio frontends with extensible architecture supporting voice cloning, audio separation, music generation, and speech-to-speech conversion alongside TTS synthesis. The modular extension system allows developers to add custom models independently without modifying core code, while maintaining compatibility with downstream applications like Silly Tavern through standardized API interfaces. Supports both local deployment and cloud execution via Google Colab, with pre-configured Docker containers for reproducible environment setup.
3,017 stars. Actively maintained with 6 commits in the last 30 days.
Stars
3,017
Forks
305
Language
TypeScript
License
MIT
Category
Last pushed
Feb 19, 2026
Commits (30d)
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rsxdalv/TTS-WebUI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
playht/pyht
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
aedocw/epub2tts
Turn an epub or text file into an audiobook
DrewThomasson/VoxNovel
VoxNovel: generate audiobooks giving each character a different voice actor.
gianpaj/sexyvoice
Voice Cloning, Voice Call and Text to Speech platform. Perfect for content creators, developers,...
IDEA-Emdoor-Lab/UniTTS
A TTS Trained on Universal Audio.