rsxdalv/TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

/ 100

Established

Provides dual React and Gradio frontends with extensible architecture supporting voice cloning, audio separation, music generation, and speech-to-speech conversion alongside TTS synthesis. The modular extension system allows developers to add custom models independently without modifying core code, while maintaining compatibility with downstream applications like Silly Tavern through standardized API interfaces. Supports both local deployment and cloud execution via Google Colab, with pre-configured Docker containers for reproducible environment setup.

3,017 stars. Actively maintained with 6 commits in the last 30 days.

No Package No Dependents

Maintenance 17 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

3,017

Forks

305

Language

TypeScript

License

MIT

Related tools

playht/pyht

PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API

aedocw/epub2tts

Turn an epub or text file into an audiobook

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

gianpaj/sexyvoice

Voice Cloning, Voice Call and Text to Speech platform. Perfect for content creators, developers,...

IDEA-Emdoor-Lab/UniTTS

A TTS Trained on Universal Audio.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights