skshadan/TTS-RVC-API

Text to Speech using Coqui TTS + RVC

/ 100

Established

Combines Coqui TTS for synthesis with RVC (Retrieval-Based Voice Conversion) for rapid voice cloning using just 2-3 minutes of audio and Hubert embeddings. Exposes a FastAPI endpoint supporting emotion control, speed adjustment, and speaker selection across multiple trained RVC v2 models. Bridges text-to-speech with voice conversion by using Coqui's synthesized speech as input to RVC's conversion pipeline, enabling customizable neural voice generation without extensive training datasets.

113 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

113

Forks

Language

Python

License

MIT

Related tools

herimor/voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

kadirnar/VoiceHub

VoiceHub: A Unified Inference Interface for TTS Models

NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

Atm4x/tts-with-rvc

TTS with RVC-module to generate .wav audios

Explore Voice AI Tools

All categories Trending Voice AI directory Insights