skshadan/TTS-RVC-API
Text to Speech using Coqui TTS + RVC
Combines Coqui TTS for synthesis with RVC (Retrieval-Based Voice Conversion) for rapid voice cloning using just 2-3 minutes of audio and Hubert embeddings. Exposes a FastAPI endpoint supporting emotion control, speed adjustment, and speaker selection across multiple trained RVC v2 models. Bridges text-to-speech with voice conversion by using Coqui's synthesized speech as input to RVC's conversion pipeline, enabling customizable neural voice generation without extensive training datasets.
113 stars.
Stars
113
Forks
22
Language
Python
License
MIT
Category
Last pushed
Nov 30, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/skshadan/TTS-RVC-API"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin
Atm4x/tts-with-rvc
TTS with RVC-module to generate .wav audios