artcore-c/AI-Voice-Clone-with-Coqui-XTTS-v2
Free voice cloning for creators using Coqui XTTS-v2 on Google Colab. Clone your voice with just a few minutes of audio. Complete guide to build your own notebook.
Leverages a Transformer-based architecture with VQ-VAE for speaker embedding, extracting acoustic features (pitch, tone, cadence) from reference audio and synthesizing speech matching those characteristics across 16+ languages. Optimized for Google Colab's free T4 GPU (24kHz output, ~5-minute setup), with strict Python 3.11 + PyTorch 2.1.0 + transformers <4.50.0 dependency pinning to ensure model compatibility and prevent BeamSearchScorer failures.
Stars
34
Forks
14
Language
Python
License
MIT
Category
Last pushed
Feb 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/artcore-c/AI-Voice-Clone-with-Coqui-XTTS-v2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
OpenBMB/VoxCPM
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
JackismyShephard/ultimate-rvc
An app for creating audio-based content such as song covers and speech using Retrieval-based...
codename0og/codename-rvc-fork-4
Codename's rvc fork version 4, based on Applio.
ArkanDash/Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads