voice-cloning-app/Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices
Supports multilingual voice cloning through automated dataset generation from subtitles and audiobooks, with local or remote training across multiple GPUs. Built on a reworked Tacotron2 architecture paired with HiFi-GAN vocoding for high-quality synthesis. Integrates Mozilla's DSAlign for forced alignment, Silero for voice activity detection, and offers remote training via Google Colab notebooks.
1,443 stars. No commits in the last 6 months.
Stars
1,443
Forks
238
Language
Python
License
BSD-3-Clause
Category
Last pushed
Dec 02, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/voice-cloning-app/Voice-Cloning-App"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time