Voice Cloning Synthesis ML Frameworks
Tools and frameworks for cloning voices from audio samples to generate synthetic speech. Includes real-time voice cloning, multi-speaker synthesis, and voice conversion. Does NOT include general text-to-speech without cloning, speech recognition, or voice conversion without synthesis capabilities.
There are 31 voice cloning synthesis frameworks tracked. The highest-rated is neosun100/Step-Audio-R1.1 at 34/100 with 4 stars.
Get all 31 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=voice-cloning-synthesis&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
neosun100/Step-Audio-R1.1
Step-Audio-R1.1: The First Audio Language Model with Test-Time Compute... |
|
Emerging |
| 2 |
IS2AI/TurkicASR
A multilingual ASR model that can recognize ten Turkic... |
|
Emerging |
| 3 |
Aditya1Jhaveri/AI-Video-Dubbing
AI video dubbing using Google APIs automates translation and dubbing by... |
|
Experimental |
| 4 |
seanpm2001/Phoneticut
Phoneticut is a voice actor replacement: Make a certain amount of sounds,... |
|
Experimental |
| 5 |
Pearlssx/FireRedTTS2
🔊 Generate long-form streaming TTS for multi-speaker dialogues, enhancing... |
|
Experimental |
| 6 |
mahshid1378/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos,... |
|
Experimental |
| 7 |
WildCraftsmanFilter/AI-Voice-Changer-Real-Time-Desktop
⭐️ AI Voice-Changer Real-Time 2026 is advanced AI voice changer software... |
|
Experimental |
| 8 |
Syedjunaid30/Video_Dubbing_with_ML_driven_Lip_Synchronization
AI-powered video dubbing tool that translates and synchronizes speech with... |
|
Experimental |
| 9 |
grayhatdevelopers/deepdub
🗣️ Videos for everyone. Implementation of "Automated Dubbing and Facial... |
|
Experimental |
| 10 |
Dada-Tech/speech-to-code
Limited Keyword Speech Recognition using Transfer Learning |
|
Experimental |
| 11 |
bharathraj-v/fastconformer-ctc-telugu
NVIDIA NeMo's stt_en_fastconformer_ctc_large finetuned on open-source telugu... |
|
Experimental |
| 12 |
hadihaider055/vocal-dub
Dub audio into 50+ languages using AI. Whisper transcription, Google... |
|
Experimental |
| 13 |
bseceenn/Fun-CosyVoice3-0.5B-2512-Deploy
🎤 Deploy a simplified voice synthesis service with Fun-CosyVoice3-0.5B-2512,... |
|
Experimental |
| 14 |
khalid-sha/arabic-ai-pronunciation
Guidelines and linguistic rules for improving Arabic pronunciation in AI... |
|
Experimental |
| 15 |
sekalf/MioTTS-llama.cpp
Create fast, lightweight text-to-speech audio on your CPU with... |
|
Experimental |
| 16 |
moaz11112/qwen3-tts-enhanced
🎤 Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered... |
|
Experimental |
| 17 |
ExpertVagabond/qwen3-tts-apple-silicon
Qwen3-TTS on Apple Silicon with MLX - voice cloning and generation |
|
Experimental |
| 18 |
ammosu/qwen3-tts-voice-clone
A full-stack voice cloning web application powered by Qwen3-TTS. Clone any... |
|
Experimental |
| 19 |
PrathuashaKB/ASR-Using-Deep-Learning
Automatic Speech Recognition is a technique that processes human speech into... |
|
Experimental |
| 20 |
thekartikeyamishra/VoiceCloner
The Voice Cloner is a Python-based project that leverages Tacotron 2 and... |
|
Experimental |
| 21 |
johnsonhk88/qwen3-tts-hk-cantonese-finetune
Perform Qwen3-TTS Model FIne Tune for Hong Kong Cantonese language |
|
Experimental |
| 22 |
NikhilKalloli/Voice-Recognition
A Streamlit web application for Voice recognition using a pre-trained speech... |
|
Experimental |
| 23 |
Yur1G4/as
The "as" keyword in programming languages is commonly used for type... |
|
Experimental |
| 24 |
SoheilGtex/Voice-Cloning-SV2TTS-
Safe, production-ready starter for voice cloning via SV2TTS (RTVC wrapper).... |
|
Experimental |
| 25 |
gayatrriiii/VibeVoice-finetuning
🎤 Train and fine-tune VibeVoice models with ease, tailored for specific... |
|
Experimental |
| 26 |
vivek-i8/vaani-voice-authenticity
AI system for detecting AI-generated voice clones using speech embeddings... |
|
Experimental |
| 27 |
Rafat-decodis/Robust-ASR-for-Low-Resource-Languages
Exploring Benchmark Gaps and Real-World Speech Generalization for Language... |
|
Experimental |
| 28 |
yasuohasegawa/ios-fastspeech2-hifigan
On-device iOS Text-to-Speech using FastSpeech2 and HiFi-GAN (Japanese & English) |
|
Experimental |
| 29 |
hussainkazarani/streaming-tts
voice cloning and real time streaming 📣 |
|
Experimental |
| 30 |
chabandou/poisecast
A Podcast player with client-side ML voice isolation using onnxruntime-web... |
|
Experimental |
| 31 |
lazaroborges/speecher
Edge Voice to Text Transcriber for iOS |
|
Experimental |