Voice Cloning Synthesis ML Frameworks

Tools and frameworks for cloning voices from audio samples to generate synthetic speech. Includes real-time voice cloning, multi-speaker synthesis, and voice conversion. Does NOT include general text-to-speech without cloning, speech recognition, or voice conversion without synthesis capabilities.

There are 31 voice cloning synthesis frameworks tracked. The highest-rated is neosun100/Step-Audio-R1.1 at 34/100 with 4 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=voice-cloning-synthesis&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	neosun100/Step-Audio-R1.1 Step-Audio-R1.1: The First Audio Language Model with Test-Time Compute...	34	Emerging	4	Python
2	IS2AI/TurkicASR A multilingual ASR model that can recognize ten Turkic...	34	Emerging	82	Python
3	Aditya1Jhaveri/AI-Video-Dubbing AI video dubbing using Google APIs automates translation and dubbing by...	26	Experimental	6	Python
4	seanpm2001/Phoneticut Phoneticut is a voice actor replacement: Make a certain amount of sounds,...	24	Experimental	3	Csound Document
5	Pearlssx/FireRedTTS2 🔊 Generate long-form streaming TTS for multi-speaker dialogues, enhancing...	23	Experimental	1	—
6	mahshid1378/tts-generation-webui TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos,...	23	Experimental	2	TypeScript
7	WildCraftsmanFilter/AI-Voice-Changer-Real-Time-Desktop ⭐️ AI Voice-Changer Real-Time 2026 is advanced AI voice changer software...	23	Experimental	1	C++
8	Syedjunaid30/Video_Dubbing_with_ML_driven_Lip_Synchronization AI-powered video dubbing tool that translates and synchronizes speech with...	23	Experimental	7	Jupyter Notebook
9	grayhatdevelopers/deepdub 🗣️ Videos for everyone. Implementation of "Automated Dubbing and Facial...	23	Experimental	5	Shell
10	Dada-Tech/speech-to-code Limited Keyword Speech Recognition using Transfer Learning	22	Experimental	1	JavaScript
11	bharathraj-v/fastconformer-ctc-telugu NVIDIA NeMo's stt_en_fastconformer_ctc_large finetuned on open-source telugu...	22	Experimental	7	Jupyter Notebook
12	hadihaider055/vocal-dub Dub audio into 50+ languages using AI. Whisper transcription, Google...	22	Experimental	—	TypeScript
13	bseceenn/Fun-CosyVoice3-0.5B-2512-Deploy 🎤 Deploy a simplified voice synthesis service with Fun-CosyVoice3-0.5B-2512,...	22	Experimental	—	Python
14	khalid-sha/arabic-ai-pronunciation Guidelines and linguistic rules for improving Arabic pronunciation in AI...	22	Experimental	—	—
15	sekalf/MioTTS-llama.cpp Create fast, lightweight text-to-speech audio on your CPU with...	22	Experimental	—	C++
16	moaz11112/qwen3-tts-enhanced 🎤 Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered...	22	Experimental	—	Python
17	ExpertVagabond/qwen3-tts-apple-silicon Qwen3-TTS on Apple Silicon with MLX - voice cloning and generation	22	Experimental	—	Python
18	ammosu/qwen3-tts-voice-clone A full-stack voice cloning web application powered by Qwen3-TTS. Clone any...	21	Experimental	—	TypeScript
19	PrathuashaKB/ASR-Using-Deep-Learning Automatic Speech Recognition is a technique that processes human speech into...	19	Experimental	4	Python
20	thekartikeyamishra/VoiceCloner The Voice Cloner is a Python-based project that leverages Tacotron 2 and...	19	Experimental	8	Python
21	johnsonhk88/qwen3-tts-hk-cantonese-finetune Perform Qwen3-TTS Model FIne Tune for Hong Kong Cantonese language	17	Experimental	3	Jupyter Notebook
22	NikhilKalloli/Voice-Recognition A Streamlit web application for Voice recognition using a pre-trained speech...	16	Experimental	2	PureBasic
23	Yur1G4/as The "as" keyword in programming languages is commonly used for type...	15	Experimental	1	—
24	SoheilGtex/Voice-Cloning-SV2TTS- Safe, production-ready starter for voice cloning via SV2TTS (RTVC wrapper)....	15	Experimental	7	Python
25	gayatrriiii/VibeVoice-finetuning 🎤 Train and fine-tune VibeVoice models with ease, tailored for specific...	14	Experimental	—	—
26	vivek-i8/vaani-voice-authenticity AI system for detecting AI-generated voice clones using speech embeddings...	14	Experimental	—	TypeScript
27	Rafat-decodis/Robust-ASR-for-Low-Resource-Languages Exploring Benchmark Gaps and Real-World Speech Generalization for Language...	13	Experimental	2	Jupyter Notebook
28	yasuohasegawa/ios-fastspeech2-hifigan On-device iOS Text-to-Speech using FastSpeech2 and HiFi-GAN (Japanese & English)	13	Experimental	2	C++
29	hussainkazarani/streaming-tts voice cloning and real time streaming 📣	11	Experimental	—	Python
30	chabandou/poisecast A Podcast player with client-side ML voice isolation using onnxruntime-web...	11	Experimental	—	TypeScript
31	lazaroborges/speecher Edge Voice to Text Transcriber for iOS	11	Experimental	5	Swift