Voice Cloning Synthesis ML Frameworks

Tools and frameworks for cloning voices from audio samples to generate synthetic speech. Includes real-time voice cloning, multi-speaker synthesis, and voice conversion. Does NOT include general text-to-speech without cloning, speech recognition, or voice conversion without synthesis capabilities.

There are 31 voice cloning synthesis frameworks tracked. The highest-rated is neosun100/Step-Audio-R1.1 at 34/100 with 4 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=voice-cloning-synthesis&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 neosun100/Step-Audio-R1.1

Step-Audio-R1.1: The First Audio Language Model with Test-Time Compute...

34
Emerging
2 IS2AI/TurkicASR

A multilingual ASR model that can recognize ten Turkic...

34
Emerging
3 Aditya1Jhaveri/AI-Video-Dubbing

AI video dubbing using Google APIs automates translation and dubbing by...

26
Experimental
4 seanpm2001/Phoneticut

Phoneticut is a voice actor replacement: Make a certain amount of sounds,...

24
Experimental
5 Pearlssx/FireRedTTS2

🔊 Generate long-form streaming TTS for multi-speaker dialogues, enhancing...

23
Experimental
6 mahshid1378/tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos,...

23
Experimental
7 WildCraftsmanFilter/AI-Voice-Changer-Real-Time-Desktop

⭐️ AI Voice-Changer Real-Time 2026 is advanced AI voice changer software...

23
Experimental
8 Syedjunaid30/Video_Dubbing_with_ML_driven_Lip_Synchronization

AI-powered video dubbing tool that translates and synchronizes speech with...

23
Experimental
9 grayhatdevelopers/deepdub

🗣️ Videos for everyone. Implementation of "Automated Dubbing and Facial...

23
Experimental
10 Dada-Tech/speech-to-code

Limited Keyword Speech Recognition using Transfer Learning

22
Experimental
11 bharathraj-v/fastconformer-ctc-telugu

NVIDIA NeMo's stt_en_fastconformer_ctc_large finetuned on open-source telugu...

22
Experimental
12 hadihaider055/vocal-dub

Dub audio into 50+ languages using AI. Whisper transcription, Google...

22
Experimental
13 bseceenn/Fun-CosyVoice3-0.5B-2512-Deploy

🎤 Deploy a simplified voice synthesis service with Fun-CosyVoice3-0.5B-2512,...

22
Experimental
14 khalid-sha/arabic-ai-pronunciation

Guidelines and linguistic rules for improving Arabic pronunciation in AI...

22
Experimental
15 sekalf/MioTTS-llama.cpp

Create fast, lightweight text-to-speech audio on your CPU with...

22
Experimental
16 moaz11112/qwen3-tts-enhanced

🎤 Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered...

22
Experimental
17 ExpertVagabond/qwen3-tts-apple-silicon

Qwen3-TTS on Apple Silicon with MLX - voice cloning and generation

22
Experimental
18 ammosu/qwen3-tts-voice-clone

A full-stack voice cloning web application powered by Qwen3-TTS. Clone any...

21
Experimental
19 PrathuashaKB/ASR-Using-Deep-Learning

Automatic Speech Recognition is a technique that processes human speech into...

19
Experimental
20 thekartikeyamishra/VoiceCloner

The Voice Cloner is a Python-based project that leverages Tacotron 2 and...

19
Experimental
21 johnsonhk88/qwen3-tts-hk-cantonese-finetune

Perform Qwen3-TTS Model FIne Tune for Hong Kong Cantonese language

17
Experimental
22 NikhilKalloli/Voice-Recognition

A Streamlit web application for Voice recognition using a pre-trained speech...

16
Experimental
23 Yur1G4/as

The "as" keyword in programming languages is commonly used for type...

15
Experimental
24 SoheilGtex/Voice-Cloning-SV2TTS-

Safe, production-ready starter for voice cloning via SV2TTS (RTVC wrapper)....

15
Experimental
25 gayatrriiii/VibeVoice-finetuning

🎤 Train and fine-tune VibeVoice models with ease, tailored for specific...

14
Experimental
26 vivek-i8/vaani-voice-authenticity

AI system for detecting AI-generated voice clones using speech embeddings...

14
Experimental
27 Rafat-decodis/Robust-ASR-for-Low-Resource-Languages

Exploring Benchmark Gaps and Real-World Speech Generalization for Language...

13
Experimental
28 yasuohasegawa/ios-fastspeech2-hifigan

On-device iOS Text-to-Speech using FastSpeech2 and HiFi-GAN (Japanese & English)

13
Experimental
29 hussainkazarani/streaming-tts

voice cloning and real time streaming 📣

11
Experimental
30 chabandou/poisecast

A Podcast player with client-side ML voice isolation using onnxruntime-web...

11
Experimental
31 lazaroborges/speecher

Edge Voice to Text Transcriber for iOS

11
Experimental