SforAiDl/Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
ArchivedImplements speaker embedding extraction and multi-speaker generative modeling to achieve content-independent voice cloning with minimal samples, following the Baidu paper architecture. Combines a speaker encoder that learns voice identity fingerprints (pitch, accent, etc.) with a generative model trained on 84 speakers from VCTK dataset, enabling rapid speaker adaptation in 10-20 minutes. Built on PyTorch with DeepVoice3-inspired architecture and optimized for NVIDIA GPUs.
436 stars. No commits in the last 6 months.
Stars
436
Forks
121
Language
Python
License
MIT
Category
Last pushed
Feb 23, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/SforAiDl/Neural-Voice-Cloning-With-Few-Samples"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time