SforAiDl/Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Archived

/ 100

Established

Implements speaker embedding extraction and multi-speaker generative modeling to achieve content-independent voice cloning with minimal samples, following the Baidu paper architecture. Combines a speaker encoder that learns voice identity fingerprints (pitch, accent, etc.) with a generative model trained on 84 speakers from VCTK dataset, enabling rapid speaker adaptation in 10-20 minutes. Built on PyTorch with DeepVoice3-inspired architecture and optimized for NVIDIA GPUs.

436 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

436

Forks

121

Language

Python

License

MIT

Related tools

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Explore Voice AI Tools

All categories Trending Voice AI directory Insights