SforAiDl/Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Archived
50
/ 100
Established

Implements speaker embedding extraction and multi-speaker generative modeling to achieve content-independent voice cloning with minimal samples, following the Baidu paper architecture. Combines a speaker encoder that learns voice identity fingerprints (pitch, accent, etc.) with a generative model trained on 84 speakers from VCTK dataset, enabling rapid speaker adaptation in 10-20 minutes. Built on PyTorch with DeepVoice3-inspired architecture and optimized for NVIDIA GPUs.

436 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

436

Forks

121

Language

Python

License

MIT

Last pushed

Feb 23, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/SforAiDl/Neural-Voice-Cloning-With-Few-Samples"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.