vlomme/Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Archived
50
/ 100
Established

Implements a four-stage pipeline combining speaker verification (GE2E encoder), phonemic text-to-speech synthesis (Tacotron 2), and neural vocoding (WaveRNN) to enable few-shot voice cloning from seconds of audio. Uses phoneme-based representation with language-specific dictionaries to support both Russian and English in a unified model. Provides pretrained weights and curated multilingual datasets, with training extensible to additional languages via the phoneme dictionary approach.

397 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

397

Forks

91

Language

Python

License

Last pushed

Feb 07, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/vlomme/Multi-Tacotron-Voice-Cloning"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.