vlomme/Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on
ArchivedImplements a four-stage pipeline combining speaker verification (GE2E encoder), phonemic text-to-speech synthesis (Tacotron 2), and neural vocoding (WaveRNN) to enable few-shot voice cloning from seconds of audio. Uses phoneme-based representation with language-specific dictionaries to support both Russian and English in a unified model. Provides pretrained weights and curated multilingual datasets, with training extensible to additional languages via the phoneme dictionary approach.
397 stars. No commits in the last 6 months.
Stars
397
Forks
91
Language
Python
License
—
Category
Last pushed
Feb 07, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/vlomme/Multi-Tacotron-Voice-Cloning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model