CMsmartvoice/One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS
Combines a U-Net architecture with Adaptive Instance Normalization (AdaIN) layers to enable robust speaker and style transfer from a single reference audio sample, automatically estimating duration statistics without manual annotation. Built on TensorFlowTTS, it uses a three-stage pipeline (duration model, acoustic model, vocoder) trained exclusively on neutral speech corpus to synthesize arbitrary text in cloned voices. Supports both Python inference and Google Colab notebooks, with pre-trained models available for immediate use.
245 stars. No commits in the last 6 months.
Stars
245
Forks
43
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 22, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/CMsmartvoice/One-Shot-Voice-Cloning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
OpenBMB/VoxCPM
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
JackismyShephard/ultimate-rvc
An app for creating audio-based content such as song covers and speech using Retrieval-based...
codename0og/codename-rvc-fork-4
Codename's rvc fork version 4, based on Applio.
ArkanDash/Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads