CMsmartvoice/One-Shot-Voice-Cloning

:relaxed: One Shot Voice Cloning base on Unet-TTS

/ 100

Emerging

Combines a U-Net architecture with Adaptive Instance Normalization (AdaIN) layers to enable robust speaker and style transfer from a single reference audio sample, automatically estimating duration statistics without manual annotation. Built on TensorFlowTTS, it uses a three-stage pipeline (duration model, acoustic model, vocoder) trained exclusively on neutral speech corpus to synthesize arbitrary text in cloned voices. Supports both Python inference and Google Colab notebooks, with pre-trained models available for immediate use.

245 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 20 / 25

How are scores calculated?

Stars

245

Forks

Language

Jupyter Notebook

License

—

Featured in

Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

OpenBMB/VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

JackismyShephard/ultimate-rvc

An app for creating audio-based content such as song covers and speech using Retrieval-based...

codename0og/codename-rvc-fork-4

Codename's rvc fork version 4, based on Applio.

ArkanDash/Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

Explore Voice AI Tools

All categories Trending Voice AI directory Insights