gitmylo/bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
Uses HuBERT (a self-supervised speech model) with a custom quantizer to extract semantic tokens from reference voice samples, enabling high-quality voice cloning in Bark. The quantizer compresses HuBERT embeddings into discrete tokens that condition the text-to-speech model. Pretrained quantizers are available for multiple languages (English, Polish, German), and the modular design allows easy integration into existing Bark projects via simple Python APIs.
711 stars. No commits in the last 6 months.
Stars
711
Forks
116
Language
Python
License
MIT
Category
Last pushed
Sep 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gitmylo/bark-voice-cloning-HuBERT-quantizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
OpenBMB/VoxCPM
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
JackismyShephard/ultimate-rvc
An app for creating audio-based content such as song covers and speech using Retrieval-based...
codename0og/codename-rvc-fork-4
Codename's rvc fork version 4, based on Applio.
ArkanDash/Advanced-RVC-Inference
Advanced RVC Inference for quicker and effortless model downloads