Ashish-Patnaik/kokoclone
Voice Cloning, Now Inside Kokoro. Generate natural multilingual speech and clone any target voice with ease.
Combines Kokoro-ONNX TTS with the Kanade voice-conversion model to enable both text-to-speech cloning and audio-to-audio re-voicing without transcription. Features VRAM-aware chunking with RoPE-ceiling enforcement for processing long recordings while respecting Transformer positional embedding limits, plus automatic hardware detection for CPU/GPU optimization. Supports 8 languages through a unified API layer exposed via Gradio web UI, CLI, and Python SDK.
Stars
62
Forks
10
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 11, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Ashish-Patnaik/kokoclone"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch...