remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Provides phoneme-level control with per-word timestamped captions and supports voice mixing via weighted combinations, enabling fine-grained audio generation and synthesis customization. Implements an OpenAI-compatible Speech API endpoint for drop-in integration with existing applications while offering a built-in web UI for standalone use. Includes Kubernetes/Helm deployment support and integrations with popular AI frameworks like SillyTavern and OpenWebUI.
4,585 stars.
Stars
4,585
Forks
764
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/remsky/Kokoro-FastAPI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
lucasjinreal/Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with...