neosun100/cosyvoice-docker
🎙️ CosyVoice All-in-One Docker - Production-ready TTS with Web UI, REST API & Voice Cloning
Built on Alibaba's Fun-CosyVoice3-0.5B model with integrated Fun-ASR-Nano for automatic voice transcription, it delivers sub-1.3s time-to-first-byte streaming output via OpenAI-compatible `/v1/audio/speech` endpoints. Features embedding caching (53% latency reduction on repeat voices), custom voice management through reference audio uploads, and multi-language support across Chinese dialects, English, Japanese, Korean, and European languages—all optimized for NVIDIA GPUs with real-time factor performance below 1.0x.
Stars
38
Forks
6
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/neosun100/cosyvoice-docker"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin
Atm4x/tts-with-rvc
TTS with RVC-module to generate .wav audios