neosun100/cosyvoice-docker

🎙️ CosyVoice All-in-One Docker - Production-ready TTS with Web UI, REST API & Voice Cloning

/ 100

Emerging

Built on Alibaba's Fun-CosyVoice3-0.5B model with integrated Fun-ASR-Nano for automatic voice transcription, it delivers sub-1.3s time-to-first-byte streaming output via OpenAI-compatible `/v1/audio/speech` endpoints. Features embedding caching (53% latency reduction on repeat voices), custom voice management through reference audio uploads, and multi-language support across Chinese dialects, English, Japanese, Korean, and European languages—all optimized for NVIDIA GPUs with real-time factor performance below 1.0x.

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 13 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

herimor/voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

EveryVoiceTTS/EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

kadirnar/VoiceHub

VoiceHub: A Unified Inference Interface for TTS Models

NeonGeckoCom/neon-tts-plugin-coqui

Coqui AI TTS plugin

Atm4x/tts-with-rvc

TTS with RVC-module to generate .wav audios

Explore Voice AI Tools

All categories Trending Voice AI directory Insights