journey-ad/CosyVoice2-Ex
CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)
Builds on the CosyVoice2 base model with a REST API interface and web UI for voice cloning and synthesis, supporting multilingual text normalization and style-based instruction control ("speak slowly, use a cute tone"). Leverages pre-trained acoustic models from ModelScope with optional `ttsfrd` text processing acceleration on Linux, deployable via Conda environments or standalone Windows executables.
189 stars. No commits in the last 6 months.
Stars
189
Forks
28
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/journey-ad/CosyVoice2-Ex"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
herimor/voxtream
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
EveryVoiceTTS/EveryVoice
The EveryVoice TTS Toolkit - Text To Speech for your language
kadirnar/VoiceHub
VoiceHub: A Unified Inference Interface for TTS Models
NeonGeckoCom/neon-tts-plugin-coqui
Coqui AI TTS plugin
Atm4x/tts-with-rvc
TTS with RVC-module to generate .wav audios