shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
Builds on distilwhisper for ASR and GPT-SoVITS for multi-speaker TTS, with advanced IndexTTS2 support for emotion control via reference audio, emotion vectors, or text inference. Offers streaming TTS for low-latency real-time synthesis with independent speaker identity and emotion decoupling, plus CLI and Python API for seamless integration.
526 stars and 238 monthly downloads. Available on PyPI.
Stars
526
Forks
95
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 11, 2025
Monthly downloads
238
Commits (30d)
0
Dependencies
20
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shibing624/parrots"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
codyw912/open-asr-server
OpenAI-compatible ASR server with pluggable local backends (Parakeet, Whisper, Lightning...
Frikallo/parakeet.cpp
Ultra fast and portable Parakeet implementation for on-device inference in C++ using Axiom with...