MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai
Provides an HTTP API for real-time speech-to-text inference using Coqui STT models, with YAML-based configuration for tuning model parameters (beam width, language model alpha/beta) and server settings. Supports TensorFlow Lite models with optional scorer files for domain-specific phrase recognition, accepting audio payloads via POST requests with configurable size limits.
219 stars and 1,072 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
219
Forks
70
Language
Python
License
MPL-2.0
Category
Last pushed
Jul 12, 2022
Monthly downloads
1,072
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MainRo/deepspeech-server"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...
daanzu/deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments