MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

/ 100

Established

Provides an HTTP API for real-time speech-to-text inference using Coqui STT models, with YAML-based configuration for tuning model parameters (beam width, language model alpha/beta) and server settings. Supports TensorFlow Lite models with optional scorer files for domain-specific phrase recognition, accepting audio payloads via POST requests with configurable size limits.

219 stars and 1,072 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 17 / 25

Maturity 25 / 25

Community 23 / 25

How are scores calculated?

Stars

219

Forks

Language

Python

License

MPL-2.0

Related tools

shibing624/parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust

thewh1teagle/pyannote-rs

pyannote audio diarization in rust

PaddlePaddle/Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...

daanzu/deepspeech-websocket-server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Explore Voice AI Tools

All categories Trending Voice AI directory Insights