daanzu/deepspeech-websocket-server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

45
/ 100
Emerging

Implements streaming inference via Mozilla's DeepSpeech with voice activity detection (VAD) on the client to segment utterances and filter noise. The server handles multi-user connections using gevent WebSockets with sequential decoding, while the client streams raw audio from PyAudio-compatible microphone interfaces. Supports DeepSpeech v0.2+ with configurable CTC decoder parameters (language model weight, beam width, vocabulary insertion penalty) and optional utterance saving for model validation.

103 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

103

Forks

31

Language

Python

License

MPL-2.0

Last pushed

May 29, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/daanzu/deepspeech-websocket-server"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.