SEPIA-Framework/sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

44
/ 100
Emerging

# Technical Summary Full-duplex Python FastAPI server supporting multiple pluggable open-source ASR engines (Vosk, Coqui, Deepspeech, Scribosermo) with standardized WebSocket API for streaming audio and receiving real-time partial/final transcriptions. Features modular architecture enabling per-engine configuration, optional post-processing, speaker identification, grammar constraints, confidence scores, and word timestamps—all configurable on-the-fly via HTTP REST and WebSocket events. Includes Docker multi-architecture support (x86-64, ARM 32/64-bit) optimized for resource-constrained devices like Raspberry Pi 4, with token-based user authentication and tight integration with SEPIA Framework clients.

136 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

136

Forks

23

Language

Python

License

MIT

Last pushed

Nov 07, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/SEPIA-Framework/sepia-stt-server"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.