alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Built on lightweight acoustic models (50MB), Vosk delivers continuous large-vocabulary transcription with zero-latency streaming and reconfigurable vocabulary across 20+ languages. The toolkit supports speaker identification and exposes a streaming API with bindings across Python, Java, Node.js, C++, Rust, and Go, scaling from embedded devices to server clusters. Primary use cases include chatbot integration, smart home voice control, virtual assistants, and media subtitle generation.
14,377 stars and 320,271 monthly downloads. Used by 6 other packages. Available on PyPI.
Stars
14,377
Forks
1,687
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Feb 22, 2026
Monthly downloads
320,271
Commits (30d)
0
Dependencies
5
Reverse dependents
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/alphacep/vosk-api"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
huggingface/speech-to-speech
Build local voice agents with open-source models
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models
Lex-au/Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and...