alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

/ 100

Verified

Built on lightweight acoustic models (50MB), Vosk delivers continuous large-vocabulary transcription with zero-latency streaming and reconfigurable vocabulary across 20+ languages. The toolkit supports speaker identification and exposes a streaming API with bindings across Python, Java, Node.js, C++, Rust, and Go, scaling from embedded devices to server clusters. Primary use cases include chatbot integration, smart home voice control, virtual assistants, and media subtitle generation.

14,377 stars and 320,271 monthly downloads. Used by 6 other packages. Available on PyPI.

Maintenance 10 / 25

Adoption 25 / 25

Maturity 25 / 25

Community 21 / 25

How are scores calculated?

Stars

14,377

Forks

1,687

Language

Jupyter Notebook

License

Apache-2.0

Featured in

Things AI Won't Tell You About Building a Voice App

Related tools

huggingface/speech-to-speech

Build local voice agents with open-source models

linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

Picovoice/speech-to-text-benchmark

speech to text benchmark framework

vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

Lex-au/Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights