alphacep/vosk

VOSK Speech Recognition Toolkit

/ 100

Established

Audio fingerprinting and LSH-based indexing enable training on massive speech datasets (100k+ hours) without neural networks, with incremental model improvement through direct sample addition. The system segments audio into chunks, stores them in a hash-indexed database for fast lookup during decoding, and integrates with Kaldi for phoneme alignment and segmentation. Supports lifelong learning paradigms with built-in verification tools to identify and correct recognition gaps.

493 stars and 335,415 monthly downloads. Used by 6 other packages. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 25 / 25

Maturity 18 / 25

Community 18 / 25

How are scores calculated?

Stars

493

Forks

Language

License

Apache-2.0

Compare

vosk and vosk-browser vosk and vosk-asterisk vosk and IBus-Speech-To-Text vosk and vosk-cli-dictation

Related tools

k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and...

ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

solyarisoftware/voskJs

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

alphacep/vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Explore Voice AI Tools

All categories Trending Voice AI directory Insights