alphacep/vosk
VOSK Speech Recognition Toolkit
Audio fingerprinting and LSH-based indexing enable training on massive speech datasets (100k+ hours) without neural networks, with incremental model improvement through direct sample addition. The system segments audio into chunks, stores them in a hash-indexed database for fast lookup during decoding, and integrates with Kaldi for phoneme alignment and segmentation. Supports lifelong learning paradigms with built-in verification tools to identify and correct recognition gaps.
493 stars and 335,415 monthly downloads. Used by 6 other packages. No commits in the last 6 months. Available on PyPI.
Stars
493
Forks
56
Language
C
License
Apache-2.0
Category
Last pushed
Jul 13, 2022
Monthly downloads
335,415
Commits (30d)
0
Dependencies
5
Reverse dependents
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/alphacep/vosk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and...
ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
solyarisoftware/voskJs
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.
alphacep/vosk-asterisk
Speech Recognition in Asterisk with Vosk Server