vosk and IBus-Speech-To-Text
The VOSK Speech Recognition Toolkit is a core ASR engine, while the IBus Speech To Text engine is an input method interface that depends on and integrates VOSK as its underlying recognition backend, making them complements rather than competitors.
About vosk
alphacep/vosk
VOSK Speech Recognition Toolkit
Audio fingerprinting and LSH-based indexing enable training on massive speech datasets (100k+ hours) without neural networks, with incremental model improvement through direct sample addition. The system segments audio into chunks, stores them in a hash-indexed database for fast lookup during decoding, and integrates with Kaldi for phoneme alignment and segmentation. Supports lifelong learning paradigms with built-in verification tools to identify and correct recognition gaps.
About IBus-Speech-To-Text
PhilippeRo/IBus-Speech-To-Text
A speech to text IBus engine using VOSK
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work