voxtype and whisper-overlay
These are near-identical competitors—both implement push-to-talk speech-to-text overlays for Wayland using similar architectures, so users would typically choose one based on maturity (voxtype) versus active maintenance (whisper-overlay).
About voxtype
peteonrails/voxtype
Voice-to-text with push-to-talk for Wayland compositors
Supports 7 offline transcription engines (Whisper, Moonshine, SenseVoice, etc.) with native CJK language support, meeting mode for continuous transcription with speaker attribution and multi-format export, and flexible text input via compositor keybindings or evdev fallback. Architecture uses whisper.cpp for local inference with configurable post-processing pipelines, optional Ollama integration for grammar correction, and multiple typing backends (wtype, dotool, ydotool, clipboard) with automatic fallback chains based on environment capabilities.
About whisper-overlay
oddlama/whisper-overlay
A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work