gurjar1/OmniDictate
Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.
Leverages the optimized `faster-whisper` library with support for the `large-v3-turbo` model, enabling GPU-accelerated transcription via CUDA (NVIDIA) or CPU fallback. Features voice activity detection (VAD) and push-to-talk modes, spoken punctuation commands, and hallucination filtering to refine output quality. Built with Python/PyTorch and simulates keyboard input across Windows applications, with all configuration (model size, language, sensitivity thresholds, hotkeys) managed through a modern slate-glass UI.
111 stars.
Stars
111
Forks
12
Language
Python
License
—
Category
Last pushed
Dec 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gurjar1/OmniDictate"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jatinkrmalik/vocalinux
Free, open-source, 100% offline voice dictation for Linux. Speak and type anywhere via...
mkiol/dsnote
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to...
peteonrails/voxtype
Voice-to-text with push-to-talk for Wayland compositors
themanyone/voice_typing
State-of-the-art offline (or networked) voice typing everywhere + text terminals (Linux or WFL...
oddlama/whisper-overlay
A wayland overlay providing speech-to-text functionality for any application via a global...