whisper-clip and whisper-speech-to-text
These are ecosystem siblings—WhisperClip is a desktop application that wraps OpenAI's Whisper API, while the JavaScript library provides the same Whisper integration for web-based implementations, serving different deployment environments rather than competing or requiring use together.
About whisper-clip
gustavostz/whisper-clip
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
# Technical Summary Leverages `faster-whisper` for accelerated local inference with quantization options (int8, float16) and optional CUDA GPU support, ensuring all audio processing remains on-device. Exposes a FastAPI-based transcription server enabling remote mobile clients via iOS Shortcuts and Android to submit audio over VPN (Tailscale/Meshnet) and receive clipboard-ready transcriptions. Includes real-time audio visualization, configurable hotword biasing, and a system tray interface with global hotkey activation.
About whisper-speech-to-text
nitaiaharoni1/whisper-speech-to-text
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via OpenAI's Whisper, intended for web applications.
Provides real-time transcription with pause/resume control during recording sessions. Uses the Web Audio API to capture microphone input and streams audio to OpenAI's Whisper endpoint via callback-based async methods. Handles microphone permissions automatically, making it suitable for embedding directly into React, Vue, or vanilla JavaScript web applications without additional permission management.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work