TheStageAI/TheWhisper
Optimized Whisper models for streaming and on-device use
Fine-tuned Whisper variants support flexible chunk sizes (10s-30s vs. original 30s fixed) and deliver platform-specific optimizations: CoreML engines for Apple Silicon (~2W power, ~2GB RAM) and NVIDIA GPU acceleration (220 tok/s on L40s). Streaming inference is available across both platforms with word-level timestamps and multilingual support, deployable via Python API or local REST endpoints integrated with Electron/web frontends.
821 stars.
Stars
821
Forks
55
Language
Python
License
MIT
Category
Last pushed
Mar 01, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/TheStageAI/TheWhisper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
ChetanXpro/nodejs-whisper
NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++...
vilassn/whisper_android
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
sandrohanea/whisper.net
Whisper.net. Speech to text made simple using Whisper Models
mybigday/whisper.rn
React Native binding of whisper.cpp.