TheStageAI/TheWhisper

Optimized Whisper models for streaming and on-device use

/ 100

Emerging

Fine-tuned Whisper variants support flexible chunk sizes (10s-30s vs. original 30s fixed) and deliver platform-specific optimizations: CoreML engines for Apple Silicon (~2W power, ~2GB RAM) and NVIDIA GPU acceleration (220 tok/s on L40s). Streaming inference is available across both platforms with word-level timestamps and multilingual support, deployable via Python API or local REST endpoints integrated with Electron/web frontends.

821 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 13 / 25

Community 15 / 25

How are scores calculated?

Stars

821

Forks

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App

Compare

TheWhisper and whisper.cpp

Higher-rated alternatives

ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

ChetanXpro/nodejs-whisper

NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++...

vilassn/whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

sandrohanea/whisper.net

Whisper.net. Speech to text made simple using Whisper Models

mybigday/whisper.rn

React Native binding of whisper.cpp.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights