argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

/ 100

Established

Wraps OpenAI's Whisper models in CoreML format optimized for Apple Silicon, enabling real-time streaming transcription with word-level timestamps and voice activity detection. Provides Swift SDK integration, a local HTTP server with Deepgram-compatible WebSocket API, and companion tools for speaker diarization and text-to-speech. Supports model customization through fine-tuning workflows and deployment to HuggingFace repositories for easy distribution across projects.

5,775 stars. Actively maintained with 3 commits in the last 30 days.

No Package No Dependents

Maintenance 16 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

5,775

Forks

516

Language

Swift

License

MIT

Compare

WhisperKit and whisper-web WhisperKit and flutter_whisper_kit

Related tools

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data,...

xenova/whisper-web

ML-powered speech recognition directly in your browser

vasistalodagala/whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets...

r0227n/flutter_whisper_kit

🎤 A Flutter plugin for running WhisperKit speech-to-text models on-device, powered by...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights