nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Built on OpenAI's Whisper, CrisperWhisper employs a custom tokenizer and attention loss mechanism during training to achieve precise word-level timestamp alignment, particularly around disfluencies and pauses. It integrates seamlessly with both 🤗 Transformers and Faster Whisper pipelines, enabling deployment in existing speech recognition workflows. The model prioritizes verbatim transcription including fillers ("um", "uh") and speech artifacts, ranking first on the OpenASR Leaderboard for verbatim datasets like TED-LIUM and AMI.
927 stars. No commits in the last 6 months.
Stars
927
Forks
48
Language
Python
License
—
Category
Last pushed
Jun 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nyrahealth/CrisperWhisper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
linto-ai/linto-stt
An automatic speech recognition API
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper