nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

/ 100

Emerging

Built on OpenAI's Whisper, CrisperWhisper employs a custom tokenizer and attention loss mechanism during training to achieve precise word-level timestamp alignment, particularly around disfluencies and pauses. It integrates seamlessly with both 🤗 Transformers and Faster Whisper pipelines, enabling deployment in existing speech recognition workflows. The model prioritizes verbatim transcription including fillers ("um", "uh") and speech artifacts, ranking first on the OpenASR Leaderboard for verbatim datasets like TED-LIUM and AMI.

927 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

927

Forks

Language

Python

License

—

Compare

CrisperWhisper and whisperX

Higher-rated alternatives

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and...

jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...

linto-ai/linto-stt

An automatic speech recognition API

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Explore Voice AI Tools

All categories Trending Voice AI directory Insights