whisperX and whisper-run

WhisperX is a mature, widely-adopted framework that combines Whisper ASR with word-level alignment and diarization, while whisper-run is a lightweight wrapper around Faster Whisper that adds diarization as a simpler alternative for users prioritizing speed over the comprehensive timestamp precision that WhisperX provides.

whisperX

Verified

whisper-run

Emerging

Maintenance 20/25

Adoption 25/25

Maturity 25/25

Community 20/25

Maintenance 0/25

Adoption 12/25

Maturity 18/25

Community 8/25

Stars: 20,758

Forks: 2,188

Downloads: 864,629

Commits (30d): 15

Language: Python

License: BSD-2-Clause

Stars: 9

Forks: 1

Downloads: 1,184

Commits (30d): 0

Language: Python

License: Apache-2.0

No risk flags

Stale 6m

About whisperX

m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Builds on OpenAI's Whisper by combining faster-whisper for batched GPU inference (70x speedup) with wav2vec2 forced phoneme alignment to achieve sub-word timing accuracy. Integrates pyannote-audio for speaker diarization and includes VAD preprocessing to reduce hallucinations while maintaining quality. Supports multiple languages with automatic language-specific alignment model selection from HuggingFace and torchaudio.

About whisper-run

gorkemkaramolla/whisper-run

Faster Whisper with Speaker Diarization

Related comparisons

whisperX and whisply whisperX and whisper-diarization whisperX and docker-whisperX whisperX and CrisperWhisper whisperX and whisperVideo whisperX and whisper-v3-diarization

Scores updated daily from GitHub, PyPI, and npm data. How scores work