whisper-diarization and whisper-run

These are competitors offering similar end-to-end solutions for combining Whisper ASR with speaker diarization, though the second prioritizes inference speed optimization while the first has gained significantly more community adoption.

whisper-diarization

Established

whisper-run

Emerging

Maintenance 10/25

Adoption 10/25

Maturity 16/25

Community 20/25

Maintenance 0/25

Adoption 12/25

Maturity 18/25

Community 8/25

Stars: 5,437

Forks: 500

Downloads: —

Commits (30d): 0

Language: Jupyter Notebook

License: BSD-2-Clause

Stars: 9

Forks: 1

Downloads: 1,184

Commits (30d): 0

Language: Python

License: Apache-2.0

No Package No Dependents

Stale 6m

About whisper-diarization

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Combines Whisper with NVIDIA NeMo's voice activity detection and speaker embedding models (MarbleNet/TitaNet) to attribute transcribed text to individual speakers. Uses source separation (Demucs) for vocal extraction, CTC-forced alignment for precise timestamp correction, and punctuation-based realignment to compensate for temporal drift across segments. Outputs speaker-labeled transcriptions with segment-level timestamps, supporting configurable Whisper models and parallel inference modes for systems with sufficient VRAM.

About whisper-run

gorkemkaramolla/whisper-run

Faster Whisper with Speaker Diarization

Related comparisons

whisper-diarization and whisperX whisper-diarization and whisper-v3-diarization whisper-diarization and whisperX whisper-diarization and whisper-v3-diarization

Scores updated daily from GitHub, PyPI, and npm data. How scores work