whisper-diarization and whisper-run

These are competitors offering similar end-to-end solutions for combining Whisper ASR with speaker diarization, though the second prioritizes inference speed optimization while the first has gained significantly more community adoption.

whisper-diarization
56
Established
whisper-run
38
Emerging
Maintenance 10/25
Adoption 10/25
Maturity 16/25
Community 20/25
Maintenance 0/25
Adoption 12/25
Maturity 18/25
Community 8/25
Stars: 5,437
Forks: 500
Downloads:
Commits (30d): 0
Language: Jupyter Notebook
License: BSD-2-Clause
Stars: 9
Forks: 1
Downloads: 1,184
Commits (30d): 0
Language: Python
License: Apache-2.0
No Package No Dependents
Stale 6m

About whisper-diarization

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Combines Whisper with NVIDIA NeMo's voice activity detection and speaker embedding models (MarbleNet/TitaNet) to attribute transcribed text to individual speakers. Uses source separation (Demucs) for vocal extraction, CTC-forced alignment for precise timestamp correction, and punctuation-based realignment to compensate for temporal drift across segments. Outputs speaker-labeled transcriptions with segment-level timestamps, supporting configurable Whisper models and parallel inference modes for systems with sufficient VRAM.

About whisper-run

gorkemkaramolla/whisper-run

Faster Whisper with Speaker Diarization

Scores updated daily from GitHub, PyPI, and npm data. How scores work