Whisper Diarization Voice AI Tools

Tools that combine OpenAI Whisper (or similar ASR) with speaker diarization to identify and separate speakers in audio. Does NOT include general transcription without speaker identification, or standalone diarization tools without ASR components.

There are 24 whisper diarization tools tracked. 1 score above 70 (verified tier). The highest-rated is m-bain/whisperX at 90/100 with 20,758 stars and 864,629 monthly downloads. 1 of the top 10 are actively maintained.

Get all 24 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=whisper-diarization&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

90
Verified
2 tsmdt/whisply

💬 Fast, cross-platform CLI and GUI for batch transcription, translation,...

62
Established
3 MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

56
Established
4 linto-ai/linto-stt

An automatic speech recognition API

50
Established
5 jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level...

50
Established
6 ringger/transcribe-critic

Multi-source transcript merging inspired by textual criticism — LLM...

48
Emerging
7 linto-ai/linto-studio

Transcription and annotation interface for recorded audio or video files

40
Emerging
8 gorkemkaramolla/whisper-run

Faster Whisper with Speaker Diarization

38
Emerging
9 nyrahealth/CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps...

36
Emerging
10 showlab/whisperVideo

Find out who said what in the video.

35
Emerging
11 linto-ai/linto-diarization

Speaker diarization service

30
Emerging
12 orianemartin/WhispGrid

A Whisper to TextGrid script that I use to automatize Corpus Annotation on...

28
Experimental
13 TharanaBope/whisper-v3-diarization

Production-ready audio transcription & speaker diarization CLI & GUI using...

28
Experimental
14 linto-ai/linto-punctuation

LinTO Platform punctuation service.

24
Experimental
15 6ixGODD/audex

Smart Medical Recording & Transcription System with voice recognition and...

23
Experimental
16 tltrogl/diaremot2-on

DiaRemot2-ON: CPU-only audio intelligence pipeline (Faster-Whisper, ONNX,...

21
Experimental
17 Cinnamon/whisper-jargon

[SIGDIAL'24] Improving Speech Recognition with Jargon Injection

20
Experimental
18 x2agi/x2agi-speechkit

🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients)

17
Experimental
19 host452b/casts_down

Cross-platform CLI to download & transcribe podcasts locally — Apple...

16
Experimental
20 ElmiraGhorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4)...

16
Experimental
21 luizomf/sussu

CLI educacional para transcrição com OpenAI Whisper

16
Experimental
22 AathifZahir/WhisprSplit

A powerful, local speech-to-text transcription system that combines OpenAI's...

12
Experimental
23 Jpzinn654/speaker-diarization-portuguese

This project implements speaker diarization for Portuguese audio using...

12
Experimental
24 mahshid1378/WhisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

11
Experimental