Whisper Diarization Voice AI Tools
Tools that combine OpenAI Whisper (or similar ASR) with speaker diarization to identify and separate speakers in audio. Does NOT include general transcription without speaker identification, or standalone diarization tools without ASR components.
There are 24 whisper diarization tools tracked. 1 score above 70 (verified tier). The highest-rated is m-bain/whisperX at 90/100 with 20,758 stars and 864,629 monthly downloads. 1 of the top 10 are actively maintained.
Get all 24 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=whisper-diarization&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
|
Verified |
| 2 |
tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation,... |
|
Established |
| 3 |
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper |
|
Established |
| 4 |
linto-ai/linto-stt
An automatic speech recognition API |
|
Established |
| 5 |
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level... |
|
Established |
| 6 |
ringger/transcribe-critic
Multi-source transcript merging inspired by textual criticism — LLM... |
|
Emerging |
| 7 |
linto-ai/linto-studio
Transcription and annotation interface for recorded audio or video files |
|
Emerging |
| 8 |
gorkemkaramolla/whisper-run
Faster Whisper with Speaker Diarization |
|
Emerging |
| 9 |
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps... |
|
Emerging |
| 10 |
showlab/whisperVideo
Find out who said what in the video. |
|
Emerging |
| 11 |
linto-ai/linto-diarization
Speaker diarization service |
|
Emerging |
| 12 |
orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on... |
|
Experimental |
| 13 |
TharanaBope/whisper-v3-diarization
Production-ready audio transcription & speaker diarization CLI & GUI using... |
|
Experimental |
| 14 |
linto-ai/linto-punctuation
LinTO Platform punctuation service. |
|
Experimental |
| 15 |
6ixGODD/audex
Smart Medical Recording & Transcription System with voice recognition and... |
|
Experimental |
| 16 |
tltrogl/diaremot2-on
DiaRemot2-ON: CPU-only audio intelligence pipeline (Faster-Whisper, ONNX,... |
|
Experimental |
| 17 |
Cinnamon/whisper-jargon
[SIGDIAL'24] Improving Speech Recognition with Jargon Injection |
|
Experimental |
| 18 |
x2agi/x2agi-speechkit
🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients) |
|
Experimental |
| 19 |
host452b/casts_down
Cross-platform CLI to download & transcribe podcasts locally — Apple... |
|
Experimental |
| 20 |
ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4)... |
|
Experimental |
| 21 |
luizomf/sussu
CLI educacional para transcrição com OpenAI Whisper |
|
Experimental |
| 22 |
AathifZahir/WhisprSplit
A powerful, local speech-to-text transcription system that combines OpenAI's... |
|
Experimental |
| 23 |
Jpzinn654/speaker-diarization-portuguese
This project implements speaker diarization for Portuguese audio using... |
|
Experimental |
| 24 |
mahshid1378/WhisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
|
Experimental |