wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Organizes diarization research across modular subsections—supervised and online approaches, speaker embedding techniques, clustering methods, and emerging LLM-based post-processing—alongside standardized benchmark datasets (DIHARD, AISHELL-4) and evaluation tools. Covers the full diarization pipeline including speaker change detection, audio feature extraction, and data augmentation strategies, plus audio-visual extensions and joint ASR-diarization systems. Serves as a reference for both classical clustering-based and modern end-to-end neural architectures handling variable speaker counts and overlapped speech scenarios.
1,851 stars. No commits in the last 6 months.
Stars
1,851
Forks
238
Language
—
License
Apache-2.0
Category
Last pushed
Jul 22, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/wq2012/awesome-diarization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
felixbur/nkululeko
Machine learning speaker characteristics
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
hitachi-speech/EEND
End-to-End Neural Diarization