wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

/ 100

Emerging

Organizes diarization research across modular subsections—supervised and online approaches, speaker embedding techniques, clustering methods, and emerging LLM-based post-processing—alongside standardized benchmark datasets (DIHARD, AISHELL-4) and evaluation tools. Covers the full diarization pipeline including speaker change detection, audio feature extraction, and data augmentation strategies, plus audio-visual extensions and joint ASR-diarization systems. Serves as a reference for both classical clustering-based and modern end-to-end neural architectures handling variable speaker counts and overlapped speech scenarios.

1,851 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

1,851

Forks

238

Language

—

License

Apache-2.0

Higher-rated alternatives

felixbur/nkululeko

Machine learning speaker characteristics

claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

juanmc2005/diart

A python package to build AI-powered real-time audio applications

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

hitachi-speech/EEND

End-to-End Neural Diarization

Explore ML Frameworks

All categories Trending ML Framework directory Insights