wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

49
/ 100
Emerging

Organizes diarization research across modular subsections—supervised and online approaches, speaker embedding techniques, clustering methods, and emerging LLM-based post-processing—alongside standardized benchmark datasets (DIHARD, AISHELL-4) and evaluation tools. Covers the full diarization pipeline including speaker change detection, audio feature extraction, and data augmentation strategies, plus audio-visual extensions and joint ASR-diarization systems. Serves as a reference for both classical clustering-based and modern end-to-end neural architectures handling variable speaker counts and overlapped speech scenarios.

1,851 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

1,851

Forks

238

Language

License

Apache-2.0

Last pushed

Jul 22, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/wq2012/awesome-diarization"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.