Speaker Diarization Embedding ML Frameworks

Tools and frameworks for speaker diarization, speaker embedding, and speaker recognition/verification in audio. Does NOT include general speech recognition, speech synthesis, or voice cloning systems.

There are 38 speaker diarization embedding frameworks tracked. 2 score above 50 (established tier). The highest-rated is felixbur/nkululeko at 64/100 with 43 stars and 1,562 monthly downloads.

Get all 38 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Framework	Score	Tier	Stars	Language
1	felixbur/nkululeko Machine learning speaker characteristics	64	Established	43	Python
2	claritychallenge/clarity Clarity Challenge toolkit - software for building Clarity Challenge systems	54	Established	179	Python
3	juanmc2005/diart A python package to build AI-powered real-time audio applications	47	Emerging	1,944	Python
4	astorfi/3D-convolutional-speaker-recognition :speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification	44	Emerging	792	Python
5	wq2012/awesome-diarization A curated list of awesome Speaker Diarization papers, libraries, datasets,...	42	Emerging	1,851	—
6	hitachi-speech/EEND End-to-End Neural Diarization	40	Emerging	423	Python
7	itmo-mbss-lab/sr_labs_book The project is related to the development of labs for the ITMO Speaker...	35	Emerging	15	Jupyter Notebook
8	mostafa-kermaninia/speech-processing-toolkit A comprehensive machine learning pipeline for robust Speaker Identification...	34	Emerging	4	Jupyter Notebook
9	yxshee/speech-command-recognition speech command recognition using CNNs, with preprocessing, model training,...	34	Emerging	4	Jupyter Notebook
10	metacore-stack/modular-auto-specch-recog-toolkit Building a modular, open-source toolkit that advances automatic speech...	32	Emerging	8	Python
11	georgygospodinov/speech_course Deep Learning for Speech	31	Emerging	109	Jupyter Notebook
12	matlab-deep-learning/deepspeech This repo provides the pretrained DeepSpeech model in MATLAB. The model is...	28	Experimental	7	MATLAB
13	BiometricVox/DAE_SpeakerID Denoising autoencoders for speaker identification on MCE 2018 challenge	28	Experimental	12	Python
14	rorizzz/YOLO-Stutter YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection	27	Experimental	20	Jupyter Notebook
15	zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview This repo is to list the references papers of 《Speaker Recognition Based on...	27	Experimental	41	—
16	matlab-deep-learning/wav2vec-2.0 This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.	27	Experimental	8	—
17	Paradeluxe/Praditor Praditor: A DBSCAN-Based Automation for Speech Onset Detection	26	Experimental	5	Python
18	rorizzz/Stutter-Solver Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection	26	Experimental	3	Jupyter Notebook
19	MingLunHan/CIF-ColDec [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with...	26	Experimental	25	—
20	zabir-nabil/awesome-speaker-recognition-verification A curated list of awesome speaker recognition/verification papers, projects,...	25	Experimental	15	—
21	A5hG0/Lyrics-To-Song-Generator Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +...	22	Experimental	—	Python
22	Erenyegar2/modular-auto-specch-recog-toolkit 🎤 Build and deploy advanced automatic speech recognition systems with this...	22	Experimental	—	Python
23	tarun-bisht/wav2vec2-asr wav2vec2 asr with transformers	22	Experimental	16	Jupyter Notebook
24	soohyunme/foreigner_speech Foreigner Korean speech voice recognition hackathon - CSLEE	22	Experimental	1	Python
25	RhysonYang-2030/ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment The automatic system that can extract PRAAT-like speech features from raw...	22	Experimental	4	Python
26	kaistmm/seed-pytorch [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement...	22	Experimental	57	Python
27	debanjan06/noise-robust-asr 🔊 Advanced Noise-Robust ASR System with Dynamic Adaptation Cutting-edge...	20	Experimental	6	Python
28	j-schmied/RealTimeSpeechRecognition Various approaches for speech recognition and speaker diarization.	18	Experimental	7	Jupyter Notebook
29	zsl24/Speech-Processing-Doc 一个关于语音算法技术汇总的文档	16	Experimental	4	—
30	lottev1991/grimesai-svs-labs HTK-style label files for GrimesAI dry stems, for training SVS AI models.	15	Experimental	—	—
31	shashikg/X-Vector-Based-Speaker-Diarization Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker...	15	Experimental	16	Jupyter Notebook
32	thuantn210823/SpeakerDiarization This repo reimplemented several popular EEND models, covering everything...	14	Experimental	7	Python
33	yuriyvnv/WAVe Word Aligned Verification of Synthetic Speech for Automatic Speech Recognition	14	Experimental	3	Python
34	JeffT13/rd-diarization Diarizing Legal Proceedings with d-vectors.	13	Experimental	6	Jupyter Notebook
35	rorizzz/TbDD Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection	13	Experimental	5	Jupyter Notebook
36	jackaduma/speaker_recognition_models.pytorch speaker recognition / speaker verification models in pytorch implementation	12	Experimental	4	—
37	SimoneCff/SAND-Challenge-Task-1-Parthenope classify dysarthria severity in ALS patients.	11	Experimental	2	Jupyter Notebook
38	Karthick47v2/mock-buddy-audio-server audio processing service for mock-buddy	11	Experimental	2	PureBasic