Speaker Diarization Embedding ML Frameworks

Tools and frameworks for speaker diarization, speaker embedding, and speaker recognition/verification in audio. Does NOT include general speech recognition, speech synthesis, or voice cloning systems.

There are 38 speaker diarization embedding frameworks tracked. 2 score above 50 (established tier). The highest-rated is felixbur/nkululeko at 64/100 with 43 stars and 1,562 monthly downloads.

Get all 38 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 felixbur/nkululeko

Machine learning speaker characteristics

64
Established
2 claritychallenge/clarity

Clarity Challenge toolkit - software for building Clarity Challenge systems

54
Established
3 juanmc2005/diart

A python package to build AI-powered real-time audio applications

47
Emerging
4 astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

44
Emerging
5 wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets,...

42
Emerging
6 hitachi-speech/EEND

End-to-End Neural Diarization

40
Emerging
7 itmo-mbss-lab/sr_labs_book

The project is related to the development of labs for the ITMO Speaker...

35
Emerging
8 mostafa-kermaninia/speech-processing-toolkit

A comprehensive machine learning pipeline for robust Speaker Identification...

34
Emerging
9 yxshee/speech-command-recognition

speech command recognition using CNNs, with preprocessing, model training,...

34
Emerging
10 metacore-stack/modular-auto-specch-recog-toolkit

Building a modular, open-source toolkit that advances automatic speech...

32
Emerging
11 georgygospodinov/speech_course

Deep Learning for Speech

31
Emerging
12 matlab-deep-learning/deepspeech

This repo provides the pretrained DeepSpeech model in MATLAB. The model is...

28
Experimental
13 BiometricVox/DAE_SpeakerID

Denoising autoencoders for speaker identification on MCE 2018 challenge

28
Experimental
14 rorizzz/YOLO-Stutter

YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection

27
Experimental
15 zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview

This repo is to list the references papers of 《Speaker Recognition Based on...

27
Experimental
16 matlab-deep-learning/wav2vec-2.0

This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.

27
Experimental
17 Paradeluxe/Praditor

Praditor: A DBSCAN-Based Automation for Speech Onset Detection

26
Experimental
18 rorizzz/Stutter-Solver

Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection

26
Experimental
19 MingLunHan/CIF-ColDec

[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with...

26
Experimental
20 zabir-nabil/awesome-speaker-recognition-verification

A curated list of awesome speaker recognition/verification papers, projects,...

25
Experimental
21 A5hG0/Lyrics-To-Song-Generator

Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +...

22
Experimental
22 Erenyegar2/modular-auto-specch-recog-toolkit

🎤 Build and deploy advanced automatic speech recognition systems with this...

22
Experimental
23 tarun-bisht/wav2vec2-asr

wav2vec2 asr with transformers

22
Experimental
24 soohyunme/foreigner_speech

Foreigner Korean speech voice recognition hackathon - CSLEE

22
Experimental
25 RhysonYang-2030/ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment

The automatic system that can extract PRAAT-like speech features from raw...

22
Experimental
26 kaistmm/seed-pytorch

[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement...

22
Experimental
27 debanjan06/noise-robust-asr

🔊 Advanced Noise-Robust ASR System with Dynamic Adaptation Cutting-edge...

20
Experimental
28 j-schmied/RealTimeSpeechRecognition

Various approaches for speech recognition and speaker diarization.

18
Experimental
29 zsl24/Speech-Processing-Doc

一个关于语音算法技术汇总的文档

16
Experimental
30 lottev1991/grimesai-svs-labs

HTK-style label files for GrimesAI dry stems, for training SVS AI models.

15
Experimental
31 shashikg/X-Vector-Based-Speaker-Diarization

Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker...

15
Experimental
32 thuantn210823/SpeakerDiarization

This repo reimplemented several popular EEND models, covering everything...

14
Experimental
33 yuriyvnv/WAVe

Word Aligned Verification of Synthetic Speech for Automatic Speech Recognition

14
Experimental
34 JeffT13/rd-diarization

Diarizing Legal Proceedings with d-vectors.

13
Experimental
35 rorizzz/TbDD

Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection

13
Experimental
36 jackaduma/speaker_recognition_models.pytorch

speaker recognition / speaker verification models in pytorch implementation

12
Experimental
37 SimoneCff/SAND-Challenge-Task-1-Parthenope

classify dysarthria severity in ALS patients.

11
Experimental
38 Karthick47v2/mock-buddy-audio-server

audio processing service for mock-buddy

11
Experimental