Speaker Diarization Embedding ML Frameworks
Tools and frameworks for speaker diarization, speaker embedding, and speaker recognition/verification in audio. Does NOT include general speech recognition, speech synthesis, or voice cloning systems.
There are 38 speaker diarization embedding frameworks tracked. 2 score above 50 (established tier). The highest-rated is felixbur/nkululeko at 64/100 with 43 stars and 1,562 monthly downloads.
Get all 38 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=speaker-diarization-embedding&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
felixbur/nkululeko
Machine learning speaker characteristics |
|
Established |
| 2 |
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems |
|
Established |
| 3 |
juanmc2005/diart
A python package to build AI-powered real-time audio applications |
|
Emerging |
| 4 |
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification |
|
Emerging |
| 5 |
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets,... |
|
Emerging |
| 6 |
hitachi-speech/EEND
End-to-End Neural Diarization |
|
Emerging |
| 7 |
itmo-mbss-lab/sr_labs_book
The project is related to the development of labs for the ITMO Speaker... |
|
Emerging |
| 8 |
mostafa-kermaninia/speech-processing-toolkit
A comprehensive machine learning pipeline for robust Speaker Identification... |
|
Emerging |
| 9 |
yxshee/speech-command-recognition
speech command recognition using CNNs, with preprocessing, model training,... |
|
Emerging |
| 10 |
metacore-stack/modular-auto-specch-recog-toolkit
Building a modular, open-source toolkit that advances automatic speech... |
|
Emerging |
| 11 |
georgygospodinov/speech_course
Deep Learning for Speech |
|
Emerging |
| 12 |
matlab-deep-learning/deepspeech
This repo provides the pretrained DeepSpeech model in MATLAB. The model is... |
|
Experimental |
| 13 |
BiometricVox/DAE_SpeakerID
Denoising autoencoders for speaker identification on MCE 2018 challenge |
|
Experimental |
| 14 |
rorizzz/YOLO-Stutter
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection |
|
Experimental |
| 15 |
zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview
This repo is to list the references papers of 《Speaker Recognition Based on... |
|
Experimental |
| 16 |
matlab-deep-learning/wav2vec-2.0
This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB. |
|
Experimental |
| 17 |
Paradeluxe/Praditor
Praditor: A DBSCAN-Based Automation for Speech Onset Detection |
|
Experimental |
| 18 |
rorizzz/Stutter-Solver
Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection |
|
Experimental |
| 19 |
MingLunHan/CIF-ColDec
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with... |
|
Experimental |
| 20 |
zabir-nabil/awesome-speaker-recognition-verification
A curated list of awesome speaker recognition/verification papers, projects,... |
|
Experimental |
| 21 |
A5hG0/Lyrics-To-Song-Generator
Step-by-step toolkit for DiffSinger voice synthesis. Preprocessing scripts +... |
|
Experimental |
| 22 |
Erenyegar2/modular-auto-specch-recog-toolkit
🎤 Build and deploy advanced automatic speech recognition systems with this... |
|
Experimental |
| 23 |
tarun-bisht/wav2vec2-asr
wav2vec2 asr with transformers |
|
Experimental |
| 24 |
soohyunme/foreigner_speech
Foreigner Korean speech voice recognition hackathon - CSLEE |
|
Experimental |
| 25 |
RhysonYang-2030/ASACA-Automatic-Speech-Analysis-for-Cognitive-Assessment
The automatic system that can extract PRAAT-like speech features from raw... |
|
Experimental |
| 26 |
kaistmm/seed-pytorch
[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement... |
|
Experimental |
| 27 |
debanjan06/noise-robust-asr
🔊 Advanced Noise-Robust ASR System with Dynamic Adaptation Cutting-edge... |
|
Experimental |
| 28 |
j-schmied/RealTimeSpeechRecognition
Various approaches for speech recognition and speaker diarization. |
|
Experimental |
| 29 |
zsl24/Speech-Processing-Doc
一个关于语音算法技术汇总的文档 |
|
Experimental |
| 30 |
lottev1991/grimesai-svs-labs
HTK-style label files for GrimesAI dry stems, for training SVS AI models. |
|
Experimental |
| 31 |
shashikg/X-Vector-Based-Speaker-Diarization
Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker... |
|
Experimental |
| 32 |
thuantn210823/SpeakerDiarization
This repo reimplemented several popular EEND models, covering everything... |
|
Experimental |
| 33 |
yuriyvnv/WAVe
Word Aligned Verification of Synthetic Speech for Automatic Speech Recognition |
|
Experimental |
| 34 |
JeffT13/rd-diarization
Diarizing Legal Proceedings with d-vectors. |
|
Experimental |
| 35 |
rorizzz/TbDD
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection |
|
Experimental |
| 36 |
jackaduma/speaker_recognition_models.pytorch
speaker recognition / speaker verification models in pytorch implementation |
|
Experimental |
| 37 |
SimoneCff/SAND-Challenge-Task-1-Parthenope
classify dysarthria severity in ALS patients. |
|
Experimental |
| 38 |
Karthick47v2/mock-buddy-audio-server
audio processing service for mock-buddy |
|
Experimental |