Speaker Diarization Embedding Voice AI Tools

There are 37 speaker diarization embedding tools tracked. 1 score above 70 (verified tier). The highest-rated is espnet/espnet at 96/100 with 9,768 stars and 21,531 monthly downloads. 1 of the top 10 are actively maintained.

Get all 37 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	espnet/espnet End-to-End Speech Processing Toolkit	96	Verified	9,768	Python
2	yeyupiaoling/PPASR 基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...	63	Established	875	Python
3	flashlight/wav2letter Facebook AI Research's Automatic Speech Recognition Toolkit	59	Established	6,446	C++
4	zzw922cn/Automatic_Speech_Recognition End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow	50	Established	2,839	Python
5	yeyupiaoling/PaddlePaddle-DeepSpeech 基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。	50	Established	758	Python
6	modelscope/ClearerVoice-Studio An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained...	47	Emerging	3,962	Python
7	gfdb/wav2aug A general purpose task-agnostic speech augmentation policy	46	Emerging	16	Python
8	google/uis-rnn This is the library for the Unbounded Interleaved-State Recurrent Neural...	44	Emerging	1,589	Python
9	pannous/tensorflow-speech-recognition 🎙Speech recognition using the tensorflow deep learning framework,...	44	Emerging	2,176	Python
10	philipperemy/deep-speaker Deep Speaker: an End-to-End Neural Speaker Embedding System.	44	Emerging	939	Python
11	noahchalifour/rnnt-speech-recognition End-to-end speech recognition using RNN Transducers in Tensorflow 2.0	44	Emerging	249	Python
12	santi-pdp/pase Problem Agnostic Speech Encoder	42	Emerging	447	Python
13	haoheliu/voicefixer_main General Speech Restoration	41	Emerging	284	Python
14	filippogiruzzi/voice_activity_detection Voice Activity Detection based on Deep Learning & TensorFlow	41	Emerging	371	Python
15	bricewalker/Hey-Jetson Deep Learning based Automatic Speech Recognition with attention for the...	40	Emerging	199	Jupyter Notebook
16	kgnlp/allophant A multilingual phoneme recognizer capable of generalizing zero-shot to...	39	Emerging	29	Python
17	Picovoice/falcon On-device speaker diarization powered by deep learning	38	Emerging	69	Python
18	Berkeley-Speech-Group/sylber Sylber: Syllabic Embedding Representation of Speech from Raw Audio	37	Emerging	74	Jupyter Notebook
19	chenmingxiang110/Chinese-automatic-speech-recognition Chinese speech recognition	36	Emerging	159	Jupyter Notebook
20	mravanelli/pytorch-kaldi pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid...	35	Emerging	2,396	Python
21	wq2012/SpeakerRecognitionFromScratch Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家	35	Emerging	47	Python
22	lucko515/speech-recognition-neural-network This is the end-to-end Speech Recognition neural network, deployed in Keras....	34	Emerging	190	HTML
23	weimeng23/speech-recognition-learning-resources :white_check_mark: A list of speech recognition learning resources including...	33	Emerging	68	—
24	shahules786/mayavoz Pytorch based speech enhancement toolkit.	33	Emerging	336	Python
25	Speaker-Identification/You-Only-Speak-Once Deep Learning - one shot learning for speaker recognition using Filter Banks	32	Emerging	171	Jupyter Notebook
26	EuleMitKeule/speaker-recognition Speaker recognition service for Home Assistant using voice embeddings. Train...	30	Emerging	17	Python
27	tuanio/noisy-student-training-asr Pytorch implementation of Noisy Student Training for Automatic Speech...	28	Experimental	99	Python
28	speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on...	27	Experimental	374	HTML
29	victor369basu/End2EndAutomaticSpeechRecognition In this repository, I have developed an end to end Automatic speech...	26	Experimental	34	Python
30	ASR-project/Multilingual-PR Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....	24	Experimental	258	Python
31	hanifabd/voice-activity-detection-vad-realtime Real-time Voice Activity Detection (VAD) with some example use case like...	24	Experimental	106	Python
32	idiap/zff_vad Unsupervised Voice Activity Detection by Modeling Source and System...	22	Experimental	24	Python
33	AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex Voice Activity Detector based on MFCC features and DNN model	20	Experimental	29	VHDL
34	AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children Preschool evaluation is crucial because it gives teachers and parents...	20	Experimental	20	Jupyter Notebook
35	PranavPutsa1006/Speaker-Diarization Identifying individual speakers in an audio stream based on the unique...	19	Experimental	18	Jupyter Notebook
36	IIP-Sogang/olkavs-avspeech The Introduction of the OLKAVS Dataset	18	Experimental	37	Python
37	jmaczan/asr-dysarthria Research on Automatic Speech Recognition for dysarthric speech	12	Experimental	19	Jupyter Notebook

Comparisons in this category

PPASR and PaddlePaddle-DeepSpeech (63 vs 50)