Speaker Diarization Embedding Voice AI Tools

There are 37 speaker diarization embedding tools tracked. 1 score above 70 (verified tier). The highest-rated is espnet/espnet at 96/100 with 9,768 stars and 21,531 monthly downloads. 1 of the top 10 are actively maintained.

Get all 37 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speaker-diarization-embedding&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 espnet/espnet

End-to-End Speech Processing Toolkit

96
Verified
2 yeyupiaoling/PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Confor...

63
Established
3 flashlight/wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

59
Established
4 zzw922cn/Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

50
Established
5 yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

50
Established
6 modelscope/ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained...

47
Emerging
7 gfdb/wav2aug

A general purpose task-agnostic speech augmentation policy

46
Emerging
8 google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural...

44
Emerging
9 pannous/tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework,...

44
Emerging
10 philipperemy/deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

44
Emerging
11 noahchalifour/rnnt-speech-recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

44
Emerging
12 santi-pdp/pase

Problem Agnostic Speech Encoder

42
Emerging
13 haoheliu/voicefixer_main

General Speech Restoration

41
Emerging
14 filippogiruzzi/voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

41
Emerging
15 bricewalker/Hey-Jetson

Deep Learning based Automatic Speech Recognition with attention for the...

40
Emerging
16 kgnlp/allophant

A multilingual phoneme recognizer capable of generalizing zero-shot to...

39
Emerging
17 Picovoice/falcon

On-device speaker diarization powered by deep learning

38
Emerging
18 Berkeley-Speech-Group/sylber

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

37
Emerging
19 chenmingxiang110/Chinese-automatic-speech-recognition

Chinese speech recognition

36
Emerging
20 mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid...

35
Emerging
21 wq2012/SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

35
Emerging
22 lucko515/speech-recognition-neural-network

This is the end-to-end Speech Recognition neural network, deployed in Keras....

34
Emerging
23 weimeng23/speech-recognition-learning-resources

:white_check_mark: A list of speech recognition learning resources including...

33
Emerging
24 shahules786/mayavoz

Pytorch based speech enhancement toolkit.

33
Emerging
25 Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

32
Emerging
26 EuleMitKeule/speaker-recognition

Speaker recognition service for Home Assistant using voice embeddings. Train...

30
Emerging
27 tuanio/noisy-student-training-asr

Pytorch implementation of Noisy Student Training for Automatic Speech...

28
Experimental
28 speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on...

27
Experimental
29 victor369basu/End2EndAutomaticSpeechRecognition

In this repository, I have developed an end to end Automatic speech...

26
Experimental
30 ASR-project/Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM....

24
Experimental
31 hanifabd/voice-activity-detection-vad-realtime

Real-time Voice Activity Detection (VAD) with some example use case like...

24
Experimental
32 idiap/zff_vad

Unsupervised Voice Activity Detection by Modeling Source and System...

22
Experimental
33 AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex

Voice Activity Detector based on MFCC features and DNN model

20
Experimental
34 AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children

Preschool evaluation is crucial because it gives teachers and parents...

20
Experimental
35 PranavPutsa1006/Speaker-Diarization

Identifying individual speakers in an audio stream based on the unique...

19
Experimental
36 IIP-Sogang/olkavs-avspeech

The Introduction of the OLKAVS Dataset

18
Experimental
37 jmaczan/asr-dysarthria

Research on Automatic Speech Recognition for dysarthric speech

12
Experimental

Comparisons in this category