Speaker Diarization Embedding Voice AI Tools
There are 37 speaker diarization embedding tools tracked. 1 score above 70 (verified tier). The highest-rated is espnet/espnet at 96/100 with 9,768 stars and 21,531 monthly downloads. 1 of the top 10 are actively maintained.
Get all 37 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=speaker-diarization-embedding&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
espnet/espnet
End-to-End Speech Processing Toolkit |
|
Verified |
| 2 |
yeyupiaoling/PPASR
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Confor... |
|
Established |
| 3 |
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit |
|
Established |
| 4 |
zzw922cn/Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow |
|
Established |
| 5 |
yeyupiaoling/PaddlePaddle-DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。 |
|
Established |
| 6 |
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained... |
|
Emerging |
| 7 |
gfdb/wav2aug
A general purpose task-agnostic speech augmentation policy |
|
Emerging |
| 8 |
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural... |
|
Emerging |
| 9 |
pannous/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework,... |
|
Emerging |
| 10 |
philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System. |
|
Emerging |
| 11 |
noahchalifour/rnnt-speech-recognition
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0 |
|
Emerging |
| 12 |
santi-pdp/pase
Problem Agnostic Speech Encoder |
|
Emerging |
| 13 |
haoheliu/voicefixer_main
General Speech Restoration |
|
Emerging |
| 14 |
filippogiruzzi/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow |
|
Emerging |
| 15 |
bricewalker/Hey-Jetson
Deep Learning based Automatic Speech Recognition with attention for the... |
|
Emerging |
| 16 |
kgnlp/allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to... |
|
Emerging |
| 17 |
Picovoice/falcon
On-device speaker diarization powered by deep learning |
|
Emerging |
| 18 |
Berkeley-Speech-Group/sylber
Sylber: Syllabic Embedding Representation of Speech from Raw Audio |
|
Emerging |
| 19 |
chenmingxiang110/Chinese-automatic-speech-recognition
Chinese speech recognition |
|
Emerging |
| 20 |
mravanelli/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid... |
|
Emerging |
| 21 |
wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家 |
|
Emerging |
| 22 |
lucko515/speech-recognition-neural-network
This is the end-to-end Speech Recognition neural network, deployed in Keras.... |
|
Emerging |
| 23 |
weimeng23/speech-recognition-learning-resources
:white_check_mark: A list of speech recognition learning resources including... |
|
Emerging |
| 24 |
shahules786/mayavoz
Pytorch based speech enhancement toolkit. |
|
Emerging |
| 25 |
Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks |
|
Emerging |
| 26 |
EuleMitKeule/speaker-recognition
Speaker recognition service for Home Assistant using voice embeddings. Train... |
|
Emerging |
| 27 |
tuanio/noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech... |
|
Experimental |
| 28 |
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on... |
|
Experimental |
| 29 |
victor369basu/End2EndAutomaticSpeechRecognition
In this repository, I have developed an end to end Automatic speech... |
|
Experimental |
| 30 |
ASR-project/Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM.... |
|
Experimental |
| 31 |
hanifabd/voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like... |
|
Experimental |
| 32 |
idiap/zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System... |
|
Experimental |
| 33 |
AlexKly/Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex
Voice Activity Detector based on MFCC features and DNN model |
|
Experimental |
| 34 |
AmirAbaskohi/Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children
Preschool evaluation is crucial because it gives teachers and parents... |
|
Experimental |
| 35 |
PranavPutsa1006/Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique... |
|
Experimental |
| 36 |
IIP-Sogang/olkavs-avspeech
The Introduction of the OLKAVS Dataset |
|
Experimental |
| 37 |
jmaczan/asr-dysarthria
Research on Automatic Speech Recognition for dysarthric speech |
|
Experimental |