cvqluu/simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

46
/ 100
Emerging

Combines Silero VAD for speech detection, SpeechBrain embeddings (X-Vector or ECAPA-TDNN) for speaker representation, and spectral or agglomerative clustering to identify and separate speakers. Optionally integrates ESPnet ASR models for transcription alongside diarization. Supports configurable embedding and clustering methods, allowing users to trade off between speed and accuracy for different use cases.

155 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

155

Forks

32

Language

Python

License

GPL-3.0

Last pushed

May 02, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/cvqluu/simple_diarizer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.