juanmc2005/diart

A python package to build AI-powered real-time audio applications

54
/ 100
Established

Leverages speaker segmentation and embedding models with incremental clustering for real-time speaker diarization that improves accuracy as conversations progress. Offers modular pipelines for voice activity detection and transcription, integrates pre-trained models from Hugging Face and Pyannote, and supports custom model integration via ONNX and PyTorch. Provides WebSocket support for web deployment and includes CLI tools for streaming from microphones or audio files.

1,944 stars. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

1,944

Forks

159

Language

Python

License

MIT

Last pushed

Feb 12, 2025

Commits (30d)

0

Dependencies

20

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/juanmc2005/diart"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.