jtkim-kaist/VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

43
/ 100
Emerging

Implements multi-resolution cochleagram (MRCG) feature extraction with configurable post-processing parameters (hang_before, hang_over, on/off_length) to handle common VAD errors like false positives and dropouts. Built on TensorFlow with Python/MATLAB support, it includes a real-world recorded dataset across four acoustic environments (bus stop, construction site, park, room) at 16kHz sampling with manual speech annotations and low SNR conditions (2-18dB).

869 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 25 / 25

How are scores calculated?

Stars

869

Forks

233

Language

MATLAB

License

Last pushed

Jun 09, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jtkim-kaist/VAD"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.