AkojimaSLP/Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

/ 100

Emerging

Implements three beamforming algorithms for multi-channel audio processing: delay-and-sum, MVDR (Minimum Variance Distortionless Response), and CGMM-MVDR which leverages complex Gaussian mixture models for improved noise robustness. The framework processes raw audio files and outputs enhanced speech, with test scripts demonstrating each algorithm's performance on real data.

279 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 23 / 25

How are scores calculated?

Stars

279

Forks

Language

Python

License

—

Category

keyword-speech-recognition

Last pushed

Jan 19, 2019

Commits (30d)

GitHub

Keyword Speech Recognition · 112 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AkojimaSLP/Beamforming-for-speech-enhancement"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

shenasa-ai/speech2text

A Deep-Learning-Based Persian Speech Recognition System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights