AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
Implements three beamforming algorithms for multi-channel audio processing: delay-and-sum, MVDR (Minimum Variance Distortionless Response), and CGMM-MVDR which leverages complex Gaussian mixture models for improved noise robustness. The framework processes raw audio files and outputs enhanced speech, with test scripts demonstrating each algorithm's performance on real data.
279 stars. No commits in the last 6 months.
Stars
279
Forks
82
Language
Python
License
—
Category
Last pushed
Jan 19, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AkojimaSLP/Beamforming-for-speech-enhancement"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System