A-SHOJAEI/cross-lingual-phoneme-aware-speech-enhancement-with-adaptive-masking
Multi-stage speech enhancement system that leverages cross-lingual phoneme embeddings to guide adaptive time-frequency masking for noise reduction in low-resource languages. The model uses phoneme-conditioned attention to learn language-agnostic acoustic patterns from high-resource languages (English, Spanish) and transfers them to low-resource lan
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Feb 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/A-SHOJAEI/cross-lingual-phoneme-aware-speech-enhancement-with-adaptive-masking"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
marl/openl3
OpenL3: Open-source deep audio and image embeddings
ductho-le/WaveDL
A Scalable Deep Learning Framework for Wave-Based Inverse Problems
Spijkervet/torchaudio-augmentations
Audio transformations library for PyTorch
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.