pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

67
/ 100
Established

Provides GPU-accelerated audio transforms (spectrograms, MelSpectrograms, MFCC) and speech processing functions like forced alignment, all implemented as differentiable PyTorch operations for end-to-end training. Includes compliance interfaces that replicate Kaldi feature extraction, enabling seamless migration from traditional speech processing frameworks while maintaining gradient flow through the audio pipeline.

2,838 stars. Actively maintained with 1 commit in the last 30 days.

No Package No Dependents
Maintenance 16 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

2,838

Forks

764

Language

Python

License

BSD-2-Clause

Last pushed

Mar 13, 2026

Commits (30d)

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/pytorch/audio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.