iver56/torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

50
/ 100
Established

Transforms are implemented as `nn.Module` subclasses and support GPU acceleration (CUDA), making them composable directly into neural network architectures with minimal training overhead. The library provides flexible randomization control through `per_batch`, `per_example`, and `per_channel` modes, allowing fine-grained augmentation strategies; most transforms are differentiable to enable end-to-end training. It includes 15+ waveform transforms (pitch shift, filtering, gain modulation, impulse response convolution) and handles batched multichannel audio natively.

1,136 stars.

No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

1,136

Forks

100

Language

Python

License

MIT

Last pushed

Nov 24, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/iver56/torch-audiomentations"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.