sp-nitech/diffsptk

A differentiable version of SPTK

/ 100

Established

Implements classic speech processing algorithms (STFT, mel-cepstral analysis, LPC, WORLD vocoder components) as differentiable PyTorch layers, enabling end-to-end optimization of signal processing pipelines within neural networks. Built on PyTorch 2.3.1+, it supports both object-oriented modules and functional APIs for flexible integration with audio synthesis and analysis tasks. Covers specialized operations like pitch extraction, spectral envelope analysis (CheapTrick), aperiodicity estimation (D4C), and polyphase quadrature mirror filter banks for subband decomposition.

196 stars. Available on PyPI.

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

196

Forks

Language

Python

License

Apache-2.0

Related frameworks

trigeorgis/mdm

A TensorFlow implementation of the Mnemonic Descent Method.

Michedev/DDPMs-Pytorch

Implementation of various DDPM papers to understand how they work

clovaai/fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT...

clovaai/mxfont

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font...

openclimatefix/diffusion_weather

Testing out Diffusion-based models for weather and PV forecasting

Explore ML Frameworks

All categories Trending ML Framework directory Insights