sp-nitech/diffsptk
A differentiable version of SPTK
Implements classic speech processing algorithms (STFT, mel-cepstral analysis, LPC, WORLD vocoder components) as differentiable PyTorch layers, enabling end-to-end optimization of signal processing pipelines within neural networks. Built on PyTorch 2.3.1+, it supports both object-oriented modules and functional APIs for flexible integration with audio synthesis and analysis tasks. Covers specialized operations like pitch extraction, spectral envelope analysis (CheapTrick), aperiodicity estimation (D4C), and polyphase quadrature mirror filter banks for subband decomposition.
196 stars. Available on PyPI.
Stars
196
Forks
20
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 26, 2026
Commits (30d)
0
Dependencies
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/sp-nitech/diffsptk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
trigeorgis/mdm
A TensorFlow implementation of the Mnemonic Descent Method.
Michedev/DDPMs-Pytorch
Implementation of various DDPM papers to understand how they work
clovaai/fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT...
clovaai/mxfont
Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font...
openclimatefix/diffusion_weather
Testing out Diffusion-based models for weather and PV forecasting