OpenPPL/ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

49
/ 100
Emerging

Supports extensible quantization via 27 composable optimization passes and a custom execution engine handling 99 ONNX operators natively, enabling per-operator and per-tensor bit-width/granularity control. Integrates with 10+ inference frameworks including TensorRT, OpenPPL, OpenVINO, NCNN, and MNN, with hardware-specific quantization strategies and QAT capabilities. Features FP8 quantization (E4M3/E5M2 formats), graph fusion, pattern matching, and bias correction for low-latency edge deployment.

1,788 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

1,788

Forks

274

Language

Python

License

Apache-2.0

Last pushed

Mar 28, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/OpenPPL/ppq"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.