OpenPPL/ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

/ 100

Emerging

Supports extensible quantization via 27 composable optimization passes and a custom execution engine handling 99 ONNX operators natively, enabling per-operator and per-tensor bit-width/granularity control. Integrates with 10+ inference frameworks including TensorRT, OpenPPL, OpenVINO, NCNN, and MNN, with hardware-specific quantization strategies and QAT capabilities. Features FP8 quantization (E4M3/E5M2 formats), graph fusion, pattern matching, and bias correction for low-latency edge deployment.

1,788 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

1,788

Forks

274

Language

Python

License

Apache-2.0

Higher-rated alternatives

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...

Explore ML Frameworks

All categories Trending ML Framework directory Insights