Tencent/PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

/ 100

Emerging

Supports multiple compression techniques—channel pruning, weight sparsification, and quantization—with automatic hyperparameter tuning via reinforcement learning (DDPG), Gaussian Processes, or Tree-structured Parzen Estimators to optimize compression ratios without manual tuning. The framework employs a learner-optimizer loop where compression algorithms generate candidate models that are evaluated and fed back to guide the search space exploration. Includes training enhancements like knowledge distillation, multi-GPU distributed training, and group fine-tuning to minimize accuracy degradation on deep learning models for mobile and resource-constrained deployment.

2,914 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

2,914

Forks

492

Language

Python

License

—

Higher-rated alternatives

NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit...

mlcommons/inference

Reference implementations of MLPerf® inference benchmarks

datamade/usaddress

:us: a python library for parsing unstructured United States address strings into address components

GRAAL-Research/deepparse

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning

mlcommons/training

Reference implementations of MLPerf® training benchmarks

Explore ML Frameworks

All categories Trending ML Framework directory Insights