Tencent/PocketFlow
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
Supports multiple compression techniques—channel pruning, weight sparsification, and quantization—with automatic hyperparameter tuning via reinforcement learning (DDPG), Gaussian Processes, or Tree-structured Parzen Estimators to optimize compression ratios without manual tuning. The framework employs a learner-optimizer loop where compression algorithms generate candidate models that are evaluated and fed back to guide the search space exploration. Includes training enhancements like knowledge distillation, multi-GPU distributed training, and group fine-tuning to minimize accuracy degradation on deep learning models for mobile and resource-constrained deployment.
2,914 stars. No commits in the last 6 months.
Stars
2,914
Forks
492
Language
Python
License
—
Category
Last pushed
Mar 31, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Tencent/PocketFlow"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit...
mlcommons/inference
Reference implementations of MLPerf® inference benchmarks
datamade/usaddress
:us: a python library for parsing unstructured United States address strings into address components
GRAAL-Research/deepparse
Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning
mlcommons/training
Reference implementations of MLPerf® training benchmarks