LLM Pruning Compression Transformer Models

Tools and methods for reducing the size and computational cost of large language models through structural pruning, layer removal, and parameter elimination. Does NOT include quantization, distillation-only approaches, or general model optimization techniques.

There are 17 llm pruning compression models tracked. 2 score above 50 (established tier). The highest-rated is VainF/Torch-Pruning at 69/100 with 3,267 stars and 21,337 monthly downloads.

Get all 17 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-pruning-compression&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision...

69
Established
2 peremartra/optipfair

Structured pruning and bias visualization for Large Language Models. Tools...

58
Established
3 horseee/LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language...

40
Emerging
4 CASIA-LMC-Lab/FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

37
Emerging
5 princeton-nlp/LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via...

35
Emerging
6 VITA-Group/LiGO

[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer...

31
Emerging
7 ahazeemi/dPrune

🌿 dPrune: A Framework for Data Pruning

30
Emerging
8 oshindutta/TVAprune

[ICML 2024 Es-FoMo] - Efficient LLM Pruning with Global Token-Dependency...

28
Experimental
9 namgyu-youn/PyTorch-Pruning

Benchmark and profile pruning researches and open-sources

26
Experimental
10 horseee/LLaMA-Pruning

Structural Pruning for LLaMA

26
Experimental
11 ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning

An implementation of the DISP-LLM method from the NeurIPS 2024 paper:...

25
Experimental
12 hexuandeng/DRPruning

Implementation for our paper “DRPruning: Efficient Large Language Model...

23
Experimental
13 cliang1453/SAGE

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for...

22
Experimental
14 gszfwsb/Data-Whisperer

Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for...

20
Experimental
15 visresearch/SDMPrune

The official implementation of "SDMPrune: Self-Distillation MLP Pruning for...

17
Experimental
16 thegreat-art/pruneren

🛠️ Optimize LLMs with advanced pruning strategies and real-time...

15
Experimental
17 Adam-Mazur/Lazy-Llama

An implementation of LazyLLM token pruning for LLaMa 2 model family.

14
Experimental