arrmansa/Temporal-Neuron-Variance-Pruning-Demo
An implementation of Variance Pruning: Pruning Language Models via Temporal Neuron Variance by Berry Weinstein, Yonatan Belinkov
No commits in the last 6 months.
Stars
1
Forks
—
Language
Jupyter Notebook
License
Unlicense
Category
Last pushed
Feb 07, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/arrmansa/Temporal-Neuron-Variance-Pruning-Demo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Tencent/AngelSlim
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
liyucheng09/Selective_Context
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40%...
nebuly-ai/optimate
A collection of libraries to optimise AI model performances
kyo-takano/chinchilla
A toolkit for scaling law research ⚖
antgroup/glake
GLake: optimizing GPU memory management and IO transmission.