IAAR-Shanghai/Awesome-Attention-Heads

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

/ 100

Experimental

Provides a curated research platform with a peer-reviewed survey paper accepted by *Patterns* (Cell Press) that organizes attention head studies using a four-stage cognitive framework—Knowledge Recalling, In-Context Identification, Latent Reasoning, and Expression Preparation—to systematize mechanistic interpretability research. Includes structured paper taxonomy with experimental methodology classifications and causal analysis techniques (path patching, attribution heads, mediation analysis) for isolating functional circuits within transformer attention mechanisms across diverse LLM tasks.

400 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

400

Forks

Language

TeX

License

—

Higher-rated alternatives

Lightning-AI/litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

liangyuwang/Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

microsoft/Text2Grad

🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model...

catherinesyeh/attention-viz

Visualizing query-key interactions in language + vision transformers (VIS 2023)

FareedKhan-dev/Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's...

Explore LLM Tools

All categories Trending LLM Tool directory Insights