MaxwellCalkin/interpretability-toolkit
Practical mechanistic interpretability tools — activation caching, linear probes, activation patching, circuit discovery, and visualization for transformer models
Stars
—
Forks
—
Language
Python
License
MIT
Last pushed
Mar 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/MaxwellCalkin/interpretability-toolkit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
inseq-team/inseq
Interpretability for sequence generation models 🐛 🔍
jessevig/bertviz
BertViz: Visualize Attention in Transformer Models
EleutherAI/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models.
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for...
cdpierse/transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model...