cdpierse/transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

/ 100

Emerging

Supports attribution-based explainability for both text and vision transformers through gradient-based methods (Integrated Gradients), generating token-level importance scores. Provides multiple explainer classes tailored to different task types—sequence classification, pairwise classification, question answering, and image classification—with built-in visualization as interactive HTML or static PNG outputs. Integrates directly with Hugging Face transformers' model and tokenizer APIs, supporting any pretrained or fine-tuned model from the ecosystem.

1,413 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 18 / 25

How are scores calculated?

Stars

1,413

Forks

100

Language

Jupyter Notebook

License

Apache-2.0

Higher-rated alternatives

jessevig/bertviz

BertViz: Visualize Attention in Transformer Models

inseq-team/inseq

Interpretability for sequence generation models 🐛 🔍

EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

hila-chefer/Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for...

taufeeque9/codebook-features

Sparse and discrete interpretability tool for neural networks

Explore Transformer Models

All categories Trending Transformer directory Insights