jessevig/bertviz

BertViz: Visualize Attention in Transformer Models

/ 100

Established

Provides three complementary visualization modes—head view (individual attention heads), model view (all layers/heads overview), and neuron view (query/key vector decomposition)—enabling multi-level analysis of transformer attention mechanisms. Integrates directly with Huggingface transformers via a simple Python API that works in Jupyter and Colab notebooks by extracting attention tensors from model outputs. Supports both encoder-only models (BERT, GPT-2) and encoder-decoder architectures (BART, T5) with interactive HTML visualizations.

7,945 stars. Available on PyPI.

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 20 / 25

How are scores calculated?

Stars

7,945

Forks

871

Language

Python

License

Apache-2.0

Related models

inseq-team/inseq

Interpretability for sequence generation models 🐛 🔍

EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

hila-chefer/Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for...

cdpierse/transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model...

taufeeque9/codebook-features

Sparse and discrete interpretability tool for neural networks

Explore Transformer Models

All categories Trending Transformer directory Insights