chrisliu298/awesome-sparse-autoencoders

A resource repository of sparse autoencoders for large language models

/ 100

Experimental

This repository helps AI researchers and machine learning engineers stay current with the latest advancements in sparse autoencoders, particularly for understanding how large language models (LLMs) work internally. It curates a comprehensive list of academic papers and blog posts on this topic. You get a structured overview of research, which is valuable for anyone focused on mechanistic interpretability of LLMs.

No commits in the last 6 months.

Use this if you are an AI researcher or machine learning engineer actively involved in understanding or developing mechanistic interpretability for large language models.

Not ideal if you are looking for ready-to-use code, a tutorial for building sparse autoencoders, or general information on LLMs without a focus on their internal mechanisms.

AI-research LLM-interpretability mechanistic-interpretability machine-learning-engineering AI-safety

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

—

License

Apache-2.0

Higher-rated alternatives

BradyFU/Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

FoundationVision/Liquid

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

Paranioar/Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...

Yangyi-Chen/Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the...

thuml/AutoTimes

Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"

Explore Transformer Models

All categories Trending Transformer directory Insights