alexiglad/EBT

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

55
/ 100
Established

Energy-Based Transformers reformulate transformer inference as an energy minimization problem, enabling iterative refinement ("System 2 thinking") at every token prediction rather than just feed-forward generation. The approach scales favorably across multiple axes—data, model depth, parameters, and FLOPs—while improving generalization, demonstrating this through multi-modal experiments (NLP, vision, video). Built on PyTorch Lightning, it provides modular training/inference pipelines with support for distributed training, HuggingFace dataset integration, and W&B logging, along with minimal examples for quick experimentation.

613 stars.

No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 20 / 25

How are scores calculated?

Stars

613

Forks

85

Language

Python

License

Apache-2.0

Last pushed

Mar 01, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/alexiglad/EBT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.