alexiglad/EBT

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

/ 100

Established

Energy-Based Transformers reformulate transformer inference as an energy minimization problem, enabling iterative refinement ("System 2 thinking") at every token prediction rather than just feed-forward generation. The approach scales favorably across multiple axes—data, model depth, parameters, and FLOPs—while improving generalization, demonstrating this through multi-modal experiments (NLP, vision, video). Built on PyTorch Lightning, it provides modular training/inference pipelines with support for distributed training, HuggingFace dataset integration, and W&B logging, along with minimal examples for quick experimentation.

613 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 20 / 25

How are scores calculated?

Stars

613

Forks

Language

Python

License

Apache-2.0

Related tools

NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large...

vlm-run/vlmrun-hub

A hub for various industry-specific schemas to be used with VLMs.

HyperGAI/HPT

HPT - Open Multimodal LLMs from HyperGAI

yash9439/Falcon-Local-AI-Model

Explore this GitHub repository housing 3 versions of Falcon code for text generation. Each...

DorsaRoh/transformer-from-scratch

Complete transformer from scratch, using only numpy

Explore Generative AI Tools

All categories Trending Generative AI directory Insights