gmlwns2000/sttabt

[ICLR2023] Official code of Sparse Token Transformer with Attention Back-Tracking

/ 100

Experimental

This project offers a way for machine learning engineers to make their Transformer-based AI models run more efficiently on devices with limited memory and processing power, like mobile phones or edge devices. It takes an existing Transformer model and optimizes it, resulting in a model that uses less memory and computes faster while maintaining strong performance on tasks like image classification or natural language processing. The end-users are machine learning engineers deploying AI models in resource-constrained environments.

No commits in the last 6 months.

Use this if you are a machine learning engineer working with Transformer models and need to reduce their computational and memory footprint for deployment on mobile or edge devices without significantly sacrificing accuracy.

Not ideal if you are working with non-Transformer neural network architectures or if your deployment environment has ample computational resources and memory.

AI model deployment edge computing mobile AI computer vision optimization natural language processing optimization

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Higher-rated alternatives

lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising experimental features...

kanishkamisra/minicons

Utility for behavioral and representational analyses of Language Models

lucidrains/simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

Nicolepcx/Transformers-in-Action

This is the corresponding code for the book Transformers in Action

Explore Transformer Models

All categories Trending Transformer directory Insights