bhavnicksm/vanilla-transformer-jax

JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al. (https://arxiv.org/abs/1706.03762)

/ 100

Emerging

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 18 / 25

Community 14 / 25

Stars

Forks

Language

Python

License

MIT

Category

Last pushed

Aug 16, 2021

Monthly downloads

Commits (30d)

Dependencies

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/bhavnicksm/vanilla-transformer-jax"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

kyegomez/SparseAttention

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with...

AbdelStark/attnres

Rust implementation of Attention Residuals from MoonshotAI/Kimi

takara-ai/go-attention

A full attention mechanism and transformer in pure go.