Vision Transformer Implementations Transformer Models

Reference implementations and educational repositories of Vision Transformer architectures across frameworks (TensorFlow, PyTorch, Keras). Includes core ViT models and variants for standard vision tasks. Does NOT include specialized vision-language models, 3D vision, medical imaging, or hybrid architectures that significantly depart from standard ViT design.

There are 41 vision transformer implementations models tracked. The highest-rated is Kohulan/DECIMER-Image_Transformer at 48/100 with 345 stars.

Get all 41 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=vision-transformer-implementations&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 Kohulan/DECIMER-Image_Transformer

DECIMER Image Transformer is a deep-learning-based tool designed for...

48
Emerging
2 sovit-123/vision_transformers

Vision Transformers for image classification, image segmentation, and object...

46
Emerging
3 fcakyon/video-transformers

Easiest way of fine-tuning HuggingFace video classification models

46
Emerging
4 leaderj1001/BottleneckTransformers

Bottleneck Transformers for Visual Recognition

40
Emerging
5 qubvel/transformers-notebooks

Inference and fine-tuning examples for vision models from 🤗 Transformers

39
Emerging
6 rishikksh20/convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

38
Emerging
7 cmhungsteve/Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention,...

38
Emerging
8 alohays/awesome-visual-representation-learning-with-transformers

Awesome Transformers (self-attention) in Computer Vision

37
Emerging
9 mmaaz60/EdgeNeXt

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently...

35
Emerging
10 sayakpaul/robustness-vit

Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).

35
Emerging
11 xmindflow/Awesome-Transformer-in-Medical-Imaging

[MedIA Journal] An ultimately comprehensive paper list of Vision...

35
Emerging
12 adaptivetokensampling/ATS

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral...

34
Emerging
13 EMalagoli92/GCViT-TensorFlow

TensorFlow 2.X reimplementation of Global Context Vision Transformers, Ali...

31
Emerging
14 RLado/STB-VMM

STB-VMM: Swin Transformer Based Video Motion Magnification (official repository)

30
Emerging
15 varchasvee108/vision-transformer-maze-agent

Vision Transformer agent that learns to navigate mazes while visualizing...

29
Experimental
16 ziplab/HVT

[ICCV 2021] Official implementation of "Scalable Vision Transformers with...

29
Experimental
17 GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography

A 3D Vision Transformer-based neural network for reconstructing electrical...

29
Experimental
18 EMalagoli92/CvT-TensorFlow

TensorFlow 2.X reimplementation of CvT: Introducing Convolutions to Vision...

24
Experimental
19 rajatsaini0294/awesome-image-transformer

List of all the papers on Transformers for Vision.

22
Experimental
20 sayakpaul/vision-transformers-tf

A non-exhaustive collection of vision transformer models implemented in TensorFlow.

21
Experimental
21 jmanuelc87/vision-transformer

Implementation of different vision transformer models for classification,...

19
Experimental
22 Kotomiya07/kuzushiji-vision

くずし字認識システム

19
Experimental
23 RubenCasal/owl_vit_detector

NanoOWL Detection System enables real-time open-vocabulary object detection...

18
Experimental
24 MingSun-Tse/Awesome-Efficient-ViT

Recent Advances on Efficient Vision Transformers

17
Experimental
25 danilodjor/image-retrieval-using-transformers

This repository contains code used to perform image retrieval using...

16
Experimental
26 chagmgang/dinov2-remote-sensing

Implementation dino v2 for remote sensing with huggingface transformers

16
Experimental
27 vitality-vis/vitality-vis.github.io

Promoting Serendipitous Discovery of Academic Literature with Transformers &...

16
Experimental
28 revanurambareesh/instantaneous_transformer

Official repo of Instantaneous Transformers for Video based Physiology...

15
Experimental
29 uakarsh/TiLT-Implementation

Implementation of the paper: Going Full-TILT Boogie on Document...

15
Experimental
30 GuillaumeZahnd/vision-transformer

Vision Transformer

14
Experimental
31 tim-roderick/VST

Video Summarization Transformer: Implementation in PyTorch of the...

14
Experimental
32 matin-ghorbani/Video-Classification-Transformers

Implement a video classification using transformers

13
Experimental
33 ahmedgh970/convnext-charm

Official Tensorflow implementation of ConvNeXt-ChARM: ConvNeXt-based...

13
Experimental
34 koc-lab/vispool

Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks...

12
Experimental
35 mbari-org/vitstrain

Fine-tune vision transformer models to classify Plankton, UAV(drone),...

11
Experimental
36 DaniGarciaPerez/vision_transformer

A repo to explore the implementation of a Vision Transformer from scratch...

11
Experimental
37 4rtux/3D-CNN-Action-Recognition-Model

Identificación de actividades cotidianas basado en visión por computador y...

11
Experimental
38 EMalagoli92/VAN-Classification-TensorFlow

TensorFlow 2.X reimplementation of Visual Attention Network, Meng-Hao Guo,...

10
Experimental
39 nachiket273/VisTrans

Implementations of transformers based models for different vision tasks

10
Experimental
40 nakshatrasinghh/Vision-Transformer

Tensorflow implementation of the Vision Transformer (Bye-Bye Convolutions)

10
Experimental
41 Justin900429/vision-transformer

Implement the vision transformer using pytorch

10
Experimental