Vision Transformer Implementations Transformer Models

Reference implementations and educational repositories of Vision Transformer architectures across frameworks (TensorFlow, PyTorch, Keras). Includes core ViT models and variants for standard vision tasks. Does NOT include specialized vision-language models, 3D vision, medical imaging, or hybrid architectures that significantly depart from standard ViT design.

There are 41 vision transformer implementations models tracked. The highest-rated is Kohulan/DECIMER-Image_Transformer at 48/100 with 345 stars.

Get all 41 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=vision-transformer-implementations&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	Kohulan/DECIMER-Image_Transformer DECIMER Image Transformer is a deep-learning-based tool designed for...	48	Emerging	345	Python
2	sovit-123/vision_transformers Vision Transformers for image classification, image segmentation, and object...	46	Emerging	65	Python
3	fcakyon/video-transformers Easiest way of fine-tuning HuggingFace video classification models	46	Emerging	148	Python
4	leaderj1001/BottleneckTransformers Bottleneck Transformers for Visual Recognition	40	Emerging	279	Python
5	qubvel/transformers-notebooks Inference and fine-tuning examples for vision models from 🤗 Transformers	39	Emerging	165	Jupyter Notebook
6	rishikksh20/convolution-vision-transformers PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers	38	Emerging	226	Python
7	cmhungsteve/Awesome-Transformer-Attention An ultimately comprehensive paper list of Vision Transformer/Attention,...	38	Emerging	5,022	—
8	alohays/awesome-visual-representation-learning-with-transformers Awesome Transformers (self-attention) in Computer Vision	37	Emerging	269	—
9	mmaaz60/EdgeNeXt [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently...	35	Emerging	411	Python
10	sayakpaul/robustness-vit Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).	35	Emerging	122	Jupyter Notebook
11	xmindflow/Awesome-Transformer-in-Medical-Imaging [MedIA Journal] An ultimately comprehensive paper list of Vision...	35	Emerging	218	—
12	adaptivetokensampling/ATS Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral...	34	Emerging	104	Shell
13	EMalagoli92/GCViT-TensorFlow TensorFlow 2.X reimplementation of Global Context Vision Transformers, Ali...	31	Emerging	7	Python
14	RLado/STB-VMM STB-VMM: Swin Transformer Based Video Motion Magnification (official repository)	30	Emerging	50	Python
15	varchasvee108/vision-transformer-maze-agent Vision Transformer agent that learns to navigate mazes while visualizing...	29	Experimental	3	Python
16	ziplab/HVT [ICCV 2021] Official implementation of "Scalable Vision Transformers with...	29	Experimental	33	Python
17	GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography A 3D Vision Transformer-based neural network for reconstructing electrical...	29	Experimental	9	Python
18	EMalagoli92/CvT-TensorFlow TensorFlow 2.X reimplementation of CvT: Introducing Convolutions to Vision...	24	Experimental	3	Python
19	rajatsaini0294/awesome-image-transformer List of all the papers on Transformers for Vision.	22	Experimental	7	—
20	sayakpaul/vision-transformers-tf A non-exhaustive collection of vision transformer models implemented in TensorFlow.	21	Experimental	10	—
21	jmanuelc87/vision-transformer Implementation of different vision transformer models for classification,...	19	Experimental	—	Python
22	Kotomiya07/kuzushiji-vision くずし字認識システム	19	Experimental	—	Python
23	RubenCasal/owl_vit_detector NanoOWL Detection System enables real-time open-vocabulary object detection...	18	Experimental	2	C++
24	MingSun-Tse/Awesome-Efficient-ViT Recent Advances on Efficient Vision Transformers	17	Experimental	55	—
25	danilodjor/image-retrieval-using-transformers This repository contains code used to perform image retrieval using...	16	Experimental	3	Python
26	chagmgang/dinov2-remote-sensing Implementation dino v2 for remote sensing with huggingface transformers	16	Experimental	36	Jupyter Notebook
27	vitality-vis/vitality-vis.github.io Promoting Serendipitous Discovery of Academic Literature with Transformers &...	16	Experimental	1	JavaScript
28	revanurambareesh/instantaneous_transformer Official repo of Instantaneous Transformers for Video based Physiology...	15	Experimental	22	Python
29	uakarsh/TiLT-Implementation Implementation of the paper: Going Full-TILT Boogie on Document...	15	Experimental	18	Jupyter Notebook
30	GuillaumeZahnd/vision-transformer Vision Transformer	14	Experimental	—	Python
31	tim-roderick/VST Video Summarization Transformer: Implementation in PyTorch of the...	14	Experimental	10	Jupyter Notebook
32	matin-ghorbani/Video-Classification-Transformers Implement a video classification using transformers	13	Experimental	8	Jupyter Notebook
33	ahmedgh970/convnext-charm Official Tensorflow implementation of ConvNeXt-ChARM: ConvNeXt-based...	13	Experimental	5	Python
34	koc-lab/vispool Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks...	12	Experimental	3	Python
35	mbari-org/vitstrain Fine-tune vision transformer models to classify Plankton, UAV(drone),...	11	Experimental	5	Python
36	DaniGarciaPerez/vision_transformer A repo to explore the implementation of a Vision Transformer from scratch...	11	Experimental	—	Python
37	4rtux/3D-CNN-Action-Recognition-Model Identificación de actividades cotidianas basado en visión por computador y...	11	Experimental	—	Python
38	EMalagoli92/VAN-Classification-TensorFlow TensorFlow 2.X reimplementation of Visual Attention Network, Meng-Hao Guo,...	10	Experimental	1	Python
39	nachiket273/VisTrans Implementations of transformers based models for different vision tasks	10	Experimental	1	Python
40	nakshatrasinghh/Vision-Transformer Tensorflow implementation of the Vision Transformer (Bye-Bye Convolutions)	10	Experimental	1	Python
41	Justin900429/vision-transformer Implement the vision transformer using pytorch	10	Experimental	1	Python

Comparisons in this category

Awesome-Transformer-Attention and awesome-visual-representation-learning-with-transformers (38 vs 37) Awesome-Transformer-Attention and Awesome-Transformer-in-Medical-Imaging (38 vs 35) convolution-vision-transformers and CvT-TensorFlow (38 vs 24) vision_transformers and vision-transformers-tf (46 vs 21)