ViT Image Classification Transformer Models
Tools and implementations for training Vision Transformers on image classification tasks across various datasets (MNIST, CIFAR-10, custom domains). Includes from-scratch implementations, fine-tuning tutorials, and comparative studies. Does NOT include vision-language models, object detection, medical imaging, 3D vision, or other downstream vision tasks beyond classification.
There are 34 vit image classification models tracked. The highest-rated is UdbhavPrasad072300/Transformer-Implementations at 48/100 with 69 stars and 31 monthly downloads.
Get all 34 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=vit-image-classification&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
UdbhavPrasad072300/Transformer-Implementations
Library - Vanilla, ViT, DeiT, BERT, GPT |
|
Emerging |
| 2 |
jaehyunnn/ViTPose_pytorch
An unofficial implementation of ViTPose [Y. Xu et al., 2022] |
|
Emerging |
| 3 |
tintn/vision-transformer-from-scratch
A Simplified PyTorch Implementation of Vision Transformer (ViT) |
|
Emerging |
| 4 |
icon-lab/ResViT
Official Implementation of ResViT: Residual Vision Transformers for... |
|
Emerging |
| 5 |
gupta-abhay/pytorch-vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale |
|
Emerging |
| 6 |
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges... |
|
Emerging |
| 7 |
rishikksh20/CrossViT-pytorch
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer... |
|
Emerging |
| 8 |
sayakpaul/probing-vits
Probing the representations of Vision Transformers. |
|
Emerging |
| 9 |
all-things-vits/code-samples
Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and... |
|
Emerging |
| 10 |
kyegomez/MC-ViT
Implementation of the model: "(MC-ViT)" from the paper: "Memory... |
|
Emerging |
| 11 |
jordandeklerk/SwinViT
Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification |
|
Experimental |
| 12 |
Orion-AI-Lab/televit
Teleconnection-driven vision transformers for improved long-term forecasting |
|
Experimental |
| 13 |
vishvaRam/Fine-Tuning-Siglip2-Vit-Model
This repository offers tools and guidance for fine-tuning the Siglip2 Vision... |
|
Experimental |
| 14 |
sayannath/ViT-TF-Hub-Application
Build and fine-tune your Image Classifier using a Vision Transformer Model... |
|
Experimental |
| 15 |
shub-garg/Vision-Transformer-VIT-for-MNIST
This repository implements a Vision Transformer (ViT) to classify... |
|
Experimental |
| 16 |
r-dug/GCViT_Classifier
Image classifier and training script, using GCViT |
|
Experimental |
| 17 |
godofpdog/ViT_PyTorch
This is a simple PyTorch implementation of Vision Transformer (ViT)... |
|
Experimental |
| 18 |
benisalla/Tiny-ViT-Transformer-from-scratch
This repository offers a straightforward implementation of Vision... |
|
Experimental |
| 19 |
guglielmocamporese/visual-transformer-pytorch
An easy and minimal implementation of the Visual Transformer (ViT) in... |
|
Experimental |
| 20 |
wambugu71/SmartAgriImage_classification_ViT
Vision Transformer trained with thousands of agricultural diseases in... |
|
Experimental |
| 21 |
PRITHIVSAKTHIUR/Vit-Mature-Content-Detection
Vit-Mature-Content-Detection is an image classification vision-language... |
|
Experimental |
| 22 |
zubairmk83/ViTP
🌟 Pretrain domain-specific models using visual instructions to enhance... |
|
Experimental |
| 23 |
bikhanal/vision-transformer
Implementation of Vision Transformer (ViT) from scratch for image classification. |
|
Experimental |
| 24 |
sergio-sanz-rodriguez/Vision-Transformers-Image-Classification
Development of Vision Transformer (ViT) networks for multi-class image... |
|
Experimental |
| 25 |
Vitgracer/ViT-from-scratch
Simple minimal Vision Transformer implementation in PyTorch |
|
Experimental |
| 26 |
Sid7on1/ViT-Vision-Transformer
ViT-ClassiPy is a lightweight Vision Transformer built from scratch using... |
|
Experimental |
| 27 |
AddictivelyRecursive/lightweight-multimodal-transformer-pipeline
Lightweight multimodal transformer pipeline comparing MobileViT and... |
|
Experimental |
| 28 |
jhtobigs/ViT_Survey
Vision Transformer Survey and Implementation |
|
Experimental |
| 29 |
jordandeklerk/ViT
Implementing a vision transformer model in PyTorch on CIFAR-10 |
|
Experimental |
| 30 |
Nahom32/ViT
An implementation of the vision transformer using CIFAR-10. |
|
Experimental |
| 31 |
codebywiam/visual-transformer
A deep learning project using Vision Transformer (ViT) to classify bean leaf... |
|
Experimental |
| 32 |
PawanKonwar/Huggingface-Image-Project
Custom Vision Transformer (ViT) implementation for image classification.... |
|
Experimental |
| 33 |
conceptofmind/DeepViT-flax
Implementation of Deep Vision Transformer in Flax |
|
Experimental |
| 34 |
KimiaaK/vision-transformer-HuggingFace
This project utilizes ViT via HuggingFace to classify 9 strawberry diseases. |
|
Experimental |