pytorch-vit and ViT_PyTorch

pytorch-vit

Emerging

ViT_PyTorch

Experimental

Maintenance 0/25

Adoption 10/25

Maturity 16/25

Community 17/25

Maintenance 0/25

Adoption 7/25

Maturity 8/25

Community 10/25

Stars: 306

Forks: 36

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

Stars: 25

Forks: 3

Downloads: —

Commits (30d): 0

Language: Python

License: —

Stale 6m No Package No Dependents

No License Stale 6m No Package No Dependents

About pytorch-vit

gupta-abhay/pytorch-vit

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

This project helps machine learning engineers and researchers classify images more accurately and efficiently. It takes raw image data as input and produces highly accurate classifications by leveraging transformer architectures, which are typically used for text. This is ideal for those working on computer vision tasks who want to explore cutting-edge models.

image-classification computer-vision deep-learning machine-learning-research

About ViT_PyTorch

godofpdog/ViT_PyTorch

This is a simple PyTorch implementation of Vision Transformer (ViT) described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale"

This project helps machine learning engineers and researchers quickly set up and train a Vision Transformer (ViT) model for image classification tasks. You input a dataset of images, and it outputs a trained model capable of categorizing new images. This is for professionals building advanced computer vision systems.

image-classification deep-learning computer-vision model-training vision-transformers

Related comparisons

pytorch-vit and Transformer-Implementations pytorch-vit and vision-transformer-from-scratch

Scores updated daily from GitHub, PyPI, and npm data. How scores work