ChristophReich1996/MaxViT
PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].
This project offers a pre-configured architecture for computer vision tasks, specifically for image classification. It takes image data as input and outputs classifications. This tool is for machine learning engineers and researchers who are building and experimenting with advanced deep learning models for image analysis.
164 stars. No commits in the last 6 months.
Use this if you are a machine learning practitioner looking to implement or research the MaxViT architecture for image classification within a PyTorch environment.
Not ideal if you are a non-developer or need a ready-to-use application for image classification without custom model building.
Stars
164
Forks
18
Language
Python
License
MIT
Category
Last pushed
Jul 12, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/ChristophReich1996/MaxViT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...
microsoft/CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
BR-IDL/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
gaohuang/MSDNet
Multi-Scale Dense Networks for Resource Efficient Image Classification (ICLR 2018 Oral)