ChristophReich1996/MaxViT

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].

/ 100

Emerging

This project offers a pre-configured architecture for computer vision tasks, specifically for image classification. It takes image data as input and outputs classifications. This tool is for machine learning engineers and researchers who are building and experimenting with advanced deep learning models for image analysis.

164 stars. No commits in the last 6 months.

Use this if you are a machine learning practitioner looking to implement or research the MaxViT architecture for image classification within a PyTorch environment.

Not ideal if you are a non-developer or need a ready-to-use application for image classification without custom model building.

deep-learning image-classification computer-vision model-building pytorch

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

164

Forks

Language

Python

License

MIT

Compare

MaxViT and FasterViT

Higher-rated alternatives

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...

ViTAE-Transformer/ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...

microsoft/CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

gaohuang/MSDNet

Multi-Scale Dense Networks for Resource Efficient Image Classification （ICLR 2018 Oral）

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights