Vision Transformer Optimization Computer Vision Tools

There are 24 vision transformer optimization tools tracked. 1 score above 50 (established tier). The highest-rated is BR-IDL/PaddleViT at 51/100 with 1,241 stars.

Get all 24 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=computer-vision&subcategory=vision-transformer-optimization&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for...

51
Established
2 pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

48
Emerging
3 IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

45
Emerging
4 NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

43
Emerging
5 ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer...

42
Emerging
6 bytedance/SPTSv2

The official implementation of SPTS v2: Single-Point Text Spotting

41
Emerging
7 wjun0830/QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video...

41
Emerging
8 Seokju-Cho/Volumetric-Aggregation-Transformer

Official Implementation of VAT

39
Emerging
9 amazon-science/glass-text-spotting

Official implementation for "GLASS: Global to Local Attention for Scene-Text...

39
Emerging
10 insitro/ChannelViT

Channel Vision Transformers: An Image Is Worth C x 16 x 16 Words

38
Emerging
11 ViTAE-Transformer/QFormer

The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"

37
Emerging
12 kkakkkka/ETRIS

[ICCV-2023] The official code of Bridging Vision and Language Encoders:...

36
Emerging
13 dlut-dimt/ReCoNet

ECCV 2022 | Recurrent Correction Network for Fast and Efficient...

34
Emerging
14 PediaMedAI/ViTASD

[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer...

32
Emerging
15 Haochen-Wang409/DropPos

[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing...

32
Emerging
16 maclong01/DeBiFormer

[ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with...

29
Experimental
17 d3tk/REOrder

Does patch ordering affect context-limited vision transformers?

26
Experimental
18 ViTAE-Transformer/ViTAE-Transformer-Scene-Text-Detection

A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@...

25
Experimental
19 demidovd98/sm-vit

Official repository for the paper "Salient Mask-Guided Vision Transformer...

23
Experimental
20 lorebianchi98/FG-OVD

[CVPR 2024 Highlight] Official repository of the paper "The devil is in the...

22
Experimental
21 ruohaoguo/ovavss

Official Implementation of "Open-Vocabulary Audio-Visual Semantic...

21
Experimental
22 LeapLabTHU/DAT-Segmentation

Repository of Vision Transformer with Deformable Attention (CVPR2022) and...

18
Experimental
23 LeapLabTHU/DAT-Detection

Repository of Vision Transformer with Deformable Attention (CVPR2022) and...

15
Experimental
24 ruohaoguo/pavsodr

Official Implementation of "Instance-Level Panoramic Audio-Visual Saliency...

14
Experimental