Vision Transformer Optimization Computer Vision Tools
There are 24 vision transformer optimization tools tracked. 1 score above 50 (established tier). The highest-rated is BR-IDL/PaddleViT at 51/100 with 1,241 stars.
Get all 24 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=computer-vision&subcategory=vision-transformer-optimization&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
BR-IDL/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for... |
|
Established |
| 2 |
pathak22/unsupervised-video
[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web |
|
Emerging |
| 3 |
IBM/CrossViT
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899 |
|
Emerging |
| 4 |
NVlabs/GCVit
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers |
|
Emerging |
| 5 |
ViTAE-Transformer/ViTDet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer... |
|
Emerging |
| 6 |
bytedance/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting |
|
Emerging |
| 7 |
wjun0830/QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video... |
|
Emerging |
| 8 |
Seokju-Cho/Volumetric-Aggregation-Transformer
Official Implementation of VAT |
|
Emerging |
| 9 |
amazon-science/glass-text-spotting
Official implementation for "GLASS: Global to Local Attention for Scene-Text... |
|
Emerging |
| 10 |
insitro/ChannelViT
Channel Vision Transformers: An Image Is Worth C x 16 x 16 Words |
|
Emerging |
| 11 |
ViTAE-Transformer/QFormer
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention" |
|
Emerging |
| 12 |
kkakkkka/ETRIS
[ICCV-2023] The official code of Bridging Vision and Language Encoders:... |
|
Emerging |
| 13 |
dlut-dimt/ReCoNet
ECCV 2022 | Recurrent Correction Network for Fast and Efficient... |
|
Emerging |
| 14 |
PediaMedAI/ViTASD
[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer... |
|
Emerging |
| 15 |
Haochen-Wang409/DropPos
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing... |
|
Emerging |
| 16 |
maclong01/DeBiFormer
[ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with... |
|
Experimental |
| 17 |
d3tk/REOrder
Does patch ordering affect context-limited vision transformers? |
|
Experimental |
| 18 |
ViTAE-Transformer/ViTAE-Transformer-Scene-Text-Detection
A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@... |
|
Experimental |
| 19 |
demidovd98/sm-vit
Official repository for the paper "Salient Mask-Guided Vision Transformer... |
|
Experimental |
| 20 |
lorebianchi98/FG-OVD
[CVPR 2024 Highlight] Official repository of the paper "The devil is in the... |
|
Experimental |
| 21 |
ruohaoguo/ovavss
Official Implementation of "Open-Vocabulary Audio-Visual Semantic... |
|
Experimental |
| 22 |
LeapLabTHU/DAT-Segmentation
Repository of Vision Transformer with Deformable Attention (CVPR2022) and... |
|
Experimental |
| 23 |
LeapLabTHU/DAT-Detection
Repository of Vision Transformer with Deformable Attention (CVPR2022) and... |
|
Experimental |
| 24 |
ruohaoguo/pavsodr
Official Implementation of "Instance-Level Panoramic Audio-Visual Saliency... |
|
Experimental |