Gorilla-Lab-SCUT/PaDT
[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"
36
/ 100
Emerging
251 stars.
No Package
No Dependents
Maintenance
6 / 25
Adoption
10 / 25
Maturity
9 / 25
Community
11 / 25
Stars
251
Forks
13
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 31, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/Gorilla-Lab-SCUT/PaDT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
col14m/cadrille
[ICLR2026] cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning
43
filaPro/cad-recode
[ICCV2025] CAD-Recode: Reverse Engineering CAD Code from Point Clouds
41
pengsongyou/openscene
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
36
cambrian-mllm/cambrian-s
Cambrian-S: Towards Spatial Supersensing in Video
36
worldbench/3EED
[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D
36