NKU-MetautoAI/awesome-large-vision-language-models
Advances in recent large vision language models (LVLMs)
This resource helps you navigate the rapidly evolving landscape of large language and vision models. It provides a structured overview, including key details like release dates, organizations, and model parameters for various models. Researchers, AI practitioners, and data scientists who need to stay current with the latest advancements in large AI models will find this useful for their work.
No commits in the last 6 months.
Use this if you need a quick way to compare and select appropriate large language or vision models for your research or development projects.
Not ideal if you are looking for ready-to-use APIs or direct implementations of these models without further technical setup.
Stars
15
Forks
—
Language
—
License
—
Category
Last pushed
Sep 23, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/NKU-MetautoAI/awesome-large-vision-language-models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification),...
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the...
thuml/AutoTimes
Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"