Vision Language Instruction Tuning Diffusion Models
There are 1 vision language instruction tuning models tracked. The highest-rated is FoundationVision/LlamaGen at 35/100 with 1,939 stars.
Get all 1 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=vision-language-instruction-tuning&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation |
|
Emerging |