T2I Evaluation Benchmarks Diffusion Models
Benchmarks, datasets, and metrics for evaluating text-to-image generation quality and alignment. Does NOT include tools for generating images, training models, or prompt optimization.
There are 50 t2i evaluation benchmarks models tracked. 1 score above 70 (verified tier). The highest-rated is Vchitect/VBench at 73/100 with 1,537 stars and 3,530 monthly downloads. 1 of the top 10 are actively maintained.
Get all 50 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=t2i-evaluation-benchmarks&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation |
|
Verified |
| 2 |
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340 |
|
Established |
| 3 |
EndlessSora/focal-frequency-loss
[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis |
|
Established |
| 4 |
JIA-Lab-research/DreamOmni2
This project is the official implementation of 'DreamOmni2: Multimodal... |
|
Emerging |
| 5 |
PKU-YuanGroup/ChronoMagic-Bench
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic... |
|
Emerging |
| 6 |
SkyworkAI/UniPic
Open-source SOTA multi-image editing model |
|
Emerging |
| 7 |
Amshaker/Mobile-O
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device |
|
Emerging |
| 8 |
ViStoryBench/vistorybench
[CVPR 2026] ViStoryBench: AI Story Visualization Benchmark |
|
Emerging |
| 9 |
nupurkmr9/syncd
SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization... |
|
Emerging |
| 10 |
uni-medical/UniMedVL
Official implementation of "UniMedVL: Unifying Medical Multimodal... |
|
Emerging |
| 11 |
zai-org/CogView2
official code repo for paper "CogView2: Faster and Better Text-to-Image... |
|
Emerging |
| 12 |
Karine-Huang/T2I-CompBench
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image... |
|
Emerging |
| 13 |
zai-org/CogView4
CogView4, CogView3-Plus and CogView3(ECCV 2024) |
|
Emerging |
| 14 |
tobran/GALIP
[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training |
|
Emerging |
| 15 |
OpenGVLab/GenExam
GenExam: A Multidisciplinary Text-to-Image Exam |
|
Emerging |
| 16 |
AIDC-AI/Ovis-U1
An unified model that seamlessly integrates multimodal understanding,... |
|
Emerging |
| 17 |
JustusThies/NeuralTexGen
Image-space texture optimization of 3D meshes using PyTorch |
|
Emerging |
| 18 |
humansensinglab/ITI-GEN
[ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation |
|
Emerging |
| 19 |
inclusionAI/Ming-UniVision
Code release for Ming-UniVision: Joint Image Understanding and Geneation... |
|
Emerging |
| 20 |
360CVGroup/PlanGen
Unified layout planning and image generation, ICCV2025 |
|
Experimental |
| 21 |
lxa9867/ImageFolder
High-performance Image Tokenizers for VAR and AR |
|
Experimental |
| 22 |
boomb0om/text2image-benchmark
Benchmark for generative image models |
|
Experimental |
| 23 |
FoundationVision/OmniTokenizer
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint... |
|
Experimental |
| 24 |
KlingAIResearch/IMBA-Loss
[ICCV 2025] Official Implementation of the Paper "Imbalance in Balance:... |
|
Experimental |
| 25 |
migs2021/migs
MIGS: Meta Image Generation from Scene Graphs (BMVC 2021) |
|
Experimental |
| 26 |
microsoft/BizGenEval
Bridging the gap between image generation and real-world design: a benchmark... |
|
Experimental |
| 27 |
GordonChen19/STENCIL
[ICIP2025 Spotlight] Efficient and High-Fidelity Image Generation |
|
Experimental |
| 28 |
EPFL-VILAB/search-over-tokens
SoT is a framework for test-time search in autoregressive (AR) image... |
|
Experimental |
| 29 |
roeiherz/CanonicalSg2Im
Code for "Learning Canonical Representations for Scene Graph to Image... |
|
Experimental |
| 30 |
bcmi/F2GAN-Few-Shot-Image-Generation
Fusing-and-Filling GAN (F2GAN) for few-shot image generation, ACM MM2020 |
|
Experimental |
| 31 |
yongchoooon/stellar
[AAAI'26 Workshops Oral] STELLAR: Scene Text Editor for Low-Resource... |
|
Experimental |
| 32 |
TIGER-AI-Lab/VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable... |
|
Experimental |
| 33 |
ali-vilab/IDEA-Bench
Official repository of IDEA-Bench |
|
Experimental |
| 34 |
yunqing-me/A-Closer-Look-at-FSIG
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2022 |
|
Experimental |
| 35 |
hysts/CogView2_demo
Unofficial demo app for CogView2 |
|
Experimental |
| 36 |
1jsingh/Divide-Evaluate-and-Refine
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating... |
|
Experimental |
| 37 |
matsuolab/multibanana
[CVPR 2026 Main] MultiBanana: A Challenging Benchmark for Multi-Reference... |
|
Experimental |
| 38 |
zeyofu/Commonsense-T2I
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models... |
|
Experimental |
| 39 |
wzhlearning/Tex2Sem
Official Implementation of “Tex2Sem: Learning from Textures to Semantics... |
|
Experimental |
| 40 |
bowen-upenn/ControlText
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering... |
|
Experimental |
| 41 |
FtmsdtHosseini/IDPL-PFOD
An Image Dataset of Printed Farsi Text for OCR Research |
|
Experimental |
| 42 |
AIGCResearch/styleme3d
Official repo for StyleMe3D |
|
Experimental |
| 43 |
yczhou001/LongBench-T2I
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex... |
|
Experimental |
| 44 |
360CVGroup/HiCo_T2I
Layout Conditioned Image Generation, NeurIPS2024 |
|
Experimental |
| 45 |
hadi-hosseini/T2I-FineEval
[ECCV 2024 Workshop EVAL-FoMo] T2I-FineEval: Fine-Grained Compositional... |
|
Experimental |
| 46 |
j-min/VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023) |
|
Experimental |
| 47 |
K1nght/T2I-ConBench
T2I-ConBench: Text-to-Image Benchmark for Continual Post-training |
|
Experimental |
| 48 |
pmh9960/GCDP
Official PyTorch implementation of "Learning to Generate Semantic Layouts... |
|
Experimental |
| 49 |
AIGCResearch/Awesome-Story-Visualization
A Survey of Story Visualization |
|
Experimental |
| 50 |
HaoyuanYang-2023/ImagineFSL
Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters... |
|
Experimental |