j-min/DallEval

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

/ 100

Emerging

Provides comprehensive evaluation frameworks across four dimensions: visual reasoning skills via DETR-based object detection and spatial understanding, demographic biases (gender/skin tone) in generated outputs, image quality through FID scoring, and image-text alignment using CLIP retrieval and VL-T5 captioning. Includes inference implementations for multiple text-to-image models (DALL-E variants, minDALL-E, Stable Diffusion) enabling cross-model benchmark comparisons.

143 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

143

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

ai-forever/ru-dalle

Generate images from texts. In Russian

KR-HappyFace/KoDALLE

🇰🇷 Text to Image in Korean

kevinhermawan/Canvas

DALL·E playground for the Mac

simonsanvil/DALL-E-Explained

Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other...

alxschwrz/dalle2_python

DALLE2 in the command line.

Explore Diffusion Models

All categories Trending Diffusion directory Insights