j-min/DallEval
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
Provides comprehensive evaluation frameworks across four dimensions: visual reasoning skills via DETR-based object detection and spatial understanding, demographic biases (gender/skin tone) in generated outputs, image quality through FID scoring, and image-text alignment using CLIP retrieval and VL-T5 captioning. Includes inference implementations for multiple text-to-image models (DALL-E variants, minDALL-E, Stable Diffusion) enabling cross-model benchmark comparisons.
143 stars. No commits in the last 6 months.
Stars
143
Forks
6
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jun 10, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/j-min/DallEval"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ai-forever/ru-dalle
Generate images from texts. In Russian
KR-HappyFace/KoDALLE
🇰🇷 Text to Image in Korean
kevinhermawan/Canvas
DALL·E playground for the Mac
simonsanvil/DALL-E-Explained
Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other...
alxschwrz/dalle2_python
DALLE2 in the command line.