AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Organizes research papers on diffusion-based text-to-image generation across multiple research directions—unconditional generation, personalization, and text-guided editing—with papers indexed by venue and year. Beyond core T2I methods, it catalogs emerging intersections like Diffusion Transformers, LLM integration, and federated learning applications. The repository includes curated datasets, toolkits, and links to production systems, serving as a comprehensive reference for tracking the evolution of generative model architectures from 2020 onwards.
750 stars.
Stars
750
Forks
40
Language
TeX
License
MIT
Category
Last pushed
Dec 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/AlonzoLeeeooo/awesome-text-to-image-studies"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
FoundationVision/VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...
nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
eps696/aphantasia
CLIP + FFT/DWT/RGB = text to image/video