AlonzoLeeeooo/awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

/ 100

Emerging

Organizes research papers on diffusion-based text-to-image generation across multiple research directions—unconditional generation, personalization, and text-guided editing—with papers indexed by venue and year. Beyond core T2I methods, it catalogs emerging intersections like Diffusion Transformers, LLM integration, and federated learning applications. The repository includes curated datasets, toolkits, and links to production systems, serving as a comprehensive reference for tracking the evolution of generative model architectures from 2020 onwards.

750 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

750

Forks

Language

TeX

License

MIT

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Explore Diffusion Models

All categories Trending Diffusion directory Insights