Text-to-Image Generation Diffusion Models

Tools and implementations for generating images from text prompts using diffusion models, GANs, or CLIP-guided approaches. Does NOT include image editing tools, inpainting, video generation, or evaluation benchmarks.

There are 43 text-to-image generation models tracked. 2 score above 50 (established tier). The highest-rated is NVlabs/Sana at 60/100 with 5,000 stars. 1 of the top 10 are actively maintained.

Get all 43 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=text-to-image-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	NVlabs/Sana SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer	60	Established	5,000	Python
2	FoundationVision/VAR [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in...	50	Established	8,641	Jupyter Notebook
3	nerdyrodent/VQGAN-CLIP Just playing with getting VQGAN+CLIP running locally, rather than having to...	49	Emerging	2,653	Python
4	huggingface/finetrainers Scalable and memory-optimized training of diffusion models	41	Emerging	1,343	Python
5	AssemblyAI-Community/MinImagen MinImagen: A minimal implementation of the Imagen text-to-image model	40	Emerging	313	Python
6	eps696/aphantasia CLIP + FFT/DWT/RGB = text to image/video	40	Emerging	789	Python
7	AlonzoLeeeooo/awesome-text-to-image-studies A collection of awesome text-to-image generation studies.	39	Emerging	750	TeX
8	kyegomez/LUMIERE Implementation of the text to video model LUMIERE from the paper: "A...	39	Emerging	52	Python
9	parlance-zz/dualdiffusion Dual Diffusion is a generative diffusion model for music trained on video...	38	Emerging	90	Python
10	nerdyrodent/CLIP-Guided-Diffusion Just playing with getting CLIP Guided Diffusion running locally, rather than...	37	Emerging	385	Python
11	AIDC-AI/Ovis-Image Ovis-Image is a 7B text-to-image model specifically optimized for...	37	Emerging	307	Python
12	WZDTHU/NiT [NeurIPS 2025] Native-resolution diffusion Transformer	37	Emerging	283	Python
13	kamalkraj/stable-diffusion-tritonserver Deploy stable diffusion model with onnx/tenorrt + tritonserver	36	Emerging	126	Jupyter Notebook
14	songweige/rich-text-to-image Rich-Text-to-Image Generation	36	Emerging	801	Python
15	mehdidc/feed_forward_vqgan_clip Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for...	34	Emerging	140	Python
16	woctezuma/stable-diffusion-safety-checker Python package to apply the Safety Checker from Stable Diffusion.	33	Emerging	9	Python
17	rockerBOO/sd-ext Scripts and extensions for Stable Diffusion	33	Emerging	9	Python
18	slowy07/luna text to image generation with stable diffusion	32	Emerging	65	Python
19	OutofAi/StableFace Build your own Face App with Stable Diffusion 2.1	32	Emerging	153	Jupyter Notebook
20	huggingface/instruction-tuned-sd Code for instruction-tuning Stable Diffusion.	32	Emerging	249	Python
21	HFAiLab/clip-gen CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP	32	Emerging	146	Python
22	DiT-3D/DiT-3D 🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for...	31	Emerging	315	Python
23	amaralibey/nanoCLIP A lightweight Text-to-Image Retrieval model [Web App]	30	Emerging	29	Python
24	py-img-gen/python-image-generation 🎨 書籍「Pythonで学ぶ画像生成」のコードを配置したリポジトリです	30	Emerging	20	Jupyter Notebook
25	gmongaras/Stable-Diffusion-3-From-Scratch A repo that attempts to train stable diffusion 3 from scratch	28	Experimental	37	Python
26	saharmor/anima Turn text into video using Stable Diffusion and Google FILM	27	Experimental	42	Jupyter Notebook
27	Qiyuan-Ge/PaintMind Fast and controllable text-to-image model.	27	Experimental	41	Python
28	hila-chefer/TargetCLIP [ECCV 2022] Official PyTorch implementation of the paper Image-Based...	27	Experimental	231	Jupyter Notebook
29	nahyeonkaty/textboost TextBoost: Towards One-Shot Personalization of Text-to-Image Models via...	26	Experimental	57	Python
30	ShivamDuggal4/UNITE-tokenization-generation Single-stage End-to-End Training for Tokenization and Generation	25	Experimental	62	Python
31	ji-code25/Point-Transformer-Diffusion Point Transformer Diffusion is a novel generative model for 3D point cloud...	24	Experimental	24	Python
32	ouhenio/StyleGAN3-CLIP-notebooks A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and...	24	Experimental	215	Jupyter Notebook
33	nhtlongcs/live-novel Self-host application can generate illustration from a novel by highlighting...	23	Experimental	13	Python
34	defgsus/clipig OpenAI CLIP based image generator with complex config file controlled...	20	Experimental	19	Python
35	SaiBalaji-PSS/Stable-Diffusion-Catalyst A macOS Catalyst app which uses Apple's CoreML Stable Diffusion package to...	20	Experimental	14	Swift
36	contrebande-labs/charred CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for...	20	Experimental	14	Python
37	jdh-algo/JoyType JoyType: A Robust Design for Multilingual Visual Text Creation	18	Experimental	39	Python
38	monk1337/OpenAI-CLIP-Image-search OpenAI's CLIP neural network	16	Experimental	4	Python
39	EngineeringAI-LAB/MIS-DiT-AST This is a training-free sketch to scene generation.	14	Experimental	4	Python
40	tripletclip/TripletCLIP [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional...	14	Experimental	46	Python
41	johnsutor/emoji-painter Paint with emojis.	12	Experimental	3	Python
42	TrieuPhi/Image-Caption Project sẽ tổng hợp những model liên quan đến image caption, sử dụng các...	10	Experimental	1	Jupyter Notebook
43	linsun449/cliper.code This repo is the official pytorch implementation of the paper: CLIPer:...	10	Experimental	40	Python

Comparisons in this category

VQGAN-CLIP and CLIP-Guided-Diffusion (49 vs 37)