Text to Image Generation Transformer Models
Tools for generating, manipulating, and editing images from text prompts using diffusion models and related generative techniques. Does NOT include general image classification, detection, or non-generative image processing tasks.
There are 50 text to image generation models tracked. 1 score above 70 (verified tier). The highest-rated is filipstrand/mflux at 79/100 with 1,882 stars and 31,305 monthly downloads. 1 of the top 10 are actively maintained.
Get all 50 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=text-to-image-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
filipstrand/mflux
MLX native implementations of state-of-the-art generative image models |
|
Verified |
| 2 |
potamides/DeTikZify
Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ. |
|
Emerging |
| 3 |
FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for... |
|
Emerging |
| 4 |
zai-org/CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:... |
|
Emerging |
| 5 |
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow. |
|
Emerging |
| 6 |
TextGeneratorio/text-generator.io
Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io |
|
Emerging |
| 7 |
kyegomez/Fusion3D
An extremely experimental model that intakes images and generates 3D scenes... |
|
Emerging |
| 8 |
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation |
|
Emerging |
| 9 |
amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion |
|
Emerging |
| 10 |
RishabSA/Sketch2Graphviz
Sketch2Graphviz allows you to convert sketches or images of graphs and... |
|
Emerging |
| 11 |
ivonajdenkoska/tulip
[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP" |
|
Emerging |
| 12 |
allenai/x-lxmert
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer... |
|
Experimental |
| 13 |
The-Swarm-Corporation/DART
DART (Diffusion-Autoregressive Recursive Transformer) is a novel hybrid... |
|
Experimental |
| 14 |
aimagelab/Emuru-autoregressive-text-img
Official PyTorch implementation for "Zero-Shot Styled Text Image Generation,... |
|
Experimental |
| 15 |
renan-siqueira/image-to-text-tool
This tool processes images and generates textual descriptions using advanced... |
|
Experimental |
| 16 |
FredyRivera-dev/Flux2-from-scratch
This repo proposes to implement the Flux2 model from scratch |
|
Experimental |
| 17 |
EchoSingh/GitHub_Profile_Picture
A guide code to generate your ai profile picture |
|
Experimental |
| 18 |
affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition |
|
Experimental |
| 19 |
robjsliwa/mlx-sd-single-file-models
Single safetensors file support Apple MLX Stable Diffusion |
|
Experimental |
| 20 |
andyngdz/exogen_backend
ExoGen Backend |
|
Experimental |
| 21 |
ArchitAnant/stroxapi
Text to Handwriting generation using Diffusion |
|
Experimental |
| 22 |
ZicoDiegoRR/stable-diffusion-xl-colab-ui
An interactive Jupyter notebook leveraging IPython widgets for the UI and... |
|
Experimental |
| 23 |
PRITHIVSAKTHIUR/Kontext-Photo-Mate-v2
Kontext-Photo-Mate-v2 is an advanced image manipulation application built on... |
|
Experimental |
| 24 |
avijit-jana/huggingface-nlp-image-tool
An end‑to‑end application leveraging Hugging Face pretrained models for... |
|
Experimental |
| 25 |
inuwamobarak/stable-diffusion
Implementing a diffusion framework with Hugging Face. Stable diffusion... |
|
Experimental |
| 26 |
Hadifard/image-colorization-project
Automatic black and white photo and video colorization using deep learning... |
|
Experimental |
| 27 |
maocide/PlotCaption
A local, private AI tool to turn any image into rich character lore for... |
|
Experimental |
| 28 |
alecremer/Darkroom-CV
An End-to-End, Model-Agnostic Computer Vision Framework |
|
Experimental |
| 29 |
VachanVY/diffusion-transformer
Pytorch and JAX Implementation of Scalable Diffusion Models with... |
|
Experimental |
| 30 |
kalpthakkar/ChromaVision-Object-aware-Image-Colorization
AI-driven object-aware image colorization system that restores grayscale... |
|
Experimental |
| 31 |
pky1987/Verilume-True-Light-Image-Generator
Verilume is a high-fidelity image generation and editing framework built... |
|
Experimental |
| 32 |
PRITHIVSAKTHIUR/GALLO-3XL
High Quality Image Generation Model - Powered with NVIDIA A100 |
|
Experimental |
| 33 |
ndrohith09/11th-hour
With the use of AI, summarise your movies and bring back the colour in older films. |
|
Experimental |
| 34 |
johnamit/sit-faf-generate-edit
A deep learning project for Fundus Autofluorescence (FAF) image generation,... |
|
Experimental |
| 35 |
PRITHIVSAKTHIUR/Flux-Krea-multi-GPU-Pool
A Python-based multi-GPU image generation pipeline using Huggingface... |
|
Experimental |
| 36 |
PRITHIVSAKTHIUR/Flux.1-dev-4bit
FLUX.1-dev model with 4-bit quantization, quantized model maintains image... |
|
Experimental |
| 37 |
PRITHIVSAKTHIUR/Sub-Memory-Efficient-Merging-FluxKreaDev
black-forest-labs/FLUX.1-dev and black-forest-labs/FLUX.1-Krea-dev. This... |
|
Experimental |
| 38 |
devs-org-in/RemovBG
Image Background Remover powered by the RMBG V1.4 model from BRIA AI... |
|
Experimental |
| 39 |
krishnakoushik225/CLAP-Optimized-Text-to-Audio-Generation-AudioLDM-
Inference-time optimization for diffusion-based text-to-audio generation... |
|
Experimental |
| 40 |
sdtrkl/ai-photo-editing-with-inpainting
This project is part of Generative AI Nanodegree by Udacity |
|
Experimental |
| 41 |
dheeren-tejani/mini-sd
A lightweight, end-to-end implementation of Stable Diffusion built from... |
|
Experimental |
| 42 |
YashBaraii/GiggleGen
AI Meme Generator |
|
Experimental |
| 43 |
kantkrishan0206-crypto/gen-image3.0
a powerful, large-scale, multimodal model for Text-to-Image generation. |
|
Experimental |
| 44 |
charudatta10/ai-igen
AI image generator |
|
Experimental |
| 45 |
edcalderin/textual-diffuser
TextualDiffuser is a text-to-image generation tool powered by Stable... |
|
Experimental |
| 46 |
damianoimola/diffit
DiffiT: Diffusion Vision Transformers for Image Generation and DiffiP a... |
|
Experimental |
| 47 |
jiaowoguanren0615/DiT-Pytorch
This is a warehouse for DiT-pytorch-model, can be used to generate your image dataset |
|
Experimental |
| 48 |
Eyelor/text-to-image-item-generator
A Python workflow for generating random item images using models from Hugging Face. |
|
Experimental |
| 49 |
SARIT42/Image-InPainting-SAM
A combination of Image segmentation, Image editing and in-place Image... |
|
Experimental |
| 50 |
SaiLikith14/text_to_image_generation
text to image generation |
|
Experimental |