Multimodal Conditioned Generation Diffusion Models

Tools that generate images from non-text inputs (voice, speech, captions, parameters, specifications) or combine multiple input modalities to drive image generation. Does NOT include pure text-to-image, video generation, or image editing tools.

There are 166 multimodal conditioned generation models tracked. 3 score above 50 (established tier). The highest-rated is sakalond/StableGen at 58/100 with 699 stars. 1 of the top 10 are actively maintained.

Get all 166 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=multimodal-conditioned-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 sakalond/StableGen

Transform your 3D texturing workflow with the power of generative AI,...

58
Established
2 neggles/animatediff-cli

a CLI utility/library for AnimateDiff stable diffusion generation

57
Established
3 victordibia/peacasso

UI interface for experimenting with multimodal (text, image) models (stable...

53
Established
4 ai-forever/Kandinsky-2

Kandinsky 2 β€” multilingual text2image latent diffusion model

47
Emerging
5 SyntheticAutonomicMind/ALICE

Artificial Latent Image Composition Engine

40
Emerging
6 samedii/perceptor

Modular image generation library

40
Emerging
7 carefree0910/carefree-drawboard

🎨 Infinite Drawboard in Python

40
Emerging
8 Teriks/dgenerate

dgenerate is a scriptable command line tool (and library) for generating...

40
Emerging
9 carefree0910/carefree-creator

AI magics meet Infinite draw board.

39
Emerging
10 movenb3at/snap-pocket

SnapPocket β€” Capture. Transform. Save.

37
Emerging
11 bil9148/LyricDiffusion

LyricDiffusion is a versatile application that transforms song lyrics into...

37
Emerging
12 NeuralRealm/StableFusion

Transform text into images and images into new ones using AI. Our...

37
Emerging
13 LUMI2049/deepfake-detection-streamlit

πŸ” Detect deepfakes with advanced AI using EfficientNetB7 and an attention...

37
Emerging
14 mwydmuch/ZoomVideoComposer

Pyhton script for generating zoom in/out videos from a set of images

36
Emerging
15 vicgalle/stable-diffusion-aesthetic-gradients

Personalization for Stable Diffusion via Aesthetic Gradients 🎨

35
Emerging
16 gordicaleksa/stable_diffusion_playground

Playing around with stable diffusion. Generated images are reproducible...

35
Emerging
17 cloneofsimo/paint-with-words-sd

Implementation of Paint-with-words with Stable Diffusion : method from...

35
Emerging
18 minimaxir/stable-diffusion-negative-prompt

Jupyter Notebooks for experimenting with negative prompting with Stable...

35
Emerging
19 yownas/shift-attention

In stable diffusion, generate a sequence of images shifting attention in the prompt.

33
Emerging
20 open-mmlab/Live2Diff

Live2Diff: A Pipeline that processes Live video streams by a uni-directional...

33
Emerging
21 mix1009/stable-diffusion-video-maker

Video generation tool for Stable Diffusion.

33
Emerging
22 lunarring/latentblending

Create butter-smooth transitions between prompts, powered by stable diffusion

33
Emerging
23 AvishakeAdhikary/Text-To-Image-Generator

Python GUI application that generates images based on user prompts using the...

32
Emerging
24 prompthero/openjourney

A fine-tuned model based on Stable Diffusion to create images in the style...

32
Emerging
25 JSJeong-me/Generate_AI_for_Image

μƒμ„±ν˜•AIλŠ” μ–΄λ–»κ²Œ 이미지λ₯Ό λ§Œλ“€κΉŒμš”?

32
Emerging
26 Kittensx/Simple_KES

Simple KES blends single or multiple schedulers and provides a noise...

31
Emerging
27 rootLocalGhost/ArtTic-LAB

An open-source AI image generation suite. Features a beautiful UI along with...

31
Emerging
28 Nota-NetsPresso/BK-SDM

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

31
Emerging
29 aagdev/mlimgsynth

Image synthesis using machine learning

31
Emerging
30 Leonm99/Stable-Diffusion-Video2Video

Small script for AUTOMATIC1111/stable-diffusion-webui to run video through img2img.

31
Emerging
31 dimitreOliveira/stable-diffusion-textual-inversion-app

Custom Textual-Inversion for Stable-Diffusion models with Keras.

30
Emerging
32 sbaier1/pyttv

A tool for generating (music-)videos using generative models

30
Emerging
33 Alyxsissy/Sissy-Research-Projects

Alyxsissy x Sissy Research Institute collaborative projects.

30
Emerging
34 RandomGamingDev/MCSkinsGen

A repository for a Minecraft skin generator based off of Stable Diffusion...

30
Emerging
35 Purushothaman-natarajan/Synth-SONAR

A controllable SONAR image generation with text-to-image diffusion models,...

30
Emerging
36 lasthero3819/DeepFakeAI-GUI-2026

DeepFake Soft AI is a powerful and user-friendly desktop software designed...

29
Experimental
37 ototadana/imgflw

A demo application for image editing using LLM

29
Experimental
38 sofiadparamo/DiffusionCraft

Diffusioncraft python img2img pipeline integration using diffusers with...

28
Experimental
39 torresflo/Picture-Machine

A little Python application to generate pictures from a text prompt. Based...

28
Experimental
40 miguelCalado/prompt-to-prompt-tensorflow

TensorFlow implementation of the "Prompt-to-Prompt Image Editing with Cross...

28
Experimental
41 sshh12/planet-diffusion

Fine-tuning stable diffusion to generate planet/moon textures.

27
Experimental
42 albertotrunk/depth2video

stable diffusion V2 depth2video - animation - coherence

27
Experimental
43 Zeqiang-Lai/Anything2Image

Generate image from anything with ImageBind and Stable Diffusion

27
Experimental
44 paolorechia/openimagegenius

Stable Diffusion

26
Experimental
45 numz/StableDiffusionPygameInpaintIsometricMap

This technical demo is an open-source project that allows users to customize...

26
Experimental
46 Anashel-RPG/echoai

Echo AI - Passsive AI Image Generation

26
Experimental
47 Logeswaran123/Stable-Diffusion-Playground

An application that generates images or videos using Stable Diffusion models.

26
Experimental
48 DN6/giffusion

Create GIFs and Videos using Stable Diffusion

25
Experimental
49 h2oai/wave-image-styling-playground

A interactive playground to style and edit images, generate art and have fun.

25
Experimental
50 schmidtdominik/stablediffusion-interpolation-tools

Tools for smoothly interpolating between prompts for Stable Diffusion models

25
Experimental
51 habedi/mosaic-art-maker

Making mosaic art using a Stable Diffusion model

25
Experimental
52 Extraltodeus/noise_latent_perlinpinpin

This allows to create latent spaces filled with perlin-based noise that can...

25
Experimental
53 MonishSoundarRaj/image-generator-streamlit

IMAGINATE HUB: Text-to-Image Streamlit App Select from six cutting-edge...

25
Experimental
54 sae-llm-coconut/coconut-ai

Python library that ease the installation process of Stable Diffusion, and...

24
Experimental
55 sayakpaul/caption-upsampling

This repository implements the idea of "caption upsampling" from DALL-E 3...

24
Experimental
56 RubenGres/Seg2Sat

Using StableDiffusion and ControlNet to generate synthetic aerial images

24
Experimental
57 uninterruptedpowersupply/Stable-Diffusion-Mobile-UI

For generating images using Stable Diffusion 1.5 with as less as 2gb Vram

24
Experimental
58 chenruiRae/GalaxySD

Fine-tuning sd-1.5 for galaxy image generation by given morphological prompts.

23
Experimental
59 inferless/stable-diffusion-xl-turbo

A distilled and cost-effective variant of SDXL that delivers high-quality...

23
Experimental
60 matt-dreyer/ai_art_generator

This script generates AI-based artwork based on a text prompt. It uses...

22
Experimental
61 sumitsahoo/img-to-video-svd

img-to-video-svd

22
Experimental
62 mtkaya/stable-diffusion-text2img

Stable Diffusion Text-to-Image & Image-to-Image Generator with Gradio

22
Experimental
63 palakv07/Text_to_video

A text prompt to video generation

22
Experimental
64 FxThomas08/stable-diffusion

Generate high-quality images from text prompts using an efficient latent...

22
Experimental
65 CodelarXgusaw/score

πŸ§ͺ Enhance scientific writing with AI-generated expert-level empirical...

22
Experimental
66 hrvojet/bela-dataset-gen

Hungarian (Belote) card dataset generator

22
Experimental
67 inferless/animagine-xl-3.0

High-quality image generation from text prompts, with improved hand anatomy...

22
Experimental
68 abgache/Genmoji

Reproduction of Apple Intelligence Genmoji for apple style emoji generation...

22
Experimental
69 pica-labs/picachain

⚑️ Build quick LLM pipelines for AI applications

22
Experimental
70 YouvenZ/Imagegen_ink

Inkscape extension to create image with generative model within inkscape.

21
Experimental
71 mmehmetisik/ai-text-to-image-generator

AI-powered image generation tool using Hugging Face API and Stable...

21
Experimental
72 melchisedech333/laser-scanning-microscopy

πŸ§ͺ Reproducing the concept of Confocal Laser Scanning Microscope. Using...

21
Experimental
73 nihaljn/multimodal-prompting

Enabling the use of multiple modalities while prompting Stable Diffusion

21
Experimental
74 thinh-vu/ai_artist

Image generator using Stable Diffusion AI model

21
Experimental
75 inferless/animagine-xl-3.1

Generates high-quality anime images with improved hand anatomy and new...

21
Experimental
76 akx/lcm_test

Quick and dirty Streamlit UI for Latent Consistency Models

21
Experimental
77 shirayu/purepale

🎨 A simple web interface of image generations

21
Experimental
78 lcysyzxdxc/AGIQA-3k-Database

[IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database...

21
Experimental
79 karand120497/glaze

Glaze is langchain for images. If you're building a text-to-image app with...

21
Experimental
80 Haoke98/FrameDiffusion

A frame2frame, video2video Video Editor based on the stable-diffusion

21
Experimental
81 geocine/sd-easy-mode

Easy Mode Stable Diffusion process allows you to generate images based on...

21
Experimental
82 sshh12/terrain-diffusion-app

An infinite collaborative inpainter which allows users to dynamic generate...

21
Experimental
83 dlimeng/awesome-ai-generated

η‹¬η«‹εΌ€ζΊεˆ›δ½œθ€…οΌŒη§―η΄―AIη”Ÿζˆη›Έε…³οΌŒιšη€ε­¦δΉ ζδΊ€θ΅„ζ–™οΌˆζ•™η¨‹οΌŒεŽŸη†οΌŒζ–°ι—»οΌŒδ½Ώη”¨οΌ‰

21
Experimental
84 boostcampaitech5/level3_cv_finalproject-cv-16

"μ•—, γ€Žμ΄ μ„Έκ³„γ€λ‘œλΆ€ν„°μ˜ μ†λ‹˜μ΄ λ‚΄κ²Œ μ°Ύμ•„μ™”λ‹€!?"λŠ” μ‚¬μ§„μ—μ„œ 선택 인물만 μΆ”μΆœν•΄μ„œ μ›ν•˜λŠ” ν™”ν’μœΌλ‘œ μ• λ‹ˆλ©”μ΄μ…˜ν™” ν•˜λŠ” ν”„λ‘œμ νŠΈμž…λ‹ˆλ‹€.

20
Experimental
85 360CVGroup/Bridge_Diffusion_Model

Chinese-native image generation while compatible with SD eco-system,...

20
Experimental
86 LyFl0w/TextureMaker

TextureMaker is an innovative Minecraft tool that utilizes AI (stable...

20
Experimental
87 akleine/sdcpp-on-android

Create images from text using sd.cpp on Android

20
Experimental
88 inferless/playground-v2.5

Generate highly aesthetic 1024x1024 images with superior quality, flexible...

20
Experimental
89 yangheng95/SuperResolutionAnimeDiffusion

Super Resolution Anime Diffusion, waifu2x

20
Experimental
90 SJTU-TES/awesome-sjtu-tes

SJTU Technology Engage Square

19
Experimental
91 inferless/sdxl-lightning

A lightning-fast text-to-image generation model that generate high-quality...

19
Experimental
92 palaashatri/jforge

Text-to-image powered by Stable Diffusion. Image upscaling with ESRGAN....

19
Experimental
93 shinn-bit/image-maker

AIη”»εƒη”Ÿζˆγ‚’γƒ—γƒͺ(Colabη‰ˆοΌ‰- Stable Diffusion + LoRA + ControlNet...

19
Experimental
94 hyeonsangjeon/AIsketcher

Text-to-image generation using Huggingface stable diffusion ControlNet...

19
Experimental
95 YUKII2K3/Image-generator

A powerful AI image generator powered by Stable Diffusion, Gradio, and...

19
Experimental
96 DonMischo/Billions-of-Wildcards-for-Stable-Diffusion

Billions of Wildcards for Stable Diffusion and maybe other generative AI

19
Experimental
97 samuelvinay91/image-generation

Cloud-native Image Generation Service with diffusion models, VAE, and...

19
Experimental
98 Cominclip/Long_Video_Generation

A pipeline to generate long videos according to text prompt

18
Experimental
99 andreafailla/Diff2GIF-Animated-Diffusion-Models

Create your own animated network visualization by exploiting a diffusion model!

18
Experimental
100 RhythrosaLabs/loom

Loom is a Streamlit-based application designed to generate and process...

18
Experimental
101 s-du/FocusPocusAI

Realtime diffusion (LCM-LoRA) from screen capture or webcam, for...

17
Experimental
102 sidhantls/minimal-casteer

Minimal and extensible implementation of CASteer

17
Experimental
103 yeonsumia/momentum-textual-inversion

[NCSOFT 23' winter internship] Enhance personalizing stable diffusion for...

16
Experimental
104 cesarolvr/gif-generator-ai

A command-line tool to create GIFs from AI-generated images. Works only in...

16
Experimental
105 shadowlamer/diffusezx

DiffuseZX is a demo project showcasing the capabilities of pretrained Stable...

15
Experimental
106 causeri3/selfusion-pi

Art installation for Fusion 2025 - automated selfies, diffuser...

15
Experimental
107 DolicaAkelloEgwel/studies-show

Not tech-art, but TeX-art ???

15
Experimental
108 ylp1455/Flask-A-Graph

"Flask-A-Graph' is a Flask app that uses Stable Diffusion Pipeline to...

15
Experimental
109 Stefan356/Stable-Diffusion-X-Dreamhold

FHOOE Semester Project, AI based visualization of interactive fiction game

15
Experimental
110 PRITHIVSAKTHIUR/Canopus-Realism

Realistic Image Generation, Realistic trigger works properly, better for...

15
Experimental
111 jerenchen/simple-diffusion-pose-gen

A simple Python/Godot example of an AI prompt-based 3D human pose generator

15
Experimental
112 FareedKhan-dev/digiart-Powerful-AI-Image-generator

DigiArt is a powerful AI image generator built on top of Stable Diffusion....

14
Experimental
113 tigrisdata-community/tigris-text-to-image

Text-To-Image Generator with Hugging Face model, Fly GPUs and Tigris as storage

14
Experimental
114 aredden/denoisemoji

command-line tool that allows you to create a pseudo diffusion denoised...

14
Experimental
115 5anjana/StyleSwap

Localized fashion image editing using text-driven latent diffusion and...

14
Experimental
116 LazyShuyaa/Tensor.art-Scaper

A simple scraper for tensor.art.. it will help u to generate images

14
Experimental
117 nhatm6586/ScreenDiffusion

🌌 Transform your screen into living art with real-time AI rendering,...

14
Experimental
118 tejank10/null-text-inversion

Unofficial Implementation of Null-text Inversion (https://arxiv.org/abs/2211.09794)

14
Experimental
119 Robin-WZQ/Text-Guide-Chinese-Landscape-Painting-Generation

Fine Tuning Stable Diffusion on Chinese Landscape Painting Generation(εŸΊδΊŽζ‰©ζ•£ζ¨‘εž‹ηš„δΈ­ε›½ε±±ζ°΄η”»η”Ÿζˆ)

14
Experimental
120 Ba5bit/echoscape

Interactive generative city simulation using webcam motion, audio input,...

14
Experimental
121 PsorTheDoctor/generative-ai

Generative AI techniques including Stable Diffusion based image generation.

14
Experimental
122 ytl0623/Stable-Diffusion-LINE-app-Vercel

Implements Stable-Diffusion text-to-image function on LINE app with Vercel platform

14
Experimental
123 nub340/DreamsOfAJaguar

Dreams of a Jaguar is a side scrolling video game made with pygame and...

13
Experimental
124 aimagelab/Alfie

Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24)

13
Experimental
125 ranjanashwin/digital-twin-generator

Advanced digital twin generator using IPAdapter FaceID with batch image...

13
Experimental
126 bigsk1/sd-ascii-image

Stable Diffusion Script to turn images into Ascii Art

13
Experimental
127 jzbor/sdiff-gtk

GTK+ front end for AI image generation using StableDiffusion

13
Experimental
128 soheil-mp/GameAssetLab

AI-powered game asset generation studio - transform prompts into...

13
Experimental
129 ihsavru/sd-text-effects

Inspired by Adobe Firefly's text effects, Stable Diffusion Text Effects...

12
Experimental
130 IlIllII/diffusion-sketch

Draw with Stable Diffusion

12
Experimental
131 Friday202/ComicGenerator

Comic generator that works using Stable Diffusion 1.5

12
Experimental
132 jan1na/Un-Stable-Diffusion

Test the text- and image-encoder CLIP against adversarial text attacks using...

12
Experimental
133 graceduansu/IdentifyingPromptedArtists

Identifying Prompted Artist Names from Generated Images: a large-scale...

12
Experimental
134 dilums/ml-bytedance-sdxl-lightning

experiments with ByteDance's SDXL-Lightning model for generating images from...

12
Experimental
135 Fardeen37/BRANDGENAI

BrandGenAI is a text-based AI Iamge/logo generator that uses Stable...

12
Experimental
136 zeittresor/pygame-mizzzout

Simple skillgame written in python using pygame with sound and AI generated...

11
Experimental
137 Razorwings18/Stable-Totem

UNMAINTAINED! Stable Totem is a user-friendly desktop application for...

11
Experimental
138 viniciusslvdor/image-automation

Simple Python Web application for OpenAI Image models inference

11
Experimental
139 epauliat/Project_4BIM

4BIM @ INSA LYON - Group project on AI used to create sketches of people...

11
Experimental
140 TheCleverIdiott/stable-diffusion-3

gradio based starter transcript for generating images from text

11
Experimental
141 dizys/nyu-cv-final-project

NYU CV Final Project: build a AI-generated v.s. real-world-captured image classifier

11
Experimental
142 dieBaerigenBerchtesgadener/PlOtter

Ein selbstmalender Bilderrahmen

11
Experimental
143 adithya-s-k/diffusechain

🎨Streamlining the creation of consistent diffusion images with AI-powered...

11
Experimental
144 AstraBert/awesome-tiny-sd

Tiny stable diffusion chatbot based on segmind/small-sd to generate images...

11
Experimental
145 RolandoAndrade/stable-diffusion-instagram-publisher

Small script used to generate images from a prompt, define the caption and...

11
Experimental
146 PoyBoi/AynAssg

Generate Images, Upscale Images, Fix Faces and Replace background using...

11
Experimental
147 ixtal23/neuroimage

This is an image generation tool that implements the generating of images by...

11
Experimental
148 brucethemoose/StableDiffusion-Video2Video

Experiments using modified Diffusers img2img and Vapoursynth for temporally...

11
Experimental
149 MohamedAliRashad/Imaginate

Imaginate is a gradio app for generating images from initial images and prompts

11
Experimental
150 Sher-Kal/hf-space-poetry-art

Portfolio demo: Hugging Face Space with text + image generation (poetry & painting)

11
Experimental
151 kianelbo/magic-rugs

A dataset of oriental rugs and a diffusion image generator

11
Experimental
152 R4F405/python-stable-diffusion-generator

An image generator implemented in Python using the powerful Stable Diffusion...

11
Experimental
153 Pixel4bit/pxd-image-generator

Streamlit-based application that allows you to generate AI images from text...

11
Experimental
154 mohammadzainabbas/Deep-Learning-CS

🎨 Monet like Painting πŸ‘©πŸ»β€πŸŽ¨ with stable diffusion model πŸ‘¨πŸ»β€πŸ’»

11
Experimental
155 zzbuzzard/sd-variant-anim

Create animations where each frame is an AI-generated variation of the one before.

11
Experimental
156 M26I/image-generation-system

A fast, lightweight command-line tool for text-to-image generation using...

11
Experimental
157 ty70/stable-diffusion-image-generator

A simple and flexible Python script for generating images using Stable...

11
Experimental
158 GagDrag/ThreadShift

ThreadShift leverages advanced segmentation, image editing, and generation...

10
Experimental
159 DebarghaSanyal/Music_And_Video_Generator

This project uses Stable Diffusion to generate AI-driven images & musics...

10
Experimental
160 dhanushreddy291/lcm-sdxl-cog

Generate high-quality images faster with Latent Consistency Models (LCM), a...

10
Experimental
161 ncdisrup-ai/Captioning4Generation

Caption image and with that base generate new images (with stable diffusion...

10
Experimental
162 aahmed-se/generate_image

Stable Diffusion Image Generator

10
Experimental
163 gopalchand/rotopy

Combine PNG or JPEG files in a folder into a movie file. Useful for...

10
Experimental
164 AtharvaTaras/Stable-Diffusion-Library

A library of images generated using Stable Diffusion

10
Experimental
165 inferless/stable-diffusion-3-5-large-turbo

A fast, optimized diffusion model that generates high-quality images from...

10
Experimental
166 player29879/Kandinsky

Kandinsky 2 β€” multilingual text2image latent diffusion model

10
Experimental