Multimodal Conditioned Generation Diffusion Models
Tools that generate images from non-text inputs (voice, speech, captions, parameters, specifications) or combine multiple input modalities to drive image generation. Does NOT include pure text-to-image, video generation, or image editing tools.
There are 166 multimodal conditioned generation models tracked. 3 score above 50 (established tier). The highest-rated is sakalond/StableGen at 58/100 with 699 stars. 1 of the top 10 are actively maintained.
Get all 166 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=multimodal-conditioned-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
sakalond/StableGen
Transform your 3D texturing workflow with the power of generative AI,... |
|
Established |
| 2 |
neggles/animatediff-cli
a CLI utility/library for AnimateDiff stable diffusion generation |
|
Established |
| 3 |
victordibia/peacasso
UI interface for experimenting with multimodal (text, image) models (stable... |
|
Established |
| 4 |
ai-forever/Kandinsky-2
Kandinsky 2 β multilingual text2image latent diffusion model |
|
Emerging |
| 5 |
SyntheticAutonomicMind/ALICE
Artificial Latent Image Composition Engine |
|
Emerging |
| 6 |
samedii/perceptor
Modular image generation library |
|
Emerging |
| 7 |
carefree0910/carefree-drawboard
π¨ Infinite Drawboard in Python |
|
Emerging |
| 8 |
Teriks/dgenerate
dgenerate is a scriptable command line tool (and library) for generating... |
|
Emerging |
| 9 |
carefree0910/carefree-creator
AI magics meet Infinite draw board. |
|
Emerging |
| 10 |
movenb3at/snap-pocket
SnapPocket β Capture. Transform. Save. |
|
Emerging |
| 11 |
bil9148/LyricDiffusion
LyricDiffusion is a versatile application that transforms song lyrics into... |
|
Emerging |
| 12 |
NeuralRealm/StableFusion
Transform text into images and images into new ones using AI. Our... |
|
Emerging |
| 13 |
LUMI2049/deepfake-detection-streamlit
π Detect deepfakes with advanced AI using EfficientNetB7 and an attention... |
|
Emerging |
| 14 |
mwydmuch/ZoomVideoComposer
Pyhton script for generating zoom in/out videos from a set of images |
|
Emerging |
| 15 |
vicgalle/stable-diffusion-aesthetic-gradients
Personalization for Stable Diffusion via Aesthetic Gradients π¨ |
|
Emerging |
| 16 |
gordicaleksa/stable_diffusion_playground
Playing around with stable diffusion. Generated images are reproducible... |
|
Emerging |
| 17 |
cloneofsimo/paint-with-words-sd
Implementation of Paint-with-words with Stable Diffusion : method from... |
|
Emerging |
| 18 |
minimaxir/stable-diffusion-negative-prompt
Jupyter Notebooks for experimenting with negative prompting with Stable... |
|
Emerging |
| 19 |
yownas/shift-attention
In stable diffusion, generate a sequence of images shifting attention in the prompt. |
|
Emerging |
| 20 |
open-mmlab/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional... |
|
Emerging |
| 21 |
mix1009/stable-diffusion-video-maker
Video generation tool for Stable Diffusion. |
|
Emerging |
| 22 |
lunarring/latentblending
Create butter-smooth transitions between prompts, powered by stable diffusion |
|
Emerging |
| 23 |
AvishakeAdhikary/Text-To-Image-Generator
Python GUI application that generates images based on user prompts using the... |
|
Emerging |
| 24 |
prompthero/openjourney
A fine-tuned model based on Stable Diffusion to create images in the style... |
|
Emerging |
| 25 |
JSJeong-me/Generate_AI_for_Image
μμ±νAIλ μ΄λ»κ² μ΄λ―Έμ§λ₯Ό λ§λ€κΉμ? |
|
Emerging |
| 26 |
Kittensx/Simple_KES
Simple KES blends single or multiple schedulers and provides a noise... |
|
Emerging |
| 27 |
rootLocalGhost/ArtTic-LAB
An open-source AI image generation suite. Features a beautiful UI along with... |
|
Emerging |
| 28 |
Nota-NetsPresso/BK-SDM
A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24] |
|
Emerging |
| 29 |
aagdev/mlimgsynth
Image synthesis using machine learning |
|
Emerging |
| 30 |
Leonm99/Stable-Diffusion-Video2Video
Small script for AUTOMATIC1111/stable-diffusion-webui to run video through img2img. |
|
Emerging |
| 31 |
dimitreOliveira/stable-diffusion-textual-inversion-app
Custom Textual-Inversion for Stable-Diffusion models with Keras. |
|
Emerging |
| 32 |
sbaier1/pyttv
A tool for generating (music-)videos using generative models |
|
Emerging |
| 33 |
Alyxsissy/Sissy-Research-Projects
Alyxsissy x Sissy Research Institute collaborative projects. |
|
Emerging |
| 34 |
RandomGamingDev/MCSkinsGen
A repository for a Minecraft skin generator based off of Stable Diffusion... |
|
Emerging |
| 35 |
Purushothaman-natarajan/Synth-SONAR
A controllable SONAR image generation with text-to-image diffusion models,... |
|
Emerging |
| 36 |
lasthero3819/DeepFakeAI-GUI-2026
DeepFake Soft AI is a powerful and user-friendly desktop software designed... |
|
Experimental |
| 37 |
ototadana/imgflw
A demo application for image editing using LLM |
|
Experimental |
| 38 |
sofiadparamo/DiffusionCraft
Diffusioncraft python img2img pipeline integration using diffusers with... |
|
Experimental |
| 39 |
torresflo/Picture-Machine
A little Python application to generate pictures from a text prompt. Based... |
|
Experimental |
| 40 |
miguelCalado/prompt-to-prompt-tensorflow
TensorFlow implementation of the "Prompt-to-Prompt Image Editing with Cross... |
|
Experimental |
| 41 |
sshh12/planet-diffusion
Fine-tuning stable diffusion to generate planet/moon textures. |
|
Experimental |
| 42 |
albertotrunk/depth2video
stable diffusion V2 depth2video - animation - coherence |
|
Experimental |
| 43 |
Zeqiang-Lai/Anything2Image
Generate image from anything with ImageBind and Stable Diffusion |
|
Experimental |
| 44 |
paolorechia/openimagegenius
Stable Diffusion |
|
Experimental |
| 45 |
numz/StableDiffusionPygameInpaintIsometricMap
This technical demo is an open-source project that allows users to customize... |
|
Experimental |
| 46 |
Anashel-RPG/echoai
Echo AI - Passsive AI Image Generation |
|
Experimental |
| 47 |
Logeswaran123/Stable-Diffusion-Playground
An application that generates images or videos using Stable Diffusion models. |
|
Experimental |
| 48 |
DN6/giffusion
Create GIFs and Videos using Stable Diffusion |
|
Experimental |
| 49 |
h2oai/wave-image-styling-playground
A interactive playground to style and edit images, generate art and have fun. |
|
Experimental |
| 50 |
schmidtdominik/stablediffusion-interpolation-tools
Tools for smoothly interpolating between prompts for Stable Diffusion models |
|
Experimental |
| 51 |
habedi/mosaic-art-maker
Making mosaic art using a Stable Diffusion model |
|
Experimental |
| 52 |
Extraltodeus/noise_latent_perlinpinpin
This allows to create latent spaces filled with perlin-based noise that can... |
|
Experimental |
| 53 |
MonishSoundarRaj/image-generator-streamlit
IMAGINATE HUB: Text-to-Image Streamlit App Select from six cutting-edge... |
|
Experimental |
| 54 |
sae-llm-coconut/coconut-ai
Python library that ease the installation process of Stable Diffusion, and... |
|
Experimental |
| 55 |
sayakpaul/caption-upsampling
This repository implements the idea of "caption upsampling" from DALL-E 3... |
|
Experimental |
| 56 |
RubenGres/Seg2Sat
Using StableDiffusion and ControlNet to generate synthetic aerial images |
|
Experimental |
| 57 |
uninterruptedpowersupply/Stable-Diffusion-Mobile-UI
For generating images using Stable Diffusion 1.5 with as less as 2gb Vram |
|
Experimental |
| 58 |
chenruiRae/GalaxySD
Fine-tuning sd-1.5 for galaxy image generation by given morphological prompts. |
|
Experimental |
| 59 |
inferless/stable-diffusion-xl-turbo
A distilled and cost-effective variant of SDXL that delivers high-quality... |
|
Experimental |
| 60 |
matt-dreyer/ai_art_generator
This script generates AI-based artwork based on a text prompt. It uses... |
|
Experimental |
| 61 |
sumitsahoo/img-to-video-svd
img-to-video-svd |
|
Experimental |
| 62 |
mtkaya/stable-diffusion-text2img
Stable Diffusion Text-to-Image & Image-to-Image Generator with Gradio |
|
Experimental |
| 63 |
palakv07/Text_to_video
A text prompt to video generation |
|
Experimental |
| 64 |
FxThomas08/stable-diffusion
Generate high-quality images from text prompts using an efficient latent... |
|
Experimental |
| 65 |
CodelarXgusaw/score
π§ͺ Enhance scientific writing with AI-generated expert-level empirical... |
|
Experimental |
| 66 |
hrvojet/bela-dataset-gen
Hungarian (Belote) card dataset generator |
|
Experimental |
| 67 |
inferless/animagine-xl-3.0
High-quality image generation from text prompts, with improved hand anatomy... |
|
Experimental |
| 68 |
abgache/Genmoji
Reproduction of Apple Intelligence Genmoji for apple style emoji generation... |
|
Experimental |
| 69 |
pica-labs/picachain
β‘οΈ Build quick LLM pipelines for AI applications |
|
Experimental |
| 70 |
YouvenZ/Imagegen_ink
Inkscape extension to create image with generative model within inkscape. |
|
Experimental |
| 71 |
mmehmetisik/ai-text-to-image-generator
AI-powered image generation tool using Hugging Face API and Stable... |
|
Experimental |
| 72 |
melchisedech333/laser-scanning-microscopy
π§ͺ Reproducing the concept of Confocal Laser Scanning Microscope. Using... |
|
Experimental |
| 73 |
nihaljn/multimodal-prompting
Enabling the use of multiple modalities while prompting Stable Diffusion |
|
Experimental |
| 74 |
thinh-vu/ai_artist
Image generator using Stable Diffusion AI model |
|
Experimental |
| 75 |
inferless/animagine-xl-3.1
Generates high-quality anime images with improved hand anatomy and new... |
|
Experimental |
| 76 |
akx/lcm_test
Quick and dirty Streamlit UI for Latent Consistency Models |
|
Experimental |
| 77 |
shirayu/purepale
π¨ A simple web interface of image generations |
|
Experimental |
| 78 |
lcysyzxdxc/AGIQA-3k-Database
[IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database... |
|
Experimental |
| 79 |
karand120497/glaze
Glaze is langchain for images. If you're building a text-to-image app with... |
|
Experimental |
| 80 |
Haoke98/FrameDiffusion
A frame2frame, video2video Video Editor based on the stable-diffusion |
|
Experimental |
| 81 |
geocine/sd-easy-mode
Easy Mode Stable Diffusion process allows you to generate images based on... |
|
Experimental |
| 82 |
sshh12/terrain-diffusion-app
An infinite collaborative inpainter which allows users to dynamic generate... |
|
Experimental |
| 83 |
dlimeng/awesome-ai-generated
η¬η«εΌζΊεδ½θ οΌη§―η΄―AIηζηΈε ³οΌιηε¦δΉ ζδΊ€θ΅ζοΌζη¨οΌεηοΌζ°ι»οΌδ½Ώη¨οΌ |
|
Experimental |
| 84 |
boostcampaitech5/level3_cv_finalproject-cv-16
"μ, γμ΄ μΈκ³γλ‘λΆν°μ μλμ΄ λ΄κ² μ°Ύμμλ€!?"λ μ¬μ§μμ μ ν μΈλ¬Όλ§ μΆμΆν΄μ μνλ ννμΌλ‘ μ λλ©μ΄μ ν νλ νλ‘μ νΈμ λλ€. |
|
Experimental |
| 85 |
360CVGroup/Bridge_Diffusion_Model
Chinese-native image generation while compatible with SD eco-system,... |
|
Experimental |
| 86 |
LyFl0w/TextureMaker
TextureMaker is an innovative Minecraft tool that utilizes AI (stable... |
|
Experimental |
| 87 |
akleine/sdcpp-on-android
Create images from text using sd.cpp on Android |
|
Experimental |
| 88 |
inferless/playground-v2.5
Generate highly aesthetic 1024x1024 images with superior quality, flexible... |
|
Experimental |
| 89 |
yangheng95/SuperResolutionAnimeDiffusion
Super Resolution Anime Diffusion, waifu2x |
|
Experimental |
| 90 |
SJTU-TES/awesome-sjtu-tes
SJTU Technology Engage Square |
|
Experimental |
| 91 |
inferless/sdxl-lightning
A lightning-fast text-to-image generation model that generate high-quality... |
|
Experimental |
| 92 |
palaashatri/jforge
Text-to-image powered by Stable Diffusion. Image upscaling with ESRGAN.... |
|
Experimental |
| 93 |
shinn-bit/image-maker
AIη»εηζγ’γγͺοΌColabηοΌ- Stable Diffusion + LoRA + ControlNet... |
|
Experimental |
| 94 |
hyeonsangjeon/AIsketcher
Text-to-image generation using Huggingface stable diffusion ControlNet... |
|
Experimental |
| 95 |
YUKII2K3/Image-generator
A powerful AI image generator powered by Stable Diffusion, Gradio, and... |
|
Experimental |
| 96 |
DonMischo/Billions-of-Wildcards-for-Stable-Diffusion
Billions of Wildcards for Stable Diffusion and maybe other generative AI |
|
Experimental |
| 97 |
samuelvinay91/image-generation
Cloud-native Image Generation Service with diffusion models, VAE, and... |
|
Experimental |
| 98 |
Cominclip/Long_Video_Generation
A pipeline to generate long videos according to text prompt |
|
Experimental |
| 99 |
andreafailla/Diff2GIF-Animated-Diffusion-Models
Create your own animated network visualization by exploiting a diffusion model! |
|
Experimental |
| 100 |
RhythrosaLabs/loom
Loom is a Streamlit-based application designed to generate and process... |
|
Experimental |
| 101 |
s-du/FocusPocusAI
Realtime diffusion (LCM-LoRA) from screen capture or webcam, for... |
|
Experimental |
| 102 |
sidhantls/minimal-casteer
Minimal and extensible implementation of CASteer |
|
Experimental |
| 103 |
yeonsumia/momentum-textual-inversion
[NCSOFT 23' winter internship] Enhance personalizing stable diffusion for... |
|
Experimental |
| 104 |
cesarolvr/gif-generator-ai
A command-line tool to create GIFs from AI-generated images. Works only in... |
|
Experimental |
| 105 |
shadowlamer/diffusezx
DiffuseZX is a demo project showcasing the capabilities of pretrained Stable... |
|
Experimental |
| 106 |
causeri3/selfusion-pi
Art installation for Fusion 2025 - automated selfies, diffuser... |
|
Experimental |
| 107 |
DolicaAkelloEgwel/studies-show
Not tech-art, but TeX-art ??? |
|
Experimental |
| 108 |
ylp1455/Flask-A-Graph
"Flask-A-Graph' is a Flask app that uses Stable Diffusion Pipeline to... |
|
Experimental |
| 109 |
Stefan356/Stable-Diffusion-X-Dreamhold
FHOOE Semester Project, AI based visualization of interactive fiction game |
|
Experimental |
| 110 |
PRITHIVSAKTHIUR/Canopus-Realism
Realistic Image Generation, Realistic trigger works properly, better for... |
|
Experimental |
| 111 |
jerenchen/simple-diffusion-pose-gen
A simple Python/Godot example of an AI prompt-based 3D human pose generator |
|
Experimental |
| 112 |
FareedKhan-dev/digiart-Powerful-AI-Image-generator
DigiArt is a powerful AI image generator built on top of Stable Diffusion.... |
|
Experimental |
| 113 |
tigrisdata-community/tigris-text-to-image
Text-To-Image Generator with Hugging Face model, Fly GPUs and Tigris as storage |
|
Experimental |
| 114 |
aredden/denoisemoji
command-line tool that allows you to create a pseudo diffusion denoised... |
|
Experimental |
| 115 |
5anjana/StyleSwap
Localized fashion image editing using text-driven latent diffusion and... |
|
Experimental |
| 116 |
LazyShuyaa/Tensor.art-Scaper
A simple scraper for tensor.art.. it will help u to generate images |
|
Experimental |
| 117 |
nhatm6586/ScreenDiffusion
π Transform your screen into living art with real-time AI rendering,... |
|
Experimental |
| 118 |
tejank10/null-text-inversion
Unofficial Implementation of Null-text Inversion (https://arxiv.org/abs/2211.09794) |
|
Experimental |
| 119 |
Robin-WZQ/Text-Guide-Chinese-Landscape-Painting-Generation
Fine Tuning Stable Diffusion on Chinese Landscape Painting Generation(εΊδΊζ©ζ£ζ¨‘εηδΈε½ε±±ζ°΄η»ηζ) |
|
Experimental |
| 120 |
Ba5bit/echoscape
Interactive generative city simulation using webcam motion, audio input,... |
|
Experimental |
| 121 |
PsorTheDoctor/generative-ai
Generative AI techniques including Stable Diffusion based image generation. |
|
Experimental |
| 122 |
ytl0623/Stable-Diffusion-LINE-app-Vercel
Implements Stable-Diffusion text-to-image function on LINE app with Vercel platform |
|
Experimental |
| 123 |
nub340/DreamsOfAJaguar
Dreams of a Jaguar is a side scrolling video game made with pygame and... |
|
Experimental |
| 124 |
aimagelab/Alfie
Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24) |
|
Experimental |
| 125 |
ranjanashwin/digital-twin-generator
Advanced digital twin generator using IPAdapter FaceID with batch image... |
|
Experimental |
| 126 |
bigsk1/sd-ascii-image
Stable Diffusion Script to turn images into Ascii Art |
|
Experimental |
| 127 |
jzbor/sdiff-gtk
GTK+ front end for AI image generation using StableDiffusion |
|
Experimental |
| 128 |
soheil-mp/GameAssetLab
AI-powered game asset generation studio - transform prompts into... |
|
Experimental |
| 129 |
ihsavru/sd-text-effects
Inspired by Adobe Firefly's text effects, Stable Diffusion Text Effects... |
|
Experimental |
| 130 |
IlIllII/diffusion-sketch
Draw with Stable Diffusion |
|
Experimental |
| 131 |
Friday202/ComicGenerator
Comic generator that works using Stable Diffusion 1.5 |
|
Experimental |
| 132 |
jan1na/Un-Stable-Diffusion
Test the text- and image-encoder CLIP against adversarial text attacks using... |
|
Experimental |
| 133 |
graceduansu/IdentifyingPromptedArtists
Identifying Prompted Artist Names from Generated Images: a large-scale... |
|
Experimental |
| 134 |
dilums/ml-bytedance-sdxl-lightning
experiments with ByteDance's SDXL-Lightning model for generating images from... |
|
Experimental |
| 135 |
Fardeen37/BRANDGENAI
BrandGenAI is a text-based AI Iamge/logo generator that uses Stable... |
|
Experimental |
| 136 |
zeittresor/pygame-mizzzout
Simple skillgame written in python using pygame with sound and AI generated... |
|
Experimental |
| 137 |
Razorwings18/Stable-Totem
UNMAINTAINED! Stable Totem is a user-friendly desktop application for... |
|
Experimental |
| 138 |
viniciusslvdor/image-automation
Simple Python Web application for OpenAI Image models inference |
|
Experimental |
| 139 |
epauliat/Project_4BIM
4BIM @ INSA LYON - Group project on AI used to create sketches of people... |
|
Experimental |
| 140 |
TheCleverIdiott/stable-diffusion-3
gradio based starter transcript for generating images from text |
|
Experimental |
| 141 |
dizys/nyu-cv-final-project
NYU CV Final Project: build a AI-generated v.s. real-world-captured image classifier |
|
Experimental |
| 142 |
dieBaerigenBerchtesgadener/PlOtter
Ein selbstmalender Bilderrahmen |
|
Experimental |
| 143 |
adithya-s-k/diffusechain
π¨Streamlining the creation of consistent diffusion images with AI-powered... |
|
Experimental |
| 144 |
AstraBert/awesome-tiny-sd
Tiny stable diffusion chatbot based on segmind/small-sd to generate images... |
|
Experimental |
| 145 |
RolandoAndrade/stable-diffusion-instagram-publisher
Small script used to generate images from a prompt, define the caption and... |
|
Experimental |
| 146 |
PoyBoi/AynAssg
Generate Images, Upscale Images, Fix Faces and Replace background using... |
|
Experimental |
| 147 |
ixtal23/neuroimage
This is an image generation tool that implements the generating of images by... |
|
Experimental |
| 148 |
brucethemoose/StableDiffusion-Video2Video
Experiments using modified Diffusers img2img and Vapoursynth for temporally... |
|
Experimental |
| 149 |
MohamedAliRashad/Imaginate
Imaginate is a gradio app for generating images from initial images and prompts |
|
Experimental |
| 150 |
Sher-Kal/hf-space-poetry-art
Portfolio demo: Hugging Face Space with text + image generation (poetry & painting) |
|
Experimental |
| 151 |
kianelbo/magic-rugs
A dataset of oriental rugs and a diffusion image generator |
|
Experimental |
| 152 |
R4F405/python-stable-diffusion-generator
An image generator implemented in Python using the powerful Stable Diffusion... |
|
Experimental |
| 153 |
Pixel4bit/pxd-image-generator
Streamlit-based application that allows you to generate AI images from text... |
|
Experimental |
| 154 |
mohammadzainabbas/Deep-Learning-CS
π¨ Monet like Painting π©π»βπ¨ with stable diffusion model π¨π»βπ» |
|
Experimental |
| 155 |
zzbuzzard/sd-variant-anim
Create animations where each frame is an AI-generated variation of the one before. |
|
Experimental |
| 156 |
M26I/image-generation-system
A fast, lightweight command-line tool for text-to-image generation using... |
|
Experimental |
| 157 |
ty70/stable-diffusion-image-generator
A simple and flexible Python script for generating images using Stable... |
|
Experimental |
| 158 |
GagDrag/ThreadShift
ThreadShift leverages advanced segmentation, image editing, and generation... |
|
Experimental |
| 159 |
DebarghaSanyal/Music_And_Video_Generator
This project uses Stable Diffusion to generate AI-driven images & musics... |
|
Experimental |
| 160 |
dhanushreddy291/lcm-sdxl-cog
Generate high-quality images faster with Latent Consistency Models (LCM), a... |
|
Experimental |
| 161 |
ncdisrup-ai/Captioning4Generation
Caption image and with that base generate new images (with stable diffusion... |
|
Experimental |
| 162 |
aahmed-se/generate_image
Stable Diffusion Image Generator |
|
Experimental |
| 163 |
gopalchand/rotopy
Combine PNG or JPEG files in a folder into a movie file. Useful for... |
|
Experimental |
| 164 |
AtharvaTaras/Stable-Diffusion-Library
A library of images generated using Stable Diffusion |
|
Experimental |
| 165 |
inferless/stable-diffusion-3-5-large-turbo
A fast, optimized diffusion model that generates high-quality images from... |
|
Experimental |
| 166 |
player29879/Kandinsky
Kandinsky 2 β multilingual text2image latent diffusion model |
|
Experimental |