muzishen/IMAGHarmony
🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high fidelity and coherence in complex multi-object edits. It integrates harmony-aware attention and preference-guided noise selection to enable precise, stable, and semantically aligned generation.
Builds on Stable Diffusion XL and IP-Adapter, employing a harmony-aware module to jointly encode object structure and semantic information alongside preference-guided noise initialization for stabilized generation. The framework is trained and evaluated on HarmonyBench, a curated benchmark dataset with diverse multi-object editing scenarios, supporting dual-category edits and fine-grained control over object count and spatial arrangement. Provides both training and inference code with a Gradio demo interface for interactive testing.
607 stars.
Stars
607
Forks
45
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 18, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/muzishen/IMAGHarmony"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
River-Zhang/ICEdit
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image...
bytedance/InfiniteYou
🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
AMAP-ML/FE2E
[CVPR 2026] Beyond Generation: Advancing Image Editing Priors for Depth and Normal Estimation
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting...
ermongroup/SDEdit
PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations