open-mmlab/PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型

49
/ 100
Emerging

Built on a diffusion architecture with task-specific prompt embeddings, PowerPaint uses tailored text prompts (e.g., "P_obj" for insertion, "P_ctxt" for removal) routed through cross-attention layers to unify four distinct inpainting tasks in a single model. The v2 iteration integrates BrushNet-style conditioning while preserving cross-attention mechanisms for task prompt injection, and supports ControlNet modules (Canny, Depth, Pose) for additional spatial control during generation. Distributed via Hugging Face and OpenXLab with Gradio-based inference, the project integrates into the MMagic ecosystem and provides both model weights and inference code.

1,059 stars.

No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

1,059

Forks

76

Language

Python

License

MIT

Last pushed

Dec 20, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/open-mmlab/PowerPaint"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.