ali-vilab/AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

/ 100

Emerging

Built on Stable Diffusion v2.1 and ControlNet, AnyDoor leverages DINOv2 for semantic feature extraction to enable precise object-level customization without task-specific training. The system accepts reference objects and target masks, using a diffusion-based architecture to synthesize realistic placements while preserving object identity and scene context. Training uses multi-dataset supervision (COCO, UVO, LVIS) and supports inference on downstream applications like virtual try-on and face swapping through optional domain-specific fine-tuning.

4,222 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

4,222

Forks

372

Language

Python

License

MIT

Higher-rated alternatives

jolibrain/joliGEN

Generative AI Image and Video Toolset with GANs and Diffusion for Real-World Applications

zhangmozhe/Deep-Exemplar-based-Video-Colorization

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".

naver-ai/StyleKeeper

Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual...

lixiaowen-xw/DiffuEraser

DiffuEraser is a diffusion model for video inpainting, which performs great content completeness...

ironjr/semantic-draw

Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content...

Explore Diffusion Models

All categories Trending Diffusion directory Insights