ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Built on Stable Diffusion v2.1 and ControlNet, AnyDoor leverages DINOv2 for semantic feature extraction to enable precise object-level customization without task-specific training. The system accepts reference objects and target masks, using a diffusion-based architecture to synthesize realistic placements while preserving object identity and scene context. Training uses multi-dataset supervision (COCO, UVO, LVIS) and supports inference on downstream applications like virtual try-on and face swapping through optional domain-specific fine-tuning.
4,222 stars. No commits in the last 6 months.
Stars
4,222
Forks
372
Language
Python
License
MIT
Category
Last pushed
Apr 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/ali-vilab/AnyDoor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jolibrain/joliGEN
Generative AI Image and Video Toolset with GANs and Diffusion for Real-World Applications
zhangmozhe/Deep-Exemplar-based-Video-Colorization
The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".
naver-ai/StyleKeeper
Official Pytorch implementation of "StyleKeeper: Prevent Content Leakage using Negative Visual...
lixiaowen-xw/DiffuEraser
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness...
ironjr/semantic-draw
Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content...