davidelobba/TEMU-VTOFF

[ICLR 2026] "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"

/ 100

Emerging

Based on the README, here's the technical summary: Implements a dual-Diffusion Transformer (dual-DiT) architecture that combines pretrained feature extraction with text-enhanced generation, using a multimodal hybrid attention mechanism to integrate garment descriptions with person features for synthesizing occluded regions. A lightweight DINOv2-based garment aligner module conditions generation on target in-shop images rather than traditional denoising objectives. Supports multi-category garment handling (upper/lower/full-body) across Dress Code and VITON-HD datasets, with pre-extracted features from CLIP, OpenCLIP, and T5 encoders, and requires Stable Diffusion 3 Medium via HuggingFace.

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 15 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

Zheng-Chong/CatVTON

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight...

rizavelioglu/tryoffdiff

[CVPR'25-Demo] Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment...

muzishen/IMAGGarment

[TVCG 2026] 🎨 IMAGGarment🎨 : Fine-Grained Garment Generation with Controllable Structure,...

muzishen/IMAGDressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It...

nxnai/Voost

[SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual...

Explore Diffusion Models

All categories Trending Diffusion directory Insights