TencentARC/GenCompositor

[ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer

46
/ 100
Emerging

Based on the README, here's a technical summary: Combines a Diffusion Transformer backbone (CogVideoX-5B) with specialized modules for video compositing: a masked-token background preservation branch maintains target video consistency, a DiT fusion block with full self-attention integrates foreground elements, and Extended Rotary Position Embedding (ERoPE) enables flexible spatial layout control. Includes SAM2-based foreground segmentation, trajectory-guided motion injection, and trains on a curated 61K-video dataset (VideoComp) with interactive user control over element placement and dynamics.

150 stars.

No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 8 / 25

How are scores calculated?

Stars

150

Forks

6

Language

Python

License

Last pushed

Mar 16, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/TencentARC/GenCompositor"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.