TIGER-AI-Lab/AnyV2V

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

/ 100

Emerging

Leverages image-to-video (I2V) diffusion models to reduce video editing to single-frame image editing, enabling diverse editing tasks (stylization, object manipulation, semantic changes) through plug-and-play integration with any image editing method. Uses latent space DDIM inversion and PnP guidance to propagate first-frame edits temporally while maintaining appearance and motion consistency across frames. Supports multiple I2V backbones (i2vgen-xl, ConsistI2V, SEINE) with modular architecture compatible with InstantStyle, InstructPix2Pix, and other image editors.

649 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

649

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄:...

jdh-algo/JoyVASA

Diffusion-based Portrait and Animal Animation

haidog-yaqub/EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

CMLab-Korea/Awesome-Video-Frame-Interpolation

[IEEE TCSVT'26] 🂡 AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation

linzhiqiu/t2v_metrics

Evaluating text-to-image/video/3D models with VQAScore

Explore Generative AI Tools

All categories Trending Generative AI directory Insights