menyifang/MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

/ 100

Emerging

Decomposes video into spatial components (character, motion, scene) to enable independent control over each attribute during synthesis. Built on Stable Diffusion with motion modules and pose guidance, it supports animating static character images with 3D motion data or driving videos while generalizing to novel characters and scenes. Provides both local PyTorch inference and a Gradio web interface, with pre-trained weights available via HuggingFace and ModelScope.

1,575 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

1,575

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

ModelTC/LightX2V

Light Image Video Generation Inference Framework

PKU-YuanGroup/Helios

Helios: Real Real-Time Long Video Generation Model

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Explore Diffusion Models

All categories Trending Diffusion directory Insights