G-U-N/Gen-L-Video
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
Extends existing short-video diffusion models to generate and edit videos with hundreds of frames and multiple semantic segments through temporal co-denoising, without additional training. Supports diverse conditioning modalities (pose, depth, sketch, canny, semantic segmentation) via T2I-Adapters and ControlNet, plus spatial editing through SAM and Grounding DINO integration. Offers both one-shot tuning and training-free inference pipelines compatible with Stable Diffusion-based architectures.
307 stars.
Stars
307
Forks
34
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Oct 19, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/G-U-N/Gen-L-Video"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hao-ai-lab/FastVideo
A unified inference and post-training framework for accelerated video generation.
thu-ml/TurboDiffusion
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
ModelTC/LightX2V
Light Image Video Generation Inference Framework
PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
PKU-YuanGroup/MagicTime
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators