ExponentialML/Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Archived

/ 100

Emerging

Supports training full models or parameter-efficient LoRA adapters compatible with Stable Diffusion webui extensions. Implements memory optimizations including gradient checkpointing and Torch 2.0's Scaled Dot-Product Attention, enabling training on GPUs with ≤16GB VRAM. Provides flexible data preprocessing with automatic video captioning support and YAML-based configuration for customizing training on custom video datasets.

697 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 22 / 25

How are scores calculated?

Stars

697

Forks

111

Language

Python

License

MIT

Higher-rated alternatives

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...

nateraw/stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

Explore Diffusion Models

All categories Trending Diffusion directory Insights