huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

/ 100

Emerging

Supports distributed training via DDP, FSDP-2, and HSDP, plus LoRA and full-rank fine-tuning across multiple video generation models (LTX-Video, CogVideoX, HunyuanVideo, Wan) and image models (Flux, CogView4). Built on the Diffusers framework with pluggable attention backends (flash, flex, xformers) and enables FP8 quantization-aware training to reduce VRAM requirements for single-GPU and large-scale scenarios.

1,343 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

1,343

Forks

140

Language

Python

License

Apache-2.0

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

Explore Diffusion Models

All categories Trending Diffusion directory Insights