sayakpaul/diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

/ 100

Emerging

Combines `torch.compile()` with `torchao`'s quantization schemes (INT8, FP8) to accelerate diffusion model inference, achieving 53.88% speedup on Flux.1-Dev and 27.33% on CogVideoX-5b. Provides end-to-end recipes for both inference optimization and experimental FP8 training, with serialization strategies to reduce framework overhead. Integrates directly into the Hugging Face `diffusers` pipeline as an official quantization backend, supporting automatic quantization (`autoquant`) with minimal code changes.

397 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

397

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

PrunaAI/pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster,...

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

haoheliu/AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...

Explore Diffusion Models

All categories Trending Diffusion directory Insights