xlite-dev/Awesome-DiT-Inference

📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉

51
/ 100
Established

Organizes diffusion inference optimization papers across five technical domains—sampling algorithms (DPM-Solver variants, parallel denoising), KV-cache strategies (DeepCache, layer caching), tensor parallelism and distributed training, quantization methods, and attention optimizations. Covers both UNet and Transformer-based (DiT) architectures with implementation links and research citations. Targets the broader ML inference acceleration ecosystem, complementing a parallel LLM inference resource for practitioners optimizing generative model deployments.

526 stars. Actively maintained with 1 commit in the last 30 days.

No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

526

Forks

26

Language

Python

License

GPL-3.0

Last pushed

Feb 25, 2026

Commits (30d)

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/xlite-dev/Awesome-DiT-Inference"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.