dome272/Paella

Official Implementation of Paella https://arxiv.org/abs/2211.07292v2

/ 100

Emerging

Operates on compressed, quantized latent spaces conditioned with CLIP embeddings to achieve high-fidelity image generation in under 10 steps and 500ms per image. Beyond text-to-image synthesis, supports latent space interpolation and image manipulations including inpainting, outpainting, and structural editing. Prioritizes accessibility with minimalistic codebases—training and sampling implementations fit under 140 lines—enabling rapid experimentation and community contribution.

748 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

748

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

xie-lab-ml/Golden-Noise-for-Diffusion-Models

[ICCV2025] The code of our work "Golden Noise for Diffusion Models: A Learning Framework".

UNIC-Lab/RadioDiff

This is the code for the paper "RadioDiff: An Effective Generative Diffusion Model for...

yulewang97/ERDiff

[NeurIPS 2023 Spotlight] Official Repo for "Extraction and Recovery of Dpatio-temporal Structure...

pantheon5100/pid_diffusion

This repository is the official implementation of the paper: Physics Informed Distillation for...

zju-pi/diff-sampler

An open-source toolbox for fast sampling of diffusion models. Official implementations of our...

Explore Diffusion Models

All categories Trending Diffusion directory Insights