madaan/minimal-text-diffusion
A minimal implementation of diffusion models for text generation
Implements discrete diffusion for text using a BERT encoder with tied embedding/output weights, where the model learns to predict clean embeddings from noisy versions via progressive denoising steps. Supports classifier-guided generation for conditional sampling and includes word-level tokenization with options for BPE. Built on PyTorch with configurable sequence lengths and diffusion schedules, designed for training on arbitrary text corpora with minimal dependencies.
410 stars. No commits in the last 6 months.
Stars
410
Forks
38
Language
Python
License
MIT
Category
Last pushed
May 11, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/madaan/minimal-text-diffusion"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
quantgirluk/aleatory
📦 Python library for Stochastic Processes Simulation and Visualisation
blei-lab/treeffuser
Treeffuser is an easy-to-use package for probabilistic prediction and probabilistic regression...
yuanchenyang/smalldiffusion
Simple and readable code for training and sampling from diffusion models
TuftsBCB/RegDiffusion
Diffusion model for gene regulatory network inference.
lvyufeng/denoising-diffusion-mindspore
Implementation of Denoising Diffusion Probabilistic Model in MindSpore