Related papers: Understanding Diffusion Models via Code Execution

Understanding Diffusion Models via Code Execution

URL: http://arxiv.org/abs/2512.07201v1
Date: Mon, 08 Dec 2025 06:25:07 GMT
Title: Understanding Diffusion Models via Code Execution
Authors: Cheng Yu,
Abstract summary: This technical report aims to provide researchers with a clear, implementation-first understanding of how diffusion models work in practice and how code and theory correspond.<n>Our code and pre-trained models are available at: https://github.com/disanda/GM/tree/main/DDPM-DDIM-ClassifierFree.
Score: 9.831893034739037
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Diffusion models have achieved remarkable performance in generative modeling, yet their theoretical foundations are often intricate, and the gap between mathematical formulations in papers and practical open-source implementations can be difficult to bridge. Existing tutorials primarily focus on deriving equations, offering limited guidance on how diffusion models actually operate in code. To address this, we present a concise implementation of approximately 300 lines that explains diffusion models from a code-execution perspective. Our minimal example preserves the essential components -- including forward diffusion, reverse sampling, the noise-prediction network, and the training loop -- while removing unnecessary engineering details. This technical report aims to provide researchers with a clear, implementation-first understanding of how diffusion models work in practice and how code and theory correspond. Our code and pre-trained models are available at: https://github.com/disanda/GM/tree/main/DDPM-DDIM-ClassifierFree.

Related papers

An Introduction to Flow Matching and Diffusion Models [1.2277343096128712]
This tutorial provides a self-contained introduction to diffusion and flow-based generative models from first principles.<n>We develop the necessary mathematical background in ordinary and differential equations and derive the core algorithms of flow matching and denoising diffusion models.<n>We then provide a step-by-step guide to building image and video generators, including training methods, guidance, and architectural design.
arXiv Detail & Related papers (2025-06-02T05:07:25Z)
Generalized Interpolating Discrete Diffusion [65.74168524007484]
Masked diffusion is a popular choice due to its simplicity and effectiveness.<n>We generalize a new family of general interpolating discrete diffusion (GIDD) which offers greater flexibility in the design of the noising processes.<n>Exploiting GIDD's flexibility, we explore a hybrid approach combining masking and uniform noise, leading to improved sample quality.
arXiv Detail & Related papers (2025-03-06T14:30:55Z)
Generative Diffusion Modeling: A Practical Handbook [25.81859481634996]
diffusion probabilistic models, score-based generative models, consistency models, rectified flow, and related methods.<n>Content encompasses the fundamentals of diffusion models, the pre-training process, and various post-training methods.<n>Designed as a practical guide, it emphasizes clarity and usability over theoretical depth.
arXiv Detail & Related papers (2024-12-22T21:02:36Z)
DiffusionTrend: A Minimalist Approach to Virtual Fashion Try-On [103.89972383310715]
DiffusionTrend harnesses latent information rich in prior information to capture the nuances of garment details.<n>It delivers a visually compelling try-on experience, underscoring the potential of training-free diffusion model.
arXiv Detail & Related papers (2024-12-19T02:24:35Z)
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review [63.31328039424469]
This tutorial provides a comprehensive survey of methods for fine-tuning diffusion models to optimize downstream reward functions. We explain the application of various RL algorithms, including PPO, differentiable optimization, reward-weighted MLE, value-weighted sampling, and path consistency learning.
arXiv Detail & Related papers (2024-07-18T17:35:32Z)
Simple and Effective Masked Diffusion Language Models [48.68198363304619]
We show that simple masked discrete diffusion is more performant than previously thought. Our objective has a simple form -- it is a mixture of classical masked language modeling losses. On language modeling benchmarks, a range of masked diffusion models trained with modern engineering practices achieves a new state-of-the-art.
arXiv Detail & Related papers (2024-06-11T17:51:40Z)
Guided Diffusion from Self-Supervised Diffusion Features [49.78673164423208]
Guidance serves as a key concept in diffusion models, yet its effectiveness is often limited by the need for extra data annotation or pretraining. We propose a framework to extract guidance from, and specifically for, diffusion models.
arXiv Detail & Related papers (2023-12-14T11:19:11Z)
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models [77.83923746319498]
We propose a framework called Diff-Instruct to instruct the training of arbitrary generative models. We show that Diff-Instruct results in state-of-the-art single-step diffusion-based models. Experiments on refining GAN models show that the Diff-Instruct can consistently improve the pre-trained generators of GAN models.
arXiv Detail & Related papers (2023-05-29T04:22:57Z)
Diffusion Models as Masked Autoencoders [52.442717717898056]
We revisit generatively pre-training visual representations in light of recent interest in denoising diffusion models. While directly pre-training with diffusion models does not produce strong representations, we condition diffusion models on masked input and formulate diffusion models as masked autoencoders (DiffMAE) We perform a comprehensive study on the pros and cons of design choices and build connections between diffusion models and masked autoencoders.
arXiv Detail & Related papers (2023-04-06T17:59:56Z)
Towards Practical Plug-and-Play Diffusion Models [19.846094740800254]
Diffusion-based generative models have achieved remarkable success in image generation. Direct use of publicly available off-the-shelf models for guidance fails due to poor performance on noisy inputs. Existing practice is to fine-tune the guidance models with labeled data corrupted with noises.
arXiv Detail & Related papers (2022-12-12T15:29:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.