Diffusion Models: A Comprehensive Survey of Methods and Applications
- URL: http://arxiv.org/abs/2209.00796v2
- Date: Tue, 6 Sep 2022 02:20:10 GMT
- Title: Diffusion Models: A Comprehensive Survey of Methods and Applications
- Authors: Ling Yang, Zhilong Zhang, Shenda Hong, Wentao Zhang
- Abstract summary: Diffusion models are a class of deep generative models that have shown impressive results on various tasks with dense theoretical founding.
Recent studies have shown great enthusiasm on improving the performance of diffusion model.
- Score: 10.557289965753437
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Diffusion models are a class of deep generative models that have shown
impressive results on various tasks with dense theoretical founding. Although
diffusion models have achieved impressive quality and diversity of sample
synthesis than other state-of-the-art models, they still suffer from costly
sampling procedure and sub-optimal likelihood estimation. Recent studies have
shown great enthusiasm on improving the performance of diffusion model. In this
article, we present a first comprehensive review of existing variants of the
diffusion models. Specifically, we provide a first taxonomy of diffusion models
and categorize them variants to three types, namely sampling-acceleration
enhancement, likelihood-maximization enhancement and data-generalization
enhancement. We also introduce in detail other five generative models (i.e.,
variational autoencoders, generative adversarial networks, normalizing flow,
autoregressive models, and energy-based models), and clarify the connections
between diffusion models and these generative models. Then we make a thorough
investigation into the applications of diffusion models, including computer
vision, natural language processing, waveform signal processing, multi-modal
modeling, molecular graph generation, time series modeling, and adversarial
purification. Furthermore, we propose new perspectives pertaining to the
development of this generative model.
Related papers
- Energy-Based Diffusion Language Models for Text Generation [126.23425882687195]
Energy-based Diffusion Language Model (EDLM) is an energy-based model operating at the full sequence level for each diffusion step.
Our framework offers a 1.3$times$ sampling speedup over existing diffusion models.
arXiv Detail & Related papers (2024-10-28T17:25:56Z) - Provable Statistical Rates for Consistency Diffusion Models [87.28777947976573]
Despite the state-of-the-art performance, diffusion models are known for their slow sample generation due to the extensive number of steps involved.
This paper contributes towards the first statistical theory for consistency models, formulating their training as a distribution discrepancy minimization problem.
arXiv Detail & Related papers (2024-06-23T20:34:18Z) - Diffusion Models in Low-Level Vision: A Survey [82.77962165415153]
diffusion model-based solutions have emerged as widely acclaimed for their ability to produce samples of superior quality and diversity.
We present three generic diffusion modeling frameworks and explore their correlations with other deep generative models.
We summarize extended diffusion models applied in other tasks, including medical, remote sensing, and video scenarios.
arXiv Detail & Related papers (2024-06-17T01:49:27Z) - An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization [59.63880337156392]
Diffusion models have achieved tremendous success in computer vision, audio, reinforcement learning, and computational biology.
Despite the significant empirical success, theory of diffusion models is very limited.
This paper provides a well-rounded theoretical exposure for stimulating forward-looking theories and methods of diffusion models.
arXiv Detail & Related papers (2024-04-11T14:07:25Z) - A Survey of Diffusion Models in Natural Language Processing [11.233768932957771]
Diffusion models capture the diffusion of information or signals across a network or manifold.
This paper discusses the different formulations of diffusion models used in NLP, their strengths and limitations, and their applications.
arXiv Detail & Related papers (2023-05-24T03:25:32Z) - Diffusion Models in Vision: A Survey [80.82832715884597]
A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage.
Diffusion models are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens.
arXiv Detail & Related papers (2022-09-10T22:00:30Z) - A Survey on Generative Diffusion Model [75.93774014861978]
Diffusion models are an emerging class of deep generative models.
They have certain limitations, including a time-consuming iterative generation process and confinement to high-dimensional Euclidean space.
This survey presents a plethora of advanced techniques aimed at enhancing diffusion models.
arXiv Detail & Related papers (2022-09-06T16:56:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.