On the Importance of Noise Scheduling for Diffusion Models
- URL: http://arxiv.org/abs/2301.10972v4
- Date: Sun, 21 May 2023 07:07:55 GMT
- Title: On the Importance of Noise Scheduling for Diffusion Models
- Authors: Ting Chen
- Abstract summary: We study the effect of noise scheduling strategies for denoising diffusion generative models.
This simple recipe yields state-of-the-art pixel-based diffusion models for high-resolution images on ImageNet.
- Score: 8.360383061862844
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We empirically study the effect of noise scheduling strategies for denoising
diffusion generative models. There are three findings: (1) the noise scheduling
is crucial for the performance, and the optimal one depends on the task (e.g.,
image sizes), (2) when increasing the image size, the optimal noise scheduling
shifts towards a noisier one (due to increased redundancy in pixels), and (3)
simply scaling the input data by a factor of $b$ while keeping the noise
schedule function fixed (equivalent to shifting the logSNR by $\log b$) is a
good strategy across image sizes. This simple recipe, when combined with
recently proposed Recurrent Interface Network (RIN), yields state-of-the-art
pixel-based diffusion models for high-resolution images on ImageNet, enabling
single-stage, end-to-end generation of diverse and high-fidelity images at
1024$\times$1024 resolution (without upsampling/cascades).
Related papers
- Zero-Shot Image Denoising for High-Resolution Electron Microscopy [28.34992348748098]
High-resolution electron microscopy (HREM) imaging technique is a powerful tool for directly visualizing a broad range of materials in real-space.
It faces challenges in denoising due to ultra-low signal-to-noise ratio (SNR) and scarce data availability.
We propose Noise2SR, a zero-shot self-supervised learning (ZS-SSL) denoising framework for HREM.
arXiv Detail & Related papers (2024-06-20T12:40:18Z) - NM-FlowGAN: Modeling sRGB Noise without Paired Images using a Hybrid Approach of Normalizing Flows and GAN [9.81778202920426]
NM-FlowGAN is a hybrid approach that exploits the strengths of both GAN and Normalizing Flows.
Our method synthesizes noise using clean images and factors that affect noise characteristics, such as easily obtainable parameters like camera type and ISO settings.
In our experiments, our NM-FlowGAN outperforms other baselines in the sRGB noise synthesis task.
arXiv Detail & Related papers (2023-12-15T09:09:25Z) - Diffusion Model for Generative Image Denoising [17.897180118637856]
In supervised learning for image denoising, usually the paired clean images and noisy images are collected and synthesised to train a denoising model.
In this paper, we regard the denoising task as a problem of estimating the posterior distribution of clean images conditioned on noisy images.
arXiv Detail & Related papers (2023-02-05T14:53:07Z) - Representing Noisy Image Without Denoising [91.73819173191076]
Fractional-order Moments in Radon space (FMR) is designed to derive robust representation directly from noisy images.
Unlike earlier integer-order methods, our work is a more generic design taking such classical methods as special cases.
arXiv Detail & Related papers (2023-01-18T10:13:29Z) - Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware
Adversarial Training [50.018580462619425]
We propose a novel framework, namely Pixel-level Noise-aware Generative Adrial Network (PNGAN)
PNGAN employs a pre-trained real denoiser to map the fake and real noisy images into a nearly noise-free solution space.
For better noise fitting, we present an efficient architecture Simple Multi-versa-scale Network (SMNet) as the generator.
arXiv Detail & Related papers (2022-04-06T14:09:02Z) - Unsupervised Single Image Super-resolution Under Complex Noise [60.566471567837574]
This paper proposes a model-based unsupervised SISR method to deal with the general SISR task with unknown degradations.
The proposed method can evidently surpass the current state of the art (SotA) method (about 1dB PSNR) not only with a slighter model (0.34M vs. 2.40M) but also faster speed.
arXiv Detail & Related papers (2021-07-02T11:55:40Z) - Image Super-Resolution via Iterative Refinement [53.57766722279425]
SR3 is an approach to image Super-Resolution via Repeated Refinement.
It adapts probabilistic denoising diffusion models to conditional image generation.
It exhibits strong performance on super-resolution tasks at different magnification factors.
arXiv Detail & Related papers (2021-04-15T17:50:42Z) - Designing a Practical Degradation Model for Deep Blind Image
Super-Resolution [134.9023380383406]
Single image super-resolution (SISR) methods would not perform well if the assumed degradation model deviates from those in real images.
This paper proposes to design a more complex but practical degradation model that consists of randomly shuffled blur, downsampling and noise degradations.
arXiv Detail & Related papers (2021-03-25T17:40:53Z) - Neighbor2Neighbor: Self-Supervised Denoising from Single Noisy Images [98.82804259905478]
We present Neighbor2Neighbor to train an effective image denoising model with only noisy images.
In detail, input and target used to train a network are images sub-sampled from the same noisy image.
A denoising network is trained on sub-sampled training pairs generated in the first stage, with a proposed regularizer as additional loss for better performance.
arXiv Detail & Related papers (2021-01-08T02:03:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.