PixelPyramids: Exact Inference Models from Lossless Image Pyramids
- URL: http://arxiv.org/abs/2110.08787v1
- Date: Sun, 17 Oct 2021 10:47:29 GMT
- Title: PixelPyramids: Exact Inference Models from Lossless Image Pyramids
- Authors: Shweta Mahajan, Stefan Roth
- Abstract summary: Pixel-Pyramids is a block-autoregressive approach with scale-specific representations to encode the joint distribution of image pixels.
It yields state-of-the-art results for density estimation on various image datasets, especially for high-resolution data.
For CelebA-HQ 1024 x 1024, we observe that the density estimates are improved to 44% of the baseline despite sampling speeds superior even to easily parallelizable flow-based models.
- Score: 58.949070311990916
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autoregressive models are a class of exact inference approaches with highly
flexible functional forms, yielding state-of-the-art density estimates for
natural images. Yet, the sequential ordering on the dimensions makes these
models computationally expensive and limits their applicability to
low-resolution imagery. In this work, we propose Pixel-Pyramids, a
block-autoregressive approach employing a lossless pyramid decomposition with
scale-specific representations to encode the joint distribution of image
pixels. Crucially, it affords a sparser dependency structure compared to fully
autoregressive approaches. Our PixelPyramids yield state-of-the-art results for
density estimation on various image datasets, especially for high-resolution
data. For CelebA-HQ 1024 x 1024, we observe that the density estimates (in
terms of bits/dim) are improved to ~44% of the baseline despite sampling speeds
superior even to easily parallelizable flow-based models.
Related papers
- Fast constrained sampling in pre-trained diffusion models [77.21486516041391]
Diffusion models have dominated the field of large, generative image models.
We propose an algorithm for fast-constrained sampling in large pre-trained diffusion models.
arXiv Detail & Related papers (2024-10-24T14:52:38Z) - Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization [45.20189929583484]
We decompose the deblurring (regression) task into blur pixel discretization and discrete-to-continuous conversion tasks.
Specifically, we generate the discretized image residual errors by identifying the blur pixels and then transform them to a continuous form.
arXiv Detail & Related papers (2024-04-18T13:22:56Z) - Deep Richardson-Lucy Deconvolution for Low-Light Image Deblurring [48.80983873199214]
We develop a data-driven approach to model the saturated pixels by a learned latent map.
Based on the new model, the non-blind deblurring task can be formulated into a maximum a posterior (MAP) problem.
To estimate high-quality deblurred images without amplified artifacts, we develop a prior estimation network.
arXiv Detail & Related papers (2023-08-10T12:53:30Z) - Hierarchical Integration Diffusion Model for Realistic Image Deblurring [71.76410266003917]
Diffusion models (DMs) have been introduced in image deblurring and exhibited promising performance.
We propose the Hierarchical Integration Diffusion Model (HI-Diff), for realistic image deblurring.
Experiments on synthetic and real-world blur datasets demonstrate that our HI-Diff outperforms state-of-the-art methods.
arXiv Detail & Related papers (2023-05-22T12:18:20Z) - InfinityGAN: Towards Infinite-Resolution Image Synthesis [92.40782797030977]
We present InfinityGAN, a method to generate arbitrary-resolution images.
We show how it trains and infers patch-by-patch seamlessly with low computational resources.
arXiv Detail & Related papers (2021-04-08T17:59:30Z) - Locally Masked Convolution for Autoregressive Models [107.4635841204146]
LMConv is a simple modification to the standard 2D convolution that allows arbitrary masks to be applied to the weights at each location in the image.
We learn an ensemble of distribution estimators that share parameters but differ in generation order, achieving improved performance on whole-image density estimation.
arXiv Detail & Related papers (2020-06-22T17:59:07Z) - Weighted Encoding Based Image Interpolation With Nonlocal Linear
Regression Model [8.013127492678272]
In image super-resolution, the low-resolution image is directly down-sampled from its high-resolution counterpart without blurring and noise.
To address this problem, we propose a novel image model based on sparse representation.
New approach to learn adaptive sub-dictionary online instead of clustering.
arXiv Detail & Related papers (2020-03-04T03:20:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.