Beyond Learned Metadata-based Raw Image Reconstruction
- URL: http://arxiv.org/abs/2306.12058v1
- Date: Wed, 21 Jun 2023 06:59:07 GMT
- Title: Beyond Learned Metadata-based Raw Image Reconstruction
- Authors: Yufei Wang, Yi Yu, Wenhan Yang, Lanqing Guo, Lap-Pui Chau, Alex C.
Kot, Bihan Wen
- Abstract summary: Raw images have distinct advantages over sRGB images, e.g., linearity and fine-grained quantization levels.
They are not widely adopted by general users due to their substantial storage requirements.
We propose a novel framework that learns a compact representation in the latent space, serving as metadata.
- Score: 86.1667769209103
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While raw images have distinct advantages over sRGB images, e.g., linearity
and fine-grained quantization levels, they are not widely adopted by general
users due to their substantial storage requirements. Very recent studies
propose to compress raw images by designing sampling masks within the pixel
space of the raw image. However, these approaches often leave space for
pursuing more effective image representations and compact metadata. In this
work, we propose a novel framework that learns a compact representation in the
latent space, serving as metadata, in an end-to-end manner. Compared with lossy
image compression, we analyze the intrinsic difference of the raw image
reconstruction task caused by rich information from the sRGB image. Based on
the analysis, a novel backbone design with asymmetric and hybrid spatial
feature resolutions is proposed, which significantly improves the
rate-distortion performance. Besides, we propose a novel design of the context
model, which can better predict the order masks of encoding/decoding based on
both the sRGB image and the masks of already processed features. Benefited from
the better modeling of the correlation between order masks, the already
processed information can be better utilized. Moreover, a novel sRGB-guided
adaptive quantization precision strategy, which dynamically assigns varying
levels of quantization precision to different regions, further enhances the
representation ability of the model. Finally, based on the iterative properties
of the proposed context model, we propose a novel strategy to achieve variable
bit rates using a single model. This strategy allows for the continuous
convergence of a wide range of bit rates. Extensive experimental results
demonstrate that the proposed method can achieve better reconstruction quality
with a smaller metadata size.
Related papers
- You Can Mask More For Extremely Low-Bitrate Image Compression [80.7692466922499]
Learned image compression (LIC) methods have experienced significant progress during recent years.
LIC methods fail to explicitly explore the image structure and texture components crucial for image compression.
We present DA-Mask that samples visible patches based on the structure and texture of original images.
We propose a simple yet effective masked compression model (MCM), the first framework that unifies LIC and LIC end-to-end for extremely low-bitrate compression.
arXiv Detail & Related papers (2023-06-27T15:36:22Z) - Compression with Bayesian Implicit Neural Representations [16.593537431810237]
We propose overfitting variational neural networks to the data and compressing an approximate posterior weight sample using relative entropy coding instead of quantizing and entropy coding it.
Experiments show that our method achieves strong performance on image and audio compression while retaining simplicity.
arXiv Detail & Related papers (2023-05-30T16:29:52Z) - Raw Image Reconstruction with Learned Compact Metadata [61.62454853089346]
We propose a novel framework to learn a compact representation in the latent space serving as the metadata in an end-to-end manner.
We show how the proposed raw image compression scheme can adaptively allocate more bits to image regions that are important from a global perspective.
arXiv Detail & Related papers (2023-02-25T05:29:45Z) - Rank-Enhanced Low-Dimensional Convolution Set for Hyperspectral Image
Denoising [50.039949798156826]
This paper tackles the challenging problem of hyperspectral (HS) image denoising.
We propose rank-enhanced low-dimensional convolution set (Re-ConvSet)
We then incorporate Re-ConvSet into the widely-used U-Net architecture to construct an HS image denoising method.
arXiv Detail & Related papers (2022-07-09T13:35:12Z) - Implicit Neural Representations for Image Compression [103.78615661013623]
Implicit Neural Representations (INRs) have gained attention as a novel and effective representation for various data types.
We propose the first comprehensive compression pipeline based on INRs including quantization, quantization-aware retraining and entropy coding.
We find that our approach to source compression with INRs vastly outperforms similar prior work.
arXiv Detail & Related papers (2021-12-08T13:02:53Z) - Generating Images with Sparse Representations [21.27273495926409]
High dimensionality of images presents architecture and sampling-efficiency challenges for likelihood-based generative models.
We present an alternative approach, inspired by common image compression methods like JPEG, and convert images to quantized discrete cosine transform (DCT) blocks.
We propose a Transformer-based autoregressive architecture, which is trained to sequentially predict the conditional distribution of the next element in such sequences.
arXiv Detail & Related papers (2021-03-05T17:56:03Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.