Universal Deep Image Compression via Content-Adaptive Optimization with
Adapters
- URL: http://arxiv.org/abs/2211.00918v1
- Date: Wed, 2 Nov 2022 07:01:30 GMT
- Title: Universal Deep Image Compression via Content-Adaptive Optimization with
Adapters
- Authors: Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa
- Abstract summary: Deep image compression performs better than conventional codecs, such as JPEG, on natural images.
Deep image compression is learning-based and encounters a problem: the compression performance deteriorates significantly for out-of-domain images.
This study aims to compress images belonging to arbitrary domains, such as natural images, line drawings, and comics.
- Score: 43.291753358414255
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep image compression performs better than conventional codecs, such as
JPEG, on natural images. However, deep image compression is learning-based and
encounters a problem: the compression performance deteriorates significantly
for out-of-domain images. In this study, we highlight this problem and address
a novel task: universal deep image compression. This task aims to compress
images belonging to arbitrary domains, such as natural images, line drawings,
and comics. To address this problem, we propose a content-adaptive optimization
framework; this framework uses a pre-trained compression model and adapts the
model to a target image during compression. Adapters are inserted into the
decoder of the model. For each input image, our framework optimizes the latent
representation extracted by the encoder and the adapter parameters in terms of
rate-distortion. The adapter parameters are additionally transmitted per image.
For the experiments, a benchmark dataset containing uncompressed images of four
domains (natural images, line drawings, comics, and vector arts) is constructed
and the proposed universal deep compression is evaluated. Finally, the proposed
model is compared with non-adaptive and existing adaptive compression models.
The comparison reveals that the proposed model outperforms these. The code and
dataset are publicly available at https://github.com/kktsubota/universal-dic.
Related papers
- Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates [47.47031054057152]
Generative models have been explored to compress RS images into extremely low-bitrate streams.
These generative models struggle to reconstruct visually plausible images due to the highly ill-posed nature of extremely low-bitrate image compression.
We propose an image compression framework that utilizes a pre-trained diffusion model with powerful natural image priors to achieve high-realism reconstructions.
arXiv Detail & Related papers (2024-09-03T14:29:54Z) - Estimating the Resize Parameter in End-to-end Learned Image Compression [50.20567320015102]
We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models.
Our results show that our new resizing parameter estimation framework can provide Bjontegaard-Delta rate (BD-rate) improvement of about 10% against leading perceptual quality engines.
arXiv Detail & Related papers (2022-04-26T01:35:02Z) - The Devil Is in the Details: Window-based Attention for Image
Compression [58.1577742463617]
Most existing learned image compression models are based on Convolutional Neural Networks (CNNs)
In this paper, we study the effects of multiple kinds of attention mechanisms for local features learning, then introduce a more straightforward yet effective window-based local attention block.
The proposed window-based attention is very flexible which could work as a plug-and-play component to enhance CNN and Transformer models.
arXiv Detail & Related papers (2022-03-16T07:55:49Z) - Variable-Rate Deep Image Compression through Spatially-Adaptive Feature
Transform [58.60004238261117]
We propose a versatile deep image compression network based on Spatial Feature Transform (SFT arXiv:1804.02815)
Our model covers a wide range of compression rates using a single model, which is controlled by arbitrary pixel-wise quality maps.
The proposed framework allows us to perform task-aware image compressions for various tasks.
arXiv Detail & Related papers (2021-08-21T17:30:06Z) - Enhanced Invertible Encoding for Learned Image Compression [40.21904131503064]
In this paper, we propose an enhanced Invertible.
Network with invertible neural networks (INNs) to largely mitigate the information loss problem for better compression.
Experimental results on the Kodak, CLIC, and Tecnick datasets show that our method outperforms the existing learned image compression methods.
arXiv Detail & Related papers (2021-08-08T17:32:10Z) - How to Exploit the Transferability of Learned Image Compression to
Conventional Codecs [25.622863999901874]
We show how learned image coding can be used as a surrogate to optimize an image for encoding.
Our approach can remodel a conventional image to adjust for the MS-SSIM distortion with over 20% rate improvement without any decoding overhead.
arXiv Detail & Related papers (2020-12-03T12:34:51Z) - Quantization Guided JPEG Artifact Correction [69.04777875711646]
We develop a novel architecture for artifact correction using the JPEG files quantization matrix.
This allows our single model to achieve state-of-the-art performance over models trained for specific quality settings.
arXiv Detail & Related papers (2020-04-17T00:10:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.