Machine Perception-Driven Image Compression: A Layered Generative
Approach
- URL: http://arxiv.org/abs/2304.06896v1
- Date: Fri, 14 Apr 2023 02:12:38 GMT
- Title: Machine Perception-Driven Image Compression: A Layered Generative
Approach
- Authors: Yuefeng Zhang, Chuanmin Jia, Jiannhui Chang, Siwei Ma
- Abstract summary: layered generative image compression model is proposed to achieve high human vision-oriented image reconstructed quality.
Task-agnostic learning-based compression model is proposed, which effectively supports various compressed domain-based analytical tasks.
Joint optimization schedule is adopted to acquire best balance point among compression ratio, reconstructed image quality, and downstream perception performance.
- Score: 32.23554195427311
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In this age of information, images are a critical medium for storing and
transmitting information. With the rapid growth of image data amount, visual
compression and visual data perception are two important research topics
attracting a lot attention. However, those two topics are rarely discussed
together and follow separate research path. Due to the compact compressed
domain representation offered by learning-based image compression methods,
there exists possibility to have one stream targeting both efficient data
storage and compression, and machine perception tasks. In this paper, we
propose a layered generative image compression model achieving high human
vision-oriented image reconstructed quality, even at extreme compression
ratios. To obtain analysis efficiency and flexibility, a task-agnostic
learning-based compression model is proposed, which effectively supports
various compressed domain-based analytical tasks while reserves outstanding
reconstructed perceptual quality, compared with traditional and learning-based
codecs. In addition, joint optimization schedule is adopted to acquire best
balance point among compression ratio, reconstructed image quality, and
downstream perception performance. Experimental results verify that our
proposed compressed domain-based multi-task analysis method can achieve
comparable analysis results against the RGB image-based methods with up to
99.6% bit rate saving (i.e., compared with taking original RGB image as the
analysis model input). The practical ability of our model is further justified
from model size and information fidelity aspects.
Related papers
- Implicit Neural Image Field for Biological Microscopy Image Compression [37.0218688308699]
We propose an adaptive compression workflow based on Implicit Neural Representation (INR)
This approach permits application-specific compression objectives, capable of compressing images of any shape and arbitrary pixel-wise decompression.
We demonstrated on a wide range of microscopy images that our workflow not only achieved high, controllable compression ratios but also preserved detailed information critical for downstream analysis.
arXiv Detail & Related papers (2024-05-29T11:51:33Z) - A Rate-Distortion-Classification Approach for Lossy Image Compression [0.0]
In lossy image compression, the objective is to achieve minimal signal distortion while compressing images to a specified bit rate.
To bridge the gap between image compression and visual analysis, we propose a Rate-Distortion-Classification (RDC) model for lossy image compression.
arXiv Detail & Related papers (2024-05-06T14:11:36Z) - Transferable Learned Image Compression-Resistant Adversarial Perturbations [66.46470251521947]
Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.
We introduce a new pipeline that targets image classification models that utilize learned image compressors as pre-processing modules.
arXiv Detail & Related papers (2024-01-06T03:03:28Z) - Beyond Learned Metadata-based Raw Image Reconstruction [86.1667769209103]
Raw images have distinct advantages over sRGB images, e.g., linearity and fine-grained quantization levels.
They are not widely adopted by general users due to their substantial storage requirements.
We propose a novel framework that learns a compact representation in the latent space, serving as metadata.
arXiv Detail & Related papers (2023-06-21T06:59:07Z) - Estimating the Resize Parameter in End-to-end Learned Image Compression [50.20567320015102]
We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models.
Our results show that our new resizing parameter estimation framework can provide Bjontegaard-Delta rate (BD-rate) improvement of about 10% against leading perceptual quality engines.
arXiv Detail & Related papers (2022-04-26T01:35:02Z) - Implicit Neural Representations for Image Compression [103.78615661013623]
Implicit Neural Representations (INRs) have gained attention as a novel and effective representation for various data types.
We propose the first comprehensive compression pipeline based on INRs including quantization, quantization-aware retraining and entropy coding.
We find that our approach to source compression with INRs vastly outperforms similar prior work.
arXiv Detail & Related papers (2021-12-08T13:02:53Z) - Learned Image Compression for Machine Perception [17.40776913809306]
We develop a framework that produces a compression format suitable for both human perception and machine perception.
We show that representations can be learned that simultaneously optimize for compression and performance on core vision tasks.
arXiv Detail & Related papers (2021-11-03T14:39:09Z) - Discernible Image Compression [124.08063151879173]
This paper aims to produce compressed images by pursuing both appearance and perceptual consistency.
Based on the encoder-decoder framework, we propose using a pre-trained CNN to extract features of the original and compressed images.
Experiments on benchmarks demonstrate that images compressed by using the proposed method can also be well recognized by subsequent visual recognition and detection models.
arXiv Detail & Related papers (2020-02-17T07:35:08Z) - Learning End-to-End Lossy Image Compression: A Benchmark [90.35363142246806]
We first conduct a comprehensive literature survey of learned image compression methods.
We describe milestones in cutting-edge learned image-compression methods, review a broad range of existing works, and provide insights into their historical development routes.
By introducing a coarse-to-fine hyperprior model for entropy estimation and signal reconstruction, we achieve improved rate-distortion performance.
arXiv Detail & Related papers (2020-02-10T13:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.