Discernible Image Compression
- URL: http://arxiv.org/abs/2002.06810v3
- Date: Tue, 8 Sep 2020 00:44:12 GMT
- Title: Discernible Image Compression
- Authors: Zhaohui Yang, Yunhe Wang, Chang Xu, Peng Du, Chao Xu, Chunjing Xu, Qi
Tian
- Abstract summary: This paper aims to produce compressed images by pursuing both appearance and perceptual consistency.
Based on the encoder-decoder framework, we propose using a pre-trained CNN to extract features of the original and compressed images.
Experiments on benchmarks demonstrate that images compressed by using the proposed method can also be well recognized by subsequent visual recognition and detection models.
- Score: 124.08063151879173
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image compression, as one of the fundamental low-level image processing
tasks, is very essential for computer vision. Tremendous computing and storage
resources can be preserved with a trivial amount of visual information.
Conventional image compression methods tend to obtain compressed images by
minimizing their appearance discrepancy with the corresponding original images,
but pay little attention to their efficacy in downstream perception tasks,
e.g., image recognition and object detection. Thus, some of compressed images
could be recognized with bias. In contrast, this paper aims to produce
compressed images by pursuing both appearance and perceptual consistency. Based
on the encoder-decoder framework, we propose using a pre-trained CNN to extract
features of the original and compressed images, and making them similar. Thus
the compressed images are discernible to subsequent tasks, and we name our
method as Discernible Image Compression (DIC). In addition, the maximum mean
discrepancy (MMD) is employed to minimize the difference between feature
distributions. The resulting compression network can generate images with high
image quality and preserve the consistent perception in the feature domain, so
that these images can be well recognized by pre-trained machine learning
models. Experiments on benchmarks demonstrate that images compressed by using
the proposed method can also be well recognized by subsequent visual
recognition and detection models. For instance, the mAP value of compressed
images by DIC is about 0.6% higher than that of using compressed images by
conventional methods.
Related papers
- Transferable Learned Image Compression-Resistant Adversarial Perturbations [66.46470251521947]
Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.
We introduce a new pipeline that targets image classification models that utilize learned image compressors as pre-processing modules.
arXiv Detail & Related papers (2024-01-06T03:03:28Z) - You Can Mask More For Extremely Low-Bitrate Image Compression [80.7692466922499]
Learned image compression (LIC) methods have experienced significant progress during recent years.
LIC methods fail to explicitly explore the image structure and texture components crucial for image compression.
We present DA-Mask that samples visible patches based on the structure and texture of original images.
We propose a simple yet effective masked compression model (MCM), the first framework that unifies LIC and LIC end-to-end for extremely low-bitrate compression.
arXiv Detail & Related papers (2023-06-27T15:36:22Z) - Crowd Counting on Heavily Compressed Images with Curriculum Pre-Training [90.76576712433595]
Applying lossy compression on images processed by deep neural networks can lead to significant accuracy degradation.
Inspired by the curriculum learning paradigm, we present a novel training approach called curriculum pre-training (CPT) for crowd counting on compressed images.
arXiv Detail & Related papers (2022-08-15T08:43:21Z) - Perceptual Quality Assessment for Fine-Grained Compressed Images [38.615746092795625]
We propose a full-reference image quality assessment (FR-IQA) method for compressed images of fine-grained levels.
The proposed method is validated on the fine-grained compression image quality assessment (FGIQA) database.
arXiv Detail & Related papers (2022-06-08T12:56:45Z) - SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment [23.87848736166893]
We design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space.
Experimental results show that the proposed metric achieves higher consistency with human's perceptual judgment compared with both traditional methods and learning-based methods on CLIC datasets.
arXiv Detail & Related papers (2022-05-09T13:31:27Z) - Estimating the Resize Parameter in End-to-end Learned Image Compression [50.20567320015102]
We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models.
Our results show that our new resizing parameter estimation framework can provide Bjontegaard-Delta rate (BD-rate) improvement of about 10% against leading perceptual quality engines.
arXiv Detail & Related papers (2022-04-26T01:35:02Z) - Metric Learning for Anti-Compression Facial Forgery Detection [32.33501564446107]
We propose a novel anti-compression facial forgery detection framework.
It learns a compression-insensitive embedding feature space utilizing both original and compressed forgeries.
arXiv Detail & Related papers (2021-03-15T14:11:14Z) - Early Exit or Not: Resource-Efficient Blind Quality Enhancement for
Compressed Images [54.40852143927333]
Lossy image compression is pervasively conducted to save communication bandwidth, resulting in undesirable compression artifacts.
We propose a resource-efficient blind quality enhancement (RBQE) approach for compressed images.
Our approach can automatically decide to terminate or continue enhancement according to the assessed quality of enhanced images.
arXiv Detail & Related papers (2020-06-30T07:38:47Z) - What's in the Image? Explorable Decoding of Compressed Images [45.22726784749359]
We develop a novel decoder architecture for the ubiquitous JPEG standard, which allows traversing the set of decompressed images.
We exemplify our framework on graphical, medical and forensic use cases, demonstrating its wide range of potential applications.
arXiv Detail & Related papers (2020-06-16T17:15:44Z) - Saliency Driven Perceptual Image Compression [6.201592931432016]
The paper demonstrates that the popularly used evaluations metrics such as MS-SSIM and PSNR are inadequate for judging the performance of image compression techniques.
A new metric is proposed, which is learned on perceptual similarity data specific to image compression.
The model not only generates images which are visually better but also gives superior performance for subsequent computer vision tasks.
arXiv Detail & Related papers (2020-02-12T13:43:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.