SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment
- URL: http://arxiv.org/abs/2205.04264v1
- Date: Mon, 9 May 2022 13:31:27 GMT
- Title: SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment
- Authors: Jianzhao Liu, Xin Li, Yanding Peng, Tao Yu, Zhibo Chen
- Abstract summary: We design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space.
Experimental results show that the proposed metric achieves higher consistency with human's perceptual judgment compared with both traditional methods and learning-based methods on CLIC datasets.
- Score: 23.87848736166893
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image compression has raised widespread interest recently due to its
significant importance for multimedia storage and transmission. Meanwhile, a
reliable image quality assessment (IQA) for compressed images can not only help
to verify the performance of various compression algorithms but also help to
guide the compression optimization in turn. In this paper, we design a
full-reference image quality assessment metric SwinIQA to measure the
perceptual quality of compressed images in a learned Swin distance space. It is
known that the compression artifacts are usually non-uniformly distributed with
diverse distortion types and degrees. To warp the compressed images into the
shared representation space while maintaining the complex distortion
information, we extract the hierarchical feature representations from each
stage of the Swin Transformer. Besides, we utilize cross attention operation to
map the extracted feature representations into a learned Swin distance space.
Experimental results show that the proposed metric achieves higher consistency
with human's perceptual judgment compared with both traditional methods and
learning-based methods on CLIC datasets.
Related papers
- Transferable Learned Image Compression-Resistant Adversarial Perturbations [66.46470251521947]
Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.
We introduce a new pipeline that targets image classification models that utilize learned image compressors as pre-processing modules.
arXiv Detail & Related papers (2024-01-06T03:03:28Z) - Progressive Learning with Visual Prompt Tuning for Variable-Rate Image
Compression [60.689646881479064]
We propose a progressive learning paradigm for transformer-based variable-rate image compression.
Inspired by visual prompt tuning, we use LPM to extract prompts for input images and hidden features at the encoder side and decoder side, respectively.
Our model outperforms all current variable image methods in terms of rate-distortion performance and approaches the state-of-the-art fixed image compression methods trained from scratch.
arXiv Detail & Related papers (2023-11-23T08:29:32Z) - Machine Perception-Driven Image Compression: A Layered Generative
Approach [32.23554195427311]
layered generative image compression model is proposed to achieve high human vision-oriented image reconstructed quality.
Task-agnostic learning-based compression model is proposed, which effectively supports various compressed domain-based analytical tasks.
Joint optimization schedule is adopted to acquire best balance point among compression ratio, reconstructed image quality, and downstream perception performance.
arXiv Detail & Related papers (2023-04-14T02:12:38Z) - Crowd Counting on Heavily Compressed Images with Curriculum Pre-Training [90.76576712433595]
Applying lossy compression on images processed by deep neural networks can lead to significant accuracy degradation.
Inspired by the curriculum learning paradigm, we present a novel training approach called curriculum pre-training (CPT) for crowd counting on compressed images.
arXiv Detail & Related papers (2022-08-15T08:43:21Z) - Analysis of the Effect of Low-Overhead Lossy Image Compression on the
Performance of Visual Crowd Counting for Smart City Applications [78.55896581882595]
Lossy image compression techniques can reduce the quality of the images, leading to accuracy degradation.
In this paper, we analyze the effect of applying low-overhead lossy image compression methods on the accuracy of visual crowd counting.
arXiv Detail & Related papers (2022-07-20T19:20:03Z) - Perceptual Quality Assessment for Fine-Grained Compressed Images [38.615746092795625]
We propose a full-reference image quality assessment (FR-IQA) method for compressed images of fine-grained levels.
The proposed method is validated on the fine-grained compression image quality assessment (FGIQA) database.
arXiv Detail & Related papers (2022-06-08T12:56:45Z) - Estimating the Resize Parameter in End-to-end Learned Image Compression [50.20567320015102]
We describe a search-free resizing framework that can further improve the rate-distortion tradeoff of recent learned image compression models.
Our results show that our new resizing parameter estimation framework can provide Bjontegaard-Delta rate (BD-rate) improvement of about 10% against leading perceptual quality engines.
arXiv Detail & Related papers (2022-04-26T01:35:02Z) - Metric Learning for Anti-Compression Facial Forgery Detection [32.33501564446107]
We propose a novel anti-compression facial forgery detection framework.
It learns a compression-insensitive embedding feature space utilizing both original and compressed forgeries.
arXiv Detail & Related papers (2021-03-15T14:11:14Z) - Deep Perceptual Image Quality Assessment for Compression [12.868773445948507]
We propose the largest image compression quality dataset to date with human perceptual preferences.
We show that the proposed model can effectively learn from thousands of examples available in the new dataset.
arXiv Detail & Related papers (2021-03-01T16:31:10Z) - Discernible Image Compression [124.08063151879173]
This paper aims to produce compressed images by pursuing both appearance and perceptual consistency.
Based on the encoder-decoder framework, we propose using a pre-trained CNN to extract features of the original and compressed images.
Experiments on benchmarks demonstrate that images compressed by using the proposed method can also be well recognized by subsequent visual recognition and detection models.
arXiv Detail & Related papers (2020-02-17T07:35:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.