DeepDC: Deep Distance Correlation as a Perceptual Image Quality
Evaluator
- URL: http://arxiv.org/abs/2211.04927v2
- Date: Fri, 24 Nov 2023 12:59:12 GMT
- Title: DeepDC: Deep Distance Correlation as a Perceptual Image Quality
Evaluator
- Authors: Hanwei Zhu, Baoliang Chen, Lingyu Zhu, Shiqi Wang, and Weisi Lin
- Abstract summary: ImageNet pre-trained deep neural networks (DNNs) show notable transferability for building effective image quality assessment (IQA) models.
We develop a novel full-reference IQA (FR-IQA) model based exclusively on pre-trained DNN features.
We conduct comprehensive experiments to demonstrate the superiority of the proposed quality model on five standard IQA datasets.
- Score: 53.57431705309919
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: ImageNet pre-trained deep neural networks (DNNs) show notable transferability
for building effective image quality assessment (IQA) models. Such a remarkable
byproduct has often been identified as an emergent property in previous
studies. In this work, we attribute such capability to the intrinsic
texture-sensitive characteristic that classifies images using texture features.
We fully exploit this characteristic to develop a novel full-reference IQA
(FR-IQA) model based exclusively on pre-trained DNN features. Specifically, we
compute the distance correlation, a highly promising yet relatively
under-investigated statistic, between reference and distorted images in the
deep feature domain. In addition, the distance correlation quantifies both
linear and nonlinear feature relationships, which is far beyond the widely used
first-order and second-order statistics in the feature space. We conduct
comprehensive experiments to demonstrate the superiority of the proposed
quality model on five standard IQA datasets, one perceptual similarity dataset,
two texture similarity datasets, and one geometric transformation dataset.
Moreover, we optimize the proposed model to generate a broad spectrum of
texture patterns, by treating the model as the style loss function for neural
style transfer (NST). Extensive experiments demonstrate that the proposed
texture synthesis and NST methods achieve the best quantitative and qualitative
results. We release our code at https://github.com/h4nwei/DeepDC.
Related papers
- Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - CONVIQT: Contrastive Video Quality Estimator [63.749184706461826]
Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms.
Here we consider the problem of learning perceptually relevant video quality representations in a self-supervised manner.
Our results indicate that compelling representations with perceptual bearing can be obtained using self-supervised learning.
arXiv Detail & Related papers (2022-06-29T15:22:01Z) - Attentions Help CNNs See Better: Attention-based Hybrid Image Quality
Assessment Network [20.835800149919145]
Image quality assessment (IQA) algorithm aims to quantify the human perception of image quality.
There is a performance drop when assessing distortion images generated by generative adversarial network (GAN) with seemingly realistic texture.
We propose an Attention-based Hybrid Image Quality Assessment Network (AHIQ) to deal with the challenge and get better performance on the GAN-based IQA task.
arXiv Detail & Related papers (2022-04-22T03:59:18Z) - Dynamically-Scaled Deep Canonical Correlation Analysis [77.34726150561087]
Canonical Correlation Analysis (CCA) is a method for feature extraction of two views by finding maximally correlated linear projections of them.
We introduce a novel dynamic scaling method for training an input-dependent canonical correlation model.
arXiv Detail & Related papers (2022-03-23T12:52:49Z) - Generalized Rectifier Wavelet Covariance Models For Texture Synthesis [2.585403833659771]
State-of-the-art maximum entropy models for texture synthesis are built from statistics relying on image representations defined by convolutional neural networks (CNN)
We propose a family of statistics built upon non-linear wavelet based representations, that can be viewed as a particular instance of a one-layer CNN, using a generalized non-linearity.
These statistics significantly improve the visual quality of previous classical wavelet-based models, and allow one to produce syntheses of similar quality to state-of-the-art models, on both grayscale and color textures.
arXiv Detail & Related papers (2022-03-14T17:07:40Z) - Probabilistic Tracking with Deep Factors [8.030212474745879]
We show how to use a deep feature encoding in conjunction with generative densities over the features in a factor-graph based, probabilistic tracking framework.
We present a likelihood model that combines a learned feature encoder with generative densities over them, both trained in a supervised manner.
arXiv Detail & Related papers (2021-12-02T21:31:51Z) - Learning Transformer Features for Image Quality Assessment [53.51379676690971]
We propose a unified IQA framework that utilizes CNN backbone and transformer encoder to extract features.
The proposed framework is compatible with both FR and NR modes and allows for a joint training scheme.
arXiv Detail & Related papers (2021-12-01T13:23:00Z) - Image Quality Assessment using Contrastive Learning [50.265638572116984]
We train a deep Convolutional Neural Network (CNN) using a contrastive pairwise objective to solve the auxiliary problem.
We show through extensive experiments that CONTRIQUE achieves competitive performance when compared to state-of-the-art NR image quality models.
Our results suggest that powerful quality representations with perceptual relevance can be obtained without requiring large labeled subjective image quality datasets.
arXiv Detail & Related papers (2021-10-25T21:01:00Z) - Consumer Image Quality Prediction using Recurrent Neural Networks for
Spatial Pooling [13.750624267664156]
We propose an image quality model that attempts to mimic the attention mechanism of human visual system (HVS) by using a recurrent neural network (RNN)
The experimental study, conducted by using images with different resolutions from two recently published image quality datasets, indicates that the quality prediction accuracy of the proposed method is competitive against benchmark models representing the state-of-the-art, and the proposed method also performs consistently on different resolution versions of the same dataset.
arXiv Detail & Related papers (2021-06-02T03:31:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.