Strategy for Boosting Pair Comparison and Improving Quality Assessment
Accuracy
- URL: http://arxiv.org/abs/2010.00370v1
- Date: Thu, 1 Oct 2020 13:05:09 GMT
- Title: Strategy for Boosting Pair Comparison and Improving Quality Assessment
Accuracy
- Authors: Suiyi Ling, Jing Li, Anne Flore Perrin, Zhi Li, Luk\'a\v{s} Krasula,
Patrick Le Callet
- Abstract summary: Pair Comparison (PC) is of significant advantage over Absolute Category Rating (ACR) in terms of discriminability.
In this study, we employ a generic model to bridge the pair comparison data and ACR data, where the variance term could be recovered and the obtained information is more complete.
In such a way, the proposed methodology could achieve the same accuracy of pair comparison but with the compelxity as low as ACR.
- Score: 29.849156371902943
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The development of rigorous quality assessment model relies on the collection
of reliable subjective data, where the perceived quality of visual multimedia
is rated by the human observers. Different subjective assessment protocols can
be used according to the objectives, which determine the discriminability and
accuracy of the subjective data.
Single stimulus methodology, e.g., the Absolute Category Rating (ACR) has
been widely adopted due to its simplicity and efficiency. However, Pair
Comparison (PC) is of significant advantage over ACR in terms of
discriminability. In addition, PC avoids the influence of observers' bias
regarding their understanding of the quality scale. Nevertheless, full pair
comparison is much more time-consuming. In this study, we therefore 1) employ a
generic model to bridge the pair comparison data and ACR data, where the
variance term could be recovered and the obtained information is more complete;
2) propose a fusion strategy to boost pair comparisons by utilizing the ACR
results as initialization information; 3) develop a novel active batch sampling
strategy based on Minimum Spanning Tree (MST) for PC. In such a way, the
proposed methodology could achieve the same accuracy of pair comparison but
with the compelxity as low as ACR. Extensive experimental results demonstrate
the efficiency and accuracy of the proposed approach, which outperforms the
state of the art approaches.
Related papers
- Towards Calibrated Robust Fine-Tuning of Vision-Language Models [97.19901765814431]
This work proposes a robust fine-tuning method that improves both OOD accuracy and confidence calibration simultaneously in vision language models.
We show that both OOD classification and OOD calibration errors have a shared upper bound consisting of two terms of ID data.
Based on this insight, we design a novel framework that conducts fine-tuning with a constrained multimodal contrastive loss enforcing a larger smallest singular value.
arXiv Detail & Related papers (2023-11-03T05:41:25Z) - GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models [60.48306899271866]
We present a new framework, called GREAT Score, for global robustness evaluation of adversarial perturbation using generative models.
We show high correlation and significantly reduced cost of GREAT Score when compared to the attack-based model ranking on RobustBench.
GREAT Score can be used for remote auditing of privacy-sensitive black-box models.
arXiv Detail & Related papers (2023-04-19T14:58:27Z) - Revisiting the Evaluation of Image Synthesis with GANs [55.72247435112475]
This study presents an empirical investigation into the evaluation of synthesis performance, with generative adversarial networks (GANs) as a representative of generative models.
In particular, we make in-depth analyses of various factors, including how to represent a data point in the representation space, how to calculate a fair distance using selected samples, and how many instances to use from each set.
arXiv Detail & Related papers (2023-04-04T17:54:32Z) - Powering Comparative Classification with Sentiment Analysis via Domain
Adaptive Knowledge Transfer [13.98690716279821]
We study Comparative Preference Classification (CPC) which aims at predicting whether a preference exists between two entities in a given sentence.
High-quality CPC models can significantly benefit applications such as comparative question answering and review-based recommendations.
We proposed sentiment Analysis Enhanced COmparative Network (SAECON) which improves CPC accuracy with a sentiment analyzer.
arXiv Detail & Related papers (2021-09-07T19:17:12Z) - Understanding the Effects of Adversarial Personalized Ranking
Optimization Method on Recommendation Quality [6.197934754799158]
We model the learning characteristics of the Bayesian Personalized Ranking (BPR) and APR optimization frameworks.
We show that APR amplifies the popularity bias more than BPR due to an unbalanced number of received positive updates from short-head items.
arXiv Detail & Related papers (2021-07-29T10:22:20Z) - Scalable Personalised Item Ranking through Parametric Density Estimation [53.44830012414444]
Learning from implicit feedback is challenging because of the difficult nature of the one-class problem.
Most conventional methods use a pairwise ranking approach and negative samplers to cope with the one-class problem.
We propose a learning-to-rank approach, which achieves convergence speed comparable to the pointwise counterpart.
arXiv Detail & Related papers (2021-05-11T03:38:16Z) - Performance Evaluation of Adversarial Attacks: Discrepancies and
Solutions [51.8695223602729]
adversarial attack methods have been developed to challenge the robustness of machine learning models.
We propose a Piece-wise Sampling Curving (PSC) toolkit to effectively address the discrepancy.
PSC toolkit offers options for balancing the computational cost and evaluation effectiveness.
arXiv Detail & Related papers (2021-04-22T14:36:51Z) - Self-supervised Representation Learning with Relative Predictive Coding [102.93854542031396]
Relative Predictive Coding (RPC) is a new contrastive representation learning objective.
RPC maintains a good balance among training stability, minibatch size sensitivity, and downstream task performance.
We empirically verify the effectiveness of RPC on benchmark vision and speech self-supervised learning tasks.
arXiv Detail & Related papers (2021-03-21T01:04:24Z) - Active Sampling for Pairwise Comparisons via Approximate Message Passing
and Information Gain Maximization [5.771869590520189]
We propose ASAP, an active sampling algorithm based on approximate message passing and expected information gain.
We show that ASAP offers the highest accuracy of inferred scores compared to the existing methods.
arXiv Detail & Related papers (2020-04-12T20:48:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.