Modeling, Quantifying, and Predicting Subjectivity of Image Aesthetics
- URL: http://arxiv.org/abs/2208.09666v1
- Date: Sat, 20 Aug 2022 12:16:45 GMT
- Title: Modeling, Quantifying, and Predicting Subjectivity of Image Aesthetics
- Authors: Hyeongnam Jang, Yeejin Lee and Jong-Seok Lee
- Abstract summary: We propose a novel unified probabilistic framework that can model and quantify subjective aesthetic preference based on the subjective logic.
In this framework, the rating distribution is modeled as a beta distribution, from which the probabilities of being definitely pleasing, being definitely unpleasing, and being uncertain can be obtained.
We present a method to learn deep neural networks for prediction of image aesthetics, which is shown to be effective in improving the performance of subjectivity prediction via experiments.
- Score: 21.46956783120668
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Assessing image aesthetics is a challenging computer vision task. One reason
is that aesthetic preference is highly subjective and may vary significantly
among people for certain images. Thus, it is important to properly model and
quantify such \textit{subjectivity}, but there has not been much effort to
resolve this issue. In this paper, we propose a novel unified probabilistic
framework that can model and quantify subjective aesthetic preference based on
the subjective logic. In this framework, the rating distribution is modeled as
a beta distribution, from which the probabilities of being definitely pleasing,
being definitely unpleasing, and being uncertain can be obtained. We use the
probability of being uncertain to define an intuitive metric of subjectivity.
Furthermore, we present a method to learn deep neural networks for prediction
of image aesthetics, which is shown to be effective in improving the
performance of subjectivity prediction via experiments. We also present an
application scenario where the framework is beneficial for aesthetics-based
image recommendation.
Related papers
- Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms [91.19304518033144]
We aim to align vision models with human aesthetic standards in a retrieval system.
We propose a preference-based reinforcement learning method that fine-tunes the vision models to better align the vision models with human aesthetics.
arXiv Detail & Related papers (2024-06-13T17:59:20Z) - Impressions: Understanding Visual Semiotics and Aesthetic Impact [66.40617566253404]
We present Impressions, a novel dataset through which to investigate the semiotics of images.
We show that existing multimodal image captioning and conditional generation models struggle to simulate plausible human responses to images.
This dataset significantly improves their ability to model impressions and aesthetic evaluations of images through fine-tuning and few-shot adaptation.
arXiv Detail & Related papers (2023-10-27T04:30:18Z) - Multi-task convolutional neural network for image aesthetic assessment [0.0]
We present a multi-task convolutional neural network that takes into account aesthetic attributes.
The proposed neural network jointly learns the attributes along with the overall aesthetic scores of images.
We achieve near-human performance in terms of overall aesthetic scores when considering the Spearman's rank correlations.
arXiv Detail & Related papers (2023-05-16T11:56:02Z) - Aesthetic Visual Question Answering of Photographs [15.83390933825182]
We propose a new task of aesthetic language assessment: aesthetic visual question and answering (AVQA) of images.
The objective QA pairs are generated by the proposed aesthetic attributes analysis algorithms.
We build the first aesthetic visual question answering dataset, AesVQA, that contains 72,168 high-quality images and 324,756 pairs of aesthetic questions.
arXiv Detail & Related papers (2022-08-10T07:27:57Z) - Composition and Style Attributes Guided Image Aesthetic Assessment [66.60253358722538]
We propose a method for the automatic prediction of the aesthetics of an image.
The proposed network includes: a pre-trained network for semantic features extraction (the Backbone); a Multi Layer Perceptron (MLP) network that relies on the Backbone features for the prediction of image attributes (the AttributeNet)
Given an image, the proposed multi-network is able to predict: style and composition attributes, and aesthetic score distribution.
arXiv Detail & Related papers (2021-11-08T17:16:38Z) - A Hypothesis for the Aesthetic Appreciation in Neural Networks [17.58003267114874]
This paper proposes a hypothesis for the aesthetic appreciation that aesthetic images make a neural network strengthen salient concepts and discard inessential concepts.
In experiments, we find that the revised images are more aesthetic than the original ones to some extent.
arXiv Detail & Related papers (2021-07-31T06:19:00Z) - What Image Features Boost Housing Market Predictions? [81.32205133298254]
We propose a set of techniques for the extraction of visual features for efficient numerical inclusion in predictive algorithms.
We discuss techniques such as Shannon's entropy, calculating the center of gravity, employing image segmentation, and using Convolutional Neural Networks.
The set of 40 image features selected here carries a significant amount of predictive power and outperforms some of the strongest metadata predictors.
arXiv Detail & Related papers (2021-07-15T06:32:10Z) - User-Guided Personalized Image Aesthetic Assessment based on Deep
Reinforcement Learning [64.07820203919283]
We propose a novel user-guided personalized image aesthetic assessment framework.
It leverages user interactions to retouch and rank images for aesthetic assessment based on deep reinforcement learning (DRL)
It generates personalized aesthetic distribution that is more in line with the aesthetic preferences of different users.
arXiv Detail & Related papers (2021-06-14T15:19:48Z) - A Deep Drift-Diffusion Model for Image Aesthetic Score Distribution
Prediction [68.76594695163386]
We propose a Deep Drift-Diffusion model inspired by psychologists to predict aesthetic score distribution from images.
The DDD model can describe the psychological process of aesthetic perception instead of traditional modeling of the results of assessment.
Our novel DDD model is simple but efficient, which outperforms the state-of-the-art methods in aesthetic score distribution prediction.
arXiv Detail & Related papers (2020-10-15T11:01:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.