Understanding Aesthetics with Language: A Photo Critique Dataset for
Aesthetic Assessment
- URL: http://arxiv.org/abs/2206.08614v1
- Date: Fri, 17 Jun 2022 08:16:20 GMT
- Title: Understanding Aesthetics with Language: A Photo Critique Dataset for
Aesthetic Assessment
- Authors: Daniel Vera Nieto and Luigi Celona and Clara Fernandez-Labrador
- Abstract summary: We propose the Critique Photo Reddit dataset (RPCD), which contains 74K images and 220K comments.
We exploit the polarity of the sentiment of criticism as an indicator of aesthetic judgment.
- Score: 6.201485014848172
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Computational inference of aesthetics is an ill-defined task due to its
subjective nature. Many datasets have been proposed to tackle the problem by
providing pairs of images and aesthetic scores based on human ratings. However,
humans are better at expressing their opinion, taste, and emotions by means of
language rather than summarizing them in a single number. In fact, photo
critiques provide much richer information as they reveal how and why users rate
the aesthetics of visual stimuli. In this regard, we propose the Reddit Photo
Critique Dataset (RPCD), which contains tuples of image and photo critiques.
RPCD consists of 74K images and 220K comments and is collected from a Reddit
community used by hobbyists and professional photographers to improve their
photography skills by leveraging constructive community feedback. The proposed
dataset differs from previous aesthetics datasets mainly in three aspects,
namely (i) the large scale of the dataset and the extension of the comments
criticizing different aspects of the image, (ii) it contains mostly UltraHD
images, and (iii) it can easily be extended to new data as it is collected
through an automatic pipeline. To the best of our knowledge, in this work, we
propose the first attempt to estimate the aesthetic quality of visual stimuli
from the critiques. To this end, we exploit the polarity of the sentiment of
criticism as an indicator of aesthetic judgment. We demonstrate how sentiment
polarity correlates positively with the aesthetic judgment available for two
aesthetic assessment benchmarks. Finally, we experiment with several models by
using the sentiment scores as a target for ranking images. Dataset and
baselines are available (https://github.com/mediatechnologycenter/aestheval).
Related papers
- AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling [11.996211235559866]
Image Content Appeal Assessment (ICAA) is a novel metric that quantifies the level of positive interest an image's content generates for viewers.
ICAA is different from traditional Image-Aesthetics Assessment (IAA), which judges an image's artistic quality.
arXiv Detail & Related papers (2024-07-08T01:40:32Z) - Impressions: Understanding Visual Semiotics and Aesthetic Impact [66.40617566253404]
We present Impressions, a novel dataset through which to investigate the semiotics of images.
We show that existing multimodal image captioning and conditional generation models struggle to simulate plausible human responses to images.
This dataset significantly improves their ability to model impressions and aesthetic evaluations of images through fine-tuning and few-shot adaptation.
arXiv Detail & Related papers (2023-10-27T04:30:18Z) - Image Aesthetics Assessment via Learnable Queries [59.313054821874864]
We propose the Image Aesthetics Assessment via Learnable Queries (IAA-LQ) approach.
It adapts learnable queries to extract aesthetic features from pre-trained image features obtained from a frozen image encoder.
Experiments on real-world data demonstrate the advantages of IAA-LQ, beating the best state-of-the-art method by 2.2% and 2.1% in terms of SRCC and PLCC, respectively.
arXiv Detail & Related papers (2023-09-06T09:42:16Z) - Towards Artistic Image Aesthetics Assessment: a Large-scale Dataset and
a New Method [64.40494830113286]
We first introduce a large-scale AIAA dataset: Boldbrush Artistic Image dataset (BAID), which consists of 60,337 artistic images covering various art forms.
We then propose a new method, SAAN, which can effectively extract and utilize style-specific and generic aesthetic information to evaluate artistic images.
Experiments demonstrate that our proposed approach outperforms existing IAA methods on the proposed BAID dataset.
arXiv Detail & Related papers (2023-03-27T12:59:15Z) - VILA: Learning Image Aesthetics from User Comments with Vision-Language
Pretraining [53.470662123170555]
We propose learning image aesthetics from user comments, and exploring vision-language pretraining methods to learn multimodal aesthetic representations.
Specifically, we pretrain an image-text encoder-decoder model with image-comment pairs, using contrastive and generative objectives to learn rich and generic aesthetic semantics without human labels.
Our results show that our pretrained aesthetic vision-language model outperforms prior works on image aesthetic captioning over the AVA-Captions dataset.
arXiv Detail & Related papers (2023-03-24T23:57:28Z) - Aesthetic Visual Question Answering of Photographs [15.83390933825182]
We propose a new task of aesthetic language assessment: aesthetic visual question and answering (AVQA) of images.
The objective QA pairs are generated by the proposed aesthetic attributes analysis algorithms.
We build the first aesthetic visual question answering dataset, AesVQA, that contains 72,168 high-quality images and 324,756 pairs of aesthetic questions.
arXiv Detail & Related papers (2022-08-10T07:27:57Z) - Personalized Image Aesthetics Assessment with Rich Attributes [35.61053167813472]
We conduct the most comprehensive subjective study of personalized image aesthetics and introduce a new personalized image Aesthetics database with Rich Attributes (PARA)
PARA features wealthy annotations, including 9 image-oriented objective attributes and 4 human-oriented subjective attributes.
We also propose a conditional PIAA model by utilizing subject information as conditional prior.
arXiv Detail & Related papers (2022-03-31T02:23:46Z) - Affective Image Content Analysis: Two Decades Review and New
Perspectives [132.889649256384]
We will comprehensively review the development of affective image content analysis (AICA) in the recent two decades.
We will focus on the state-of-the-art methods with respect to three main challenges -- the affective gap, perception subjectivity, and label noise and absence.
We discuss some challenges and promising research directions in the future, such as image content and context understanding, group emotion clustering, and viewer-image interaction.
arXiv Detail & Related papers (2021-06-30T15:20:56Z) - User-Guided Personalized Image Aesthetic Assessment based on Deep
Reinforcement Learning [64.07820203919283]
We propose a novel user-guided personalized image aesthetic assessment framework.
It leverages user interactions to retouch and rank images for aesthetic assessment based on deep reinforcement learning (DRL)
It generates personalized aesthetic distribution that is more in line with the aesthetic preferences of different users.
arXiv Detail & Related papers (2021-06-14T15:19:48Z) - A Survey of Hand Crafted and Deep Learning Methods for Image Aesthetic
Assessment [2.9005223064604078]
This paper presents a literature review of the recent techniques of automatic image aesthetics assessment.
A large number of traditional hand crafted and deep learning based approaches are reviewed.
arXiv Detail & Related papers (2021-03-22T07:00:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.