User-Guided Personalized Image Aesthetic Assessment based on Deep
Reinforcement Learning
- URL: http://arxiv.org/abs/2106.07488v1
- Date: Mon, 14 Jun 2021 15:19:48 GMT
- Title: User-Guided Personalized Image Aesthetic Assessment based on Deep
Reinforcement Learning
- Authors: Pei Lv, Jianqi Fan, Xixi Nie, Weiming Dong, Xiaoheng Jiang, Bing Zhou,
Mingliang Xu and Changsheng Xu
- Abstract summary: We propose a novel user-guided personalized image aesthetic assessment framework.
It leverages user interactions to retouch and rank images for aesthetic assessment based on deep reinforcement learning (DRL)
It generates personalized aesthetic distribution that is more in line with the aesthetic preferences of different users.
- Score: 64.07820203919283
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Personalized image aesthetic assessment (PIAA) has recently become a hot
topic due to its usefulness in a wide variety of applications such as
photography, film and television, e-commerce, fashion design and so on. This
task is more seriously affected by subjective factors and samples provided by
users. In order to acquire precise personalized aesthetic distribution by small
amount of samples, we propose a novel user-guided personalized image aesthetic
assessment framework. This framework leverages user interactions to retouch and
rank images for aesthetic assessment based on deep reinforcement learning
(DRL), and generates personalized aesthetic distribution that is more in line
with the aesthetic preferences of different users. It mainly consists of two
stages. In the first stage, personalized aesthetic ranking is generated by
interactive image enhancement and manual ranking, meanwhile two policy networks
will be trained. The images will be pushed to the user for manual retouching
and simultaneously to the enhancement policy network. The enhancement network
utilizes the manual retouching results as the optimization goals of DRL. After
that, the ranking process performs the similar operations like the retouching
mentioned before. These two networks will be trained iteratively and
alternatively to help to complete the final personalized aesthetic assessment
automatically. In the second stage, these modified images are labeled with
aesthetic attributes by one style-specific classifier, and then the
personalized aesthetic distribution is generated based on the multiple
aesthetic attributes of these images, which conforms to the aesthetic
preference of users better.
Related papers
- AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling [11.996211235559866]
Image Content Appeal Assessment (ICAA) is a novel metric that quantifies the level of positive interest an image's content generates for viewers.
ICAA is different from traditional Image-Aesthetics Assessment (IAA), which judges an image's artistic quality.
arXiv Detail & Related papers (2024-07-08T01:40:32Z) - Predicting Scores of Various Aesthetic Attribute Sets by Learning from
Overall Score Labels [54.63611854474985]
In this paper, we propose to replace image attribute labels with feature extractors.
We use networks from different tasks to provide attribute features to our F2S model.
Our method makes it feasible to learn meaningful attribute scores for various aesthetic attribute sets in different types of images with only overall aesthetic scores.
arXiv Detail & Related papers (2023-12-06T01:41:49Z) - Image Aesthetics Assessment via Learnable Queries [59.313054821874864]
We propose the Image Aesthetics Assessment via Learnable Queries (IAA-LQ) approach.
It adapts learnable queries to extract aesthetic features from pre-trained image features obtained from a frozen image encoder.
Experiments on real-world data demonstrate the advantages of IAA-LQ, beating the best state-of-the-art method by 2.2% and 2.1% in terms of SRCC and PLCC, respectively.
arXiv Detail & Related papers (2023-09-06T09:42:16Z) - VILA: Learning Image Aesthetics from User Comments with Vision-Language
Pretraining [53.470662123170555]
We propose learning image aesthetics from user comments, and exploring vision-language pretraining methods to learn multimodal aesthetic representations.
Specifically, we pretrain an image-text encoder-decoder model with image-comment pairs, using contrastive and generative objectives to learn rich and generic aesthetic semantics without human labels.
Our results show that our pretrained aesthetic vision-language model outperforms prior works on image aesthetic captioning over the AVA-Captions dataset.
arXiv Detail & Related papers (2023-03-24T23:57:28Z) - Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2 [65.5524793975387]
We construct a novel dataset, named DPC-CaptionsV2, by a semi-automatic way.
Images of DPC-CaptionsV2 contain comments up to 4 aesthetic attributes: composition, lighting, color, and subject.
Our method can predict the comments on 4 aesthetic attributes, which are closer to aesthetic topics than those produced by the previous AMAN model.
arXiv Detail & Related papers (2022-08-09T03:20:59Z) - Aesthetic Attribute Assessment of Images Numerically on Mixed
Multi-attribute Datasets [16.120684660965978]
We construct an image attribute dataset called aesthetic mixed dataset with attributes(AMD-A) and design external attribute features for fusion.
Our model can achieve aesthetic classification, overall scoring and attribute scoring.
Experimental results, using the MindSpore, show that our proposed method can effectively improve the performance of the aesthetic overall and attribute assessment.
arXiv Detail & Related papers (2022-07-05T04:42:10Z) - Composition and Style Attributes Guided Image Aesthetic Assessment [66.60253358722538]
We propose a method for the automatic prediction of the aesthetics of an image.
The proposed network includes: a pre-trained network for semantic features extraction (the Backbone); a Multi Layer Perceptron (MLP) network that relies on the Backbone features for the prediction of image attributes (the AttributeNet)
Given an image, the proposed multi-network is able to predict: style and composition attributes, and aesthetic score distribution.
arXiv Detail & Related papers (2021-11-08T17:16:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.