Identifying Professional Photographers Through Image Quality and
Aesthetics in Flickr
- URL: http://arxiv.org/abs/2307.01756v1
- Date: Tue, 4 Jul 2023 14:55:37 GMT
- Title: Identifying Professional Photographers Through Image Quality and
Aesthetics in Flickr
- Authors: Sofia Strukova, Rub\'en Gaspar Marco, Jos\'e A. Ruip\'erez-Valiente,
F\'elix G\'omez M\'armol
- Abstract summary: This study reveals the lack of suitable data sets in photo and video sharing platforms.
We created one of the largest labelled data sets in Flickr with the multimodal data which has been open sourced.
We examined the relationship between the aesthetics and technical quality of a picture and the social activity of that picture.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In our generation, there is an undoubted rise in the use of social media and
specifically photo and video sharing platforms. These sites have proved their
ability to yield rich data sets through the users' interaction which can be
used to perform a data-driven evaluation of capabilities. Nevertheless, this
study reveals the lack of suitable data sets in photo and video sharing
platforms and evaluation processes across them. In this way, our first
contribution is the creation of one of the largest labelled data sets in Flickr
with the multimodal data which has been open sourced as part of this
contribution. Predicated on these data, we explored machine learning models and
concluded that it is feasible to properly predict whether a user is a
professional photographer or not based on self-reported occupation labels and
several feature representations out of the user, photo and crowdsourced sets.
We also examined the relationship between the aesthetics and technical quality
of a picture and the social activity of that picture. Finally, we depicted
which characteristics differentiate professional photographers from
non-professionals. As far as we know, the results presented in this work
represent an important novelty for the users' expertise identification which
researchers from various domains can use for different applications.
Related papers
- Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications [3.941600320957518]
This project seeks to understand how the application design affects a user's willingness to contribute and the quantity and quality of the data they capture.
We designed three versions of a camera-based mobile crowdsourcing application, which varied in the amount of labeling effort requested of the user.
The results suggest that higher levels of user labeling do not lead to reduced contribution.
arXiv Detail & Related papers (2024-09-04T18:10:35Z) - AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling [11.996211235559866]
Image Content Appeal Assessment (ICAA) is a novel metric that quantifies the level of positive interest an image's content generates for viewers.
ICAA is different from traditional Image-Aesthetics Assessment (IAA), which judges an image's artistic quality.
arXiv Detail & Related papers (2024-07-08T01:40:32Z) - Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model [80.61157097223058]
A prevalent strategy to bolster image classification performance is through augmenting the training set with synthetic images generated by T2I models.
In this study, we scrutinize the shortcomings of both current generative and conventional data augmentation techniques.
We introduce an innovative inter-class data augmentation method known as Diff-Mix, which enriches the dataset by performing image translations between classes.
arXiv Detail & Related papers (2024-03-28T17:23:45Z) - Blind Dates: Examining the Expression of Temporality in Historical
Photographs [57.07335632641355]
We investigate the dating of images using OpenCLIP, an open-source implementation of CLIP, a multi-modal language and vision model.
We use the textitDe Boer Scene Detection dataset, containing 39,866 gray-scale historical press photographs from 1950 to 1999.
Our analysis reveals that images featuring buses, cars, cats, dogs, and people are more accurately dated, suggesting the presence of temporal markers.
arXiv Detail & Related papers (2023-10-10T13:51:24Z) - Empowering Visually Impaired Individuals: A Novel Use of Apple Live
Photos and Android Motion Photos [3.66237529322911]
We advocate for the use of Apple Live Photos and Android Motion Photos technologies.
Our findings reveal that both Live Photos and Motion Photos outperform single-frame images in common visual assisting tasks.
arXiv Detail & Related papers (2023-09-14T20:46:35Z) - ConfounderGAN: Protecting Image Data Privacy with Causal Confounder [85.6757153033139]
We propose ConfounderGAN, a generative adversarial network (GAN) that can make personal image data unlearnable to protect the data privacy of its owners.
Experiments are conducted in six image classification datasets, consisting of three natural object datasets and three medical datasets.
arXiv Detail & Related papers (2022-12-04T08:49:14Z) - Structuring User-Generated Content on Social Media with Multimodal
Aspect-Based Sentiment Analysis [2.023920009396818]
This paper shows to what extent machine learning can analyze and structure these databases.
An automated data analysis pipeline is deployed to provide insights into user-generated content for researchers in other domains.
arXiv Detail & Related papers (2022-10-27T12:38:10Z) - There is a Time and Place for Reasoning Beyond the Image [63.96498435923328]
Images often more significant than only the pixels to human eyes, as we can infer, associate, and reason with contextual information from other sources to establish a more complete picture.
We introduce TARA: a dataset with 16k images with their associated news, time and location automatically extracted from New York Times (NYT), and an additional 61k examples as distant supervision from WIT.
We show that there exists a 70% gap between a state-of-the-art joint model and human performance, which is slightly filled by our proposed model that uses segment-wise reasoning, motivating higher-level vision-language joint models that
arXiv Detail & Related papers (2022-03-01T21:52:08Z) - ImageSubject: A Large-scale Dataset for Subject Detection [9.430492045581534]
Main subjects usually exist in the images or videos, as they are the objects that the photographer wants to highlight.
Detecting the main subjects is an important technique to help machines understand the content of images and videos.
We present a new dataset with the goal of training models to understand the layout of the objects then to find the main subjects among them.
arXiv Detail & Related papers (2022-01-09T22:49:59Z) - Personalized Image Semantic Segmentation [58.980245748434]
We generate more accurate segmentation results on unlabeled personalized images by investigating the data's personalized traits.
We propose a baseline method that incorporates the inter-image context when segmenting certain images.
The code and the PIS dataset will be made publicly available.
arXiv Detail & Related papers (2021-07-24T04:03:11Z) - From ImageNet to Image Classification: Contextualizing Progress on
Benchmarks [99.19183528305598]
We study how specific design choices in the ImageNet creation process impact the fidelity of the resulting dataset.
Our analysis pinpoints how a noisy data collection pipeline can lead to a systematic misalignment between the resulting benchmark and the real-world task it serves as a proxy for.
arXiv Detail & Related papers (2020-05-22T17:39:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.