Poses of People in Art: A Data Set for Human Pose Estimation in Digital
Art History
- URL: http://arxiv.org/abs/2301.05124v1
- Date: Thu, 12 Jan 2023 16:23:58 GMT
- Title: Poses of People in Art: A Data Set for Human Pose Estimation in Digital
Art History
- Authors: Stefanie Schneider and Ricarda Vollmer
- Abstract summary: We introduce the first openly licensed data set for estimating human poses in art.
The Poses of People in Art data set consists of 2,454 images from 22 art-historical depiction styles.
A total of 10,749 human figures are precisely enclosed by rectangular bounding boxes, with a maximum of four per image labeled by up to 17 keypoints.
- Score: 0.6345523830122167
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Throughout the history of art, the pose, as the holistic abstraction of the
human body's expression, has proven to be a constant in numerous studies.
However, due to the enormous amount of data that so far had to be processed by
hand, its crucial role to the formulaic recapitulation of art-historical motifs
since antiquity could only be highlighted selectively. This is true even for
the now automated estimation of human poses, as domain-specific, sufficiently
large data sets required for training computational models are either not
publicly available or not indexed at a fine enough granularity. With the Poses
of People in Art data set, we introduce the first openly licensed data set for
estimating human poses in art and validating human pose estimators. It consists
of 2,454 images from 22 art-historical depiction styles, including those that
have increasingly turned away from lifelike representations of the body since
the 19th century. A total of 10,749 human figures are precisely enclosed by
rectangular bounding boxes, with a maximum of four per image labeled by up to
17 keypoints; among these are mainly joints such as elbows and knees. For
machine learning purposes, the data set is divided into three subsets,
training, validation, and testing, that follow the established JSON-based
Microsoft COCO format, respectively. Each image annotation, in addition to
mandatory fields, provides metadata from the art-historical online encyclopedia
WikiArt. With this paper, we elaborate on the acquisition and constitution of
the data set, address various application scenarios, and discuss prospects for
a digitally supported art history. We show that the data set enables the
investigation of body phenomena in art, whether at the level of individual
figures, which can be captured in their subtleties, or entire figure
constellations, whose position, distance, or proximity to one another is
considered.
Related papers
- Evaluating Multiview Object Consistency in Humans and Image Models [68.36073530804296]
We leverage an experimental design from the cognitive sciences which requires zero-shot visual inferences about object shape.
We collect 35K trials of behavioral data from over 500 participants.
We then evaluate the performance of common vision models.
arXiv Detail & Related papers (2024-09-09T17:59:13Z) - Data Augmentation in Human-Centric Vision [54.97327269866757]
This survey presents a comprehensive analysis of data augmentation techniques in human-centric vision tasks.
It delves into a wide range of research areas including person ReID, human parsing, human pose estimation, and pedestrian detection.
Our work categorizes data augmentation methods into two main types: data generation and data perturbation.
arXiv Detail & Related papers (2024-03-13T16:05:18Z) - SniffyArt: The Dataset of Smelling Persons [6.088288090431471]
This paper introduces the SniffyArt dataset, consisting of 1941 individuals represented in 441 historical artworks.
The dataset enables the development of hybrid classification approaches for smell gesture recognition.
arXiv Detail & Related papers (2023-11-20T16:21:37Z) - Learning to Evaluate the Artness of AI-generated Images [64.48229009396186]
ArtScore is a metric designed to evaluate the degree to which an image resembles authentic artworks by artists.
We employ pre-trained models for photo and artwork generation, resulting in a series of mixed models.
This dataset is then employed to train a neural network that learns to estimate quantized artness levels of arbitrary images.
arXiv Detail & Related papers (2023-05-08T17:58:27Z) - DEArt: Dataset of European Art [0.17188280334580194]
We propose DEArt, an object detection and pose classification dataset meant to be a reference for paintings between the XIIth and the XVIIIth centuries.
It contains more than 15000 images, about 80% non-iconic, aligned with manual annotations for the bounding boxes identifying all instances of 69 classes as well as 12 possible poses for boxes identifying human-like objects.
Our results show that object detectors for the cultural heritage domain can achieve a level of precision comparable to state-of-art models for generic images via transfer learning.
arXiv Detail & Related papers (2022-11-02T16:05:35Z) - Automatic Analysis of Human Body Representations in Western Art [0.0]
We propose a computer vision pipeline to analyse human pose and representations in paintings.
For normalisation, we map the detected poses and contours to Leonardo da Vinci's Vitruvian Man.
Our approach improves over purely skeleton based analyses of human body in paintings.
arXiv Detail & Related papers (2022-10-17T08:56:22Z) - Semi-supervised Human Pose Estimation in Art-historical Images [9.633949256082763]
We propose a novel approach to estimate human poses in art-language images.
Our approach achieves significantly better results than methods that use pre-trained models or style transfer.
arXiv Detail & Related papers (2022-07-06T21:20:58Z) - StyleGAN-Human: A Data-Centric Odyssey of Human Generation [96.7080874757475]
This work takes a data-centric perspective and investigates multiple critical aspects in "data engineering"
We collect and annotate a large-scale human image dataset with over 230K samples capturing diverse poses and textures.
We rigorously investigate three essential factors in data engineering for StyleGAN-based human generation, namely data size, data distribution, and data alignment.
arXiv Detail & Related papers (2022-04-25T17:55:08Z) - Deep Learning-Based Human Pose Estimation: A Survey [66.01917727294163]
Human pose estimation has drawn increasing attention during the past decade.
It has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality.
Recent deep learning-based solutions have achieved high performance in human pose estimation.
arXiv Detail & Related papers (2020-12-24T18:49:06Z) - Demographic Influences on Contemporary Art with Unsupervised Style
Embeddings [25.107166631583212]
contempArt is a collection of paintings and drawings, a detailed graph network based on social connections on Instagram and additional socio-demographic information.
We evaluate three methods suited for generating unsupervised style embeddings of images and correlate them with the remaining data.
arXiv Detail & Related papers (2020-09-30T10:13:18Z) - Whole-Body Human Pose Estimation in the Wild [88.09875133989155]
COCO-WholeBody extends COCO dataset with whole-body annotations.
It is the first benchmark that has manual annotations on the entire human body.
A single-network model, named ZoomNet, is devised to take into account the hierarchical structure of the full human body.
arXiv Detail & Related papers (2020-07-23T08:35:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.