Yoga-82: A New Dataset for Fine-grained Classification of Human Poses
- URL: http://arxiv.org/abs/2004.10362v1
- Date: Wed, 22 Apr 2020 01:43:44 GMT
- Title: Yoga-82: A New Dataset for Fine-grained Classification of Human Poses
- Authors: Manisha Verma, Sudhakar Kumawat, Yuta Nakashima, Shanmuganathan Raman
- Abstract summary: We present a dataset, Yoga-82, for large-scale yoga pose recognition with 82 classes.
Yoga-82 consists of complex poses where fine annotations may not be possible.
The dataset contains a three-level hierarchy including body positions, variations in body positions, and the actual pose names.
- Score: 46.319423568714505
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Human pose estimation is a well-known problem in computer vision to locate
joint positions. Existing datasets for the learning of poses are observed to be
not challenging enough in terms of pose diversity, object occlusion, and
viewpoints. This makes the pose annotation process relatively simple and
restricts the application of the models that have been trained on them. To
handle more variety in human poses, we propose the concept of fine-grained
hierarchical pose classification, in which we formulate the pose estimation as
a classification task, and propose a dataset, Yoga-82, for large-scale yoga
pose recognition with 82 classes. Yoga-82 consists of complex poses where fine
annotations may not be possible. To resolve this, we provide hierarchical
labels for yoga poses based on the body configuration of the pose. The dataset
contains a three-level hierarchy including body positions, variations in body
positions, and the actual pose names. We present the classification accuracy of
the state-of-the-art convolutional neural network architectures on Yoga-82. We
also present several hierarchical variants of DenseNet in order to utilize the
hierarchical labels.
Related papers
- 3DYoga90: A Hierarchical Video Dataset for Yoga Pose Understanding [0.0]
3DYoga901 is organized within a three-level label hierarchy.
Our dataset includes meticulously curated RGB yoga pose videos and 3D skeleton sequences.
arXiv Detail & Related papers (2023-10-16T07:15:31Z) - Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark
Analysis [24.276782804825846]
Human body-pose estimation is a complex problem in computer vision.
Recent research interests have been widened specifically on the Sports, Yoga, and Dance postures.
CNNs have attained significantly improved performance in solving various human body-pose estimation problems.
arXiv Detail & Related papers (2023-08-01T07:00:13Z) - VINECS: Video-based Neural Character Skinning [82.39776643541383]
We propose a fully automated approach for creating a fully rigged character with pose-dependent skinning weights.
We show that our approach outperforms state-of-the-art while not relying on dense 4D scans.
arXiv Detail & Related papers (2023-07-03T08:35:53Z) - An Efficient Deep Convolutional Neural Network Model For Yoga Pose
Recognition Using Single Images [2.6717276381722033]
This paper presents YPose, an efficient deep convolutional neural network (CNN) model to recognize yoga asanas from RGB images.
The proposed model has been tested on the Yoga-82 dataset.
arXiv Detail & Related papers (2023-06-27T19:34:46Z) - PoseVocab: Learning Joint-structured Pose Embeddings for Human Avatar
Modeling [30.93155530590843]
We present PoseVocab, a novel pose encoding method that can encode high-fidelity human details.
Given multi-view RGB videos of a character, PoseVocab constructs key poses and latent embeddings based on the training poses.
Experiments show that our method outperforms other state-of-the-art baselines.
arXiv Detail & Related papers (2023-04-25T17:25:36Z) - PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose
Estimation [83.50127973254538]
Existing 3D human pose estimators suffer poor generalization performance to new datasets.
We present PoseAug, a new auto-augmentation framework that learns to augment the available training poses towards a greater diversity.
arXiv Detail & Related papers (2021-05-06T06:57:42Z) - FixMyPose: Pose Correctional Captioning and Retrieval [67.20888060019028]
We introduce a new captioning dataset named FixMyPose to address automated pose correction systems.
We collect descriptions of correcting a "current" pose to look like a "target" pose.
To avoid ML biases, we maintain a balance across characters with diverse demographics.
arXiv Detail & Related papers (2021-04-04T21:45:44Z) - Unsupervised 3D Human Pose Representation with Viewpoint and Pose
Disentanglement [63.853412753242615]
Learning a good 3D human pose representation is important for human pose related tasks.
We propose a novel Siamese denoising autoencoder to learn a 3D pose representation.
Our approach achieves state-of-the-art performance on two inherently different tasks.
arXiv Detail & Related papers (2020-07-14T14:25:22Z) - Transferring Dense Pose to Proximal Animal Classes [83.84439508978126]
We show that it is possible to transfer the knowledge existing in dense pose recognition for humans, as well as in more general object detectors and segmenters, to the problem of dense pose recognition in other classes.
We do this by establishing a DensePose model for the new animal which is also geometrically aligned to humans.
We also introduce two benchmark datasets labelled in the manner of DensePose for the class chimpanzee and use them to evaluate our approach.
arXiv Detail & Related papers (2020-02-28T21:43:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.