3DYoga90: A Hierarchical Video Dataset for Yoga Pose Understanding
- URL: http://arxiv.org/abs/2310.10131v1
- Date: Mon, 16 Oct 2023 07:15:31 GMT
- Title: 3DYoga90: A Hierarchical Video Dataset for Yoga Pose Understanding
- Authors: Seonok Kim
- Abstract summary: 3DYoga901 is organized within a three-level label hierarchy.
Our dataset includes meticulously curated RGB yoga pose videos and 3D skeleton sequences.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The increasing popularity of exercises including yoga and Pilates has created
a greater demand for professional exercise video datasets in the realm of
artificial intelligence. In this study, we developed 3DYoga901, which is
organized within a three-level label hierarchy. We have expanded the number of
poses from an existing state-of-the-art dataset, increasing it from 82 to 90
poses. Our dataset includes meticulously curated RGB yoga pose videos and 3D
skeleton sequences. This dataset was created by a dedicated team of six
individuals, including yoga instructors. It stands out as one of the most
comprehensive open datasets, featuring the largest collection of RGB videos and
3D skeleton sequences among publicly available resources. This contribution has
the potential to significantly advance the field of yoga action recognition and
pose assessment. Additionally, we conducted experiments to evaluate the
practicality of our proposed dataset. We employed three different model
variants for benchmarking purposes.
Related papers
- An Efficient Deep Convolutional Neural Network Model For Yoga Pose
Recognition Using Single Images [2.6717276381722033]
This paper presents YPose, an efficient deep convolutional neural network (CNN) model to recognize yoga asanas from RGB images.
The proposed model has been tested on the Yoga-82 dataset.
arXiv Detail & Related papers (2023-06-27T19:34:46Z) - Learning 3D Human Pose Estimation from Dozens of Datasets using a
Geometry-Aware Autoencoder to Bridge Between Skeleton Formats [80.12253291709673]
We propose a novel affine-combining autoencoder (ACAE) method to perform dimensionality reduction on the number of landmarks.
Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model.
arXiv Detail & Related papers (2022-12-29T22:22:49Z) - FLAG3D: A 3D Fitness Activity Dataset with Language Instruction [89.60371681477791]
We present FLAG3D, a large-scale 3D fitness activity dataset with language instruction containing 180K sequences of 60 categories.
We show that FLAG3D contributes great research value for various challenges, such as cross-domain human action recognition, dynamic human mesh recovery, and language-guided human action generation.
arXiv Detail & Related papers (2022-12-09T02:33:33Z) - A View Independent Classification Framework for Yoga Postures [2.922683311119656]
We employ transfer learning from Human Pose Estimation models for extracting 136 key-points spread all over the body to train a Random Forest classifier.
Results are evaluated on an in-house collected extensive yoga video database of 51 subjects recorded from 4 different camera angles.
arXiv Detail & Related papers (2022-06-27T18:40:34Z) - StyleGAN-Human: A Data-Centric Odyssey of Human Generation [96.7080874757475]
This work takes a data-centric perspective and investigates multiple critical aspects in "data engineering"
We collect and annotate a large-scale human image dataset with over 230K samples capturing diverse poses and textures.
We rigorously investigate three essential factors in data engineering for StyleGAN-based human generation, namely data size, data distribution, and data alignment.
arXiv Detail & Related papers (2022-04-25T17:55:08Z) - Real-time Recognition of Yoga Poses using computer Vision for Smart
Health Care [13.709260905038624]
The work also presents Yoga-hand mudra (hand gestures) identification.
The YOGI dataset has been developed which include 10 Yoga postures with around 400-900 images of each pose.
XGBoost with RandomSearch CV is most accurate and gives 99.2% accuracy.
arXiv Detail & Related papers (2022-01-19T13:41:58Z) - REGRAD: A Large-Scale Relational Grasp Dataset for Safe and
Object-Specific Robotic Grasping in Clutter [52.117388513480435]
We present a new dataset named regrad to sustain the modeling of relationships among objects and grasps.
Our dataset is collected in both forms of 2D images and 3D point clouds.
Users are free to import their own object models for the generation of as many data as they want.
arXiv Detail & Related papers (2021-04-29T05:31:21Z) - AcinoSet: A 3D Pose Estimation Dataset and Baseline Models for Cheetahs
in the Wild [51.35013619649463]
We present an extensive dataset of free-running cheetahs in the wild, called AcinoSet.
The dataset contains 119,490 frames of multi-view synchronized high-speed video footage, camera calibration files and 7,588 human-annotated frames.
The resulting 3D trajectories, human-checked 3D ground truth, and an interactive tool to inspect the data is also provided.
arXiv Detail & Related papers (2021-03-24T15:54:11Z) - Yoga-82: A New Dataset for Fine-grained Classification of Human Poses [46.319423568714505]
We present a dataset, Yoga-82, for large-scale yoga pose recognition with 82 classes.
Yoga-82 consists of complex poses where fine annotations may not be possible.
The dataset contains a three-level hierarchy including body positions, variations in body positions, and the actual pose names.
arXiv Detail & Related papers (2020-04-22T01:43:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.