A View Independent Classification Framework for Yoga Postures
- URL: http://arxiv.org/abs/2206.13577v1
- Date: Mon, 27 Jun 2022 18:40:34 GMT
- Title: A View Independent Classification Framework for Yoga Postures
- Authors: Mustafa Chasmai, Nirjhar Das, Aman Bhardwaj, Rahul Garg
- Abstract summary: We employ transfer learning from Human Pose Estimation models for extracting 136 key-points spread all over the body to train a Random Forest classifier.
Results are evaluated on an in-house collected extensive yoga video database of 51 subjects recorded from 4 different camera angles.
- Score: 2.922683311119656
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Yoga is a globally acclaimed and widely recommended practice for a healthy
living. Maintaining correct posture while performing a Yogasana is of utmost
importance. In this work, we employ transfer learning from Human Pose
Estimation models for extracting 136 key-points spread all over the body to
train a Random Forest classifier which is used for estimation of the Yogasanas.
The results are evaluated on an in-house collected extensive yoga video
database of 51 subjects recorded from 4 different camera angles. We propose a 3
step scheme for evaluating the generalizability of a Yoga classifier by testing
it on 1) unseen frames, 2) unseen subjects, and 3) unseen camera angles. We
argue that for most of the applications, validation accuracies on unseen
subjects and unseen camera angles would be most important. We empirically
analyze over three public datasets, the advantage of transfer learning and the
possibilities of target leakage. We further demonstrate that the classification
accuracies critically depend on the cross validation method employed and can
often be misleading. To promote further research, we have made key-points
dataset and code publicly available.
Related papers
- Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Videos [66.1935609072708]
Key hypothesis is that the more accurately an individual view can predict a view-agnostic text summary, the more informative it is.
We propose a framework that uses the relative accuracy of view-dependent caption predictions as a proxy for best view pseudo-labels.
During inference, our model takes as input only a multi-view video -- no language or camera poses -- and returns the best viewpoint to watch at each timestep.
arXiv Detail & Related papers (2024-11-13T16:31:08Z) - Yoga Pose Classification Using Transfer Learning [0.0]
Yoga-82, a benchmark dataset for large-scale yoga pose recognition with 82 classes, has challenging positions that could make precise annotations impossible.
We have used VGG-16, ResNet-50, ResNet-101, and DenseNet-121 and finetuned them in different ways to get better results.
The experimental result shows the best performance of DenseNet-121 having the top-1 accuracy of 85% and top-5 accuracy of 96%.
arXiv Detail & Related papers (2024-10-29T14:34:18Z) - 3DYoga90: A Hierarchical Video Dataset for Yoga Pose Understanding [0.0]
3DYoga901 is organized within a three-level label hierarchy.
Our dataset includes meticulously curated RGB yoga pose videos and 3D skeleton sequences.
arXiv Detail & Related papers (2023-10-16T07:15:31Z) - Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark
Analysis [24.276782804825846]
Human body-pose estimation is a complex problem in computer vision.
Recent research interests have been widened specifically on the Sports, Yoga, and Dance postures.
CNNs have attained significantly improved performance in solving various human body-pose estimation problems.
arXiv Detail & Related papers (2023-08-01T07:00:13Z) - Self-Supervised Learning for Videos: A Survey [70.37277191524755]
Self-supervised learning has shown promise in both image and video domains.
In this survey, we provide a review of existing approaches on self-supervised learning focusing on the video domain.
arXiv Detail & Related papers (2022-06-18T00:26:52Z) - FixMyPose: Pose Correctional Captioning and Retrieval [67.20888060019028]
We introduce a new captioning dataset named FixMyPose to address automated pose correction systems.
We collect descriptions of correcting a "current" pose to look like a "target" pose.
To avoid ML biases, we maintain a balance across characters with diverse demographics.
arXiv Detail & Related papers (2021-04-04T21:45:44Z) - CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language
Learning [78.3857991931479]
We present GROLLA, an evaluation framework for Grounded Language Learning with Attributes.
We also propose a new dataset CompGuessWhat?! as an instance of this framework for evaluating the quality of learned neural representations.
arXiv Detail & Related papers (2020-06-03T11:21:42Z) - Yoga-82: A New Dataset for Fine-grained Classification of Human Poses [46.319423568714505]
We present a dataset, Yoga-82, for large-scale yoga pose recognition with 82 classes.
Yoga-82 consists of complex poses where fine annotations may not be possible.
The dataset contains a three-level hierarchy including body positions, variations in body positions, and the actual pose names.
arXiv Detail & Related papers (2020-04-22T01:43:44Z) - Transferring Dense Pose to Proximal Animal Classes [83.84439508978126]
We show that it is possible to transfer the knowledge existing in dense pose recognition for humans, as well as in more general object detectors and segmenters, to the problem of dense pose recognition in other classes.
We do this by establishing a DensePose model for the new animal which is also geometrically aligned to humans.
We also introduce two benchmark datasets labelled in the manner of DensePose for the class chimpanzee and use them to evaluate our approach.
arXiv Detail & Related papers (2020-02-28T21:43:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.