Related papers: A View Independent Classification Framework for Yoga Postures

A View Independent Classification Framework for Yoga Postures

URL: http://arxiv.org/abs/2206.13577v1
Date: Mon, 27 Jun 2022 18:40:34 GMT
Title: A View Independent Classification Framework for Yoga Postures
Authors: Mustafa Chasmai, Nirjhar Das, Aman Bhardwaj, Rahul Garg
Abstract summary: We employ transfer learning from Human Pose Estimation models for extracting 136 key-points spread all over the body to train a Random Forest classifier. Results are evaluated on an in-house collected extensive yoga video database of 51 subjects recorded from 4 different camera angles.
Score: 2.922683311119656
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Yoga is a globally acclaimed and widely recommended practice for a healthy living. Maintaining correct posture while performing a Yogasana is of utmost importance. In this work, we employ transfer learning from Human Pose Estimation models for extracting 136 key-points spread all over the body to train a Random Forest classifier which is used for estimation of the Yogasanas. The results are evaluated on an in-house collected extensive yoga video database of 51 subjects recorded from 4 different camera angles. We propose a 3 step scheme for evaluating the generalizability of a Yoga classifier by testing it on 1) unseen frames, 2) unseen subjects, and 3) unseen camera angles. We argue that for most of the applications, validation accuracies on unseen subjects and unseen camera angles would be most important. We empirically analyze over three public datasets, the advantage of transfer learning and the possibilities of target leakage. We further demonstrate that the classification accuracies critically depend on the cross validation method employed and can often be misleading. To promote further research, we have made key-points dataset and code publicly available.

Related papers

Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis [0.6524460254566905]
This study aims to assess the effectiveness of Contrastive Language-Image Pretraining (CLIP) in classifying human postures. Applying transfer learning on 15,301 images (real and synthetic) with 82 classes has shown promising results. The fine-tuned CLIP model, tested on 3826 images, achieves an accuracy of over 85%.
arXiv Detail & Related papers (2025-01-13T11:20:44Z)
Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Videos [66.1935609072708]
Key hypothesis is that the more accurately an individual view can predict a view-agnostic text summary, the more informative it is. We propose a framework that uses the relative accuracy of view-dependent caption predictions as a proxy for best view pseudo-labels. During inference, our model takes as input only a multi-view video -- no language or camera poses -- and returns the best viewpoint to watch at each timestep.
arXiv Detail & Related papers (2024-11-13T16:31:08Z)
Yoga Pose Classification Using Transfer Learning [0.0]
Yoga-82, a benchmark dataset for large-scale yoga pose recognition with 82 classes, has challenging positions that could make precise annotations impossible. We have used VGG-16, ResNet-50, ResNet-101, and DenseNet-121 and finetuned them in different ways to get better results. The experimental result shows the best performance of DenseNet-121 having the top-1 accuracy of 85% and top-5 accuracy of 96%.
arXiv Detail & Related papers (2024-10-29T14:34:18Z)
3DYoga90: A Hierarchical Video Dataset for Yoga Pose Understanding [0.0]
3DYoga901 is organized within a three-level label hierarchy. Our dataset includes meticulously curated RGB yoga pose videos and 3D skeleton sequences.
arXiv Detail & Related papers (2023-10-16T07:15:31Z)
Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis [24.276782804825846]
Human body-pose estimation is a complex problem in computer vision. Recent research interests have been widened specifically on the Sports, Yoga, and Dance postures. CNNs have attained significantly improved performance in solving various human body-pose estimation problems.
arXiv Detail & Related papers (2023-08-01T07:00:13Z)
Self-Supervised Learning for Videos: A Survey [70.37277191524755]
Self-supervised learning has shown promise in both image and video domains. In this survey, we provide a review of existing approaches on self-supervised learning focusing on the video domain.
arXiv Detail & Related papers (2022-06-18T00:26:52Z)
FixMyPose: Pose Correctional Captioning and Retrieval [67.20888060019028]
We introduce a new captioning dataset named FixMyPose to address automated pose correction systems. We collect descriptions of correcting a "current" pose to look like a "target" pose. To avoid ML biases, we maintain a balance across characters with diverse demographics.
arXiv Detail & Related papers (2021-04-04T21:45:44Z)
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning [78.3857991931479]
We present GROLLA, an evaluation framework for Grounded Language Learning with Attributes. We also propose a new dataset CompGuessWhat?! as an instance of this framework for evaluating the quality of learned neural representations.
arXiv Detail & Related papers (2020-06-03T11:21:42Z)
Yoga-82: A New Dataset for Fine-grained Classification of Human Poses [46.319423568714505]
We present a dataset, Yoga-82, for large-scale yoga pose recognition with 82 classes. Yoga-82 consists of complex poses where fine annotations may not be possible. The dataset contains a three-level hierarchy including body positions, variations in body positions, and the actual pose names.
arXiv Detail & Related papers (2020-04-22T01:43:44Z)
Transferring Dense Pose to Proximal Animal Classes [83.84439508978126]
We show that it is possible to transfer the knowledge existing in dense pose recognition for humans, as well as in more general object detectors and segmenters, to the problem of dense pose recognition in other classes. We do this by establishing a DensePose model for the new animal which is also geometrically aligned to humans. We also introduce two benchmark datasets labelled in the manner of DensePose for the class chimpanzee and use them to evaluate our approach.
arXiv Detail & Related papers (2020-02-28T21:43:53Z)

This list is automatically generated from the titles and abstracts of the papers in this site.