Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark
Analysis
- URL: http://arxiv.org/abs/2308.00323v1
- Date: Tue, 1 Aug 2023 07:00:13 GMT
- Title: Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark
Analysis
- Authors: Asish Bera, Mita Nasipuri, Ondrej Krejcar, and Debotosh Bhattacharjee
- Abstract summary: Human body-pose estimation is a complex problem in computer vision.
Recent research interests have been widened specifically on the Sports, Yoga, and Dance postures.
CNNs have attained significantly improved performance in solving various human body-pose estimation problems.
- Score: 24.276782804825846
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Human body-pose estimation is a complex problem in computer vision. Recent
research interests have been widened specifically on the Sports, Yoga, and
Dance (SYD) postures for maintaining health conditions. The SYD pose categories
are regarded as a fine-grained image classification task due to the complex
movement of body parts. Deep Convolutional Neural Networks (CNNs) have attained
significantly improved performance in solving various human body-pose
estimation problems. Though decent progress has been achieved in yoga postures
recognition using deep learning techniques, fine-grained sports, and dance
recognition necessitates ample research attention. However, no benchmark public
image dataset with sufficient inter-class and intra-class variations is
available yet to address sports and dance postures classification. To solve
this limitation, we have proposed two image datasets, one for 102 sport
categories and another for 12 dance styles. Two public datasets, Yoga-82 which
contains 82 classes and Yoga-107 represents 107 classes are collected for yoga
postures. These four SYD datasets are experimented with the proposed deep
model, SYD-Net, which integrates a patch-based attention (PbA) mechanism on top
of standard backbone CNNs. The PbA module leverages the self-attention
mechanism that learns contextual information from a set of uniform and
multi-scale patches and emphasizes discriminative features to understand the
semantic correlation among patches. Moreover, random erasing data augmentation
is applied to improve performance. The proposed SYD-Net has achieved
state-of-the-art accuracy on Yoga-82 using five base CNNs. SYD-Net's accuracy
on other datasets is remarkable, implying its efficiency. Our Sports-102 and
Dance-12 datasets are publicly available at
https://sites.google.com/view/syd-net/home.
Related papers
- Yoga Pose Classification Using Transfer Learning [0.0]
Yoga-82, a benchmark dataset for large-scale yoga pose recognition with 82 classes, has challenging positions that could make precise annotations impossible.
We have used VGG-16, ResNet-50, ResNet-101, and DenseNet-121 and finetuned them in different ways to get better results.
The experimental result shows the best performance of DenseNet-121 having the top-1 accuracy of 85% and top-5 accuracy of 96%.
arXiv Detail & Related papers (2024-10-29T14:34:18Z) - Improved Generalization of Weight Space Networks via Augmentations [56.571475005291035]
Learning in deep weight spaces (DWS) is an emerging research direction, with applications to 2D and 3D neural fields (INRs, NeRFs)
We empirically analyze the reasons for this overfitting and find that a key reason is the lack of diversity in DWS datasets.
To address this, we explore strategies for data augmentation in weight spaces and propose a MixUp method adapted for weight spaces.
arXiv Detail & Related papers (2024-02-06T15:34:44Z) - Physics Inspired Hybrid Attention for SAR Target Recognition [61.01086031364307]
We propose a physics inspired hybrid attention (PIHA) mechanism and the once-for-all (OFA) evaluation protocol to address the issues.
PIHA leverages the high-level semantics of physical information to activate and guide the feature group aware of local semantics of target.
Our method outperforms other state-of-the-art approaches in 12 test scenarios with same ASC parameters.
arXiv Detail & Related papers (2023-09-27T14:39:41Z) - An Efficient Deep Convolutional Neural Network Model For Yoga Pose
Recognition Using Single Images [2.6717276381722033]
This paper presents YPose, an efficient deep convolutional neural network (CNN) model to recognize yoga asanas from RGB images.
The proposed model has been tested on the Yoga-82 dataset.
arXiv Detail & Related papers (2023-06-27T19:34:46Z) - Dataset Bias in Human Activity Recognition [57.91018542715725]
This contribution statistically curates the training data to assess to what degree the physical characteristics of humans influence HAR performance.
We evaluate the performance of a state-of-the-art convolutional neural network on two HAR datasets that vary in the sensors, activities, and recording for time-series HAR.
arXiv Detail & Related papers (2023-01-19T12:33:50Z) - Posture Prediction for Healthy Sitting using a Smart Chair [0.0]
Poor sitting habits have been identified as a risk factor to musculoskeletal disorders and lower back pain.
This study builds Machine Learning models for classifying sitting posture of a person.
arXiv Detail & Related papers (2022-01-05T20:31:28Z) - Learning Dynamics via Graph Neural Networks for Human Pose Estimation
and Tracking [98.91894395941766]
We propose a novel online approach to learning the pose dynamics, which are independent of pose detections in current fame.
Specifically, we derive this prediction of dynamics through a graph neural network(GNN) that explicitly accounts for both spatial-temporal and visual information.
Experiments on PoseTrack 2017 and PoseTrack 2018 datasets demonstrate that the proposed method achieves results superior to the state of the art on both human pose estimation and tracking tasks.
arXiv Detail & Related papers (2021-06-07T16:36:50Z) - Table Tennis Stroke Recognition Using Two-Dimensional Human Pose
Estimation [0.0]
We introduce a novel method for collecting table tennis video data and perform stroke detection and classification.
A diverse dataset containing video data of 11 basic strokes obtained from 14 professional table tennis players has been collected.
A temporal convolutional neural network model developed using 2D pose estimation performs multiclass classification of these 11 table tennis strokes.
arXiv Detail & Related papers (2021-04-20T11:32:43Z) - FixMyPose: Pose Correctional Captioning and Retrieval [67.20888060019028]
We introduce a new captioning dataset named FixMyPose to address automated pose correction systems.
We collect descriptions of correcting a "current" pose to look like a "target" pose.
To avoid ML biases, we maintain a balance across characters with diverse demographics.
arXiv Detail & Related papers (2021-04-04T21:45:44Z) - Yoga-82: A New Dataset for Fine-grained Classification of Human Poses [46.319423568714505]
We present a dataset, Yoga-82, for large-scale yoga pose recognition with 82 classes.
Yoga-82 consists of complex poses where fine annotations may not be possible.
The dataset contains a three-level hierarchy including body positions, variations in body positions, and the actual pose names.
arXiv Detail & Related papers (2020-04-22T01:43:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.