Hand Gesture Classification on Praxis Dataset: Trading Accuracy for
Expense
- URL: http://arxiv.org/abs/2311.00767v1
- Date: Wed, 1 Nov 2023 18:18:09 GMT
- Title: Hand Gesture Classification on Praxis Dataset: Trading Accuracy for
Expense
- Authors: Rahat Islam, Kenneth Lai, and Svetlana Yanushkevich
- Abstract summary: We focus on'skeletal' data represented by the body joint coordinates, from the Praxis dataset.
The PRAXIS dataset contains recordings of patients with cortical pathologies such as Alzheimer's disease.
Using a combination of windowing techniques with deep learning architecture such as a Recurrent Neural Network (RNN), we achieved an overall accuracy of 70.8%.
- Score: 0.6390468088226495
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we investigate hand gesture classifiers that rely upon the
abstracted 'skeletal' data recorded using the RGB-Depth sensor. We focus on
'skeletal' data represented by the body joint coordinates, from the Praxis
dataset. The PRAXIS dataset contains recordings of patients with cortical
pathologies such as Alzheimer's disease, performing a Praxis test under the
direction of a clinician. In this paper, we propose hand gesture classifiers
that are more effective with the PRAXIS dataset than previously proposed
models. Body joint data offers a compressed form of data that can be analyzed
specifically for hand gesture recognition. Using a combination of windowing
techniques with deep learning architecture such as a Recurrent Neural Network
(RNN), we achieved an overall accuracy of 70.8% using only body joint data. In
addition, we investigated a long-short-term-memory (LSTM) to extract and
analyze the movement of the joints through time to recognize the hand gestures
being performed and achieved a gesture recognition rate of 74.3% and 67.3% for
static and dynamic gestures, respectively. The proposed approach contributed to
the task of developing an automated, accurate, and inexpensive approach to
diagnosing cortical pathologies for multiple healthcare applications.
Related papers
- SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms [60.35639972035727]
The lack of publicly available annotated datasets has impeded the development of robust, machine learning-driven segmentation algorithms.
The SMILE-UHURA challenge addresses the gap in publicly available annotated datasets by providing an annotated dataset of Time-of-Flight angiography acquired with 7T MRI.
Dice scores reached up to 0.838 $pm$ 0.066 and 0.716 $pm$ 0.125 on the respective datasets, with an average performance of up to 0.804 $pm$ 0.15.
arXiv Detail & Related papers (2024-11-14T17:06:00Z) - A novel open-source ultrasound dataset with deep learning benchmarks for
spinal cord injury localization and anatomical segmentation [1.02101998415327]
We present an ultrasound dataset of 10,223-mode (B-mode) images consisting of sagittal slices of porcine spinal cords.
We benchmark the performance metrics of several state-of-the-art object detection algorithms to localize the site of injury.
We evaluate the zero-shot generalization capabilities of the segmentation models on human ultrasound spinal cord images.
arXiv Detail & Related papers (2024-09-24T20:22:59Z) - Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images [67.66644395272075]
We present first analysis of state-of-the-art semantic segmentation models when faced with geometric out-of-distribution data.
We propose an augmentation technique called "Organ Transplantation" to enhance generalizability.
Our augmentation technique improves SOA model performance by up to 67 % for RGB data and 90 % for HSI data, achieving performance at the level of in-distribution performance on real OOD test data.
arXiv Detail & Related papers (2024-08-27T19:13:15Z) - Integrative Deep Learning Framework for Parkinson's Disease Early Detection using Gait Cycle Data Measured by Wearable Sensors: A CNN-GRU-GNN Approach [0.3222802562733786]
We present a pioneering deep learning architecture tailored for the binary classification of subjects.
Our model harnesses the power of 1D-Convolutional Neural Networks (CNN), Gated Recurrent Units (GRU), and Graph Neural Network (GNN) layers.
Our proposed model achieves exceptional performance metrics, boasting accuracy, precision, recall, and F1 score values of 99.51%, 99.57%, 99.71%, and 99.64%, respectively.
arXiv Detail & Related papers (2024-04-09T15:19:13Z) - After-Stroke Arm Paresis Detection using Kinematic Data [2.375665889100906]
This paper presents an approach for detecting unilateral arm paralysis/weakness using kinematic data.
Our method employs temporal convolution networks and recurrent neural networks, guided by knowledge distillation.
The results suggest that our method could be a useful tool for clinicians and healthcare professionals working with patients with this condition.
arXiv Detail & Related papers (2023-11-03T16:56:02Z) - Data-Driven Goal Recognition in Transhumeral Prostheses Using Process
Mining Techniques [7.95507524742396]
Active prostheses utilize real-valued, continuous sensor data to recognize patient target poses, or goals, and proactively move the artificial limb.
Previous studies have examined how well the data collected in stationary poses, without considering the time steps, can help discriminate the goals.
Our approach involves transforming the data into discrete events and training an existing process mining-based goal recognition system.
arXiv Detail & Related papers (2023-09-15T02:03:59Z) - Towards Unifying Anatomy Segmentation: Automated Generation of a
Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines [113.08940153125616]
We generate a dataset of whole-body CT scans with $142$ voxel-level labels for 533 volumes providing comprehensive anatomical coverage.
Our proposed procedure does not rely on manual annotation during the label aggregation stage.
We release our trained unified anatomical segmentation model capable of predicting $142$ anatomical structures on CT data.
arXiv Detail & Related papers (2023-07-25T09:48:13Z) - Online Recognition of Incomplete Gesture Data to Interface Collaborative
Robots [0.0]
This paper introduces an HRI framework to classify large vocabularies of interwoven static gestures (SGs) and dynamic gestures (DGs) captured with wearable sensors.
The recognized gestures are used to teleoperate a robot in a collaborative process that consists of preparing a breakfast meal.
arXiv Detail & Related papers (2023-04-13T18:49:08Z) - Joint-bone Fusion Graph Convolutional Network for Semi-supervised
Skeleton Action Recognition [65.78703941973183]
We propose a novel correlation-driven joint-bone fusion graph convolutional network (CD-JBF-GCN) as an encoder and use a pose prediction head as a decoder.
Specifically, the CD-JBF-GC can explore the motion transmission between the joint stream and the bone stream.
The pose prediction based auto-encoder in the self-supervised training stage allows the network to learn motion representation from unlabeled data.
arXiv Detail & Related papers (2022-02-08T16:03:15Z) - Chest x-ray automated triage: a semiologic approach designed for
clinical implementation, exploiting different types of labels through a
combination of four Deep Learning architectures [83.48996461770017]
This work presents a Deep Learning method based on the late fusion of different convolutional architectures.
We built four training datasets combining images from public chest x-ray datasets and our institutional archive.
We trained four different Deep Learning architectures and combined their outputs with a late fusion strategy, obtaining a unified tool.
arXiv Detail & Related papers (2020-12-23T14:38:35Z) - ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed
Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings.
We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework.
The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.