A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
- URL: http://arxiv.org/abs/2408.05436v1
- Date: Sat, 10 Aug 2024 04:40:01 GMT
- Title: A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
- Authors: Jungpil Shin, Abu Saleh Musa Miah, Md. Humaun Kabir, Md. Abdur Rahim, Abdullah Al Shiam,
- Abstract summary: Hand Gesture Recognition (HGR) systems enhance natural, efficient, and authentic human-computer interaction.
Despite significant progress, automatic and precise identification of hand gestures remains a considerable challenge in computer vision.
This paper provides a comprehensive review of HGR techniques and data modalities from 2014 to 2024, exploring advancements in sensor technology and computer vision.
- Score: 1.6144710323800757
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Researchers have been developing Hand Gesture Recognition (HGR) systems to enhance natural, efficient, and authentic human-computer interaction, especially benefiting those who rely solely on hand gestures for communication. Despite significant progress, the automatic and precise identification of hand gestures remains a considerable challenge in computer vision. Recent studies have focused on specific modalities like RGB images, skeleton data, and spatiotemporal interest points. This paper provides a comprehensive review of HGR techniques and data modalities from 2014 to 2024, exploring advancements in sensor technology and computer vision. We highlight accomplishments using various modalities, including RGB, Skeleton, Depth, Audio, EMG, EEG, and Multimodal approaches and identify areas needing further research. We reviewed over 200 articles from prominent databases, focusing on data collection, data settings, and gesture representation. Our review assesses the efficacy of HGR systems through their recognition accuracy and identifies a gap in research on continuous gesture recognition, indicating the need for improved vision-based gesture systems. The field has experienced steady research progress, including advancements in hand-crafted features and deep learning (DL) techniques. Additionally, we report on the promising developments in HGR methods and the area of multimodal approaches. We hope this survey will serve as a potential guideline for diverse data modality-based HGR research.
Related papers
- A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities [2.916558661202724]
Human Activity Recognition (HAR) systems aim to understand human behaviour and assign a label to each action.
HAR can leverage various data modalities, such as RGB images and video, skeleton, depth, infrared, point cloud, event stream, audio, acceleration, and radar signals.
This paper presents a comprehensive survey of the latest advancements in HAR from 2014 to 2024.
arXiv Detail & Related papers (2024-09-15T10:04:44Z) - A Survey of Deep Learning for Group-level Emotion Recognition [21.542551233204065]
Group-level emotion recognition (GER) has emerged as an important area in analyzing human behavior.
With the proliferation of Deep Learning (DL) techniques, neural networks have garnered increasing interest in GER.
We present a comprehensive review of DL techniques applied to GER, proposing a new taxonomy for the field.
arXiv Detail & Related papers (2024-08-13T11:54:09Z) - Feature Fusion for Human Activity Recognition using Parameter-Optimized Multi-Stage Graph Convolutional Network and Transformer Models [0.6157382820537721]
The study uses sensory data from HuGaDB, PKU-MMD, LARa, and TUG datasets.
Two models, the PO-MS-GCN and a Transformer were trained and evaluated, with PO-MS-GCN outperforming state-of-the-art models.
HuGaDB and TUG achieved high accuracies and f1-scores, while LARa and PKU-MMD had lower scores.
arXiv Detail & Related papers (2024-06-24T13:44:06Z) - A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning [51.7818820745221]
Underwater image enhancement (UIE) presents a significant challenge within computer vision research.
Despite the development of numerous UIE algorithms, a thorough and systematic review is still absent.
arXiv Detail & Related papers (2024-05-30T04:46:40Z) - Deepfake Generation and Detection: A Benchmark and Survey [134.19054491600832]
Deepfake is a technology dedicated to creating highly realistic facial images and videos under specific conditions.
This survey comprehensively reviews the latest developments in deepfake generation and detection.
We focus on researching four representative deepfake fields: face swapping, face reenactment, talking face generation, and facial attribute editing.
arXiv Detail & Related papers (2024-03-26T17:12:34Z) - Study and Survey on Gesture Recognition Systems [0.0]
This paper discusses the implementation of gesture recognition systems in multiple sectors such as gaming, healthcare, home appliances, industrial robots, and virtual reality.
The role of gestures in sign language has been studied and existing approaches have been reviewed.
Common challenges faced while building gesture recognition systems have also been explored.
arXiv Detail & Related papers (2023-12-01T07:29:30Z) - TMHOI: Translational Model for Human-Object Interaction Detection [18.804647133922195]
We propose an innovative graph-based approach to detect human-object interactions (HOIs)
Our method effectively captures the sentiment representation of HOIs by integrating both spatial and semantic knowledge.
Our approach outperformed existing state-of-the-art graph-based methods by a significant margin.
arXiv Detail & Related papers (2023-03-07T21:52:10Z) - A Survey on Heterogeneous Graph Embedding: Methods, Techniques,
Applications and Sources [79.48829365560788]
Heterogeneous graphs (HGs) also known as heterogeneous information networks have become ubiquitous in real-world scenarios.
HG embedding aims to learn representations in a lower-dimension space while preserving the heterogeneous structures and semantics for downstream tasks.
arXiv Detail & Related papers (2020-11-30T15:03:47Z) - Recent Progress in Appearance-based Action Recognition [73.6405863243707]
Action recognition is a task to identify various human actions in a video.
Recent appearance-based methods have achieved promising progress towards accurate action recognition.
arXiv Detail & Related papers (2020-11-25T10:18:12Z) - Relational Graph Learning on Visual and Kinematics Embeddings for
Accurate Gesture Recognition in Robotic Surgery [84.73764603474413]
We propose a novel online approach of multi-modal graph network (i.e., MRG-Net) to dynamically integrate visual and kinematics information.
The effectiveness of our method is demonstrated with state-of-the-art results on the public JIGSAWS dataset.
arXiv Detail & Related papers (2020-11-03T11:00:10Z) - Survey on the Analysis and Modeling of Visual Kinship: A Decade in the
Making [66.72253432908693]
Kinship recognition is a challenging problem with many practical applications.
We review the public resources and data challenges that enabled and inspired many to hone-in on the views.
For the tenth anniversary, the demo code is provided for the various kin-based tasks.
arXiv Detail & Related papers (2020-06-29T13:25:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.