A novel shape matching descriptor for real-time hand gesture recognition
- URL: http://arxiv.org/abs/2101.03923v2
- Date: Wed, 10 Mar 2021 20:37:38 GMT
- Title: A novel shape matching descriptor for real-time hand gesture recognition
- Authors: Michalis Lazarou, Bo Li, Tania Stathaki
- Abstract summary: We present a novel shape matching methodology for real-time hand gesture recognition.
Our method outperforms the other methods and provides a good combination of accuracy and computational efficiency for real-time applications.
- Score: 11.798555201744596
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The current state-of-the-art hand gesture recognition methodologies heavily
rely in the use of machine learning. However there are scenarios that machine
learning cannot be applied successfully, for example in situations where data
is scarce. This is the case when one-to-one matching is required between a
query and a dataset of hand gestures where each gesture represents a unique
class. In situations where learning algorithms cannot be trained, classic
computer vision techniques such as feature extraction can be used to identify
similarities between objects. Shape is one of the most important features that
can be extracted from images, however the most accurate shape matching
algorithms tend to be computationally inefficient for real-time applications.
In this work we present a novel shape matching methodology for real-time hand
gesture recognition. Extensive experiments were carried out comparing our
method with other shape matching methods with respect to accuracy and
computational complexity using our own collected hand gesture dataset and a
modified version of the MPEG-7 dataset.%that is widely used for comparing 2D
shape matching algorithms. Our method outperforms the other methods and
provides a good combination of accuracy and computational efficiency for
real-time applications.
Related papers
- Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning [0.0]
We present a best-of-both-worlds approach to verifiable scientific machine learning.
We show that multiple standard techniques have informative error bounds that can be computed or estimated efficiently.
We present a case study of our approach for predicting lift-drag ratios from airfoil images.
arXiv Detail & Related papers (2024-04-04T16:52:17Z) - On the Utility of Probing Trajectories for Algorithm-Selection [0.24475591916185496]
Machine-learning approaches to algorithm-selection typically take data describing an instance as input.
We argue that viewing algorithm-selection purely from an instance perspective can be misleading.
We propose a novel algorithm-centric' method for describing instances that can be used to train models for algorithm-selection.
arXiv Detail & Related papers (2024-01-23T13:23:59Z) - Multi-Dimensional Ability Diagnosis for Machine Learning Algorithms [88.93372675846123]
We propose a task-agnostic evaluation framework Camilla for evaluating machine learning algorithms.
We use cognitive diagnosis assumptions and neural networks to learn the complex interactions among algorithms, samples and the skills of each sample.
In our experiments, Camilla outperforms state-of-the-art baselines on the metric reliability, rank consistency and rank stability.
arXiv Detail & Related papers (2023-07-14T03:15:56Z) - Benchmarking Learning Efficiency in Deep Reservoir Computing [23.753943709362794]
We introduce a benchmark of increasingly difficult tasks together with a data efficiency metric to measure how quickly machine learning models learn from training data.
We compare the learning speed of some established sequential supervised models, such as RNNs, LSTMs, or Transformers, with relatively less known alternative models based on reservoir computing.
arXiv Detail & Related papers (2022-09-29T08:16:52Z) - Human-in-the-Loop Disinformation Detection: Stance, Sentiment, or
Something Else? [93.91375268580806]
Both politics and pandemics have recently provided ample motivation for the development of machine learning-enabled disinformation (a.k.a. fake news) detection algorithms.
Existing literature has focused primarily on the fully-automated case, but the resulting techniques cannot reliably detect disinformation on the varied topics, sources, and time scales required for military applications.
By leveraging an already-available analyst as a human-in-the-loop, canonical machine learning techniques of sentiment analysis, aspect-based sentiment analysis, and stance detection become plausible methods to use for a partially-automated disinformation detection system.
arXiv Detail & Related papers (2021-11-09T13:30:34Z) - Towards Interpretable Deep Metric Learning with Structural Matching [86.16700459215383]
We present a deep interpretable metric learning (DIML) method for more transparent embedding learning.
Our method is model-agnostic, which can be applied to off-the-shelf backbone networks and metric learning methods.
We evaluate our method on three major benchmarks of deep metric learning including CUB200-2011, Cars196, and Stanford Online Products.
arXiv Detail & Related papers (2021-08-12T17:59:09Z) - Progressive Spatio-Temporal Bilinear Network with Monte Carlo Dropout
for Landmark-based Facial Expression Recognition with Uncertainty Estimation [93.73198973454944]
The performance of our method is evaluated on three widely used datasets.
It is comparable to that of video-based state-of-the-art methods while it has much less complexity.
arXiv Detail & Related papers (2021-06-08T13:40:30Z) - Unsupervised Feature Learning for Event Data: Direct vs Inverse Problem
Formulation [53.850686395708905]
Event-based cameras record an asynchronous stream of per-pixel brightness changes.
In this paper, we focus on single-layer architectures for representation learning from event data.
We show improvements of up to 9 % in the recognition accuracy compared to the state-of-the-art methods.
arXiv Detail & Related papers (2020-09-23T10:40:03Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z) - Deep Geometric Functional Maps: Robust Feature Learning for Shape
Correspondence [31.840880075039944]
We present a novel learning-based approach for computing correspondences between non-rigid 3D shapes.
Key to our method is a feature-extraction network that learns directly from raw shape geometry.
arXiv Detail & Related papers (2020-03-31T15:20:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.