Homogenization of Existing Inertial-Based Datasets to Support Human
Activity Recognition
- URL: http://arxiv.org/abs/2201.07891v1
- Date: Mon, 17 Jan 2022 14:29:48 GMT
- Title: Homogenization of Existing Inertial-Based Datasets to Support Human
Activity Recognition
- Authors: Hamza Amrani, Daniela Micucci, Marco Mobilio, Paolo Napoletano
- Abstract summary: Several techniques have been proposed to address the problem of recognizing activities of daily living from signals.
Deep learning techniques applied to inertial signals have proven to be effective, achieving significant classification accuracy.
Research in human activity recognition models has been almost totally model-centric.
- Score: 8.076841611508486
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Several techniques have been proposed to address the problem of recognizing
activities of daily living from signals. Deep learning techniques applied to
inertial signals have proven to be effective, achieving significant
classification accuracy. Recently, research in human activity recognition (HAR)
models has been almost totally model-centric. It has been proven that the
number of training samples and their quality are critical for obtaining deep
learning models that both perform well independently of their architecture, and
that are more robust to intraclass variability and interclass similarity.
Unfortunately, publicly available datasets do not always contain hight quality
data and a sufficiently large and diverse number of samples (e.g., number of
subjects, type of activity performed, and duration of trials). Furthermore,
datasets are heterogeneous among them and therefore cannot be trivially
combined to obtain a larger set. The final aim of our work is the definition
and implementation of a platform that integrates datasets of inertial signals
in order to make available to the scientific community large datasets of
homogeneous signals, enriched, when possible, with context information (e.g.,
characteristics of the subjects and device position). The main focus of our
platform is to emphasise data quality, which is essential for training
efficient models.
Related papers
- Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution [62.71425232332837]
We show that training amortized models with noisy labels is inexpensive and surprisingly effective.
This approach significantly accelerates several feature attribution and data valuation methods, often yielding an order of magnitude speedup over existing approaches.
arXiv Detail & Related papers (2024-01-29T03:42:37Z) - Amplifying Pathological Detection in EEG Signaling Pathways through
Cross-Dataset Transfer Learning [10.212217551908525]
We study the effectiveness of data and model scaling and cross-dataset knowledge transfer in a real-world pathology classification task.
We identify the challenges of possible negative transfer and emphasize the significance of some key components.
Our findings indicate a small and generic model (e.g. ShallowNet) performs well on a single dataset, however, a larger model (e.g. TCN) performs better on transfer and learning from a larger and diverse dataset.
arXiv Detail & Related papers (2023-09-19T20:09:15Z) - Exploring Data Redundancy in Real-world Image Classification through
Data Selection [20.389636181891515]
Deep learning models often require large amounts of data for training, leading to increased costs.
We present two data valuation metrics based on Synaptic Intelligence and gradient norms, respectively, to study redundancy in real-world image data.
Online and offline data selection algorithms are then proposed via clustering and grouping based on the examined data values.
arXiv Detail & Related papers (2023-06-25T03:31:05Z) - Combining Public Human Activity Recognition Datasets to Mitigate Labeled
Data Scarcity [1.274578243851308]
We propose a novel strategy to combine publicly available datasets with the goal of learning a generalized HAR model.
Our experimental evaluation, which includes experimenting with different state-of-the-art neural network architectures, shows that combining public datasets can significantly reduce the number of labeled samples.
arXiv Detail & Related papers (2023-06-23T18:51:22Z) - ALP: Action-Aware Embodied Learning for Perception [60.64801970249279]
We introduce Action-Aware Embodied Learning for Perception (ALP)
ALP incorporates action information into representation learning through a combination of optimizing a reinforcement learning policy and an inverse dynamics prediction objective.
We show that ALP outperforms existing baselines in several downstream perception tasks.
arXiv Detail & Related papers (2023-06-16T21:51:04Z) - Dataset Bias in Human Activity Recognition [57.91018542715725]
This contribution statistically curates the training data to assess to what degree the physical characteristics of humans influence HAR performance.
We evaluate the performance of a state-of-the-art convolutional neural network on two HAR datasets that vary in the sensors, activities, and recording for time-series HAR.
arXiv Detail & Related papers (2023-01-19T12:33:50Z) - Unsupervised Deep Learning-based clustering for Human Activity
Recognition [8.716606664673982]
The paper proposes DISC (Deep Inertial Sensory Clustering), a DL-based clustering architecture that automatically labels multi-dimensional inertial signals.
The architecture combines a recurrent AutoEncoder and a clustering criterion to predict unlabelled human activities-related signals.
The experiments demonstrate the effectiveness of DISC on both clustering accuracy and normalized mutual information metrics.
arXiv Detail & Related papers (2022-11-10T10:56:47Z) - A Survey of Learning on Small Data: Generalization, Optimization, and
Challenge [101.27154181792567]
Learning on small data that approximates the generalization ability of big data is one of the ultimate purposes of AI.
This survey follows the active sampling theory under a PAC framework to analyze the generalization error and label complexity of learning on small data.
Multiple data applications that may benefit from efficient small data representation are surveyed.
arXiv Detail & Related papers (2022-07-29T02:34:19Z) - Understanding the World Through Action [91.3755431537592]
I will argue that a general, principled, and powerful framework for utilizing unlabeled data can be derived from reinforcement learning.
I will discuss how such a procedure is more closely aligned with potential downstream tasks.
arXiv Detail & Related papers (2021-10-24T22:33:52Z) - Similarity Embedding Networks for Robust Human Activity Recognition [19.162857787656247]
We design a similarity embedding neural network that maps input sensor signals onto real vectors through carefully designed convolutional and LSTM layers.
The embedding network is trained with a pairwise similarity loss, encouraging the clustering of samples from the same class in the embedded real space.
Extensive evaluation based on two public datasets has shown that the proposed similarity embedding network significantly outperforms state-of-the-art deep models on HAR classification tasks.
arXiv Detail & Related papers (2021-05-31T11:52:32Z) - Diverse Complexity Measures for Dataset Curation in Self-driving [80.55417232642124]
We propose a new data selection method that exploits a diverse set of criteria that quantize interestingness of traffic scenes.
Our experiments show that the proposed curation pipeline is able to select datasets that lead to better generalization and higher performance.
arXiv Detail & Related papers (2021-01-16T23:45:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.