VALERIAN: Invariant Feature Learning for IMU Sensor-based Human Activity
Recognition in the Wild
- URL: http://arxiv.org/abs/2303.06048v1
- Date: Fri, 3 Mar 2023 18:22:14 GMT
- Title: VALERIAN: Invariant Feature Learning for IMU Sensor-based Human Activity
Recognition in the Wild
- Authors: Yujiao Hao, Boyu Wang, Rong Zheng
- Abstract summary: VALERIAN is an invariant feature learning method for in-the-wild wearable sensor-based HAR.
By training a multi-task model with separate task-specific layers for each subject, VALERIAN allows noisy labels to be dealt with individually while benefiting from shared feature representation across subjects.
- Score: 7.50015216403068
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Deep neural network models for IMU sensor-based human activity recognition
(HAR) that are trained from controlled, well-curated datasets suffer from poor
generalizability in practical deployments. However, data collected from
naturalistic settings often contains significant label noise. In this work, we
examine two in-the-wild HAR datasets and DivideMix, a state-of-the-art learning
with noise labels (LNL) method to understand the extent and impacts of noisy
labels in training data. Our empirical analysis reveals that the substantial
domain gaps among diverse subjects cause LNL methods to violate a key
underlying assumption, namely, neural networks tend to fit simpler (and thus
clean) data in early training epochs. Motivated by the insights, we design
VALERIAN, an invariant feature learning method for in-the-wild wearable
sensor-based HAR. By training a multi-task model with separate task-specific
layers for each subject, VALERIAN allows noisy labels to be dealt with
individually while benefiting from shared feature representation across
subjects. We evaluated VALERIAN on four datasets, two collected in a controlled
environment and two in the wild.
Related papers
- Downstream-Pretext Domain Knowledge Traceback for Active Learning [138.02530777915362]
We propose a downstream-pretext domain knowledge traceback (DOKT) method that traces the data interactions of downstream knowledge and pre-training guidance.
DOKT consists of a traceback diversity indicator and a domain-based uncertainty estimator.
Experiments conducted on ten datasets show that our model outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2024-07-20T01:34:13Z) - Generalizable Low-Resource Activity Recognition with Diverse and
Discriminative Representation Learning [24.36351102003414]
Human activity recognition (HAR) is a time series classification task that focuses on identifying the motion patterns from human sensor readings.
We propose a novel approach called Diverse and Discriminative representation Learning (DDLearn) for generalizable lowresource HAR.
Our method significantly outperforms state-of-art methods by an average accuracy improvement of 9.5%.
arXiv Detail & Related papers (2023-05-25T08:24:22Z) - Cluster-level pseudo-labelling for source-free cross-domain facial
expression recognition [94.56304526014875]
We propose the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for Facial Expression Recognition (FER)
Our method exploits self-supervised pretraining to learn good feature representations from the target data.
We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER.
arXiv Detail & Related papers (2022-10-11T08:24:50Z) - Learning with Neighbor Consistency for Noisy Labels [69.83857578836769]
We present a method for learning from noisy labels that leverages similarities between training examples in feature space.
We evaluate our method on datasets evaluating both synthetic (CIFAR-10, CIFAR-100) and realistic (mini-WebVision, Clothing1M, mini-ImageNet-Red) noise.
arXiv Detail & Related papers (2022-02-04T15:46:27Z) - Similarity Embedding Networks for Robust Human Activity Recognition [19.162857787656247]
We design a similarity embedding neural network that maps input sensor signals onto real vectors through carefully designed convolutional and LSTM layers.
The embedding network is trained with a pairwise similarity loss, encouraging the clustering of samples from the same class in the embedded real space.
Extensive evaluation based on two public datasets has shown that the proposed similarity embedding network significantly outperforms state-of-the-art deep models on HAR classification tasks.
arXiv Detail & Related papers (2021-05-31T11:52:32Z) - Meta-HAR: Federated Representation Learning for Human Activity
Recognition [21.749861229805727]
Human activity recognition (HAR) based on mobile sensors plays an important role in ubiquitous computing.
We propose Meta-HAR, a federated representation learning framework, in which a signal embedding network is meta-learned in a federated manner.
In order to boost the representation ability of the embedding network, we treat the HAR problem at each user as a different task and train the shared embedding network through a Model-Agnostic Meta-learning framework.
arXiv Detail & Related papers (2021-05-31T11:04:39Z) - On Deep Learning with Label Differential Privacy [54.45348348861426]
We study the multi-class classification setting where the labels are considered sensitive and ought to be protected.
We propose a new algorithm for training deep neural networks with label differential privacy, and run evaluations on several datasets.
arXiv Detail & Related papers (2021-02-11T15:09:06Z) - Invariant Feature Learning for Sensor-based Human Activity Recognition [11.334750079923428]
We present an invariant feature learning framework (IFLF) that extracts common information shared across subjects and devices.
Experiments demonstrated that IFLF is effective in handling both subject and device diversion across popular open datasets and an in-house dataset.
arXiv Detail & Related papers (2020-12-14T21:56:17Z) - Local Additivity Based Data Augmentation for Semi-supervised NER [59.90773003737093]
Named Entity Recognition (NER) is one of the first stages in deep language understanding.
Current NER models heavily rely on human-annotated data.
We propose a Local Additivity based Data Augmentation (LADA) method for semi-supervised NER.
arXiv Detail & Related papers (2020-10-04T20:46:26Z) - Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training.
We experimentally verify that the new dataset can significantly improve the ability of the learned FER model.
To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.