"Filling the Blanks'': Identifying Micro-activities that Compose Complex
Human Activities of Daily Living
- URL: http://arxiv.org/abs/2306.13149v2
- Date: Wed, 7 Feb 2024 23:08:29 GMT
- Title: "Filling the Blanks'': Identifying Micro-activities that Compose Complex
Human Activities of Daily Living
- Authors: Soumyajit Chatterjee, Bivas Mitra and Sandip Chakraborty
- Abstract summary: AmicroN adapts a top-down'' approach by exploiting coarse-grained annotated data to expand the macro-activities into constituent micro-activities.
In the backend, AmicroN uses textitunsupervised change-point detection to search for the micro-activity boundaries across a complex ADL.
We evaluate AmicroN on two real-life publicly available datasets and observe that AmicroN can identify the micro-activities with micro Ftextsubscript1-score $>0.75$ for both datasets.
- Score: 6.841115530838644
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Complex activities of daily living (ADLs) often consist of multiple
micro-activities. When performed sequentially, these micro-activities help the
user accomplish the broad macro-activity. Naturally, a deeper understanding of
these micro-activities can help develop more sophisticated human activity
recognition (HAR) models and add explainability to their inferred conclusions.
Previous research has attempted to achieve this by utilizing fine-grained
annotated data that provided the required supervision and rules for associating
the micro-activities to identify the macro-activity. However, this
``bottom-up'' approach is unrealistic as getting such high-quality,
fine-grained annotated sensor datasets is challenging, costly, and
time-consuming. Understanding this, in this paper, we develop AmicroN, which
adapts a ``top-down'' approach by exploiting coarse-grained annotated data to
expand the macro-activities into their constituent micro-activities without any
external supervision. In the backend, AmicroN uses \textit{unsupervised}
change-point detection to search for the micro-activity boundaries across a
complex ADL. Then, it applies a \textit{generalized zero-shot} approach to
characterize it. We evaluate AmicroN on two real-life publicly available
datasets and observe that AmicroN can identify the micro-activities with micro
F\textsubscript{1}-score $>0.75$ for both datasets. Additionally, we also
perform an initial proof-of-concept on leveraging the state-of-the-art (SOTA)
large language models (LLMs) with attribute embeddings predicted by AmicroN to
enhance further the explainability surrounding the detection of
micro-activities.
Related papers
- MMAD: Multi-label Micro-Action Detection in Videos [23.508563348306534]
We propose a new task named Multi-label Micro-Action Detection (MMAD)
MMAD involves identifying all micro-actions in a given short video, determining their start and end times, and categorizing them.
To support the MMAD task, we introduce a new dataset named Multi-label Micro-Action-52 (MMA-52), specifically designed to facilitate the detailed analysis and exploration of complex human micro-actions.
arXiv Detail & Related papers (2024-07-07T09:45:14Z) - VCHAR:Variance-Driven Complex Human Activity Recognition framework with Generative Representation [6.278293754210117]
VCHAR (Variance-Driven Complex Human Activity Recognition) is a novel framework that treats the outputs of atomic activities as a distribution over specified intervals.
We show that VCHAR enhances the accuracy of complex activity recognition without necessitating precise temporal or sequential labeling of atomic activities.
arXiv Detail & Related papers (2024-07-03T17:24:36Z) - Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition [48.21696443824074]
We propose a novel framework for micro-expression recognition, named the Adaptive Temporal Motion Guided Graph Convolution Network (ATM-GCN)
Our framework excels at capturing temporal dependencies between frames across the entire clip, thereby enhancing micro-expression recognition at the clip level.
arXiv Detail & Related papers (2024-06-13T10:57:24Z) - Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding [21.94739567923136]
We focus on a special group of human body language -- the micro-gesture (MG)
MG differs from the range of ordinary illustrative gestures in that they are not intentional behaviors performed to convey information to others, but rather unintentional behaviors driven by inner feelings.
We explore various augmentation strategies that take into account the subtle spatial and brief temporal characteristics of micro-gestures, often accompanied by repetitiveness, to determine more suitable augmentation methods.
arXiv Detail & Related papers (2024-05-21T21:16:55Z) - Three-dimensional microstructure generation using generative adversarial
neural networks in the context of continuum micromechanics [77.34726150561087]
This work proposes a generative adversarial network tailored towards three-dimensional microstructure generation.
The lightweight algorithm is able to learn the underlying properties of the material from a single microCT-scan without the need of explicit descriptors.
arXiv Detail & Related papers (2022-05-31T13:26:51Z) - Video-based Facial Micro-Expression Analysis: A Survey of Datasets,
Features and Algorithms [52.58031087639394]
micro-expressions are involuntary and transient facial expressions.
They can provide important information in a broad range of applications such as lie detection, criminal detection, etc.
Since micro-expressions are transient and of low intensity, their detection and recognition is difficult and relies heavily on expert experiences.
arXiv Detail & Related papers (2022-01-30T05:14:13Z) - Short and Long Range Relation Based Spatio-Temporal Transformer for
Micro-Expression Recognition [61.374467942519374]
We propose a novel a-temporal transformer architecture -- to the best of our knowledge, the first purely transformer based approach for micro-expression recognition.
The architecture comprises a spatial encoder which learns spatial patterns, a temporal dimension classification for temporal analysis, and a head.
A comprehensive evaluation on three widely used spontaneous micro-expression data sets, shows that the proposed approach consistently outperforms the state of the art.
arXiv Detail & Related papers (2021-12-10T22:10:31Z) - Deep neural networks approach to microbial colony detection -- a
comparative analysis [52.77024349608834]
This study investigates the performance of three deep learning approaches for object detection on the AGAR dataset.
The achieved results may serve as a benchmark for future experiments.
arXiv Detail & Related papers (2021-08-23T12:06:00Z) - DisenKGAT: Knowledge Graph Embedding with Disentangled Graph Attention
Network [48.38954651216983]
We propose a novel Disentangled Knowledge Graph Attention Network (DisenKGAT) for Knowledge graphs.
DisenKGAT uses both micro-disentanglement and macro-disentanglement to exploit representations behind Knowledge graphs.
Our work has strong robustness and flexibility to adapt to various score functions.
arXiv Detail & Related papers (2021-08-22T04:10:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.