Online Recognition of Actions Involving Objects
- URL: http://arxiv.org/abs/2104.06070v1
- Date: Tue, 13 Apr 2021 10:08:20 GMT
- Title: Online Recognition of Actions Involving Objects
- Authors: Zahra Gharaee and Peter G\"ardenfors and Magnus Johnsson
- Abstract summary: We present an online system for real time recognition of actions involving objects working in online mode.
The system merges two streams of information processing running in parallel.
The presented method combines the two information processing streams to determine what action the agent performed and on what object.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: We present an online system for real time recognition of actions involving
objects working in online mode. The system merges two streams of information
processing running in parallel. One is carried out by a hierarchical
self-organizing map (SOM) system that recognizes the performed actions by
analysing the spatial trajectories of the agent's movements. It consists of two
layers of SOMs and a custom made supervised neural network. The activation
sequences in the first layer SOM represent the sequences of significant
postures of the agent during the performance of actions. These activation
sequences are subsequently recoded and clustered in the second layer SOM, and
then labeled by the activity in the third layer custom made supervised neural
network. The second information processing stream is carried out by a second
system that determines which object among several in the agent's vicinity the
action is applied to. This is achieved by applying a proximity measure. The
presented method combines the two information processing streams to determine
what action the agent performed and on what object. The action recognition
system has been tested with excellent performance.
Related papers
- DOAD: Decoupled One Stage Action Detection Network [77.14883592642782]
Localizing people and recognizing their actions from videos is a challenging task towards high-level video understanding.
Existing methods are mostly two-stage based, with one stage for person bounding box generation and the other stage for action recognition.
We present a decoupled one-stage network dubbed DOAD, to improve the efficiency for-temporal action detection.
arXiv Detail & Related papers (2023-04-01T08:06:43Z) - 3DMODT: Attention-Guided Affinities for Joint Detection & Tracking in 3D
Point Clouds [95.54285993019843]
We propose a method for joint detection and tracking of multiple objects in 3D point clouds.
Our model exploits temporal information employing multiple frames to detect objects and track them in a single network.
arXiv Detail & Related papers (2022-11-01T20:59:38Z) - Joint-bone Fusion Graph Convolutional Network for Semi-supervised
Skeleton Action Recognition [65.78703941973183]
We propose a novel correlation-driven joint-bone fusion graph convolutional network (CD-JBF-GCN) as an encoder and use a pose prediction head as a decoder.
Specifically, the CD-JBF-GC can explore the motion transmission between the joint stream and the bone stream.
The pose prediction based auto-encoder in the self-supervised training stage allows the network to learn motion representation from unlabeled data.
arXiv Detail & Related papers (2022-02-08T16:03:15Z) - Human Activity Recognition using Attribute-Based Neural Networks and
Context Information [61.67246055629366]
We consider human activity recognition (HAR) from wearable sensor data in manual-work processes.
We show how context information can be integrated systematically into a deep neural network-based HAR system.
We empirically show that our proposed architecture increases HAR performance, compared to state-of-the-art methods.
arXiv Detail & Related papers (2021-10-28T06:08:25Z) - Cross-modal Consensus Network for Weakly Supervised Temporal Action
Localization [74.34699679568818]
Weakly supervised temporal action localization (WS-TAL) is a challenging task that aims to localize action instances in the given video with video-level categorical supervision.
We propose a cross-modal consensus network (CO2-Net) to tackle this problem.
arXiv Detail & Related papers (2021-07-27T04:21:01Z) - Online recognition of unsegmented actions with hierarchical SOM
architecture [0.0]
A novel approach for recognizing unsegmented actions in online test experiments is proposed.
The unique features of an action sequence are represented as a series of elicited key activations by the first-layer self-organizing map.
The experiment results show that although the performance drops slightly in online experiments compared to the offline tests, the ability of the proposed architecture to deal with the unsegmented action sequences as well as the online performance makes the system more plausible and practical in real-case scenarios.
arXiv Detail & Related papers (2021-04-23T14:41:46Z) - Hierarchical growing grid networks for skeleton based action recognition [0.0]
A novel cognitive architecture for action recognition is developed by applying layers of growing grid neural networks.
The system is provided with a prior knowledge of the input space, which increases the processing speed of the learning phase.
The performance of the growing grid architecture is com-pared with the results from a system based on Self-Organizing Maps.
arXiv Detail & Related papers (2021-04-22T16:35:32Z) - First and Second Order Dynamics in a Hierarchical SOM system for Action
Recognition [0.0]
We present a novel action recognition system that employs a hierarchy of Self-Organizing Maps together with a custom supervised neural network that learns to categorize actions.
The system preprocesses the input from a Kinect like 3D camera to exploit the information not only about joint positions, but also their first and second order dynamics.
arXiv Detail & Related papers (2021-04-13T09:46:40Z) - Weakly Supervised Temporal Action Localization Through Learning Explicit
Subspaces for Action and Context [151.23835595907596]
Methods learn to localize temporal starts and ends of action instances in a video under only video-level supervision.
We introduce a framework that learns two feature subspaces respectively for actions and their context.
The proposed approach outperforms state-of-the-art WS-TAL methods on three benchmarks.
arXiv Detail & Related papers (2021-03-30T08:26:53Z) - Two-Stream AMTnet for Action Detection [12.581710073789848]
We propose a new deep neural network architecture for online action detection, termed ream to the original appearance one in AMTnet.
Two-Stream AMTnet exhibits superior action detection performance over state-of-the-art approaches on the standard action detection benchmarks.
arXiv Detail & Related papers (2020-04-03T12:16:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.