LSTA-Net: Long short-term Spatio-Temporal Aggregation Network for
Skeleton-based Action Recognition
- URL: http://arxiv.org/abs/2111.00823v1
- Date: Mon, 1 Nov 2021 10:53:35 GMT
- Title: LSTA-Net: Long short-term Spatio-Temporal Aggregation Network for
Skeleton-based Action Recognition
- Authors: Tailin Chen, Shidong Wang, Desen Zhou, Yu Guan
- Abstract summary: LSTA-Net: a novel short-term Spatio-Temporal Network.
Long/short-term temporal information is not well explored in existing works.
Experiments were conducted on three public benchmark datasets.
- Score: 14.078419675904446
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Modelling various spatio-temporal dependencies is the key to recognising
human actions in skeleton sequences. Most existing methods excessively relied
on the design of traversal rules or graph topologies to draw the dependencies
of the dynamic joints, which is inadequate to reflect the relationships of the
distant yet important joints. Furthermore, due to the locally adopted
operations, the important long-range temporal information is therefore not well
explored in existing works. To address this issue, in this work we propose
LSTA-Net: a novel Long short-term Spatio-Temporal Aggregation Network, which
can effectively capture the long/short-range dependencies in a spatio-temporal
manner. We devise our model into a pure factorised architecture which can
alternately perform spatial feature aggregation and temporal feature
aggregation. To improve the feature aggregation effect, a channel-wise
attention mechanism is also designed and employed. Extensive experiments were
conducted on three public benchmark datasets, and the results suggest that our
approach can capture both long-and-short range dependencies in the space and
time domain, yielding higher results than other state-of-the-art methods. Code
available at https://github.com/tailin1009/LSTA-Net.
Related papers
- Spatial-Temporal Multi-level Association for Video Object Segmentation [89.32226483171047]
This paper proposes spatial-temporal multi-level association, which jointly associates reference frame, test frame, and object features.
Specifically, we construct a spatial-temporal multi-level feature association module to learn better target-aware features.
arXiv Detail & Related papers (2024-04-09T12:44:34Z) - Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition [0.0]
In this paper, we propose self-attention GCN hybrid model, Multi-Scale Spatial-Temporal self-attention (MSST)-GCN.
We utilize spatial self-attention module with adaptive topology to understand intra-frame interactions within a frame among different body parts, and temporal self-attention module to examine correlations between frames of a node.
arXiv Detail & Related papers (2024-04-03T10:25:45Z) - A Decoupled Spatio-Temporal Framework for Skeleton-based Action
Segmentation [89.86345494602642]
Existing methods are limited in weak-temporal modeling capability.
We propose a Decoupled Scoupled Framework (DeST) to address the issues.
DeST significantly outperforms current state-of-the-art methods with less computational complexity.
arXiv Detail & Related papers (2023-12-10T09:11:39Z) - Spatial-Temporal Graph Convolutional Gated Recurrent Network for Traffic
Forecasting [3.9761027576939414]
We propose a novel framework for traffic forecasting, named Spatial-Temporal Graph Convolutional Gated Recurrent Network (STGCGRN)
We design an attention module to capture long-term dependency by mining periodic information in traffic data.
Experiments on four datasets demonstrate the superior performance of our model.
arXiv Detail & Related papers (2022-10-06T08:02:20Z) - TraverseNet: Unifying Space and Time in Message Passing [46.12086583451224]
This paper aims to unify spatial dependency and temporal dependency in a non-Euclidean space.
We propose TraverseNet, a novel spatial-temporal graph neural network, viewing space and time as an inseparable whole.
arXiv Detail & Related papers (2021-08-25T04:35:08Z) - Spatial-Temporal Graph ODE Networks for Traffic Flow Forecasting [22.421667339552467]
Spatial-temporal forecasting has attracted tremendous attention in a wide range of applications, and traffic flow prediction is a canonical and typical example.
Existing works typically utilize shallow graph convolution networks (GNNs) and temporal extracting modules to model spatial and temporal dependencies respectively.
We propose Spatial-Temporal Graph Ordinary Differential Equation Networks (STGODE), which captures spatial-temporal dynamics through a tensor-based ordinary differential equation (ODE)
We evaluate our model on multiple real-world traffic datasets and superior performance is achieved over state-of-the-art baselines.
arXiv Detail & Related papers (2021-06-24T11:48:45Z) - ORDNet: Capturing Omni-Range Dependencies for Scene Parsing [135.11360962062957]
We build an Omni-Range Dependencies Network (ORDNet) which can effectively capture short-, middle- and long-range dependencies.
Our ORDNet is able to extract more comprehensive context information and well adapt to complex spatial variance in scene images.
arXiv Detail & Related papers (2021-01-11T14:51:11Z) - Spatial-Temporal Fusion Graph Neural Networks for Traffic Flow
Forecasting [35.072979313851235]
spatial-temporal data forecasting of traffic flow is a challenging task because of complicated spatial dependencies and dynamical trends of temporal pattern between different roads.
Existing frameworks typically utilize given spatial adjacency graph and sophisticated mechanisms for modeling spatial and temporal correlations.
This paper proposes Spatial-Temporal Fusion Graph Neural Networks (STFGNN) for traffic flow forecasting.
arXiv Detail & Related papers (2020-12-15T14:03:17Z) - Supporting Optimal Phase Space Reconstructions Using Neural Network
Architecture for Time Series Modeling [68.8204255655161]
We propose an artificial neural network with a mechanism to implicitly learn the phase spaces properties.
Our approach is either as competitive as or better than most state-of-the-art strategies.
arXiv Detail & Related papers (2020-06-19T21:04:47Z) - Disentangling and Unifying Graph Convolutions for Skeleton-Based Action
Recognition [79.33539539956186]
We propose a simple method to disentangle multi-scale graph convolutions and a unified spatial-temporal graph convolutional operator named G3D.
By coupling these proposals, we develop a powerful feature extractor named MS-G3D based on which our model outperforms previous state-of-the-art methods on three large-scale datasets.
arXiv Detail & Related papers (2020-03-31T11:28:25Z) - Spatial-Temporal Transformer Networks for Traffic Flow Forecasting [74.76852538940746]
We propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) to improve the accuracy of long-term traffic forecasting.
Specifically, we present a new variant of graph neural networks, named spatial transformer, by dynamically modeling directed spatial dependencies.
The proposed model enables fast and scalable training over a long range spatial-temporal dependencies.
arXiv Detail & Related papers (2020-01-09T10:21:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.