DMS-GCN: Dynamic Mutiscale Spatiotemporal Graph Convolutional Networks
for Human Motion Prediction
- URL: http://arxiv.org/abs/2112.10365v1
- Date: Mon, 20 Dec 2021 07:07:03 GMT
- Title: DMS-GCN: Dynamic Mutiscale Spatiotemporal Graph Convolutional Networks
for Human Motion Prediction
- Authors: Zigeng Yan, Di-Hua Zhai, Yuanqing Xia
- Abstract summary: We propose a feed-forward deep neural network for motion prediction.
The entire model is suitable for all actions and follows a framework of encoder-decoder.
Our approach outperforms SOTA methods on the datasets of Human3.6M and CMU Mocap.
- Score: 8.142947808507365
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Human motion prediction is an important and challenging task in many computer
vision application domains. Recent work concentrates on utilizing the timing
processing ability of recurrent neural networks (RNNs) to achieve smooth and
reliable results in short-term prediction. However, as evidenced by previous
work, RNNs suffer from errors accumulation, leading to unreliable results. In
this paper, we propose a simple feed-forward deep neural network for motion
prediction, which takes into account temporal smoothness and spatial
dependencies between human body joints. We design a Multi-scale Spatio-temporal
graph convolutional networks (GCNs) to implicitly establish the Spatio-temporal
dependence in the process of human movement, where different scales fused
dynamically during training. The entire model is suitable for all actions and
follows a framework of encoder-decoder. The encoder consists of temporal GCNs
to capture motion features between frames and semi-autonomous learned spatial
GCNs to extract spatial structure among joint trajectories. The decoder uses
temporal convolution networks (TCNs) to maintain its extensive ability.
Extensive experiments show that our approach outperforms SOTA methods on the
datasets of Human3.6M and CMU Mocap while only requiring much lesser
parameters. Code will be available at https://github.com/yzg9353/DMSGCN.
Related papers
- Signal-SGN: A Spiking Graph Convolutional Network for Skeletal Action Recognition via Learning Temporal-Frequency Dynamics [2.9578022754506605]
In skeletal-based action recognition, Graph Convolutional Networks (GCNs) face limitations due to their complexity and high energy consumption.
We propose a Signal-SGN(Spiking Graph Convolutional Network), which leverages the temporal dimension of skeletal sequences as the spiking timestep.
Our experiments show that the proposed models not only surpass existing SNN-based methods in accuracy but also reduce computational storage costs during training.
arXiv Detail & Related papers (2024-08-03T07:47:16Z) - Uncovering the Missing Pattern: Unified Framework Towards Trajectory
Imputation and Prediction [60.60223171143206]
Trajectory prediction is a crucial undertaking in understanding entity movement or human behavior from observed sequences.
Current methods often assume that the observed sequences are complete while ignoring the potential for missing values.
This paper presents a unified framework, the Graph-based Conditional Variational Recurrent Neural Network (GC-VRNN), which can perform trajectory imputation and prediction simultaneously.
arXiv Detail & Related papers (2023-03-28T14:27:27Z) - Intelligence Processing Units Accelerate Neuromorphic Learning [52.952192990802345]
Spiking neural networks (SNNs) have achieved orders of magnitude improvement in terms of energy consumption and latency.
We present an IPU-optimized release of our custom SNN Python package, snnTorch.
arXiv Detail & Related papers (2022-11-19T15:44:08Z) - Scalable Spatiotemporal Graph Neural Networks [14.415967477487692]
Graph neural networks (GNNs) are often the core component of the forecasting architecture.
In most pretemporal GNNs, the computational complexity scales up to a quadratic factor with the length of the sequence times the number of links in the graph.
We propose a scalable architecture that exploits an efficient encoding of both temporal and spatial dynamics.
arXiv Detail & Related papers (2022-09-14T09:47:38Z) - Hybrid SNN-ANN: Energy-Efficient Classification and Object Detection for
Event-Based Vision [64.71260357476602]
Event-based vision sensors encode local pixel-wise brightness changes in streams of events rather than image frames.
Recent progress in object recognition from event-based sensors has come from conversions of deep neural networks.
We propose a hybrid architecture for end-to-end training of deep neural networks for event-based pattern recognition and object detection.
arXiv Detail & Related papers (2021-12-06T23:45:58Z) - Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based
Motion Prediction [92.16318571149553]
We propose a multiscale-temporal graph neural network (MST-GNN) to predict the future 3D-based skeleton human poses.
The MST-GNN outperforms state-of-the-art methods in both short and long-term motion prediction.
arXiv Detail & Related papers (2021-08-25T14:05:37Z) - Spatiotemporal convolutional network for time-series prediction and
causal inference [21.895413699349966]
A neural network computing framework, i.N.N., was developed to efficiently and accurately render a multistep-ahead prediction of a time series.
The framework has great potential in practical applications in artificial intelligence (AI) or machine learning fields.
arXiv Detail & Related papers (2021-07-03T06:20:43Z) - Exploiting Spiking Dynamics with Spatial-temporal Feature Normalization
in Graph Learning [9.88508686848173]
Biological spiking neurons with intrinsic dynamics underlie the powerful representation and learning capabilities of the brain.
Despite recent tremendous progress in spiking neural networks (SNNs) for handling Euclidean-space tasks, it still remains challenging to exploit SNNs in processing non-Euclidean-space data.
Here we present a general spike-based modeling framework that enables the direct training of SNNs for graph learning.
arXiv Detail & Related papers (2021-06-30T11:20:16Z) - Progressive Tandem Learning for Pattern Recognition with Deep Spiking
Neural Networks [80.15411508088522]
Spiking neural networks (SNNs) have shown advantages over traditional artificial neural networks (ANNs) for low latency and high computational efficiency.
We propose a novel ANN-to-SNN conversion and layer-wise learning framework for rapid and efficient pattern recognition.
arXiv Detail & Related papers (2020-07-02T15:38:44Z) - Dynamic Multiscale Graph Neural Networks for 3D Skeleton-Based Human
Motion Prediction [102.9787019197379]
We propose novel dynamic multiscale graph neural networks (DMGNN) to predict 3D skeleton-based human motions.
The model is action-category-agnostic and follows an encoder-decoder framework.
The proposed DMGNN outperforms state-of-the-art methods in both short and long-term predictions.
arXiv Detail & Related papers (2020-03-17T02:49:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.