Investigating Pose Representations and Motion Contexts Modeling for 3D
Motion Prediction
- URL: http://arxiv.org/abs/2112.15012v1
- Date: Thu, 30 Dec 2021 10:45:22 GMT
- Title: Investigating Pose Representations and Motion Contexts Modeling for 3D
Motion Prediction
- Authors: Zhenguang Liu, Shuang Wu, Shuyuan Jin, Shouling Ji, Qi Liu, Shijian
Lu, and Li Cheng
- Abstract summary: We conduct an indepth study on various pose representations with a focus on their effects on the motion prediction task.
We propose a novel RNN architecture termed AHMR (Attentive Hierarchical Motion Recurrent network) for motion prediction.
Our approach outperforms the state-of-the-art methods in short-term prediction and achieves much enhanced long-term prediction proficiency.
- Score: 63.62263239934777
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting human motion from historical pose sequence is crucial for a
machine to succeed in intelligent interactions with humans. One aspect that has
been obviated so far, is the fact that how we represent the skeletal pose has a
critical impact on the prediction results. Yet there is no effort that
investigates across different pose representation schemes. We conduct an
indepth study on various pose representations with a focus on their effects on
the motion prediction task. Moreover, recent approaches build upon
off-the-shelf RNN units for motion prediction. These approaches process input
pose sequence sequentially and inherently have difficulties in capturing
long-term dependencies. In this paper, we propose a novel RNN architecture
termed AHMR (Attentive Hierarchical Motion Recurrent network) for motion
prediction which simultaneously models local motion contexts and a global
context. We further explore a geodesic loss and a forward kinematics loss for
the motion prediction task, which have more geometric significance than the
widely employed L2 loss. Interestingly, we applied our method to a range of
articulate objects including human, fish, and mouse. Empirical results show
that our approach outperforms the state-of-the-art methods in short-term
prediction and achieves much enhanced long-term prediction proficiency, such as
retaining natural human-like motions over 50 seconds predictions. Our codes are
released.
Related papers
- SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction [12.248428883804763]
3D human motion prediction is a research area computation of high significance and a challenge in computer vision.
Traditionally, autogregressive models have been used to predict human motion.
We present a non-autoregressive model for human motion prediction.
arXiv Detail & Related papers (2023-03-11T01:44:29Z) - 3D Skeleton-based Human Motion Prediction with Manifold-Aware GAN [3.1313293632309827]
We propose a novel solution for 3D skeleton-based human motion prediction.
We build a manifold-aware Wasserstein generative adversarial model that captures the temporal and spatial dependencies of human motion.
Experiments have been conducted on CMU MoCap and Human 3.6M datasets.
arXiv Detail & Related papers (2022-03-01T20:49:13Z) - Dyadic Human Motion Prediction [119.3376964777803]
We introduce a motion prediction framework that explicitly reasons about the interactions of two observed subjects.
Specifically, we achieve this by introducing a pairwise attention mechanism that models the mutual dependencies in the motion history of the two subjects.
This allows us to preserve the long-term motion dynamics in a more realistic way and more robustly predict unusual and fast-paced movements.
arXiv Detail & Related papers (2021-12-01T10:30:40Z) - Generating Smooth Pose Sequences for Diverse Human Motion Prediction [90.45823619796674]
We introduce a unified deep generative network for both diverse and controllable motion prediction.
Our experiments on two standard benchmark datasets, Human3.6M and HumanEva-I, demonstrate that our approach outperforms the state-of-the-art baselines in terms of both sample diversity and accuracy.
arXiv Detail & Related papers (2021-08-19T00:58:00Z) - Multi-level Motion Attention for Human Motion Prediction [132.29963836262394]
We study the use of different types of attention, computed at joint, body part, and full pose levels.
Our experiments on Human3.6M, AMASS and 3DPW validate the benefits of our approach for both periodical and non-periodical actions.
arXiv Detail & Related papers (2021-06-17T08:08:11Z) - Learning Dynamics via Graph Neural Networks for Human Pose Estimation
and Tracking [98.91894395941766]
We propose a novel online approach to learning the pose dynamics, which are independent of pose detections in current fame.
Specifically, we derive this prediction of dynamics through a graph neural network(GNN) that explicitly accounts for both spatial-temporal and visual information.
Experiments on PoseTrack 2017 and PoseTrack 2018 datasets demonstrate that the proposed method achieves results superior to the state of the art on both human pose estimation and tracking tasks.
arXiv Detail & Related papers (2021-06-07T16:36:50Z) - Human Motion Prediction Using Manifold-Aware Wasserstein GAN [4.771549505875783]
We build a manifold-aware Wasserstein generative adversarial model that captures the temporal and spatial dependencies of human motion.
Our approach outperforms the state-of-the-art on CMU MoCap and Human 3.6M datasets.
arXiv Detail & Related papers (2021-05-18T17:56:10Z) - Long Term Motion Prediction Using Keyposes [122.22758311506588]
We argue that, to achieve long term forecasting, predicting human pose at every time instant is unnecessary.
We call such poses "keyposes", and approximate complex motions by linearly interpolating between subsequent keyposes.
We show that learning the sequence of such keyposes allows us to predict very long term motion, up to 5 seconds in the future.
arXiv Detail & Related papers (2020-12-08T20:45:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.