AnyPose: Anytime 3D Human Pose Forecasting via Neural Ordinary
Differential Equations
- URL: http://arxiv.org/abs/2309.04840v1
- Date: Sat, 9 Sep 2023 16:59:57 GMT
- Title: AnyPose: Anytime 3D Human Pose Forecasting via Neural Ordinary
Differential Equations
- Authors: Zixing Wang, Ahmed H. Qureshi
- Abstract summary: AnyPose is a lightweight continuous-time neural architecture that models human behavior dynamics with neural ordinary differential equations.
Our results demonstrate that AnyPose exhibits high-performance accuracy in predicting future poses and takes significantly lower computational time than traditional methods.
- Score: 2.7195102129095003
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Anytime 3D human pose forecasting is crucial to synchronous real-world
human-machine interaction, where the term ``anytime" corresponds to predicting
human pose at any real-valued time step. However, to the best of our knowledge,
all the existing methods in human pose forecasting perform predictions at
preset, discrete time intervals. Therefore, we introduce AnyPose, a lightweight
continuous-time neural architecture that models human behavior dynamics with
neural ordinary differential equations. We validate our framework on the
Human3.6M, AMASS, and 3DPW dataset and conduct a series of comprehensive
analyses towards comparison with existing methods and the intersection of human
pose and neural ordinary differential equations. Our results demonstrate that
AnyPose exhibits high-performance accuracy in predicting future poses and takes
significantly lower computational time than traditional methods in solving
anytime prediction tasks.
Related papers
- Personalized Pose Forecasting [28.46838162184121]
We reformulate the human motion forecasting problem and present a model-agnostic personalization method.
Motion forecasting personalization can be performed efficiently online by utilizing a low-parametric time-series analysis model.
arXiv Detail & Related papers (2023-12-06T14:43:38Z) - TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting [27.3359362364858]
We present an efficient multi-view pose estimation model that learns a robust temporal representation.
Our model is able to generalize across datasets without fine-tuning.
arXiv Detail & Related papers (2023-09-14T17:56:30Z) - A Neuro-Symbolic Approach for Enhanced Human Motion Prediction [5.742409080817885]
We propose a neuro-symbolic approach for human motion prediction (NeuroSyM)
NeuroSyM weights differently the interactions in the neighbourhood by leveraging an intuitive technique for spatial representation called qualitative Trajectory Calculus (QTC)
Experimental results show that the NeuroSyM approach outperforms in most cases the baseline architectures in terms of prediction accuracy.
arXiv Detail & Related papers (2023-04-23T20:11:40Z) - Learning signatures of decision making from many individuals playing the
same game [54.33783158658077]
We design a predictive framework that learns representations to encode an individual's 'behavioral style'
We apply our method to a large-scale behavioral dataset from 1,000 humans playing a 3-armed bandit task.
arXiv Detail & Related papers (2023-02-21T21:41:53Z) - Using Features at Multiple Temporal and Spatial Resolutions to Predict
Human Behavior in Real Time [2.955419572714387]
We present an approach for integrating high and low-resolution spatial and temporal information to predict human behavior in real time.
Our model composes neural networks for high and low-resolution feature extraction with a neural network for behavior prediction, with all three networks trained simultaneously.
arXiv Detail & Related papers (2022-11-12T18:41:33Z) - A generic diffusion-based approach for 3D human pose prediction in the
wild [68.00961210467479]
3D human pose forecasting, i.e., predicting a sequence of future human 3D poses given a sequence of past observed ones, is a challenging-temporal task.
We provide a unified formulation in which incomplete elements (no matter in the prediction or observation) are treated as noise and propose a conditional diffusion model that denoises them and forecasts plausible poses.
We investigate our findings on four standard datasets and obtain significant improvements over the state-of-the-art.
arXiv Detail & Related papers (2022-10-11T17:59:54Z) - Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation [13.40702053084305]
We present a temporally embedded 3D human body pose and shape estimation (TePose) method to improve the accuracy and temporal consistency pose in live stream videos.
A multi-scale convolutional network is presented as the motion discriminator for adversarial training using datasets without any 3D labeling.
arXiv Detail & Related papers (2022-07-25T21:21:59Z) - Investigating Pose Representations and Motion Contexts Modeling for 3D
Motion Prediction [63.62263239934777]
We conduct an indepth study on various pose representations with a focus on their effects on the motion prediction task.
We propose a novel RNN architecture termed AHMR (Attentive Hierarchical Motion Recurrent network) for motion prediction.
Our approach outperforms the state-of-the-art methods in short-term prediction and achieves much enhanced long-term prediction proficiency.
arXiv Detail & Related papers (2021-12-30T10:45:22Z) - LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human
Bodies [78.17425779503047]
We propose a novel neural implicit representation for the human body.
It is fully differentiable and optimizable with disentangled shape and pose latent spaces.
Our model can be trained and fine-tuned directly on non-watertight raw data with well-designed losses.
arXiv Detail & Related papers (2021-11-30T04:10:57Z) - Scene Synthesis via Uncertainty-Driven Attribute Synchronization [52.31834816911887]
This paper introduces a novel neural scene synthesis approach that can capture diverse feature patterns of 3D scenes.
Our method combines the strength of both neural network-based and conventional scene synthesis approaches.
arXiv Detail & Related papers (2021-08-30T19:45:07Z) - Self-Attentive 3D Human Pose and Shape Estimation from Videos [82.63503361008607]
We present a video-based learning algorithm for 3D human pose and shape estimation.
We exploit temporal information in videos and propose a self-attention module.
We evaluate our method on the 3DPW, MPI-INF-3DHP, and Human3.6M datasets.
arXiv Detail & Related papers (2021-03-26T00:02:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.