Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction
- URL: http://arxiv.org/abs/2503.24272v1
- Date: Mon, 31 Mar 2025 16:17:45 GMT
- Title: Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction
- Authors: Yizhou Huang, Yihua Cheng, Kezhi Wang,
- Abstract summary: We propose a self-supervised pedestrian trajectory prediction framework.<n>We leverage velocity and acceleration information to enhance position prediction.<n>We conduct experiments on the ETH-UCY and Stanford Drone datasets.
- Score: 16.532357621144342
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Understanding human motion is crucial for accurate pedestrian trajectory prediction. Conventional methods typically rely on supervised learning, where ground-truth labels are directly optimized against predicted trajectories. This amplifies the limitations caused by long-tailed data distributions, making it difficult for the model to capture abnormal behaviors. In this work, we propose a self-supervised pedestrian trajectory prediction framework that explicitly models position, velocity, and acceleration. We leverage velocity and acceleration information to enhance position prediction through feature injection and a self-supervised motion consistency mechanism. Our model hierarchically injects velocity features into the position stream. Acceleration features are injected into the velocity stream. This enables the model to predict position, velocity, and acceleration jointly. From the predicted position, we compute corresponding pseudo velocity and acceleration, allowing the model to learn from data-generated pseudo labels and thus achieve self-supervised learning. We further design a motion consistency evaluation strategy grounded in physical principles; it selects the most reasonable predicted motion trend by comparing it with historical dynamics and uses this trend to guide and constrain trajectory generation. We conduct experiments on the ETH-UCY and Stanford Drone datasets, demonstrating that our method achieves state-of-the-art performance on both datasets.
Related papers
- Dynamic Intent Queries for Motion Transformer-based Trajectory Prediction [36.287188668060075]
In autonomous driving, accurately predicting the movements of other traffic participants is crucial.
Our research addresses this limitation by integrating scene-specific dynamic intention points into the MTR model.
Our findings demonstrate that incorporating dynamic intention points has a significant positive impact on trajectory accuracy.
arXiv Detail & Related papers (2025-04-22T10:20:35Z) - Physical spline for denoising object trajectory data by combining splines, ML feature regression and model knowledge [0.0]
This article presents a method for estimating the dynamic driving states (position, velocity, acceleration and heading) from noisy measurement data.
The proposed approach is effective with both complete and partial observations.
arXiv Detail & Related papers (2025-04-08T19:53:57Z) - Multi-agent Traffic Prediction via Denoised Endpoint Distribution [23.767783008524678]
Trajectory prediction at high speeds requires historical features and interactions with surrounding entities.
We present the Denoised Distribution model for trajectory prediction.
Our approach significantly reduces model complexity and performance through endpoint information.
arXiv Detail & Related papers (2024-05-11T15:41:32Z) - ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties [6.865435680843742]
We propose a novel diffusion-based, acceleratable framework that adeptly predicts future trajectories of agents with enhanced resistance to noise.
Our method meets the rigorous real-time operational standards essential for autonomous vehicles.
It achieves significant improvement in multi-agent motion prediction on the Argoverse 1 motion forecasting dataset.
arXiv Detail & Related papers (2024-05-01T18:16:55Z) - AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving [59.94343412438211]
We introduce the GPT style next token motion prediction into motion prediction.
Different from language data which is composed of homogeneous units -words, the elements in the driving scene could have complex spatial-temporal and semantic relations.
We propose to adopt three factorized attention modules with different neighbors for information aggregation and different position encoding styles to capture their relations.
arXiv Detail & Related papers (2024-03-20T06:22:37Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - MSTFormer: Motion Inspired Spatial-temporal Transformer with
Dynamic-aware Attention for long-term Vessel Trajectory Prediction [0.6451914896767135]
MSTFormer is a motion inspired vessel trajectory prediction method based on Transformer.
We propose a data augmentation method to describe the spatial features and motion features of the trajectory.
Second, we propose a Multi-headed Dynamic-aware Self-attention mechanism to focus on trajectory points with frequent motion transformations.
Third, we construct a knowledge-inspired loss function to further boost the performance of the model.
arXiv Detail & Related papers (2023-03-21T02:11:37Z) - STGlow: A Flow-based Generative Framework with Dual Graphormer for
Pedestrian Trajectory Prediction [22.553356096143734]
We propose a novel generative flow based framework with dual graphormer for pedestrian trajectory prediction (STGlow)
Our method can more precisely model the underlying data distribution by optimizing the exact log-likelihood of motion behaviors.
Experimental results on several benchmarks demonstrate that our method achieves much better performance compared to previous state-of-the-art approaches.
arXiv Detail & Related papers (2022-11-21T07:29:24Z) - StreamYOLO: Real-time Object Detection for Streaming Perception [84.2559631820007]
We endow the models with the capacity of predicting the future, significantly improving the results for streaming perception.
We consider multiple velocities driving scene and propose Velocity-awared streaming AP (VsAP) to jointly evaluate the accuracy.
Our simple method achieves the state-of-the-art performance on Argoverse-HD dataset and improves the sAP and VsAP by 4.7% and 8.2% respectively.
arXiv Detail & Related papers (2022-07-21T12:03:02Z) - Physics-Inspired Temporal Learning of Quadrotor Dynamics for Accurate
Model Predictive Trajectory Tracking [76.27433308688592]
Accurately modeling quadrotor's system dynamics is critical for guaranteeing agile, safe, and stable navigation.
We present a novel Physics-Inspired Temporal Convolutional Network (PI-TCN) approach to learning quadrotor's system dynamics purely from robot experience.
Our approach combines the expressive power of sparse temporal convolutions and dense feed-forward connections to make accurate system predictions.
arXiv Detail & Related papers (2022-06-07T13:51:35Z) - Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion [88.45326906116165]
We present a new framework to formulate the trajectory prediction task as a reverse process of motion indeterminacy diffusion (MID)
We encode the history behavior information and the social interactions as a state embedding and devise a Transformer-based diffusion model to capture the temporal dependencies of trajectories.
Experiments on the human trajectory prediction benchmarks including the Stanford Drone and ETH/UCY datasets demonstrate the superiority of our method.
arXiv Detail & Related papers (2022-03-25T16:59:08Z) - SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory
Prediction [64.16212996247943]
We present a Sparse Graph Convolution Network(SGCN) for pedestrian trajectory prediction.
Specifically, the SGCN explicitly models the sparse directed interaction with a sparse directed spatial graph to capture adaptive interaction pedestrians.
visualizations indicate that our method can capture adaptive interactions between pedestrians and their effective motion tendencies.
arXiv Detail & Related papers (2021-04-04T03:17:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.