Controllable Diverse Sampling for Diffusion Based Motion Behavior
Forecasting
- URL: http://arxiv.org/abs/2402.03981v1
- Date: Tue, 6 Feb 2024 13:16:54 GMT
- Title: Controllable Diverse Sampling for Diffusion Based Motion Behavior
Forecasting
- Authors: Yiming Xu, Hao Cheng, Monika Sester
- Abstract summary: We introduce a novel trajectory generator named Controllable Diffusion Trajectory (CDT)
CDT integrates information and social interactions into a Transformer-based conditional denoising diffusion model to guide the prediction of future trajectories.
To ensure multimodality, we incorporate behavioral tokens to direct the trajectory's modes, such as going straight, turning right or left.
- Score: 11.106812447960186
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In autonomous driving tasks, trajectory prediction in complex traffic
environments requires adherence to real-world context conditions and behavior
multimodalities. Existing methods predominantly rely on prior assumptions or
generative models trained on curated data to learn road agents' stochastic
behavior bounded by scene constraints. However, they often face mode averaging
issues due to data imbalance and simplistic priors, and could even suffer from
mode collapse due to unstable training and single ground truth supervision.
These issues lead the existing methods to a loss of predictive diversity and
adherence to the scene constraints. To address these challenges, we introduce a
novel trajectory generator named Controllable Diffusion Trajectory (CDT), which
integrates map information and social interactions into a Transformer-based
conditional denoising diffusion model to guide the prediction of future
trajectories. To ensure multimodality, we incorporate behavioral tokens to
direct the trajectory's modes, such as going straight, turning right or left.
Moreover, we incorporate the predicted endpoints as an alternative behavioral
token into the CDT model to facilitate the prediction of accurate trajectories.
Extensive experiments on the Argoverse 2 benchmark demonstrate that CDT excels
in generating diverse and scene-compliant trajectories in complex urban
settings.
Related papers
- Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - Tractable Joint Prediction and Planning over Discrete Behavior Modes for
Urban Driving [15.671811785579118]
We show that we can parameterize autoregressive closed-loop models without retraining.
We propose fully reactive closed-loop planning over discrete latent modes.
Our approach also outperforms the previous state-of-the-art in CARLA on challenging dense traffic scenarios.
arXiv Detail & Related papers (2024-03-12T01:00:52Z) - Time-series Generation by Contrastive Imitation [87.51882102248395]
We study a generative framework that seeks to combine the strengths of both: Motivated by a moment-matching objective to mitigate compounding error, we optimize a local (but forward-looking) transition policy.
At inference, the learned policy serves as the generator for iterative sampling, and the learned energy serves as a trajectory-level measure for evaluating sample quality.
arXiv Detail & Related papers (2023-11-02T16:45:25Z) - Layout Sequence Prediction From Noisy Mobile Modality [53.49649231056857]
Trajectory prediction plays a vital role in understanding pedestrian movement for applications such as autonomous driving and robotics.
Current trajectory prediction models depend on long, complete, and accurately observed sequences from visual modalities.
We propose LTrajDiff, a novel approach that treats objects obstructed or out of sight as equally important as those with fully visible trajectories.
arXiv Detail & Related papers (2023-10-09T20:32:49Z) - A Diffusion-Model of Joint Interactive Navigation [14.689298253430568]
We present DJINN - a diffusion based method of generating traffic scenarios.
Our approach jointly diffuses the trajectories of all agents, conditioned on a flexible set of state observations from the past, present, or future.
We show how DJINN flexibly enables direct test-time sampling from a variety of valuable conditional distributions.
arXiv Detail & Related papers (2023-09-21T22:10:20Z) - Planning with Occluded Traffic Agents using Bi-Level Variational
Occlusion Models [37.09462631862042]
Reasoning with occluded traffic agents is a significant open challenge for planning for autonomous vehicles.
Recent deep learning models have shown impressive results for predicting occluded agents based on the behaviour of nearby visible agents.
We propose Bi-level Variational Occlusion Models (BiVO), a two-step generative model that first predicts likely locations of occluded agents, and then generates likely trajectories for the occluded agents.
arXiv Detail & Related papers (2022-10-26T09:39:31Z) - Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion [88.45326906116165]
We present a new framework to formulate the trajectory prediction task as a reverse process of motion indeterminacy diffusion (MID)
We encode the history behavior information and the social interactions as a state embedding and devise a Transformer-based diffusion model to capture the temporal dependencies of trajectories.
Experiments on the human trajectory prediction benchmarks including the Stanford Drone and ETH/UCY datasets demonstrate the superiority of our method.
arXiv Detail & Related papers (2022-03-25T16:59:08Z) - Congestion-aware Multi-agent Trajectory Prediction for Collision
Avoidance [110.63037190641414]
We propose to learn congestion patterns explicitly and devise a novel "Sense--Learn--Reason--Predict" framework.
By decomposing the learning phases into two stages, a "student" can learn contextual cues from a "teacher" while generating collision-free trajectories.
In experiments, we demonstrate that the proposed model is able to generate collision-free trajectory predictions in a synthetic dataset.
arXiv Detail & Related papers (2021-03-26T02:42:33Z) - Multi-modal anticipation of stochastic trajectories in a dynamic
environment with Conditional Variational Autoencoders [0.12183405753834559]
Short-term motion of nearby vehicles is not strictly limited to a set of single trajectories.
We propose to account for the multi-modality of the problem with use of Conditional Conditional Autoencoder (C-VAE) conditioned on an agent's past motion as well as a scene encoded with Capsule Network (CapsNet)
In addition, we demonstrate advantages of employing the Minimum over N generated samples and tries to minimise the loss with respect to the closest sample, effectively leading to more diverse predictions.
arXiv Detail & Related papers (2021-03-05T19:38:26Z) - Multimodal Trajectory Prediction via Topological Invariance for
Navigation at Uncontrolled Intersections [45.508973373913946]
We focus on decentralized navigation among multiple non-communicating rational agents at street intersections without traffic signs or signals.
Our key insight is that the geometric structure of the intersection and the incentive of agents to move efficiently and avoid collisions (rationality) reduces the space of likely behaviors.
We design Multiple Topologies Prediction (MTP), a data-driven trajectory-prediction mechanism that reconstructs trajectory representations of high-likelihood modes in multiagent intersection scenes.
arXiv Detail & Related papers (2020-11-08T02:56:42Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.