SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent
Diffusion Models
- URL: http://arxiv.org/abs/2311.15736v1
- Date: Mon, 27 Nov 2023 11:39:27 GMT
- Title: SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent
Diffusion Models
- Authors: Zhiming Guo, Xing Gao, Jianlan Zhou, Xinyu Cai, Botian Shi
- Abstract summary: We propose a novel framework based on diffusion models, called SceneDM, to generate joint and consistent future motions of all the agents in a scene.
SceneDM achieves state-of-the-art results on the Sim Agents Benchmark.
- Score: 10.057312592344507
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Realistic scene-level multi-agent motion simulations are crucial for
developing and evaluating self-driving algorithms. However, most existing works
focus on generating trajectories for a certain single agent type, and typically
ignore the consistency of generated trajectories. In this paper, we propose a
novel framework based on diffusion models, called SceneDM, to generate joint
and consistent future motions of all the agents, including vehicles, bicycles,
pedestrians, etc., in a scene. To enhance the consistency of the generated
trajectories, we resort to a new Transformer-based network to effectively
handle agent-agent interactions in the inverse process of motion diffusion. In
consideration of the smoothness of agent trajectories, we further design a
simple yet effective consistent diffusion approach, to improve the model in
exploiting short-term temporal dependencies. Furthermore, a scene-level scoring
function is attached to evaluate the safety and road-adherence of the generated
agent's motions and help filter out unrealistic simulations. Finally, SceneDM
achieves state-of-the-art results on the Waymo Sim Agents Benchmark. Project
webpage is available at https://alperen-hub.github.io/SceneDM.
Related papers
- KiGRAS: Kinematic-Driven Generative Model for Realistic Agent Simulation [17.095651262950568]
Trajectory generation is a pivotal task in autonomous driving.
Recent studies have introduced the autoregressive paradigm.
We propose the Kinematic-Driven Generative Model for Realistic Agent Simulation.
arXiv Detail & Related papers (2024-07-17T18:12:11Z) - Motion Flow Matching for Human Motion Synthesis and Editing [75.13665467944314]
We propose emphMotion Flow Matching, a novel generative model for human motion generation featuring efficient sampling and effectiveness in motion editing applications.
Our method reduces the sampling complexity from thousand steps in previous diffusion models to just ten steps, while achieving comparable performance in text-to-motion and action-to-motion generation benchmarks.
arXiv Detail & Related papers (2023-12-14T12:57:35Z) - Trajeglish: Traffic Modeling as Next-Token Prediction [67.28197954427638]
A longstanding challenge for self-driving development is simulating dynamic driving scenarios seeded from recorded driving logs.
We apply tools from discrete sequence modeling to model how vehicles, pedestrians and cyclists interact in driving scenarios.
Our model tops the Sim Agents Benchmark, surpassing prior work along the realism meta metric by 3.3% and along the interaction metric by 9.9%.
arXiv Detail & Related papers (2023-12-07T18:53:27Z) - Real-Time Motion Prediction via Heterogeneous Polyline Transformer with
Relative Pose Encoding [121.08841110022607]
Existing agent-centric methods have demonstrated outstanding performance on public benchmarks.
We introduce the K-nearest neighbor attention with relative pose encoding (KNARPE), a novel attention mechanism allowing the pairwise-relative representation to be used by Transformers.
By sharing contexts among agents and reusing the unchanged contexts, our approach is as efficient as scene-centric methods, while performing on par with state-of-the-art agent-centric methods.
arXiv Detail & Related papers (2023-10-19T17:59:01Z) - MotionLM: Multi-Agent Motion Forecasting as Language Modeling [15.317827804763699]
We present MotionLM, a language model for multi-agent motion prediction.
Our approach bypasses post-hoc interactions where individual agent trajectory generation is conducted prior to interactive scoring.
The model's sequential factorization enables temporally causal conditional rollouts.
arXiv Detail & Related papers (2023-09-28T15:46:25Z) - MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and
Guided Intention Querying [110.83590008788745]
Motion prediction is crucial for autonomous driving systems to understand complex driving scenarios and make informed decisions.
In this paper, we propose Motion TRansformer (MTR) frameworks to address these challenges.
The initial MTR framework utilizes a transformer encoder-decoder structure with learnable intention queries.
We introduce an advanced MTR++ framework, extending the capability of MTR to simultaneously predict multimodal motion for multiple agents.
arXiv Detail & Related papers (2023-06-30T16:23:04Z) - JFP: Joint Future Prediction with Interactive Multi-Agent Modeling for
Autonomous Driving [12.460224193998362]
We propose an end-to-end trainable model that learns directly the interaction between pairs of agents in a structured, graphical model formulation.
Our approach improves significantly on the trajectory overlap metrics while obtaining on-par or better performance on single-agent trajectory metrics.
arXiv Detail & Related papers (2022-12-16T20:59:21Z) - TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors [74.67698916175614]
We propose TrafficSim, a multi-agent behavior model for realistic traffic simulation.
In particular, we leverage an implicit latent variable model to parameterize a joint actor policy.
We show TrafficSim generates significantly more realistic and diverse traffic scenarios as compared to a diverse set of baselines.
arXiv Detail & Related papers (2021-01-17T00:29:30Z) - Exploring Dynamic Context for Multi-path Trajectory Prediction [33.66335553588001]
We propose a novel framework, named Dynamic Context Network (DCENet)
In our framework, the spatial context between agents is explored by using self-attention architectures.
A set of future trajectories for each agent is predicted conditioned on the learned spatial-temporal context.
arXiv Detail & Related papers (2020-10-30T13:39:20Z) - Traffic Agent Trajectory Prediction Using Social Convolution and
Attention Mechanism [57.68557165836806]
We propose a model to predict the trajectories of target agents around an autonomous vehicle.
We encode the target agent history trajectories as an attention mask and construct a social map to encode the interactive relationship between the target agent and its surrounding agents.
To verify the effectiveness of our method, we widely compare with several methods on a public dataset, achieving a 20% error decrease.
arXiv Detail & Related papers (2020-07-06T03:48:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.