Symphony: Learning Realistic and Diverse Agents for Autonomous Driving
Simulation
- URL: http://arxiv.org/abs/2205.03195v1
- Date: Fri, 6 May 2022 13:21:40 GMT
- Title: Symphony: Learning Realistic and Diverse Agents for Autonomous Driving
Simulation
- Authors: Maximilian Igl, Daewoo Kim, Alex Kuefler, Paul Mougin, Punit Shah,
Kyriacos Shiarlis, Dragomir Anguelov, Mark Palatucci, Brandyn White, Shimon
Whiteson
- Abstract summary: We propose Symphony, which greatly improves realism by combining conventional policies with a parallel beam search.
Symphony addresses this issue with a hierarchical approach, factoring agent behaviour into goal generation and goal conditioning.
Experiments confirm that Symphony agents learn more realistic and diverse behaviour than several baselines.
- Score: 45.09881984441893
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Simulation is a crucial tool for accelerating the development of autonomous
vehicles. Making simulation realistic requires models of the human road users
who interact with such cars. Such models can be obtained by applying learning
from demonstration (LfD) to trajectories observed by cars already on the road.
However, existing LfD methods are typically insufficient, yielding policies
that frequently collide or drive off the road. To address this problem, we
propose Symphony, which greatly improves realism by combining conventional
policies with a parallel beam search. The beam search refines these policies on
the fly by pruning branches that are unfavourably evaluated by a discriminator.
However, it can also harm diversity, i.e., how well the agents cover the entire
distribution of realistic behaviour, as pruning can encourage mode collapse.
Symphony addresses this issue with a hierarchical approach, factoring agent
behaviour into goal generation and goal conditioning. The use of such goals
ensures that agent diversity neither disappears during adversarial training nor
is pruned away by the beam search. Experiments on both proprietary and open
Waymo datasets confirm that Symphony agents learn more realistic and diverse
behaviour than several baselines.
Related papers
- Autonomous Vehicle Controllers From End-to-End Differentiable Simulation [60.05963742334746]
We propose a differentiable simulator and design an analytic policy gradients (APG) approach to training AV controllers.
Our proposed framework brings the differentiable simulator into an end-to-end training loop, where gradients of environment dynamics serve as a useful prior to help the agent learn a more grounded policy.
We find significant improvements in performance and robustness to noise in the dynamics, as well as overall more intuitive human-like handling.
arXiv Detail & Related papers (2024-09-12T11:50:06Z) - SAFE-SIM: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries [94.84458417662407]
We introduce SAFE-SIM, a controllable closed-loop safety-critical simulation framework.
Our approach yields two distinct advantages: 1) generating realistic long-tail safety-critical scenarios that closely reflect real-world conditions, and 2) providing controllable adversarial behavior for more comprehensive and interactive evaluations.
We validate our framework empirically using the nuScenes and nuPlan datasets across multiple planners, demonstrating improvements in both realism and controllability.
arXiv Detail & Related papers (2023-12-31T04:14:43Z) - Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with
Prompt Learning [4.195122359359966]
Large Language Models (LLMs) are trained on mass knowledge and proved to be equipped with astonishing inference abilities.
In this work, we leverage LLMs to understand and profile the system dynamics by a prompt-based grounded action transformation.
arXiv Detail & Related papers (2023-08-28T03:49:13Z) - Optimizing Trajectories for Highway Driving with Offline Reinforcement
Learning [11.970409518725491]
We propose a Reinforcement Learning-based approach to autonomous driving.
We compare the performance of our agent against four other highway driving agents.
We demonstrate that our offline trained agent, with randomly collected data, learns to drive smoothly, achieving as close as possible to the desired velocity, while outperforming the other agents.
arXiv Detail & Related papers (2022-03-21T13:13:08Z) - Learning Interactive Driving Policies via Data-driven Simulation [125.97811179463542]
Data-driven simulators promise high data-efficiency for driving policy learning.
Small underlying datasets often lack interesting and challenging edge cases for learning interactive driving.
We propose a simulation method that uses in-painted ado vehicles for learning robust driving policies.
arXiv Detail & Related papers (2021-11-23T20:14:02Z) - Multi-Agent Variational Occlusion Inference Using People as Sensors [28.831182328958477]
Inferring occupancy from agent behaviors is an inherently multimodal problem.
We propose an occlusion inference method that characterizes observed behaviors of human agents as sensor measurements.
Our approach is validated on a real-world dataset, outperforming baselines and demonstrating real-time capable performance.
arXiv Detail & Related papers (2021-09-05T21:56:54Z) - Building Safer Autonomous Agents by Leveraging Risky Driving Behavior
Knowledge [1.52292571922932]
This study focuses on creating risk prone scenarios with heavy traffic and unexpected random behavior for creating better model-free learning agents.
We generate multiple autonomous driving scenarios by creating new custom Markov Decision Process (MDP) environment iterations in highway-env simulation package.
We train model free learning agents with supplement information of risk prone driving scenarios and compare their performance with baseline agents.
arXiv Detail & Related papers (2021-03-16T23:39:33Z) - TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors [74.67698916175614]
We propose TrafficSim, a multi-agent behavior model for realistic traffic simulation.
In particular, we leverage an implicit latent variable model to parameterize a joint actor policy.
We show TrafficSim generates significantly more realistic and diverse traffic scenarios as compared to a diverse set of baselines.
arXiv Detail & Related papers (2021-01-17T00:29:30Z) - SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for
Autonomous Driving [96.50297622371457]
Multi-agent interaction is a fundamental aspect of autonomous driving in the real world.
Despite more than a decade of research and development, the problem of how to interact with diverse road users in diverse scenarios remains largely unsolved.
We develop a dedicated simulation platform called SMARTS that generates diverse and competent driving interactions.
arXiv Detail & Related papers (2020-10-19T18:26:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.