A Multi-Agent Reinforcement Learning Approach For Safe and Efficient
Behavior Planning Of Connected Autonomous Vehicles
- URL: http://arxiv.org/abs/2003.04371v3
- Date: Sun, 4 Sep 2022 00:11:55 GMT
- Title: A Multi-Agent Reinforcement Learning Approach For Safe and Efficient
Behavior Planning Of Connected Autonomous Vehicles
- Authors: Songyang Han, Shanglin Zhou, Jiangwei Wang, Lynn Pepin, Caiwen Ding,
Jie Fu, Fei Miao
- Abstract summary: We design an information-sharing-based reinforcement learning framework for connected autonomous vehicles.
We show that our approach can improve the CAV system's efficiency in terms of average velocity and comfort.
We construct an obstacle-at-corner scenario to show that the shared vision can help CAVs to observe obstacles earlier and take action to avoid traffic jams.
- Score: 21.132777568170702
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recent advancements in wireless technology enable connected autonomous
vehicles (CAVs) to gather information about their environment by
vehicle-to-vehicle (V2V) communication. In this work, we design an
information-sharing-based multi-agent reinforcement learning (MARL) framework
for CAVs, to take advantage of the extra information when making decisions to
improve traffic efficiency and safety. The safe actor-critic algorithm we
propose has two new techniques: the truncated Q-function and safe action
mapping. The truncated Q-function utilizes the shared information from
neighboring CAVs such that the joint state and action spaces of the Q-function
do not grow in our algorithm for a large-scale CAV system. We prove the bound
of the approximation error between the truncated-Q and global Q-functions. The
safe action mapping provides a provable safety guarantee for both the training
and execution based on control barrier functions. Using the CARLA simulator for
experiments, we show that our approach can improve the CAV system's efficiency
in terms of average velocity and comfort under different CAV ratios and
different traffic densities. We also show that our approach avoids the
execution of unsafe actions and always maintains a safe distance from other
vehicles. We construct an obstacle-at-corner scenario to show that the shared
vision can help CAVs to observe obstacles earlier and take action to avoid
traffic jams.
Related papers
- Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework [79.088116316919]
Connected Autonomous Vehicles (CAVs) have begun to open road testing around the world, but their safety and efficiency performance in complex scenarios is still not satisfactory.
This paper proposes CoDrivingLLM, an interactive and learnable LLM-driven cooperative driving framework.
arXiv Detail & Related papers (2024-09-19T14:36:00Z) - Empowering Autonomous Driving with Large Language Models: A Safety Perspective [82.90376711290808]
This paper explores the integration of Large Language Models (LLMs) into Autonomous Driving systems.
LLMs are intelligent decision-makers in behavioral planning, augmented with a safety verifier shield for contextual safety learning.
We present two key studies in a simulated environment: an adaptive LLM-conditioned Model Predictive Control (MPC) and an LLM-enabled interactive behavior planning scheme with a state machine.
arXiv Detail & Related papers (2023-11-28T03:13:09Z) - CAT: Closed-loop Adversarial Training for Safe End-to-End Driving [54.60865656161679]
Adversarial Training (CAT) is a framework for safe end-to-end driving in autonomous vehicles.
Cat aims to continuously improve the safety of driving agents by training the agent on safety-critical scenarios.
Cat can effectively generate adversarial scenarios countering the agent being trained.
arXiv Detail & Related papers (2023-10-19T02:49:31Z) - Convergence of Communications, Control, and Machine Learning for Secure
and Autonomous Vehicle Navigation [78.60496411542549]
Connected and autonomous vehicles (CAVs) can reduce human errors in traffic accidents, increase road efficiency, and execute various tasks. Reaping these benefits requires CAVs to autonomously navigate to target destinations.
This article proposes solutions using the convergence of communication theory, control theory, and machine learning to enable effective and secure CAV navigation.
arXiv Detail & Related papers (2023-07-05T21:38:36Z) - A Survey of Federated Learning for Connected and Automated Vehicles [2.348805691644086]
Connected and Automated Vehicles (CAVs) are one of the emerging technologies in the automotive domain.
Federated learning (FL) is an effective solution for CAVs that enables a collaborative model development with multiple vehicles.
arXiv Detail & Related papers (2023-03-19T14:44:37Z) - Shared Information-Based Safe And Efficient Behavior Planning For
Connected Autonomous Vehicles [6.896682830421197]
We design an integrated information sharing and safe multi-agent reinforcement learning framework for connected autonomous vehicles.
We first use weight pruned convolutional neural networks (CNN) to process the raw image and point cloud LIDAR data locally at each autonomous vehicle.
We then design a safe actor-critic algorithm that utilizes both a vehicle's local observation and the information received via V2V communication.
arXiv Detail & Related papers (2023-02-08T20:31:41Z) - Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of
Connected Autonomous Vehicles in Challenging Scenarios [10.37986799561165]
Communication technologies enable coordination among connected and autonomous vehicles (CAVs)
We propose a framework of constrained multi-agent reinforcement learning (MARL) with a parallel safety shield for CAVs.
Results show that our proposed methodology significantly increases system safety and efficiency in challenging scenarios.
arXiv Detail & Related papers (2022-10-05T14:39:07Z) - Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement
Learning [9.48157144651867]
We present a novel approach for Robust on-ramp merge of CAVs via Augmented and Multi-modal Reinforcement Learning.
Specifically, we formulate the on-ramp merging problem as a Markov decision process (MDP) by taking driving safety, comfort driving behavior, and traffic efficiency into account.
To provide reliable merging maneuvers, we simultaneously leverage BSM and surveillance images for multi-modal observation.
arXiv Detail & Related papers (2022-07-21T16:34:57Z) - Differentiable Control Barrier Functions for Vision-based End-to-End
Autonomous Driving [100.57791628642624]
We introduce a safety guaranteed learning framework for vision-based end-to-end autonomous driving.
We design a learning system equipped with differentiable control barrier functions (dCBFs) that is trained end-to-end by gradient descent.
arXiv Detail & Related papers (2022-03-04T16:14:33Z) - Transferable Deep Reinforcement Learning Framework for Autonomous
Vehicles with Joint Radar-Data Communications [69.24726496448713]
We propose an intelligent optimization framework based on the Markov Decision Process (MDP) to help the AV make optimal decisions.
We then develop an effective learning algorithm leveraging recent advances of deep reinforcement learning techniques to find the optimal policy for the AV.
We show that the proposed transferable deep reinforcement learning framework reduces the obstacle miss detection probability by the AV up to 67% compared to other conventional deep reinforcement learning approaches.
arXiv Detail & Related papers (2021-05-28T08:45:37Z) - Facilitating Connected Autonomous Vehicle Operations Using
Space-weighted Information Fusion and Deep Reinforcement Learning Based
Control [6.463332275753283]
This paper describes a Deep Reinforcement Learning based approach that integrates the data collected through sensing and connectivity capabilities from other vehicles.
It is expected that implementation of the algorithm in CAVs can enhance the safety and mobility associated with CAV driving operations.
arXiv Detail & Related papers (2020-09-30T13:38:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.