Integrated Decision and Control for High-Level Automated Vehicles by
Mixed Policy Gradient and Its Experiment Verification
- URL: http://arxiv.org/abs/2210.10613v1
- Date: Wed, 19 Oct 2022 14:58:41 GMT
- Title: Integrated Decision and Control for High-Level Automated Vehicles by
Mixed Policy Gradient and Its Experiment Verification
- Authors: Yang Guan, Liye Tang, Chuanxiao Li, Shengbo Eben Li, Yangang Ren,
Junqing Wei, Bo Zhang, Keqiang Li
- Abstract summary: This paper presents a self-evolving decision-making system based on the Integrated Decision and Control (IDC)
An RL algorithm called constrained mixed policy gradient (CMPG) is proposed to consistently upgrade the driving policy of the IDC.
Experiment results show that boosting by data, the system can achieve better driving ability over model-based methods.
- Score: 10.393343763237452
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Self-evolution is indispensable to realize full autonomous driving. This
paper presents a self-evolving decision-making system based on the Integrated
Decision and Control (IDC), an advanced framework built on reinforcement
learning (RL). First, an RL algorithm called constrained mixed policy gradient
(CMPG) is proposed to consistently upgrade the driving policy of the IDC. It
adapts the MPG under the penalty method so that it can solve constrained
optimization problems using both the data and model. Second, an attention-based
encoding (ABE) method is designed to tackle the state representation issue. It
introduces an embedding network for feature extraction and a weighting network
for feature fusion, fulfilling order-insensitive encoding and importance
distinguishing of road users. Finally, by fusing CMPG and ABE, we develop the
first data-driven decision and control system under the IDC architecture, and
deploy the system on a fully-functional self-driving vehicle running in daily
operation. Experiment results show that boosting by data, the system can
achieve better driving ability over model-based methods. It also demonstrates
safe, efficient and smart driving behavior in various complex scenes at a
signalized intersection with real mixed traffic flow.
Related papers
- SPformer: A Transformer Based DRL Decision Making Method for Connected Automated Vehicles [9.840325772591024]
We propose a CAV decision-making architecture based on transformer and reinforcement learning algorithms.
A learnable policy token is used as the learning medium of the multi-vehicle joint policy.
Our model can make good use of all the state information of vehicles in traffic scenario.
arXiv Detail & Related papers (2024-09-23T15:16:35Z) - DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving [81.04174379726251]
This paper collects a comprehensive end-to-end driving dataset named DriveCoT.
It contains sensor data, control decisions, and chain-of-thought labels to indicate the reasoning process.
We propose a baseline model called DriveCoT-Agent, trained on our dataset, to generate chain-of-thought predictions and final decisions.
arXiv Detail & Related papers (2024-03-25T17:59:01Z) - Unified Automatic Control of Vehicular Systems with Reinforcement
Learning [64.63619662693068]
This article contributes a streamlined methodology for vehicular microsimulation.
It discovers high performance control strategies with minimal manual design.
The study reveals numerous emergent behaviors resembling wave mitigation, traffic signaling, and ramp metering.
arXiv Detail & Related papers (2022-07-30T16:23:45Z) - Tackling Real-World Autonomous Driving using Deep Reinforcement Learning [63.3756530844707]
In this work, we propose a model-free Deep Reinforcement Learning Planner training a neural network that predicts acceleration and steering angle.
In order to deploy the system on board the real self-driving car, we also develop a module represented by a tiny neural network.
arXiv Detail & Related papers (2022-07-05T16:33:20Z) - AI-aided Traffic Control Scheme for M2M Communications in the Internet
of Vehicles [61.21359293642559]
The dynamics of traffic and the heterogeneous requirements of different IoV applications are not considered in most existing studies.
We consider a hybrid traffic control scheme and use proximal policy optimization (PPO) method to tackle it.
arXiv Detail & Related papers (2022-03-05T10:54:05Z) - Parallelized and Randomized Adversarial Imitation Learning for
Safety-Critical Self-Driving Vehicles [11.463476667274051]
It is essential to consider reliable ADAS function coordination to control the driving system, safely.
This paper proposes a randomized adversarial imitation learning (RAIL) algorithm.
The proposed method is able to train the decision maker that deals with the LIDAR data and controls the autonomous driving in multi-lane complex highway environments.
arXiv Detail & Related papers (2021-12-26T23:42:49Z) - Encoding Integrated Decision and Control for Autonomous Driving with
Mixed Traffic Flow [5.7440882048331705]
Reinforcement learning (RL) has been widely adopted to make intelligent driving policy in autonomous driving.
This paper proposes the encoding integrated decision and control (E-IDC) to handle complicated driving tasks with mixed traffic flows.
arXiv Detail & Related papers (2021-10-24T06:06:27Z) - End-to-End Intersection Handling using Multi-Agent Deep Reinforcement
Learning [63.56464608571663]
Navigating through intersections is one of the main challenging tasks for an autonomous vehicle.
In this work, we focus on the implementation of a system able to navigate through intersections where only traffic signs are provided.
We propose a multi-agent system using a continuous, model-free Deep Reinforcement Learning algorithm used to train a neural network for predicting both the acceleration and the steering angle at each time step.
arXiv Detail & Related papers (2021-04-28T07:54:40Z) - A Driving Behavior Recognition Model with Bi-LSTM and Multi-Scale CNN [59.57221522897815]
We propose a neural network model based on trajectories information for driving behavior recognition.
We evaluate the proposed model on the public BLVD dataset, achieving a satisfying performance.
arXiv Detail & Related papers (2021-03-01T06:47:29Z) - Decision-making for Autonomous Vehicles on Highway: Deep Reinforcement
Learning with Continuous Action Horizon [14.059728921828938]
This paper utilizes the deep reinforcement learning (DRL) method to address the continuous-horizon decision-making problem on the highway.
The running objective of the ego automated vehicle is to execute an efficient and smooth policy without collision.
The PPO-DRL-based decision-making strategy is estimated from multiple perspectives, including the optimality, learning efficiency, and adaptability.
arXiv Detail & Related papers (2020-08-26T22:49:27Z) - Decision-making Strategy on Highway for Autonomous Vehicles using Deep
Reinforcement Learning [6.298084785377199]
A deep reinforcement learning (DRL)-enabled decision-making policy is constructed for autonomous vehicles to address the overtaking behaviors on the highway.
A hierarchical control framework is presented to control these vehicles, which indicates the upper-level manages the driving decisions.
The DDQN-based overtaking policy could accomplish highway driving tasks efficiently and safely.
arXiv Detail & Related papers (2020-07-16T23:41:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.