Adaptive Kalman-based hybrid car following strategy using TD3 and CACC
- URL: http://arxiv.org/abs/2312.15993v1
- Date: Tue, 26 Dec 2023 10:51:46 GMT
- Title: Adaptive Kalman-based hybrid car following strategy using TD3 and CACC
- Authors: Yuqi Zheng, Ruidong Yan, Bin Jia, Rui Jiang, Adriana TAPUS, Xiaojing
Chen, Shiteng Zheng, Ying Shang
- Abstract summary: In autonomous driving, the hybrid strategy of deep reinforcement learning and cooperative adaptive cruise control (CACC) can significantly improve the performance of car following.
It is challenging for the traditional hybrid strategy based on fixed coefficients to adapt to mixed traffic flow scenarios.
A hybrid car following strategy based on an adaptive Kalman Filter is proposed by regarding CACC and Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithms.
- Score: 5.052960220478617
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In autonomous driving, the hybrid strategy of deep reinforcement learning and
cooperative adaptive cruise control (CACC) can fully utilize the advantages of
the two algorithms and significantly improve the performance of car following.
However, it is challenging for the traditional hybrid strategy based on fixed
coefficients to adapt to mixed traffic flow scenarios, which may decrease the
performance and even lead to accidents. To address the above problems, a hybrid
car following strategy based on an adaptive Kalman Filter is proposed by
regarding CACC and Twin Delayed Deep Deterministic Policy Gradient (TD3)
algorithms. Different from traditional hybrid strategy based on fixed
coefficients, the Kalman gain H, using as an adaptive coefficient, is derived
from multi-timestep predictions and Monte Carlo Tree Search. At the end of
study, simulation results with 4157745 timesteps indicate that, compared with
the TD3 and HCFS algorithms, the proposed algorithm in this study can
substantially enhance the safety of car following in mixed traffic flow without
compromising the comfort and efficiency.
Related papers
- A Value Based Parallel Update MCTS Method for Multi-Agent Cooperative Decision Making of Connected and Automated Vehicles [9.840325772591024]
This paper proposes a Monte Carlo tree search (MCTS) method with parallel update for multi-agent Markov game with limited horizon rationality and time discounted setting.
By analyzing the parallel actions in the multi-vehicle joint action space in the partial-steady-state traffic flow, the parallel update method can quickly exclude potential dangerous actions.
arXiv Detail & Related papers (2024-09-20T03:13:01Z) - Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework [79.088116316919]
Connected Autonomous Vehicles (CAVs) have begun to open road testing around the world, but their safety and efficiency performance in complex scenarios is still not satisfactory.
This paper proposes CoDrivingLLM, an interactive and learnable LLM-driven cooperative driving framework.
arXiv Detail & Related papers (2024-09-19T14:36:00Z) - Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach [0.0]
This research introduces an innovative method for adaptive traffic signal control (ATSC) through the utilization of multi-objective deep reinforcement learning (DRL) techniques.
The proposed approach aims to enhance control strategies at intersections while simultaneously addressing safety, efficiency, and decarbonization objectives.
arXiv Detail & Related papers (2024-08-01T13:10:41Z) - Performance Comparison of Deep RL Algorithms for Mixed Traffic Cooperative Lane-Changing [3.4761212729163304]
Lane-changing is a challenging scenario for connected and automated vehicles (CAVs) because of the complex dynamics and high uncertainty of the traffic environment.
This study enhances the current CLCMT mechanism by considering both the uncertainty of the human-driven vehicles (HVs) and the microscopic interactions between HVs and CAVs.
Performance comparison among the four DRL algorithms demonstrates that DDPG, TD3, SAC, and PPO algorithms can deal with uncertainty in traffic environments.
arXiv Detail & Related papers (2024-06-25T07:49:25Z) - Roulette-Wheel Selection-Based PSO Algorithm for Solving the Vehicle
Routing Problem with Time Windows [58.891409372784516]
This paper presents a novel form of the PSO methodology that uses the Roulette Wheel Method (RWPSO)
Experiments using the Solomon VRPTW benchmark datasets on the RWPSO demonstrate that RWPSO is competitive with other state-of-the-art algorithms from the literature.
arXiv Detail & Related papers (2023-06-04T09:18:02Z) - Unified Automatic Control of Vehicular Systems with Reinforcement
Learning [64.63619662693068]
This article contributes a streamlined methodology for vehicular microsimulation.
It discovers high performance control strategies with minimal manual design.
The study reveals numerous emergent behaviors resembling wave mitigation, traffic signaling, and ramp metering.
arXiv Detail & Related papers (2022-07-30T16:23:45Z) - Comparative analysis of machine learning methods for active flow control [60.53767050487434]
Genetic Programming (GP) and Reinforcement Learning (RL) are gaining popularity in flow control.
This work presents a comparative analysis of the two, bench-marking some of their most representative algorithms against global optimization techniques.
arXiv Detail & Related papers (2022-02-23T18:11:19Z) - Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in
Connected and Automated Hybrid Electric Vehicles [3.5259944260228977]
This work proposes a Safe Off-policy Model-Based Reinforcement Learning algorithm for the eco-driving problem.
The proposed algorithm leads to a policy with a higher average speed and a better fuel economy compared to the model-free agent.
arXiv Detail & Related papers (2021-05-25T03:41:29Z) - Using Kalman Filter The Right Way: Noise Estimation Is Not Optimal [46.556605821252276]
We show that even a seemingly small violation of KF assumptions can significantly modify the effective noise.
We suggest a method to apply gradient-based optimization efficiently to the symmetric and positive-definite (SPD) parameters of KF.
arXiv Detail & Related papers (2021-04-06T08:59:15Z) - Hybrid Car-Following Strategy based on Deep Deterministic Policy
Gradient and Cooperative Adaptive Cruise Control [7.016756906859412]
A hybrid car-following strategy based on deep deterministic policy gradient (DDPG) and cooperative adaptive cruise control (CACC) is proposed.
The proposed strategy guarantees the basic performance of car-following through CACC, but also makes full use of the advantages of exploration on complex environments via DDPG.
arXiv Detail & Related papers (2021-02-24T17:37:47Z) - Reinforcement Learning Based Vehicle-cell Association Algorithm for
Highly Mobile Millimeter Wave Communication [53.47785498477648]
This paper investigates the problem of vehicle-cell association in millimeter wave (mmWave) communication networks.
We first formulate the user state (VU) problem as a discrete non-vehicle association optimization problem.
The proposed solution achieves up to 15% gains in terms sum of user complexity and 20% reduction in VUE compared to several baseline designs.
arXiv Detail & Related papers (2020-01-22T08:51:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.