A Deep Reinforcement Learning Approach for Traffic Signal Control
Optimization
- URL: http://arxiv.org/abs/2107.06115v1
- Date: Tue, 13 Jul 2021 14:11:04 GMT
- Title: A Deep Reinforcement Learning Approach for Traffic Signal Control
Optimization
- Authors: Zhenning Li, Chengzhong Xu, Guohui Zhang
- Abstract summary: Inefficient traffic signal control methods may cause numerous problems, such as traffic congestion and waste of energy.
This paper first proposes a multi-agent deep deterministic policy gradient (MADDPG) method by extending the actor-critic policy gradient algorithms.
- Score: 14.455497228170646
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Inefficient traffic signal control methods may cause numerous problems, such
as traffic congestion and waste of energy. Reinforcement learning (RL) is a
trending data-driven approach for adaptive traffic signal control in complex
urban traffic networks. Although the development of deep neural networks (DNN)
further enhances its learning capability, there are still some challenges in
applying deep RLs to transportation networks with multiple signalized
intersections, including non-stationarity environment, exploration-exploitation
dilemma, multi-agent training schemes, continuous action spaces, etc. In order
to address these issues, this paper first proposes a multi-agent deep
deterministic policy gradient (MADDPG) method by extending the actor-critic
policy gradient algorithms. MADDPG has a centralized learning and decentralized
execution paradigm in which critics use additional information to streamline
the training process, while actors act on their own local observations. The
model is evaluated via simulation on the Simulation of Urban MObility (SUMO)
platform. Model comparison results show the efficiency of the proposed
algorithm in controlling traffic lights.
Related papers
Err
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.