Design of Intersection Control Algorithm Based on Deep Reinforcement Learning

Yi Zheng; Xiaohu Yang; Anping Wang; Fei Li; Chen Li; Yaojun Gui

doi:10.2991/978-94-6239-689-0_27

<Previous Article In Volume

Next Article In Volume>

Design of Intersection Control Algorithm Based on Deep Reinforcement Learning

Authors

Yi Zheng¹, Xiaohu Yang¹, Anping Wang², Fei Li³, Chen Li⁴^{, *}, Yaojun Gui⁴

¹Yunnan Communications Investment Group Investment Co., Ltd., Kunming, 650228, China

²Yunnan Xuanhui Expressway Co., Ltd., Xunwei, 655400, China

³Yunnan Communications Investment Group Public Construction & Bridge Engineering Co., Ltd., Kunming, 650228, China

⁴YCIC Broadvision Engineering Cousultants, Kunming, 650200, China

^*Corresponding author. Email: ynkm_leec@foxmail.com

Corresponding Author

Chen Li

Available Online 28 May 2026.

DOI: 10.2991/978-94-6239-689-0_27 How to use a DOI?
Keywords: Intelligent Traffic Signal Control; Deep Reinforcement Learning; Proximal Policy Optimization; Generalized Advantage Estimation; SUMO; Urban Traffic Efficiency
Abstract: Traditional traffic signal control (TSC) schemes rely on fixed timing or rule‑based adaptation, which lack real‑time responsiveness to stochastic and highly dynamic traffic flows. Deep reinforcement learning (DRL) has become a promising paradigm for adaptive signal control, yet value‑based methods such as DQN and DDQN suffer from overestimation bias, unstable training, and low sampling efficiency in complex urban intersection environments. This paper proposes a PPO-GAE control framework, which combines Proximal Policy Optimization (PPO) with Generalized Advantage Estimation (GAE) to achieve stable, efficient, and robust traffic signal timing optimization. PPO constrains policy updates within a trust region to avoid destructive updates and ensure training stability, while GAE effectively reduces gradient variance and improves the accuracy of advantage estimation for continuous traffic states. Experiments are carried out on the SUMO simulation platform using a real intersection in Xi’an. Results show that the proposed PPO-GAE method reduces cumulative delay by 41.2% and average queue length by 33.7% compared with the fixed‑time baseline, and outperforms DQN, DDQN-PER, and traditional adaptive methods.
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 2026 2nd International Conference on Data Mining and Project Management (DMPM 2026)
Series: Advances in Economics, Business and Management Research
Publication Date: 28 May 2026
ISBN: 978-94-6239-689-0
ISSN: 2352-5428
DOI: 10.2991/978-94-6239-689-0_27 How to use a DOI?
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

ris enw bib

TY  - CONF
AU  - Yi Zheng
AU  - Xiaohu Yang
AU  - Anping Wang
AU  - Fei Li
AU  - Chen Li
AU  - Yaojun Gui
PY  - 2026
DA  - 2026/05/28
TI  - Design of Intersection Control Algorithm Based on Deep Reinforcement Learning
BT  - Proceedings of the 2026 2nd International Conference on Data Mining and Project Management (DMPM 2026)
PB  - Atlantis Press
SP  - 295
EP  - 301
SN  - 2352-5428
UR  - https://doi.org/10.2991/978-94-6239-689-0_27
DO  - 10.2991/978-94-6239-689-0_27
ID  - Zheng2026
ER  -

download .riscopy to clipboard