Sage Journals: Discover world-class research

Abstract

With the continuous increase of urban traffic flow, the intelligence of traffic signal control (TSC) has become an important means to improve traffic efficiency. Among them, the deep reinforcement learning (DRL) algorithm Deep Q-Network (DQN) has been successfully applied to the field of TSC. We focus on the problems of complex state representation of existing traffic models, insufficient performance of DQN algorithm when using multilayer perceptron (MLP) as an action network, and over-estimation of Q-value leading to degradation of convergence performance. To mine the potential traffic state information from limited features and to improve the efficiency of the model, we propose a DQN softmax cross-entropy (DQN-SCE) TSC algorithm. First, the model uses the current phase and queue length as the state representation and optimizes the reward function only by the queue length. Second, a multi-head self-attention mechanism is used to fuse the state features. Finally, an improved DRL algorithm DQN-SCE is proposed; that is, we add cross-entropy loss of current actions for the target network and the action network to DQN. The experimental results based on CityFlow show that the TSC algorithm has better performance in the metric of average travel time compared with some traditional methods and reinforcement learning methods. The proposed algorithm still performs well compared with the traditional DQN algorithms and several improved algorithms for DQN.

Keywords

traffic signal control deep reinforcement learning DQN attention mechanism DQN-SCE

Get full access to this article

View all access options for this article.

References

Makridis

Menelaou

Timotheou

Panayiotou . A Real-Time Demand Management and Route-Guidance System for Eliminating Congestion. IEEE Intelligent Transportation Systems Magazine, Vol. 16, No. 6, 2024, pp. 86–104.

Koonce

Rodegerdts

Traffic Signal Timing Manual. Tech. Rep. FHWA-HOP-08-024. Federal Highway Administration, June 2008.

Hunt

P. B.

Robertson

D. I.

Bretherton

R. D.

The SCOOT Online Traffic Signal Optimisation Technique. Traffic Engineering & Control, Vol. 23, No. 4, 1982, pp. 190–192.

Lowrie

SCATS: A Traffic Responsive Method of Controlling Urban Traffic Control. Roads and Traffic Authority, 1990.

Cai

Chi

K. W.

Benjamin

G. H.

Adaptive Traffic Signal Control Using Approximate Dynamic Programming. Transportation Research Part C: Emerging Technologies, Vol. 17, No. 5, 2009, pp. 456–474.

Wei

Zheng

Yao

Intellilight: A Reinforcement Learning Approach for Intelligent Traffic Light Control. Proc., 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, 2018, pp. 2496–2505.

Wei

Zhang

Zheng

Zang

Chen

Zhang

Zhu

Colight: Learning Network-Level Cooperation for Traffic Signal Control. Proc., 28th ACM International Conference on Information and Knowledge Management, Beijing, 2019, pp. 1913–1922.

Wei

Chen

Zheng

Gayah

Presslight: Learning Max Pressure Control to Coordinate Traffic Signals in Arterial Network. Proc., 25th ACMSIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, 2019, pp. 1290–1298.

Varaiya

Max Pressure Control of a Network of Signalized Intersections. Transportation Research Part C: Emerging Technologies, Vol. 36, 2013, pp. 177–195.

10.

Zhang

Shen

Lü

Efficient Pressure: Improving Efficiency for Signalized Intersections. Machine Learning, 2021. https://doi.org/10.48550/arXiv.2112.02336.

11.

Zheng

Xiong

Zang

Feng

Wei

Zhang

Learning Phase Competition for Traffic Signal Control. Proc., 28th ACM International Conference on Information and Knowledge Management, Beijing, 2019, pp. 1963–1972.

12.

Liang

Yutong

Shubin

Jianming

Chen

DynamicLight: Two-Stage Dynamic Traffic Signal Timing, 2024. https://doi.org/10.48550/arXiv.2211.01025.

13.

Cools

S. B.

Gershenson

D’Hooghe

Self-Organizing Traffic Lights: A Realistic Simulation. Advances in Applied Self-Organizing Systems, London, 2013, pp. 45–55.

14.

Mnih

Kavukcuoglu

Silver

Rusu

A. A.

Veness

Bellemare

M. G.

, et al. Human-Level Control Through Deep Reinforcement Learning. Nature, Vol. 518, 2015, pp. 529–533.

15.

Hado

V. H.

Guez

Silver

Deep Reinforcement Learning with Double Q-Learning. Proc., 30th AAAI Conference on Artificial Intelligence, Phoenix, AZ, 2016, pp. 2094–2100.

16.

Wang

Z. Y.

Schaul

Hessel

, et al. Dueling Network Architectures for Deep Reinforcement Learning. Proc., 33rd International Conference on Machine Learning, New York, 2016, pp. 1995–2003.

17.

Lillicrap

T. P.

Hunt

J. J.

Pritzel

, et al. Continuous Control with Deep Reinforcement Learning. International Conference on Learning Representations, 2016.

18.

Xiao

Shang

Bao

Guo

Ship Energy Scheduling with DQN-CE Algorithm Combining Bi-Directional LSTM and Attention Mechanism. Applied Energy, Vol. 347, 2023, p. 121378.

19.

Lopez

P. A.

Behrisch

Bieker-Walz

Erdmann

Flotterod

Y.-P.

Hilbrich

Lucken

Rummel

Wagner

Wießner

Microscopic Traffic Simulation Using Sumo. 2018 21st International Conference on Intelligent Transportation Systems (ITSC), IEEE, 2018, pp. 2575–2582.

20.

Zhang

Feng

Liu

Ding

Zhu

Zhou

Zhang

Jin

Cityflow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario. The World Wide Web Conference, 2019, pp. 3620–3624.

21.

Wang

Xiong

Kan

Pun

M. O.

UniTSA: A Universal Reinforcement Learning Framework for V2X Traffic Signal Control. IEEE Transactions on Vehicular Technology, Vol. 73, No. 10, 2024, pp. 14354–14369.

22.

Guanjie

Xinshi

Nan

Hua

Zhengyao

Vikash

Kai

Zhenhui

Diagnosing Reinforcement Learning for Traffic Signal Control, CoRR, 2019.

23.

Zhang

Xie

Deng

Leveraging Queue Length and Attention Mechanisms for Enhanced Traffic Signal Control Optimization. Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 2023, pp. 141–156.

24.

Huang

Chen

A Comprehensive Regional Traffic Coordination Control Strategy Integrated the Short-Term Traffic Flow Identification and Prediction. IEEE Intelligent Transportation Systems Magazine, Vol. 15, No. 4, 2023, pp. 137–149.

25.

Co-Reyes

J. D.

Miao

Peng

Real

Levine

Q. V.

Lee

Faust

Evolving Reinforcement Learning Algorithms, 2022. https://doi.org/10.48550/arXiv.2101.03958.

26.

Meng

Yuan

Trevor

Learning How to Active Learn: A Deep Reinforcement Learning Approach. Proc., 2017 Conference on Empirical Methods in Natural Language Processing, 2017, pp. 595–605.

27.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Ł.

Polosukhin

Attention Is All You Need. Advances in Neural Information Processing Systems, 2017.

28.

Horgan

Quan

Budden

Barth-Maron

Hessel

Hasselt

H. V.

Silver

Distributed Prioritized Experience Replay, CoRR, 2018.

29.

Wang

Jia

Hierarchically and Cooperatively Learning Traffic Signal Control. Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35, 2021, pp. 669–677.

30.

Wei

Deep Reinforcement Learning for Traffic Signal Control, The Pennsylvania State University, 2020.

31.

Chen

Wei

Zheng

Yang

Xiong

Toward a Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control. Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34, 2020, pp. 3414–3421.

Multi-Intersection Traffic Signal Control With Deep Q Network Softmax Cross-Entropy Algorithm Based on Attention Mechanism

Abstract

Keywords

Get full access to this article

References