Sage Journals: Discover world-class research

Abstract

As the number of vehicles continues to rise, traffic congestion has become a significant factor affecting travel efficiency. The development of intelligent connected technology offers new approaches for achieving coordinated optimization in road traffic. This study proposes a deep reinforcement learning-based vehicle–infrastructure cooperation framework to achieve cooperative control of vehicle navigation and traffic signals, aiming to minimize the impact of traffic congestion on road traffic efficiency. The tasks of vehicle navigation and signal control are modeled as a partially observable Markov decision process. Based on real-time road condition information, roadside agents can flexibly adjust signal phases, while vehicle agents determine the next routing step as vehicles approach intersections. Through communication cooperation, on-board units and roadside units can acquire more comprehensive and accurate traffic status information. Rewards for the vehicle navigation task, based on spatiotemporal dimensions, and for the pressure-based signal control task, ensure that reinforcement learning agents continuously improve their traffic efficiency optimization capabilities during training. The proposed method was implemented and evaluated in the SUMO (Simulation of Urban MObility) simulator under low, medium, and high traffic demand conditions and compared with other baseline methods. The study also analyzed the impact of reward design, the training sequence, and the intelligent connectivity penetration rate on the effectiveness of the framework, demonstrating the effectiveness and robustness of the proposed vehicle–infrastructure collaborative optimization framework in improving network traffic efficiency.

Keywords

intelligent connected vehicle–infrastructure cooperation vehicle navigation traffic signal control deep reinforcement learning

Get full access to this article

View all access options for this article.

References

Chen

Zheng

Wen

Ding

Guo

A Novel Generative Adversarial Network for Improving Crash Severity Modeling with Imbalanced Data. Transportation Research Part C: Emerging Technologies, Vol. 164, 2024, p. 104642.

Cui

Lee

Guo

Ngoduy

Inferring Heterogeneous Treatment Effects of Crashes on Highway Traffic: A Doubly Robust Causal Machine Learning Approach. Transportation Research Part C: Emerging Technologies, Vol. 160, 2024, p. 104537.

Zhang

Yao

Zhao

Deep Autoencoder Neural Networks for Short-Term Traffic Congestion Prediction of Transportation Networks. Sensors, Vol. 19, No. 10, 2019, p. 2229.

Cao

Z. G.

Jiang

S. W.

Zhang

Guo

H. L.

A Unified Framework for Vehicle Rerouting and Traffic Light Control to Reduce Traffic Congestion. IEEE Transactions on Intelligent Transportation Systems, Vol. 18, No. 7, 2017, pp. 1958–1973.

Liu

Qian

Zhang

Wang

Efficient Predictive Control Strategy for Mitigating the Overlap of EV Charging Demand and Residential Load Based on Distributed Renewable Energy. Renewable Energy, Vol. 240, 2025, p. 122154.

Collins

Etzioni

Ben-Elia

Travel Behavior and System Dynamics in a Simple Gamified Automated Multimodal Network. Transportation Research Part A: Policy and Practice, Vol. 183, 2024, p. 104060.

Sun

F. Z.

Dubey

White

DxNAT—Deep Neural Networks for Explaining Non-Recurring Traffic Congestion. Proc., 2017 IEEE International Conference on Big Data (Big Data), Boston, MA, IEEE, New York, 2017, pp. 2141–2150.

Wang

Taitler

Smirnov

Sanner

Abdulhai

EMARLIN: Distributed Coordinated Adaptive Traffic Signal Control with Topology-Embedding Propagation. Transportation Research Record: Journal of the Transportation Research Board, 2024. 2678: 189–202.

B. L.

Sun

R. T.

W. M.

Chen

Yao

Traffic Signal Control Method Based on Asynchronous Advantage Actor-Critic. Journal of Zhejiang University (Engineering Science), Vol. 58, No. 8, 2024, pp. 1671–1680.

10.

Huang

Zhang

X. Y.

S. C.

Zhao

A Model Bias Learning Approach for Adaptive Traffic Signal Control. Journal of Railway Science and Engineering, Vol. 21, No. 6, 2024, pp. 2229–2240.

11.

S. F.

Wang

Liu

G. H.

Shao

Macroscopic Fundamental Diagram of Urban Road Network Based on Traffic Volume and Taxi GPS Data. Highway Traffic Science, Vol. 31, No. 9, 2014, pp. 138–144.

12.

Huo

Wen

Liu

Wang

CHRT: Clustering-Based Hybrid Re-Routing System for Traffic Congestion Avoidance. China Communications, Vol. 18, No. 7, 2021, pp. 86–102.

13.

Yan

L. P.

W. B.

Wang

Qiu

Z. Y.

Dynamic Real-Time Algorithm for Multi-Intersection Route Selection in Urban Traffic Networks. Journal of Software, Vol. 27, No. 9, 2016, pp. 2199–2217.

14.

Varaiya

Max Pressure Control of a Network of Signalized Intersections. Transportation Research Part C: Emerging Technologies, Vol. 36, 2013, pp. 177–195.

15.

Zhang

W. B.

Yan

X. F.

Fang

L. L.

Y. J.

Distributed Signal Control of Arterial Corridors Using Multi-Agent Deep Reinforcement Learning. IEEE Transactions on Intelligent Transportation Systems, Vol. 24, No. 1, 2022, pp. 178–190.

16.

Chen

Gao

Zhao

Research on Intelligent Signal Timing Optimization of Signalized Intersection Based on Deep Reinforcement Learning Using Floating Car Data. Transportation Research Record: Journal of the Transportation Research Board, 2024. 2678: 1126–1147.

17.

Yan

L. P.

Zhu

L. L.

Song

Yuan

Z. H.

Yan

Y. J.

Tang

Peng

Graph Cooperation Deep Reinforcement Learning for Ecological Urban Traffic Signal Control. Applied Intelligence, Vol. 53, No. 6, 2023, pp. 6248–6265.

18.

Chen

Zheng

Montewka

Maritime Traffic Situation Awareness Analysis via High-Fidelity Ship Imaging Trajectory. Multimedia Tools and Applications, Vol. 83, No. 16, 2024, pp. 48907–48923.

19.

Chen

Han

Liu

Montewka

Liu

R. W.

Orientation-Aware Ship Detection via a Rotation Feature Decoupling Supported Deep Learning Approach. Engineering Applications of Artificial Intelligence, Vol. 125, 2023, pp. 1–16.

20.

X. F.

Zhong

Safdar

Modelling Medium- and Long-Term Purchasing Plans for Environment-Orientated Container Trucks: A Case Study of Yangtze River Port. Transportation Safety and Environment, Vol. 5, No. 1, 2023, p. tdac043.

21.

Wang

F. Y.

Traffic Signal Timing via Deep Reinforcement Learning. IEEE/CAA Journal of Automatica Sinica, Vol. 3, No. 3, 2016, pp. 247–254.

22.

Chen

X. Y.

Xiong

Chen

Y. Y.

Song

Wang

F.-Y.

A Collaborative Communication-Qmix Approach for Large-Scale Networked Traffic Signal Control. Proc., 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, IEEE, New York, 2021, pp. 3450–3455.

23.

Rashid

Samvelyan

De Witt

C. S.

Farquhar

Foerster

Whiteson

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. Journal of Machine Learning Research, Vol. 21, No. 178, 2020, pp. 1–51.

24.

Dijkstra

E. W.

A Note on Two Problems in Connexion with Graphs Numer. Numerische Mathematik, Vol. 1, No. 1, 1959, pp. 269–271.

25.

Hart

P. E.

Nilsson

N. J.

Raphael

A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Transactions on Systems Man and Cybernetics, Vol. 4, No. 2, 1968, pp. 100–107.

26.

Wang

Djahel

Zhang

Z. H.

McManis

Next Road Rerouting: A Multiagent System for Mitigating Unexpected Urban Traffic Congestion. IEEE Transactions on Intelligent Transportation Systems, Vol. 17, No. 10, 2016, pp. 2888–2899.

27.

Geng

Y. Z.

Liu

E. W.

Wang

Liu

Y. M.

Rao

W. X.

Feng

S. J.

Dong

Z. R.

Chen

Y. F.

Deep Reinforcement Learning Based Dynamic Route Planning for Minimizing Travel Time. Proc., 2021 IEEE International Conference on Communications Workshops (ICC Workshops), Montreal, QC, Canada, IEEE, New York, 2021, pp. 1–6.

28.

Koh

Zhou

Fang

Yang

Z. L.

Yang

Guan

Z. G.

Real-Time Deep Reinforcement Learning Based Vehicle Navigation. Applied Soft Computing, Vol. 96, 2020, p. 106694.

29.

Wang

XRouting: Explainable Vehicle Rerouting for Urban Road Congestion Avoidance Using Deep Reinforcement Learning, Proc., 2022 IEEE International Smart Cities Conference (ISC2), Pafos, Cyprus, IEEE, New York, 2022, pp. 1–7.

30.

Wang

W. B.

Zhang

W. B.

CRRFNet: An Adaptive Traffic Object Detection Method Based on Camera and Radar Radio Frequency Fusion. Transportation Research Part C: Emerging Technologies, Vol. 166, 2024, p. 104791.

31.

Chen

Bandaru

V. K.

Wang

Romero

M. A.

Tarko

Feng

Cooperative Perception System for Aiding Connected and Automated Vehicle Navigation and Improving Safety. Transportation Research Record: Journal of the Transportation Research Board, 2024. 2678: 1498–1510.

32.

Feng

Zhao

Chen

Z. J.

Song

An Adaptive Coupled Control Method Based on Vehicles Platooning for Intersection Controller and Vehicle Trajectories in Mixed Traffic. IET Intelligent Transport Systems, Vol. 18, No. 8, 2024, pp. 1459–1476.

33.

Zheng

Liu

Y. G.

R. R.

Yang

H. T.

Optimization of Isolated Intersection Signal Timing and Trajectory Planning Under Mixed Traffic Environment: The Flexible Catalysis of Connected and Automated Vehicles. Physica A: Statistical Mechanics and its Applications, Vol. 640, 2024, p. 129668.

34.

Feng

Y. H.

C. H.

Liu

H. X.

Spatiotemporal Intersection Control in a Connected and Automated Vehicle Environment. Transportation Research Part C: Emerging Technologies, Vol. 89, 2018, pp. 364–383.

35.

C. H.

Feng

Y. H.

Liu

H. X.

W. J.

Yang

X. G.

Integrated Optimization of Traffic Signals and Vehicle Trajectories at Isolated Urban Intersections. Transportation Research Part B: Methodological, Vol. 112, 2018, pp. 89–112.

36.

Jiang

Y. S.

Zhao

Liu

Yao

Z. H.

A Two-Level Model for Traffic Signal Timing and Trajectories Planning of Multiple CAVs in a Random Environment. Journal of Advanced Transportation, Vol. 1, 2021, p. 9945398.

37.

Guo

J. Q.

DRL-TP3: A Learning and Control Framework for Signalized Intersections with Mixed Connected Automated Traffic. Transportation Research Part C: Emerging Technologies, Vol. 132, 2021, p. 103416.

38.

W. J.

J. J.

C. H.

Shared-Phase-Dedicated-Lane Based Intersection Control with Mixed Traffic of Human-Driven Vehicles and Connected and Automated Vehicles. Transportation Research Part C: Emerging Technologies, Vol. 135, 2022, p. 103509.

39.

Orfila

Saint Pierre

Messias

Abecassis

Mejuto

Lopez

Development of an Ecodriving Assistance Application for Nomadic Devices Performing Real-Time and Post-Trip Coaching for Road Vehicles. Proc., TRA2014-Transport Research Arena, France, April 2014.

40.

Bilgram

Ernstsen

Greve

Lahrmann

Larsen

K. G.

Muñiz

Taankvist

Pedersen

Online and Proactive Vehicle Rerouting with Uppaal Stratego. Transportation Research Record: Journal of the Transportation Research Board, 2021. 2675: 13–22.

41.

Liu

Zhang

W. T.

X. J.

Feng

Pei

Yao

D. Y.

A Simulation System and Speed Guidance Algorithms for Intersection Traffic Control Using Connected Vehicle Technology. Tsinghua Science and Technology, Vol. 24, No. 2, 2018, pp. 160–170.

42.

H. R.

Chu

D. F.

Liang

D. C.

Zhou

T. Q.

Closed-Loop Feedback Speed Guidance Method Considering Driving Style. Journal of Transportation Systems Engineering and Information Technology, Vol. 21, No. 3, 2021, p. 94.

43.

L. P.

Deng

M. J.

A Speed Guidance Method at Signalized Intersections Based on Vehicle Infrastructure Cooperation. Traffic Information and Safety, Vol. 39, No. 2, 2021, pp. 78–86.

44.

Sun

Zhang

W. J.

Mei

Xiong

Hierarchical Reinforcement Learning for Dynamic Autonomous Vehicle Navigation at Intelligent Intersections. Proc., 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, Association for Computing Machinery, New York, 2023, pp. 4852–4861.

45.

Ding

Zhao

Tan

Liu

A Fundamental-Diagram-Informed Spatial Partitioning Method for Heterogeneous Traffic Networks. IEEE Internet of Things Journal, Vol. 12, No. 9, 2024, pp. 12193–12205.

46.

Chen

C. C.

Wei

Zheng

G. J.

Yang

Xiong

Y. H.

Z. H.

Toward a Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control. Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34, No. 4, 2020, pp. 3414–3421.

47.

Wei

Chen

C. C.

Zheng

G. J.

Gayah

Z. H.

PressLight: Learning Max Pressure Control to Coordinate Traffic Signals in Arterial Network. Proc., 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, Association for Computing Machinery, New York, 2019, pp. 1290–1298.

48.

Mnih

Kavukcuoglu

Silver

Graves

Antonoglou

Wierstra

Riedmiller

Playing Atari with Deep Reinforcement Learning. Computer Science, Vol. 21, 2013, pp. 351–362.

49.

Hasselt

H. V.

Guez

Silver

Deep Reinforcement Learning with Double Q-Learning. Proceedings of the AAAI Conference on Artificial Intelligence, Vol, 30, No. 1, 2016, pp. 2094–2100.

50.

Wang

Schaul

Hessel

van Hasselt

Lanctot

de Freitas

Dueling Network Architectures for Deep Reinforcement Learning. International Conference on Machine Learning, PMLR, Vol. 48, 2016, pp. 1995–2003.

51.

Mei

Lei

X. L.

L. C.

Shi

Wei

Libsignal: An Open Library for Traffic Signal Control. Machine Learning, Vol. 113, No. 8, 2024, pp. 5235–5271.

52.

Roess

R. P.

Prassas

E. S.

McShane

W. R.

Traffic Engineering. Pearson/Prentice Hall, Upper Saddle River, NJ, 2011.

Vehicle–Infrastructure Cooperation Framework for Vehicle Navigation and Traffic Signal Control using Deep Reinforcement Learning

Abstract

Keywords

Get full access to this article

References