A multi-agent reinforcement learning-based approach with lateral-longitudinal coupling for highway on-ramp merging control of autonomous vehicle

Abstract

To improve the safety and efficiency of autonomous vehicles when lane changing occur in the parallel-type on-ramp merging condition on the highway, this paper proposes a multi-agent deep reinforcement learning-based control model in considering the lateral and longitudinal dynamic-coupling to deal with complex high-density traffic flow scenarios. The model takes dynamic traffic state as input, and accurately controls the longitudinal acceleration and lateral front wheel angle of the autonomous vehicle through the dynamics coupling and state interaction between lateral and longitudinal agents. The kernel of this model is a Multi-Head Adaptive Attention Proximal Policy Optimization (MAAPPO) algorithm, which improves the network learning efficiency and driving policy stability. To satisfy the vehicle safety, comfort, and efficient driving requirements, this paper proposes a Multi-Dimensional Balanced Optimization of Reward Scheme (MDBORS), which achieves multi-dimensional control objective cooperation. To train and test the model, we constructed a highway ramp merging simulation experiment by using different traffic flow inputs. Results show that the proposed approach can generate efficient and safe control instructions. We also compare our approach with a state-of-the-art approach, our model has significant advantages in key metrics such as lane change time, trajectory length, and Time-To-Collision (TTC), verifying its excellent performance in the field of ramp merging control strategy for autonomous vehicles.

Keywords

Parallel-type ramp condition MDBORS multi-agent coupling control model MAAPPO algorithm

Get full access to this article

View all access options for this article.

References

Wang

Zhang

Huang

, et al. Safety of autonomous vehicles. J Adv Transp 2020; 2020: 8867757.

Faghihian

Sargolzaei

. Energy efficiency of connected autonomous vehicles: a review. Electronics 2023; 12(19): 4086.

Guo

Zhang

Cai

, et al. Effects of level 3 automated vehicle drivers’ fatigue on their take-over behaviour: a literature review. J Adv Transp 2021; 2021: 1–12.

Wang

Hadiuzzaman

Qiu

, et al. Sensitivity analysis of freeway capacity at a complex weaving segment. In: CICTP 2014: safe, smart, and sustainable multimodal transportation systems, 2014, pp.596–608. Reston, VA: American Society of Civil Engineers.

Ahammed

Hassan

Sayed

. Modeling driver behavior and safety on freeway merging areas. J Transp Eng 2008; 134(9): 370–377.

Wang

Jiang

, et al. A multi-agent reinforcement learning-based longitudinal and lateral control of CAVs to improve traffic efficiency in a mandatory lane change scenario. Transp Res Part C Emerg Technol 2024; 158: 104445.

Tajalli

Niroumand

Hajbabaie

. Distributed cooperative trajectory and lane changing optimization of connected automated vehicles: freeway segments with lane drop. Transp Res Part C Emerg Technol 2022; 143: 103761.

Chen

Dong

, et al. Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles. Comput Aided Civ Infrastruct Eng 2021; 36(7): 838–857.

Altché

de La Fortelle

. An LSTM network for highway trajectory prediction. In: Proceedings of the IEEE international conference on intelligent transport systems (ITSC), Yokohama, Japan, 16–19 October 2017, pp.353–359. New York: IEEE.

10.

Devlin

Chang

Lee

, et al. BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein

Doran

Solorio

(eds) Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). Minneapolis, MN: Association for Computational Linguistics, 2019, pp.4171–4186.

11.

Dankwa

Zheng

. Twin-delayed DDPG: a deep reinforcement learning technique to model a continuous movement of an intelligent robot agent. In: Proceedings of the 3rd international conference of vision, image and signal process, Vancouver, BC, Canada, 26–28 August 2019, pp.1–5. New York: ACM.

12.

Ding

Peng

, et al. A rule-based cooperative merging strategy for connected and automated vehicles. IEEE Trans Intell Transp Syst 2020; 21(8): 3436–3446.

13.

Kesting

Treiber

Helbing

. MOBIL: general lane changing model for car-following models. Transp Res Record 2007; 199(1): 86–94.

14.

Urmson

Anhalt

Bagnell

, et al. Autonomous driving in urban environments: boss and the urban challenge. J Field Robot 2008; 25(8): 425–466.

15.

Shen

Sun

, et al. Heuristics based cooperative planning for highway on-ramp merge. In: 2018 21st international conference on intelligent transportation systems (ITSC), Maui, HI, USA, 4–7 November 2018, pp.1266–1272. New York: IEEE.

16.

Schwarting

Alonso-Mora

Rus

. Planning and decision-making for autonomous vehicles. Annu Rev Control Robot Auton Syst 2018; 1(1): 187–210.

17.

Zhu

Tasic

. Flow-level coordination of connected and autonomous vehicles in multilane freeway ramp merging areas. Multimodal Transp 2022; 1(1): 100005.

18.

Karimi

Roncoli

Alecsandru

, et al. Cooperative merging control via trajectory optimization in mixed vehicular traffic. Transp Res Part C Emerg Technol 2020; 116: 102663.

19.

Tang

Zhu

Zhang

, et al. A novel hierarchical cooperative merging control model of connected and automated vehicles featuring flexible merging positions in system optimization. Transp Res Part C Emerg Technol 2022; 138: 103650.

20.

Sun

Huang

Zhang

. Cooperative decision-making for mixed traffic: a ramp merging example. Transp Res Part C Emerg Technol 2020; 120: 102764.

21.

Lubars

Gupta

Chinchali

, et al. Combining reinforcement learning with model predictive control for on-ramp merging. In: 2021 IEEE international intelligent transportation systems conference (ITSC), Indianapolis, IN, USA, 19–22 September 2021, pp.942–947. New York: IEEE.

22.

Mahabal

Fang

Wang

. On-ramp merging for connected autonomous vehicles using deep reinforcement learning. In: 2022 IEEE international conferences on Internet of Things (IThings) and IEEE green computing & communications (GreenCom) and IEEE cyber, physical & social computing (CPSCom) and IEEE smart data (SmartData) and IEEE congress on cybermatics (Cybermatics), Espoo, Finland, 22–25 August 2022, pp.56–61. New York: IEEE.

23.

Lin

Mcphee

Azad

. Anti-jerk on-ramp merging using deep reinforcement learning. arXiv preprint arXiv:1909.12967, 2019.

24.

Chen

Jiang

, et al. Deep reinforcement learning algorithm based ramp merging decision model. Proc IMechE, Part D: J Automobile Engineering 2024; 239(1): 70–84.

25.

Pei

Chen

, et al. A safe and efficient lane change decision-making strategy of autonomous driving based on deep reinforcement learning. Mathematics 2022; 10(9): 1551.

26.

Zhao

Sun

. A deep reinforcement learning approach for automated on-ramp merging. In: 2022 IEEE 25th international conference on intelligent transportation systems (ITSC), Macau, China, 8–12 October 2022, pp.3800–3806. New York: IEEE.

27.

Nishitani

Yang

Guo

, et al. Deep merging: vehicle merging controller based on deep reinforcement learning with embedding network. In: IEEE international conference on robotics and automation (ICRA), Paris, France, 31 May–31 August 2020, pp.216–221. New York: IEEE.

28.

Kerner

Rehborn

. Experimental properties of phase transitions in traffic flow. Phys Rev Lett 1997; 79: 4030.

29.

Wang

Yuan

Guo

, et al. A deep reinforcement learning-based approach for autonomous driving in highway on-ramp merge. Proc IMechE, Part D: J Automobile Engineering 2021; 235(10–11): 2726–2739.