Autonomous Heterogeneous Mining Fleet Control at Nonorthogonal Intersections by Hippopotamus Optimization Algorithm-Based Adaptive Proximal Policy Optimization

Abstract

This work focuses on the autonomous heterogeneous fleet cooperative control problem in nonorthogonal intersections and proposes a fleet control algorithm to give the fleet target acceleration to achieve longitudinal control of the fleet with the objective of minimizing travel delay. We propose a hybrid hippopotamus optimization algorithm (HOA)-based adaptive proximal policy optimization (PPO) algorithm that leverages HOA for adaptive learning rate tuning in PPO. It addresses the challenge of controlling the operation of a heterogeneous fleet of trucks by calculating the priorities of all trucks in the intersection area and incorporating them into the state space of the reinforcement learning. Furthermore, our framework’s state-action space design is universally adaptable, functioning robustly across all intersection types, including those with non-orthogonal angles. Finally, the simulation experiments demonstrate that our proposed method can be adapted to different operational scenarios and obtain better policy with fewer training episodes than traditional learning rate setting methods.

Keywords

data and data science artificial intelligence and advanced computing applications machine learning (artificial intelligence)reinforcement learning

Get full access to this article

View all access options for this article.

References

Wang

Dai

Bian

Xie

Yang

Real-Time Truck Dispatching in Open-Pit Mines. International Journal of Mining, Reclamation and Environment, Vol. 12, 2023, pp. 1–20.

Yuan

Yang

Tang

X. F.

Chen

A. W.

Autonomous Vehicle Motion Planning Based on Improved RRT* Algorithm and Trajectory Optimization (in Chinese). Acta Automatica Sinica, Vol. 48, No. 12, 2022, pp. 2941–2950.

Paden

Cap

Yong

S. Z.

Yershov

Frazzoli

A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles. IEEE Transactions on Intelligent Vehicles, Vol. 1, No. 1, 2016, pp. 33–55.

Yuan

Shu

Huang

Zhang

Khajepour

Zhang

Mixed Local Motion Planning and Tracking Control Framework for Autonomous Vehicles Based on Model Predictive Control. IET Intelligent Transport Systems, Vol. 13, No. 6, 2019, pp. 950–959.

Amer

N. H.

Zamzuri

Hudha

Kadir

Z. A.

Modelling and Control Strategies in Path Tracking Control for Autonomous Ground Vehicles: A Review of State of the Art and Challenges. Journal of Intelligent & Robotic Systems, Vol. 86, No. 2, 2017, pp. 225–254.

Dresner

Stone

A Multiagent Approach to Autonomous Intersection Management. Journal of Artificial Intelligence Research, Vol. 31, No. 1, 2008, pp. 591–656.

Krishnan

Govind Aadithya

Ramakrishnan

Arvindh

Sivanathan

A Look at Motion Planning for AVs at an Intersection. Proc., 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, IEEE, New York, NY, 2018, pp. 333–340.

Onieva

Hernandez-Jayo

Osaba

Perallos

Zhang

A Multi-Objective Evolutionary Algorithm for the Tuning of Fuzzy Rule Bases for Uncoordinated Intersections in Autonomous Driving. Information Sciences, Vol. 321, 2015, pp. 14–30.

Nam Bui

K. H.

Jung

J. J.

Cooperative Game-Theoretic Approach to Traffic Flow Optimization for Multiple Intersections. Computers & Electrical Engineering, Vol. 71, 2018, pp. 1012–1024.

10.

Wang

Yang

Sequential Distributed Model Predictive Control for Automated Mining Fleet. IEEE Access, Vol. 11, 2023, pp. 109776–109792.

11.

Katriniok

Kleibaum

Joševski

Distributed Model Predictive Control for Intersection Automation Using a Parallelized Optimization Approach. IFAC-PapersOnLine, Vol. 50, No. 1, 2017, pp. 5940–5946.

12.

Xing

Cao

Driver Anomaly Quantification for Intelligent Vehicles: A Contrastive Learning Approach with Representation Clustering. IEEE Transactions on Intelligent Vehicles, Vol. 8, No. 1, 2023, pp. 37–47.

13.

Isele

Rahimi

Cosgun

Subramanian

Fujimura

Navigating Occluded Intersections with Autonomous Vehicles Using Deep Reinforcement Learning. Proc., IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, IEEE, New York, NY, 2018, pp. 2034–2039.

14.

Abdulhai

Pringle

Karakoulas

G. J.

Reinforcement Learning for True Adaptive Traffic Signal Control. Journal of Transportation Engineering, Vol. 129, No. 3, 2003, pp. 278–285.

15.

Hernandez-Leal

Kartal

Taylor

M. E.

A Survey and Critique of Multiagent Deep Reinforcement Learning. Autonomous Agents and Multi-Agent Systems, 2019, Vol. 33, No. 6, pp. 750–797.

16.

Chen

Zhang

Liu

Zhang

Bennis

Age of Information Aware Radio Resource Management in Vehicular Networks: A Proactive Deep Reinforcement Learning Perspective. IEEE Transactions on Wireless Communications, 2020, Vol. 19, No. 4, pp. 2268–2281.

17.

Guo

Wang

Zhang

Coordination for Connected and Automated Vehicles at Non-Signalized Intersections: A Value Decomposition-Based Multiagent Deep Reinforcement Learning Approach. IEEE Transactions on Vehicular Technology, Vol. 72, No. 3, 2023, pp. 3025–3034.

18.

Mousa

S. R.

Ishak

Mousa

R. M.

Codjoe

Elhenawy

Deep Reinforcement Learning Agent with Varying Actions Strategy for Solving the Eco-Approach and Departure Problem at Signalized Intersections. Transportation Research Record: Journal of the Transportation Research Board, 2020. 2674: 119–131.

19.

Gorges

Ecological Adaptive Cruise Control for Vehicles with Step-Gear Transmission Based on Reinforcement Learning. IEEE Transactions on Intelligent Transportation Systems, Vol. 21, No. 11, 2020, pp. 4895–4905.

20.

Guo

Angah

Liu

Ban (Jeff)

Hybrid Deep Reinforcement Learning Based Eco-Driving for Low-Level Connected and Automated Vehicles Along Signalized Corridors. Transportation Research Part C: Emerging Technologies, Vol. 124, 2021, p. 102980.

21.

Antonio

G. P.

Maria-Dolores

Multi-Agent Deep Reinforcement Learning to Manage Connected Autonomous Vehicles at Tomorrow’s Intersections. IEEE Transactions on Vehicular Technology, Vol. 71, No. 7, 2022, pp. 7033–7043.

22.

Zamfirache

I. A.

Precup

R. E.

Petriu

E. M.

Adaptive Reinforcement Learning-Based Control Using Proximal Policy Optimization and Slime Mould Algorithm with Experimental Tower Crane System Validation. Applied Soft Computing, Vol. 160, 2024, p. 111687.

23.

Schulman

Wolski

Dhariwal

Radford

Klimov

Proximal Policy Optimization Algorithms. arXiv preprint arXiv:1707.06347, 2017.

24.

Schulman

Levine

Moritz

Jordan

M. I.

Abbeel

Trust Region Policy Optimization. arXiv, 2017.

25.

Yin

Xiong

Fast-Apply Deep Autoregressive Recurrent Proximal Policy Optimization for Controlling Hot Water Systems. Applied Energy, Vol. 367, 2024, p. 123348.

26.

Rehman

A. U.

Ullah

Qazi

H. S.

Hasanien

H. M.

Khalid

H. M.

Reinforcement Learning-Driven Proximal Policy Optimization-Based Voltage Control for PV and WT Integrated Power System. Renewable Energy, Vol. 227, 2024, p. 120590.

27.

Banar

Mohammadi

SeismoNet: A Proximal Policy Optimization-Based Earthquake Early Warning System Using Dilated Convolution Layers and Online Data Augmentation. Expert Systems with Applications, Vol. 253, 2024, p. 124337.

28.

Pamucar

Gokasar

Ebadi Torkayesh

Deveci

Martínez

Prioritization of Unmanned Aerial Vehicles in Transportation Systems Using the Integrated Stratified Fuzzy Rough Decision-Making Approach with the Hamacher Operator. Information Sciences, Vol. 622, 2023, pp. 374–404.

29.

Klein

Zelinka

Seidl

Optimizing Parameters in Swarm Intelligence Using Reinforcement Learning: An Application of Proximal Policy Optimization to the iSOMA Algorithm. Swarm and Evolutionary Computation, Vol. 85, 2024, p. 101487.

30.

George

A. P.

Powell

W. B.

Adaptive Stepsizes for Recursive Estimation with Applications in Approximate Dynamic Programming. Machine Learning, Vol. 65, No. 1, 2006, pp. 167–198.

31.

Amiri

M. H.

Mehrabi Hashjin

Montazeri

Mirjalili

Khodadadi

Hippopotamus Optimization Algorithm: A Novel Nature-Inspired Optimization Algorithm. Scientific Report, Vol. 14, No. 1, 2024, p. 5032.

32.

D. F.

Chen

X. D.

Jin

Mixed-Coordinated Decision-Making Method for Arterial Signals Based on Reinforcement Learning (in Chinese). Journal of Transportation Systems Engineering and Information Technology, Vol. 22, No. 2, 2022, pp. 145–153.

33.

Wang

Z. J.

Z. X.

Zhao

S. Y.

Wang

P. J.

Gao

H. B.

Adaptive Learning Rate Algorithms Based on the Improved Barzilai–Borwein Method. Pattern Recognition, Vol. 160, 2025, p. 111179.

34.

Luo

Xiong

Liu

Sun

Adaptive Gradient Methods with Dynamic Bound of Learning Rate. arXiv preprint arXiv:1902.09843, 2019.