Sage Journals: Discover world-class research

Abstract

Excavator path planning in modern smart ports facilitates efficient and safe access to work areas by accurately identifying bulk storage locations. This approach not only enhances operational efficiency and reduces costs but also prioritizes the safety of large machinery movement. However, the bulk cargo areas of ports often contain irregular and dynamic obstacles, posing significant challenges to traditional path planning algorithms. This study addresses the issues of local optima and training instability in excavator path planning by introducing an enhanced deep reinforcement learning (DRL) algorithm, AS-TD3 (an improved TD3 algorithm). The algorithm is evaluated by simulating the port bulk area environment on a continuous map. By integrating the A* heuristic function with the reward mechanism of the TD3 algorithm, AS-TD3 enhances the global optimal solution discovery by accounting for distance, time, and state variations in path planning. The A* component provides an efficient heuristic search, while TD3 refines the decision-making process through reinforcement learning. Additionally, the epsilon-greedy strategy effectively balances exploration and exploitation, facilitating smooth convergence of the reward curve. Experimental results indicate that the AS-TD3 algorithm reduces the steps required to find the optimal path by 5.7% and accelerates convergence by 58.75%.

Keywords

path planning TD3 algorithm reinforcement learning reward function

Get full access to this article

View all access options for this article.

References

Cheng

C.-Q.

Hao

X.-Y.

J.-S.

(2017). Global dynamic path planning by integrating improved A* algorithm and dynamic window method. Journal of Xi'an Jiaotong University, 51(11), 137–143. https://doi.org/10.7652/xjtuxb201711019

Deguale

A. D.

Sinishaw

L. M.

(2024). Enhancing stability and performance in Mobile robot path planning with PMR-dueling DQN algorithm. Sensors, 24(5), 1523. https://doi.org/10.3390/s24051523

Fang

Wang

(2022). Research on path planning and trajectory tracking of an unmanned electric shovel based on improved APF and preview deviation fuzzy control. Machines, 10(8), 707. https://doi.org/10.3390/machines10080707

Wang

(2024). Research on mobile robot path planning based on improved DDQN algorithm. Computer Knowledge and Technology, 20(16), 11–14. https://doi.org/10.14004/j.cnki.ckt.2024.0810

Fujimoto

van Hoof

Meger

(2024). Addressing function approximation error in actor-critic methods. Online Referencing. https://doi.org/10.14004/j.cnki.ckt.2024.0810

Guo

Yao

(2024). A decentralised path planning model based on deep reinforcement learning. Computers and Electrical Engineering, 117, 109276. https://doi.org/10.1016/j.compeleceng.2024.109276

Wang

Tong

(2023). Path planning for autonomous vehicles in unknown dynamic environment based on deep reinforcement learning. Applied Sciences, 13(18), 10056. https://doi.org/10.3390/app131810056

Liu

Zhang

(2021b). Novel best path selection approach based on hybrid improved A* algorithm and reinforcement learning. Applied Intelligence, 51, 9015–9029. https://doi.org/10.1007/s10489-021-02303-8

Liu

X.-H.

Zhang

Zhu

(2021a). A path planning method based on the particle swarm optimization trained fuzzy neural network algorithm. Cluster Computing, 24(3), 1573–7543. https://doi.org/10.1007/s10586-021-03235-1

10.

Mao

Q.-H.

Zhang

X.-H.

(2023). Deviation correction path planning method of full-width horizontal axis roadheader based on improved particle swarm optimization algorithm. Mathematical Problems in Engineering, 13, 3373873. https://doi.org/10.1155/2023/3373873

11.

Martínez

J. R.

Navarrete

C. E.

Pérez

I. T.

(2024). Reinforcement-Learning-Based path planning: A reward function strategy. Applied Sciences, 14(17), 7654. https://doi.org/10.3390/app14177654

12.

Tian

M.-Z.

(2023). Research on path planning and target detection of construction machinery Master's thesis. Shandong University. https://doi.org/10.27272/d.cnki.gshdu.2023.003339

13.

Wang

Z.-H.

(2021). Path planning of unmanned construction machinery based on potential field ant colony algorithm. Master's thesis Shandong University. https://doi.org/10.27272/d.cnki.gshdu.2021.003805

14.

Wang

P.-C.

(2023). Indoor mobile robot path planning and real-time obstacle avoidance based on improved SAC algorithm Master's thesis. Dalian Maritime University. https://doi.org/10.26989/d.cnki.gdlhu.2023.000938

15.

Wang

Han

(2024a). Reinforcement learning path planning method incorporating multi-step hindsight experience replay for lightweight robots. Displays, 84, 102796. https://doi.org/10.1016/j.displa.2024.102796

16.

Wang

Shen

Xiong

(2024b). Costmap A* Guided Reinforcement Learning Path Planning Method for Complex Environments Navigation. Shanghai Jiaotong University (Science), (prepublish).

17.

Ren

Lin

Yao

(2023). Autonomous path finding and obstacle avoidance method for unmanned construction machinery. Electronics, 12(9), 1998. https://doi.org/10.3390/electronics12091998

18.

Yao

Feng

(2023). Object-level complete coverage path planning for excavators in earthwork construction. Scientific Reports, 13, 12818. https://doi.org/10.1038/s41598-023-40038-3

19.

Zhang

(2024). Research on path planning algorithm of mobile robot based on deep reinforcement learning Master's thesis. Henan University of Technology. https://doi.org/10.27791/d.cnki.ghegy.2024.000757

20.

Zhang

Y.-S.

Pan

(2024). Mobile robot path planning algorithm based on improved deep dual-Q network. Information and Control, 53(03), 365–376. https://doi.org/10.13976/j.cnki.xk.2024.3090

21.

Zhen

Wang

Zhang

(2023). Improved reinforcement learning path planning algorithm integrating prior knowledge. PLoS One, 18(5), e0284942. https://doi.org/10.1371/journal.pone.0284942

22.

Zheng

(2024). Path planning for mobile robots using transfer reinforcement learning. International Journal on Artificial Intelligence Tools, 33(07), 2440005. prepublish. https://doi.org/10.1142/S0218213024400050

23.

Zhou

Wang

(2025). Artificial Intelligence Technology for Path Planning of Automated Earthwork Machinery. Journal of Field Robotics, 51(3), 131–146. https://doi.org/10.1002/rob.22479

Excavator Path Planning Based on Reinforcement Learning

Abstract

Keywords

Get full access to this article

References