Sage Journals: Discover world-class research

Abstract

To address the challenges in ship path planning and collision avoidance in complex marine environments, this study proposes a deep reinforcement learning (DRL)-based local path planning algorithm that integrates spatiotemporal feature modeling, aiming to achieve unified ship collision avoidance and motion control. Firstly, an end-to-end state space and action space framework is established, enabling the system to directly output control commands based on perceptual information, thereby bridging the entire pipeline from perception to control. Secondly, a network architecture incorporating spatial position encoding, temporal context modeling, and an attention mechanism is designed to enhance feature modeling and decision-making capabilities in complex environments. Finally, a reward function system combining navigation objectives and collision avoidance requirements is constructed, alleviating the sparse reward problem in reinforcement learning and accelerating training convergence. Comparative experiments conducted in various typical scenarios demonstrate the superiority of the proposed algorithm with reference to path efficiency, safety, and control stability, providing robust technical support for the autonomous navigation of intelligent ships.

Keywords

path planning dynamic obstacle avoidance deep reinforcement learning STPPO

Get full access to this article

View all access options for this article.

References

Chen

Han

Liu

Montewka

Liu

R. W.

Orientation-Aware Ship Detection via a Rotation Feature Decoupling Supported Deep Learning Approach. Engineering Applications of Artificial Intelligence, Vol. 125, 2023, p. 106686.

Zhang

Huang

Wang

Chen

Tian

Zhao

A Real-Time Multi-Ship Collision Avoidance Decision-Making System for Autonomous Ships Considering Ship Motion Uncertainty. Ocean Engineering, Vol. 278, 2023, p. 114205.

Ventikos

N. P.

Louzis

Risk Dynamics for Marine Systems: Towards a Bio-Inspired Framework for Dynamic Risk Assessment. Transportation Safety and Environment, Vol. 4, No. 3, 2022, p. tdac018.

Felski

Zwolak

The Ocean-Going Autonomous Ship—Challenges and Threats. Journal of Marine Science and Engineering, Vol. 8, No. 1, 2020, p. 41.

Zarayskaya

Wallace

Wigley

Zwolak

Bazhenova

Bohan

Brown

Kok

Kristoffersen

S. M.

GEBCO-NF Alumni Team Technology Solution for Shell Ocean Discovery XPRIZE Final Round. Presented at OCEANS 2019 – Marseille, 2019, pp. 1–10.

Akbar

Aasen

A. K.

Msakni

M. K.

Fagerholt

Lindstad

Meisel

An Economic Analysis of Introducing Autonomous Ships in a Short-Sea Liner Shipping Network. International Transactions in Operational Research, Vol. 28, No. 4, 2021, pp. 1740–1764.

Zhu

Weng

Han

Y. E.

Intelligent Ship Collision Avoidance in Maritime Field: A Bibliometric and Systematic Review. Expert Systems with Applications, Vol. 252, 2024, p. 124148.

Chen

Huang

Papadimitriou

Mou

Van Gelder

Global Path Planning for Autonomous Ship: A Hybrid Approach of Fast Marching Square and Velocity Obstacles Methods. Ocean Engineering, Vol. 214, 2020, p. 107793.

Xue

Kong

Dong

Multi-Agent Path Planning Based on MPC and DDPG. arXiv preprint arXiv:2102.13283, 2021.

10.

Shen

H. Q.

Guo

T. S.

, et al. An Intelligent Collision Avoidance and Navigation Approach of Unmanned Surface Vessel Considering Navigation Experience and Rules. Journal of Harbin Engineering University, Vol. 39, No. 6, 2018, pp. 998–1005.

11.

Liu

Zhao

Application of Improved APF Method in Ship Narrow Channel Path Planning. Proc. SPIE, Sixth International Conference on Traffic Engineering and Transportation System (ICTETS 2022), 125911R, 2023.

12.

Liu

Wang

Research on Path-Planning Algorithm Integrating Optimization A-Star Algorithm and Artificial Potential Field Method. Electronics, Vol. 11, No. 22, 2022, p. 3660.

13.

Wang

Chi

Wang

Meng

M. Q. H.

Neural RRT*: Learning-Based Optimal Path Planning. IEEE Transactions on Automation Science and Engineering, Vol. 17, No. 4, 2020, pp. 1748–1758.

14.

Liao

Deep Reinforcement Learning with Dynamic Window Approach Based Collision Avoidance Path Planning for Maritime Autonomous Surface Ships. Ocean Engineering, Vol. 284, 2023, p. 115208.

15.

Zhu

Yin

Lyu

Automatic Collision Avoidance Algorithm Based on Route-Plan-Guided Artificial Potential Field Method. Ocean Engineering, Vol. 271, 2023, p. 113737.

16.

Liu

Chu

Negenborn

R. R.

Dynamic Anti-Collision A-Star Algorithm for Multi-Ship Encounter Situations. Applied Ocean Research, Vol. 118, 2022, p. 102995.

17.

Zhang

A Path-Planning Strategy for Unmanned Surface Vehicles Based on an Adaptive Hybrid Dynamic Stepsize and Target Attractive Force-RRT Algorithm. Journal of Marine Science and Engineering, Vol. 7, 2019, p. 132.

18.

Dewangan

R. K.

Shukla

Godfrey

W. W.

Three Dimensional Path Planning Using Grey Wolf Optimizer for UAVs. Applied Intelligence, Vol. 49, 2019, pp. 2201–2217.

19.

Zheng

Ship 3D Path Planning Based on Improved Particle Swarm Optimization Algorithm. Ship Science and Technology, Vol. 45, No. 9, 2023, pp. 65–68.

20.

Gao

Zhou

Zhao

Shao

Research on Ship Collision Avoidance Path Planning Based on Modified Potential Field Ant Colony Algorithm. Ocean & Coastal Management, Vol. 235, 2023, p. 106482.

21.

Wang

Research on Path Planning of Mobile Robot Based on Improved Genetic Algorithm. Journal of Huazhong University of Science and Technology (Natural Science Edition), Vol. 52, No. 5, 2024, pp. 158–164.

22.

Guo

Zhao

Wen

Zhang

Global Path Planning and Multi-Objective Path Control for Unmanned Surface Vehicle Based on Modified Particle Swarm Optimization (PSO) Algorithm. Ocean Engineering, Vol. 216, 2020, p. 107693.

23.

Lyridis

D. V.

An Improved Ant Colony Optimization Algorithm for Unmanned Surface Vehicle Local Path Planning with Multi-Modality Constraints. Ocean Engineering, Vol. 241, 2021, p. 109890.

24.

Wang

Zhang

Zhu

Ship Route Planning Based on Double-Cycling Genetic Algorithm Considering Ship Maneuverability Constraint. IEEE Access, Vol. 8, 2020, pp. 190746–190759.

25.

Xie

Zha

Yang

Ship Path Planning Based on Deep Reinforcement Learning and Historical Trajectories. China Navigation, Vol. 47, No. 1, 2024, pp. 36–44+51.

26.

Hadi

Khosravi

Sarhadi

Deep Reinforcement Learning for Adaptive Path Planning and Control of an Autonomous Underwater Vehicle. Applied Ocean Research, Vol. 129, 2022, p. 103326.

27.

Chen

Zhao

Huang

An Improved DQN Path Planning Algorithm. Journal of Supercomputing, Vol. 78, 2022, pp. 616–639.

28.

Cui

Guan

Luo

Zhang

Intelligent Navigation Method for Multiple Marine Autonomous Surface Ships Based on Improved PPO Algorithm. Ocean Engineering, Vol. 287, 2023, p. 115783.

29.

Zhao

Wang

Bai

A DDPG-Based USV Path-Planning Algorithm. Applied Sciences, Vol. 13, 2023, p. 10567.

30.

Zheng

Tao

Sun

Chen

Sun

Xie

Soft Actor–Critic Based Active Disturbance Rejection Path Following Control for Unmanned Surface Vessel Under Wind and Wave Disturbances. Ocean Engineering, Vol. 247, 2022, p. 110631.

31.

Zhang

Sun

Zhao

Progress on Deep Reinforcement Learning in Path Planning. Computer Engineering and Applications, Vol. 57, No. 19, 2021, pp. 44–56.

32.

Huang

Yuan

Z.-M.

A Path Planning Strategy Unified with a COLREGS Collision Avoidance Function Based on Deep Reinforcement Learning and Artificial Potential Field. Applied Ocean Research, Vol. 113, 2021, p. 102759.

33.

Teitgen

Monsuez

Kukla

Pasquier

Foinet

Dynamic Trajectory Planning for Ships in Dense Environment Using Collision Grid with Deep Reinforcement Learning. Ocean Engineering, Vol. 281, 2023, p. 114807.

34.

Guo

Zhang

Zheng

An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning. Sensors, Vol. 20, 2020, p. 426.

35.

Wang

Soft Actor-Critic and Risk Assessment-Based Reinforcement Learning Method for Ship Path Planning. Sustainability, Vol. 16, 2024, p. 3239.

36.

Jiang

Zhang

Wang

A Human-Like Collision Avoidance Method for Autonomous Ship with Attention-Based Deep Reinforcement Learning. Ocean Engineering, Vol. 264, 2022, p. 112378.

37.

Lei

Zhang

Deep Reinforcement Learning-Based Path Control and Optimization for Unmanned Ships. Wireless Communications and Mobile Computing, Vol. 2022, No. 1, 2022, p. 7135043.

38.

Wang

Zhang

Yang

Bashir

Lee

Collision Avoidance for Autonomous Ship Using Deep Reinforcement Learning and Prior-Knowledge-Based Approximate Representation. Frontiers in Marine Science, Vol. 9, 2023, p. 1084763.

39.

Waltz

Okhrin

Spatial–Temporal Recurrent Reinforcement Learning for Autonomous Ships. Neural Networks, Vol. 165, 2023, pp. 634–653.

40.

Konda

V. R.

Tsitsiklis

J. N.

Actor-critic Algorithms. Advances in Neural Information Processing Systems, Vol. 12, 1999, pp. 1008–1014.

41.

Mehta

A. L.

Zaloom

V. A.

Craig

B. N.

Analysis of Waterway Transportation in Southeast Texas Waterway Based on AIS Data. Ocean Engineering, Vol. 121, 2016, pp. 196–209.

42.

Chen

Zheng

Montewka

Maritime Traffic Situation Awareness Analysis via High-Fidelity Ship Imaging Trajectory. Multimedia Tools and Applications, Vol. 83, 2024, pp. 48907–48923.

43.

Sun

L. C.

Analysis of Wave Characteristics in the Western Bohai Sea. Journal of Oceanography of Huanghai & Bohai Seas, Vol. 9, No. 3, 1991, pp. 50–58.

44.

Ding

W. D.

The Influence of Climate Characteristics in Tianjin Port Area on Pilotage. Port Economy, No. 3, 2010, pp. 52–53.

45.

Zhang

A. M.

Ning

Y. W.

Wang

C. X.

Zhang

Liu

R. X.

Sun

C. H.

Characteristics of Tidal Current and Residual Current at the Continuous Observation Point in the Main Channel of Tianjin Port. Journal of Marine Sciences, Vol. 37, No. 1, 2019, pp. 75–82.

46.

Fossen

T. I.

Handbook of Marine Craft Hydrodynamics and Motion Control. John Wiley & Sons, Ltd., Chichester, 2011.

47.

Faltinsen

O. E.

Sea Loads on Ships and Offshore Structures. Cambridge University Press, Cambridge, 1993.

48.

Zhang

Local Path Planning for AUV with Fusion of DWA and RRT Algorithms in a Complex Environment. Journal of Intelligent Systems, Vol. 19, No. 4, 2024, pp. 961–973.

Study of Ship Path Planning in Complex Environments Based on Spatiotemporal Reinforcement Learning

Abstract

Keywords

Get full access to this article

References