Sage Journals: Discover world-class research

Abstract

Reinforcement learning has achieved significant progress in UAV autonomous navigation. However, existing methods typically rely on either purely discrete or purely continuous action spaces to define the UAV’s maneuver mode. Discrete action space is simple to implement and converge quickly but lack sufficient control granularity. In contrast, continuous action space provides higher control resolution but often leads to inefficient training and susceptibility to local optima. Existing RL methods cannot adaptively switch maneuver modes within a unified framework because discrete and continuous action spaces differ fundamentally in structure and control objectives. To address this issue, we propose a hierarchical reinforcement learning framework with hybrid action space (HAS-HRL). Specifically, the high-level policy adaptively selects the maneuver mode according to the environment context, while the low-level policy consists of a set of primitive navigation skills associated with the hybrid maneuver modes. These skills generate executable control commands, enabling the UAV to perform smooth maneuvers in dense obstacle regions while cruising efficiently in open spaces. Furthermore, an event-triggered control rule is introduced to provide structured prior guidance during the early training stage, thereby improving exploration efficiency and convergence stability. Experiments in various simulation environments demonstrate that the proposed HAS-HRL framework consistently outperforms single-layer RL and HRL baselines in terms of success rate, obstacle-avoidance performance, and training stability. The results show that the hybrid maneuver modes effectively balance flight safety and navigation efficiency, offering a robust and efficient solution for UAV autonomous navigation in complex scenarios.

Keywords

UAV autonomous navigation hybrid action space hybrid maneuver modes hierarchical reinforcement learning event-triggered control rule

Get full access to this article

View all access options for this article.

References

Chang

Cheng

Manzoor

et al. A review of uav autonomous navigation in gps-denied environments. Rob Auton Syst 2023; 170: 104533.

Zhao

Hsu

et al. Lightweight unmanned aerial vehicle object detection algorithm based on improved yolov8. Intell Data Anal 2025; 29: 235–252.

AlMahamid

Grolinger

. Autonomous unmanned aerial vehicle navigation using reinforcement learning: A systematic review. Eng Appl Artif Intell 2022; 115: 105321.

Sheng

Liu

et al. Uav autonomous navigation based on deep reinforcement learning in highly dynamic and high-density environments. Drones 2024; 8: 516.

Arafat

Alam

Moh

. Vision-based navigation techniques for unmanned aerial vehicles: Review and challenges. Drones 2023; 7: 89.

Fei

Xiaoping

Zhou

et al. Deep-reinforcement-learning-based uav autonomous navigation and collision avoidance in unknown environments. Chinese J Aeronaut 2024; 37: 237–257.

Ananthi

Lakshmana Kumar

Muthu

et al. Adaptive marine intelligence and sensing architecture for autonomous underwater ecosystem monitoring using ai and iot integration. Intell Data Anal 2025: 1088467X251339271. DOI: https://doi.org/10.1177/1088467X251339271.

Abdullayeva

Valikhanli

. Multimodal deep neural network for uav gps jamming attack detection. Cyber Sec Appl 2025; 3: 100094.

Zhu

Zhao

. An overview of the action space for deep reinforcement learning. In: Proceedings of the international conference on algorithms, computing and artificial intelligence, 2021, pp.1–10.

10.

Yang

et al. A review of safe reinforcement learning: Methods, theories, and applications. IEEE Trans Pattern Anal Mach Intell 2024; 46: 11216–11235.

11.

Cai

Huang

et al. The challenges of driving mode switching in automated vehicles: A review. IEEE Trans Vehic Technol 2024; 73: 1777–1791.

12.

Uddin

Hossain

Ahamed

et al. Abnormal driving behavior detection: A machine and deep learning based hybrid model. Int J Intell Trans Syst Res 2025; 23: 568–591.

13.

Dulac-Arnold

Evans

van Hasselt

et al. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:151207679, 2015.

14.

Majeed

Hutter

. Exact reduction of huge action spaces in general reinforcement learning. In: Proceedings of the AAAI conference on artificial intelligence, Vol. 35, 2021, pp.8874–8883.

15.

Zhong

Yang

Zhao

. No prior mask: Eliminate redundant action for deep reinforcement learning. In: Proceedings of the AAAI conference on artificial intelligence, Vol. 38, 2024, pp.17078–17086.

16.

Wang

et al. Skill-based hierarchical reinforcement learning for target visual navigation. IEEE Trans Multimedia 2023; 25: 8920–8932.

17.

Lee

Kim

Choi

. Adaptive and explainable deployment of navigation skills via hierarchical deep reinforcement learning. arXiv preprint arXiv:230519746, 2023.

18.

Tong

Jiang

Biyue

et al. Uav navigation in high dynamic environments: A deep reinforcement learning approach. Chinese J Aeronaut 2021; 34: 479–489.

19.

Liu

Cao

Chen

et al. A hierarchical reinforcement learning algorithm based on attention mechanism for uav autonomous navigation. IEEE trans Intell Transp Syst 2022; 24: 13309–13320.

20.

Han

Wei

et al. Event-triggered deep reinforcement learning using parallel control: A case study in autonomous driving. IEEE Trans Intell Vehicles 2023; 8: 2821–2831.

21.

Chen

Xue

. Hierarchical uav autonomous navigation algorithm based on event-triggered deep reinforcement learning. IEEE Trans Vehic Technol 2025; 1: 1–12.

22.

Majidi

Shamsi

Marvasti

. Algorithmic trading using continuous action space deep reinforcement learning. Expert Syst Appl 2024; 235: 121245.

23.

Zhang

Peng

et al. A state-decomposition ddpg algorithm for uav autonomous navigation in 3-d complex environments. IEEE Int Things J 2024; 11: 10778–10790.

24.

Hwangbo

Siegwart

et al. Control of a quadrotor with reinforcement learning. IEEE Robot Autom Lett 2017; 2: 2096–2103.

25.

Walvekar

Goel

Jain

et al. Vision based autonomous navigation of quadcopter using reinforcement learning. In: IEEE International conference on automation, electronics and electrical engineering, 2019, pp.160–165.

26.

Song

Romero

Müller

et al. Reaching the limit in autonomous racing: Optimal control versus reinforcement learning. Sci Robot 2023; 8: eadg1462.

27.

Jiang

Cai

et al. Autonomous obstacle avoidance and target tracking of uav: Transformer for observation sequence in reinforcement learning. Knowl Based Syst 2024; 290: 111604.

28.

Yuan

Chen

et al. Deep reinforcement learning-based distributed 3d uav trajectory design. IEEE Trans Commun 2024; 72: 3736–3751.

29.

Pateria

Subagdja

Ah et al Tan. Hierarchical reinforcement learning: A comprehensive survey. ACM Comput Surv 2021; 54: 1–35.

30.

Xia

Luo

Lan

et al. Reason more like human: Incorporating meta information into hierarchical reinforcement learning for knowledge graph reasoning. Appl Intell 2023; 53: 13293–13308.

31.

Sutton

Precup

Singh

. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artif Intell 1999; 112: 181–211.

32.

Bacon

Harb

Precup

. The option-critic architecture. In: Proceedings of the AAAI conference on artificial intelligence, Vol. 31, 2017.

33.

Vezhnevets

Osindero

Schaul

et al. Feudal networks for hierarchical reinforcement learning. In: International conference on machine learning, 2017, pp.3540–3549.

34.

Johnson

Cao

Dana

et al. Feudal networks for visual navigation. arXiv preprint arXiv:240212498, 2024.

35.

Liu

Zou

Chang

et al. Autonomous navigation of uav in complex environment: a deep reinforcement learning method based on temporal attention. Appl Intell 2025; 55: 316.

36.

Kalidas

Joshua

et al. Deep reinforcement learning for vision-based navigation of uavs in avoiding stationary and mobile obstacles. Drones 2023; 7: 245.

37.

Lin

Huang

Chen

et al. Decentralized multi-robot navigation for autonomous surface vehicles with distributional reinforcement learning. In: IEEE International conference on robotics and automation, 2024, pp.8327–8333.

38.

Dang

Chen

et al. Event-triggered model predictive control with deep reinforcement learning for autonomous driving. IEEE Trans Intell Veh 2023; 9: 459–468.

39.

Chen

et al. Adaptive multigradient recursive reinforcement learning event-triggered tracking control for multiagent systems. IEEE Trans Neural Netw Learn Syst 2021; 34: 144–156.

40.

Zhu

Liang

Niu

et al. Observer-based reinforcement learning for optimal fault-tolerant consensus control of nonlinear multi-agent systems via a dynamic event-triggered mechanism. Inf Sci (Ny) 2025; 689: 121350.

41.

Brockman

Cheung

Pettersson

et al. Openai gym. arXiv preprint arXiv:160601540, 2016.

42.

Mnih

. Playing atari with deep reinforcement learning. arXiv preprint arXiv:13125602, 2013.

43.

Wang

Schaul

Hessel

et al. Dueling network architectures for deep reinforcement learning. In: International conference on machine learning, 2016, pp.1995–2003.

44.

Fujimoto

Hoof

Meger

. Addressing function approximation error in actor-critic methods. In: International conference on machine learning, 2018, pp.1587–1596. PMLR.

45.

Haarnoja

Zhou

Abbeel

et al. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In: International conference on machine learning, 2018, pp.1861–1870.

UAV autonomous navigation with hybrid maneuver modes: A hierarchical reinforcement learning method

Abstract

Keywords

Get full access to this article

References