Robot target tracking control considering obstacle avoidance based on combination of deep reinforcement learning and PID

Abstract

When simultaneously addressing the challenges of dynamic target tracking and obstacle avoidance for robots, conventional control and control only based on reinforcement learning cannot deal with the complex scenarios effectively. The purpose of this study is to design a robot control algorithm that combines deep reinforcement learning (Soft Actor-Critic, SAC) with PID to achieve real-time tracking of a moving object and effectively avoid single or multiple obstacles. The control of the robot is divided into two key components: initially, the first joint of the 6-degree-of-freedom robot is controlled by PID algorithm, which makes the working plane (the plane coincident with the axis of the first joint and parallel to the linkage) quickly approach the target until it overlaps. Subsequently, the task of reinforcement learning is simplified to control the planar robot to track the target projection in working plane while avoiding the obstacle projection, ultimately achieve target tracking and obstacle avoidance in 3D space. The simulation and experiment results show that the proposed method has good efficiency and convergence speed. The SAC-PID strategy effectively controls the Universal-Robots UR5 to complete dynamic target tracking while accomplishing obstacle avoidance in both virtual and real-world environments.

Keywords

Robot target tracking collision avoidance deep reinforcement learning trajectory planning

Get full access to this article

View all access options for this article.

References

Wei

Ren

A method on dynamic path planning for robotic manipulator autonomous obstacle avoidance based on an improved RRT algorithm. Sensors 2018; 18(2): 71.

Zhou

, et al. Trajectory optimization of pickup manipulator in obstacle environment based on improved artificial potential field method. Appl Sci 2020; 10(3): 935.

Yang

Merkt

Ivan

, et al. HDRM: a resolution complete dynamic roadmap for real-time motion planning in complex scenes. IEEE Robot Autom Lett 2018; 3(1): 551–558.

Sutton

Barto

(eds). Reinforcement learning: an Introduction. 2nd ed. Cambridge, MA: The MIT Press, 2018.

Analysis of space manipulator route planning based on Sarsa (λ) reinforcement learning. J Astronaut 2019; 40: 435–443.

Chen

Bai

Huang

, et al. Double-task deep Q-learning with multiple views. In: 2017 IEEE international conference on computer vision workshops, 22–29 October 2017, pp.1050–1058. New York: IEEE.

Peng

Liao

Guan

, et al. A pushing-grasping collaborative method based on deep Q-network algorithm in dual viewpoints. Sci Rep 2022; 12(1): 3927.

Sangiovanni

Rendiniello

Incremona

, et al. Deep reinforcement learning for collision avoidance of robotic manipulators. In: European control conference (ECC), Limassol, Cyprus, 12–15 June 2018, pp.2063–2068. New York: IEEE.

Kober

Peters

Reinforcement learning in robotics: a survey. Berlin, Heidelberg: Springer, 2012. pp.579–610.

10.

Degris

White

Sutton

RS.

Off-policy actor-critic, https://arxiv.org/abs/1205.4839 (2012, accessed 20 June 2013).

11.

Liu

Niu

Mahyuddin

, et al. A model-free deep reinforcement learning approach for robotic manipulators path planning. In: 2021 21st International conference on control, automation and systems (ICCAS)South Korea, 2021, pp.512–517. New York: IEEE.

12.

Chen

Deep reinforcement learning based moving object grasping. Inf Sci 2021; 565: 62–76.

13.

Choi

Lee

Reinforcement learning-based dynamic obstacle avoidance and integration of path planning. Intell Serv Robot 2021; 14(5): 663–677.

14.

Haarnoja

Zhou

Abbeel

, et al. Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor. In: 35th International conference on machine learning (ICML), Stockholm, Sweden, 10–15 July 2018, p.15. San Diego: Jmlr-Journal Machine Learning Research.

15.

Zhang

Reinforcement learning for robot research: a comprehensive review and open issues. Int J Adv Robot Syst 2021; 18(3): 22.

16.

Ibarz

Tan

Finn

, et al. How to train your robot with deep reinforcement learning: lessons we have learned. Int J Rob Res 2021; 40(4-5): 698–721.

17.

Liu

Dong

A general framework of motion planning for redundant robot manipulator based on deep reinforcement learning. IEEE Trans Ind Inform 2022; 18(8): 5253–5263.

18.

Zhong

Wang

Cheng

Collision-free path planning for welding manipulator via hybrid algorithm of deep reinforcement learning and inverse kinematics. Complex Intell Syst 2022; 8(3): 1899–1912.

19.

Zhu

, et al. Constrained Motion planning of 7-DOF space manipulator via deep reinforcement learning combined with artificial potential field. Aerospace 2022; 9(3): 163.

20.

Lillicrap

Hunt

Pritzel

, et al. Continuous control with deep reinforcement learning, https://arxiv.org/abs/1509.02971 (2015, accessed 5 July 2019).

21.

Liu

Jiang

, et al. Research on robot dynamic target tracking and obstacle avoidance control based on DDPG-PID. J Nanjing Univ Aeronaut Astronaut 2022; 54: 41–50.

22.

Fujimoto

van Hoof

Meger

. Addressing function approximation error in actor-critic methods. In: 35th International conference on machine learning (ICML), Stockholm, Sweden, 10–15 July 2018. San Diego: Journal Machine Learning Research.

23.

Schulman

Wolski

Dhariwal

, et al. Proximal policy optimization algorithms, https://arxiv.org/abs/1707.06347 (2017, accessed 28 August 2017).

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB