Sage Journals: Discover world-class research

Abstract

This paper introduces a novel adaptive tuning strategy for a neural network-based PID (NNPID) controller that leverages reinforcement learning (RL) to enhance control performance in nonlinear systems. Unlike conventional NNPID approaches that rely on fixed or manually defined learning rates, the proposed method embeds the learning rate as part of a policy-based RL framework, enabling dynamic and autonomous adjustment during training. This adaptive mechanism allows the controller to better cope with system uncertainties, external disturbances, and nonlinear behaviors while ensuring fast and stable convergence. The RL agent is trained using a reward function designed solely from the tracking error, promoting improved trajectory tracking and reduced steady-state error. By continuously adapting the learning rate through interaction with the environment, the controller achieves superior robustness and generalization across varying operating conditions. Simulation studies and validation on a nonlinear transesterification reactor confirm that the proposed RL-NNPID outperforms traditional gradient descent methods and fixed learning rate strategies in terms of tracking accuracy, convergence speed, and disturbance rejection capability.

Keywords

neural network-based PID reinforcement learning policy network adaptive learning rates

Get full access to this article

View all access options for this article.

References

Ahmadi

Esfanjani

(2025) Improved machine learning-based approach for autotuning pid controller using genetic algorithms and parallel processing. Journal of Scientific & Industrial Research 84(3): 269–277.

Baghelani

Teshnehlab

Roshanian

(2025) A novel combination of fuzzy pid and deep neural controller in feedback-error-learning framework. Chaos, Solitons & Fractals 194: 116250.

Bentaher

Zribi

Hamza

(2024) A multi-objective indirect neural adaptive processes control design for minimization of energy consumption: an experimental validation on a transesterification reactor. Journal of Vibration and Control 45(6): 1–20.

Bujgoi

Sendrescu

(2025) Tuning of pid controllers using reinforcement learning for nonlinear system control. Processes 13: 735.

Çelik

Khosravi

Khan

, et al. (2025) Advancements in nonlinear pid controllers: a comprehensive review. Computers & Electrical Engineering 2025: 107189.

Chaturvedi

Kumar

(2024) A pso-optimized novel pid neural network model for temperature control of jacketed cstr: design, simulation, and a comparative study. Soft Computing 28: 4759–4773.

Elkhalil

Zribi

(2023a) Linear controller design approach for nonlinear systems by integrating gap metric and stability margin. Proceedings of the Institution of Mechanical Engineers - Part I: Journal of Systems & Control Engineering 237: 1800–1811.

Elkhalil

Zribi

(2023b) Systematic model bank determination approach for nonlinear systems using gap metric and stability margin. Journal of Process Control 132: 103126–103236.

Farhat

Rhili

Atig

, et al. (2021) Stability analysis strategy for the adaptive neural control system: a practical validation via a transesterification reactor. Iranian Journal of Science and Technology Transactions of Electrical Engineering 45(1): 105–117.

10.

Frijet

Zribi

Chtourou

(2018) An adaptive neural network controller based on pso and gradient descent method for pmsm speed drive 9(3): 1412–1422.

11.

Hamza

Farhat

Zribi

(2022) A new indirect adaptive neural control for nonlinear systems: a real validation on a chemical process. The International Journal on Artificial Intelligence Tools 31(8): 2250041.

12.

Hamza

Farhat

Zribi

(2023) A novel neural emulator identification of nonlinear dynamical systems using lyapunov stability theory. Transactions of the Institute of Measurement and Control 45(2): 1–17.

13.

Hasan

Rana

Tabassum

, et al. (2023) Optimizing the initial weights of a pid neural network controller for voltage stabilization of microgrids using a peo-ga algorithm. Applied Soft Computing 147: 110771.

14.

Hosseini

Mashayekhi Fard

Soltani

(2024) Active filter design and synthesis for hybrid neuro-fuzzy and robust pid controllers. International Journal of Dynamics and Control 12: 3873–3883.

15.

Kashfi

Balochian

Alishahi

(2024) Design of a optimal robust adaptive neural network-based fractional-order pid controller for h-bridge single-phase inverter. Applied Soft Computing 166: 112142.

16.

Kiumarsi

Modares

Lewis

, et al. (2017) Optimal and autonomous control using reinforcement learning: a survey. IEEE Transactions on Neural Networks and Learning Systems 29(6): 2042–2062.

17.

Kumar

Detroja

(2024) Gain scheduled pi controller design using multi-objective reinforcement learning. IFAC-PapersOnLine 58(7): 132–137.

18.

Lewis

Vrabie

Syrmos

(2009) Reinforcement Learning and Approximate Dynamic Programming for Feedback Control. IEEE Press.

19.

Liu

(2023) Pid control model based on back propagation neural network optimized by adversarial learning-based grey wolf optimization. Applied Sciences 13(8): 4767.

20.

Liu

Gan

, et al. (2025) Application of particle swarm optimization-based bp neural network pid controller in spray humidification. Engineering Research Express 7(2): 025412.

21.

Mazare

Taghizadeh

Kazemi

(2018) Optimal hybrid scheme of dynamic neural network and pid controller based on harmony search algorithm to control a pwm-driven pneumatic actuator position. Journal of Vibration and Control 24(16): 3538–3554.

22.

McClement

Lawrence

Loewen

, et al. (2021) A meta-reinforcement learning approach to process control. IFAC-PapersOnLine 54(3): 685–692.

23.

Nievas

Espinosa-Leal

Pagès-Bernaus

, et al. (2025) Offline reinforcement learning for adaptive control in manufacturing processes: a press hardening case study. Journal of Computing and Information Science in Engineering 25(1): 011004.

24.

Shi

Zhao

Fan

(2023) Parameter optimization of nonlinear pid controller using rbf neural network for continuous stirred tank reactor. Measurement and Control 56(9-10): 1835–1843.

25.

Shuprajhaa

Sujit

Srinivasan

(2022) Reinforcement learning based adaptive pid controller design for control of linear/nonlinear unstable processes. Applied Soft Computing 128: 109450.

26.

Singh

Kumar

Singh

(2022) Reinforcement learning in robotic applications: a comprehensive survey. Artificial Intelligence Review 55: 945–990.

27.

Sutton

Barto

(2018a) Reinforcement Learning: An Introduction. MIT press.

28.

Sutton

Barto

(2018b) Reinforcement Learning: An Introduction. 2nd ed. MIT Press.

29.

Wang

Turner

Mann

(2020) Constrained attractor selection using deep reinforcement learning. Journal of Vibration and Control 27(5-6): 502–514.

30.

Fan

, et al. (2022) A self-adaptive sac-pid control approach based on reinforcement learning for Mobile robots. International Journal of Robust and Nonlinear Control 32: 9625–9643.

31.

Zamfirache

Precup

Petriu

(2024) Adaptive reinforcement learning-based control using proximal policy optimization and slime mould algorithm with experimental tower crane system validation. Applied Soft Computing 160: 111687.

32.

Zhang

(2021) Reinforcement learning for robot research: a comprehensive review and open issue. International Journal of Advanced Robotic Systems 18(3): 17298814211007305.

33.

Zribi

Chtourou

Djemel

(2017) Multiple model bank selection based on a new validity criterion. Journal of Control Engineering and Applied Informatics 19(4): 43–51.

34.

Zribi

Chtourou

Djemel

(2019) Models’ bank selection of nonlinear systems by integrating gap metric, margin stability, and mopso algorithm. Iranian Journal of Science and Technology, Transactions of Electrical Engineering 43: 857–869.

Adaptive reinforcement learning-based tuning of neural PID controllers for nonlinear systems

Abstract

Keywords

Get full access to this article

References