Sage Journals: Discover world-class research

Abstract

A new triple-loop recurrent neural network (TLRNN) super-twisting terminal sliding mode control (TSMC) scheme based on the barrier function is proposed for the attitude tracking control of flight systems with unknown disturbances and parameter uncertainties. Compared with the traditional recurrent neural network (RNN), the newly designed TLRNN incorporates three different feedback loops, which improves the performance of approximation and enhances the capacity to save information. Additionally, the barrier function-based variable gain super-twisting algorithm is used to reduce chattering and dynamically adjust control gain. The finite-time convergence of a quad-rotor UAV system is derived using the Lyapunov function technique. The experiments, compared with existing methods, demonstrate the effectiveness of the proposed control strategy.

Keywords

quad-rotor UAV sliding mode control super-twisting barrier function triple-loop recurrent neural network

Introduction

In recent years, quad-rotor UAVs have attracted a great deal of attention from researchers¹ due to their lightweight and high flexibility, which allows them to be used for specialized aerial operations, including the fighting forest fires,² inspecting crops,³ monitoring routes,⁴ tracking moving objects⁵ and delivering material for military operations or disaster relief.⁶ For the control of a quad-rotor UAV, we have to consider the multi-input and multi-output structure and the dynamics with parametric uncertainty. These characteristics have been studied, for example: adaptive tracking control of quad-rotor UAVs,⁷ robust adaptive attitude tracking of quad-rotor UAVs.⁸

In past research, several strategies have existed for attitude control of quad-rotor UAVs. These include linear and nonlinear control methods, neural network control, adaptive control and fuzzy logic control.^9,10 In quad-rotor UAVs with multiple attitude control strategies, there are unmodelled disturbances and parameter variations as uncertainties, in addition to considering that external factors have a significant impact on the stability of the UAVs, such as nonlinear friction and payload variations.^11–13 This requires the development of a control methodology for UAVs with adaptive laws that can rapidly converge and exhibit enhanced robustness.

The sliding mode control (SMC) technique is an effective tool for designing robust control laws in nonlinear systems with uncertain conditions.¹⁴ For example, an adaptive sliding mode controller in Mehmood et al.¹⁵ is designed to compensate for the effects of external disturbances and parametric uncertainties. However, the aforementioned control scheme doesn’t consider problems associated with finite time convergence. Compared with SMC, the terminal sliding mode control technique enables the implementation of convergence in finite time.¹⁶ Therefore, a novel sliding mode controller is proposed in Mofid and Mobayen,¹⁷ which is able to not only establish a fast convergence rate but also adaptively tune the control parameters to guesstimate the unknown parameters of the quad-rotor system. Building upon the concept of finite-time convergence, some researchers have explored predefined-time stability. A predefined-time adaptive distributed controller is designed in Qiu et al.¹⁸ using the backstepping method. This proposed control scheme achieves shorter settling time and smaller tracking errors within a predefined time.

However, the implementation of TSMC is not without challenges, most notably the chattering phenomenon, which causes high-frequency oscillations in control signals that can impair performance and cause increased wear to the actuators.^19,20 To address the chattering issue in traditional sliding mode control, a gain optimization scheme for sliding mode control systems based on nonlinear adaptive particle swarm optimization was proposed in Bosera et al.,²¹ which demonstrated high control performance. The super-twisting algorithm exhibits enhanced robustness in comparison with traditional sliding mode control through the implementation of a smooth control law, which enables more effective management of system uncertainties and external perturbations and achieves a more robust controller in practical applications. The super-twisting control algorithm is designed in Kahouadji et al.,²² which drives the sliding variable, and it’s derivative to zero in finite time. An improved singular-free adaptive super-twisting sliding mode control method was proposed in Abera et al.,²³ which utilized the particle swarm optimization (PSO) algorithm to adjust the controller gains. However, the aforementioned super-twisting controller usually requires the assumption that the external disturbances of the system have a known boundary. To overcome the above difficulty, a barrier function in Obeid et al.²⁴ is used to dynamically adjust the control gain, and ignore the disturbances upper boundary, ensuring sliding mode is achieved. The control gain compensates for the difference between the filtered estimate and the actual disturbance. The method enforces that the system reaches the sliding mode by increasing the gain and then decreasing the gain until it leaves the sliding mode, which for the second-order system used ensures that the sliding mode surfaces towards zero in finite time.

For attitude tracking control of UAVs with external disturbances, the mentioned TSMC can improve control of effects. However, implementing this method can be challenging due to the requirement of a good understanding of the system structure. In Zhao et al. and Zhi et al.,^25,26 the observer-based H_$\infty$ synchronization and terminal sliding mode observer (TSMO) are investigated to address the problem of unknown parameters. In other research, neural networks are often used to approximate unknown functions and enhance control. For instance, an adaptive control method based on neural network approximation characteristics is proposed in Jiang et al.²⁷ A radial basis function (RBF) neural network (NN) has been used to approximate the system dynamics in Al Aela et al.²⁸ According to the different NN structures, feedforward NN (FNN) and recurrent NN (RNN) are defined as two categories of NN.²⁹ Compared to FNNs, RNNs are dynamic mappings that can be used to store information and use the structure of their feedback loops to improve their approximation capabilities. A controller based on recurrent neural network is proposed in Flores and Flores,³⁰ and the purpose of this controller is to stabilize the flight transition manoeuvres of UAVs. Liu et al.³¹ conducted an overview of the stability analysis of recurrent neural networks. Compared to the dynamic characteristics of RNN, deep reinforcement learning (DRL) serves as a more effective strategy for highly nonlinear dynamic models. Therefore, an adaptive controller based on deep reinforcement learning (DRL) in Jiang et al.³² is proposed to generate a dynamic controller in continuous action space, where the trajectory planning function is simultaneously integrated into the dynamic solution. In Ghadiri et al.,³³ a novel integral fast terminal sliding mode controller (IFTSMC) is designed and proposed as a new control method for trajectory tracking of quadrotor UAV systems under external disturbances and parameter uncertainties.

To implement adaptive neural sliding mode controller and attitude tracking control of a quad-rotor UAV, inspired by the design structure of different RNNs, this paper proposes a novel triple-loop RNN structure. Compared to the traditional RNN, the triple-loop RNN calculates firing and output weights by neurons in the hidden and output layers. To reduce uncertain disturbance and eliminate the impact of neural approximation errors, a switching controller and a compensator controller are implemented in the control system.

The innovations of this article are described below:

A new neural network structure with a triple-loop has been designed, and the hidden layer neurons receive feedback from themselves and other neurons. Additionally, the input layer receives feedback from both the hidden layer and the output layer neurons. This strengthens the neural network’s ability to capture dynamics and store information and effectively improves the approximation speed of the neural network.

The neural network uses the enhanced sigmoid function as the activation function to avoid gradient saturation, and the continuous function also simplifies the derivation.

The super-twisting algorithm is based on the barrier function and is not affected by the upper bound of the unknown disturbance. This guarantees that the output values remain within a predefined neighbourhood.

The rest of the article is organized as follows. In Section II, the attitude dynamics model of a quad-rotor UAV is given. Section III presents the complete design process of the controller with a triple-loop RNN, and performs stability analyses. Section IV verifies the effectiveness of the proposed controller through experiments. Finally, conclusions are given in Section IV.

Problem formulation and preliminaries

To obtain the attitude dynamics equation of the quad-rotor, the roll, pitch and yaw angles are defined as $Θ = {[\begin{matrix} ϕ, θ, ψ \end{matrix}]}^{T}$ , the angular velocities in these three directions are defined as $\overset{\cdot}{Θ} = {[\begin{matrix} \overset{\cdot}{ϕ}, \overset{\cdot}{Θ}, \overset{\cdot}{ψ} \end{matrix}]}^{T}$ , and accelerations $\overset{\cdot\cdot}{Θ} = {[\begin{matrix} \overset{\cdot\cdot}{ϕ}, \overset{\cdot\cdot}{θ}, \overset{\cdot\cdot}{ψ} \end{matrix}]}^{T}$ . Based on the above description of the angles, angular velocity and angular acceleration, we can write the attitude dynamics equation of the UAV²²:

\overset{\cdot\cdot}{Θ} = [M (Θ)]^{- 1} [Ψ^{T} (Θ) u - C (Θ, \overset{\cdot}{Θ}) \overset{\cdot}{Θ}] + d (t)

(1)

with $M (Θ) = Ψ^{T} (Θ) J Ψ (Θ) \in R^{3 \times 3}$ is an auxiliary positive inertia matrix and $C (Θ, \overset{\cdot}{Θ}) = - M (Θ) {\overset{\cdot}{Ψ}}^{- 1} (Θ) Ψ (Θ) - Ψ^{T} (Θ) sk [\begin{matrix} J Ψ (Θ) \overset{\cdot}{Θ} \end{matrix}] Ψ (Θ)$ where $J \in R^{3 \times 3}$ is the total inertia matrix, The Euler matrix $Ψ (Θ) \in R^{3 \times 3}$ and the cross-product operator $sk [β]$ for a vector $[β] = {[\begin{matrix} β_{1}, β_{2}, β_{3} \end{matrix}]}^{T} \in R^{3}$ are given by:

Ψ (Θ) = [\begin{matrix} 1 & 0 & - \sin θ \\ 0 & \cos ϕ & \sin ϕ \cos θ \\ 0 & - \sin ϕ & \cos ϕ \cos θ \end{matrix}]

(2)

sk [β] = [\begin{matrix} 0 & - β_{3} & β_{2} \\ β_{3} & 0 & - β_{1} \\ - β_{2} & β_{1} & 0 \end{matrix}]

(3)

As we consider that the quad-rotor UAV is axisymmetric, the matrix $J$ can be simplified as:

J = [\begin{matrix} J_{xx} & 0 & 0 \\ 0 & J_{yy} & 0 \\ 0 & 0 & J_{zz} \end{matrix}]

(4)

with the input from the controller, the matrix M is given by:

M (Θ) = [\begin{matrix} M_{1} (Θ) M_{2} (Θ) M_{3} (Θ) \end{matrix}]

(5)

which the $M_{i} (Θ) (i = 1, 2, 3)$ be interpreted as the $i$ th column of matrix $M$ . Using equation (5), the above-mentioned $M (Θ_{1})$ , $M (Θ_{2})$ and $M (Θ_{3})$ can be expressed as follows:

M_{1} (Θ) = [\begin{matrix} J_{xx} \\ 0 \\ - J_{xx} \sin θ \end{matrix}]

(6)

M_{2} (Θ) = [\begin{matrix} 0 \\ J_{yy} \sin^{2} ϕ + J_{zz} \cos^{2} ϕ \\ (J_{yy} - J_{zz}) \cos θ \cos ϕ \sin ϕ \end{matrix}]

(7)

M_{3} (Θ) = [\begin{matrix} - J_{xx} \sin θ \\ (J_{yy} - J_{zz}) \cos θ \cos ϕ \sin ϕ \\ J_{xx} \sin^{2} θ + \cos^{2} θ (J_{yy} \sin^{2} ϕ + J_{zz} \cos^{2} ϕ) \end{matrix}]

(8)

Consider a 3-DOF quad-rotor dynamical system, which simplified state-space form of this model is given:

\begin{matrix} {\begin{matrix} {\overset{\cdot}{x}}_{1} = x_{2} \\ {\overset{\cdot}{x}}_{2} = f (x, t) + g (x, t) u + d (t) \end{matrix} \end{matrix}

(9)

where $x_{1} = [ϕ, θ, ψ]^{T}$ represents Euler angles, that is, roll, pitch and yaw and the control inputs $u = [u_{ϕ}, u_{θ}, u_{ψ}]^{T}$ . The $f (x, t)$ and $g (x, t)$ are, respectively, systems matrix and control matrix in system and $g (x, t)$ is reversible. In addition, we consider that $d (t) \in R^{3 \times 1}$ is uncertainty disturbance and $u$ is the input of the control. $f (x, t)$ and $g (x, t)$ can be given:

f (x, t) = [M (Θ)]^{- 1} [- C (Θ, \overset{\cdot}{Θ}) \overset{\cdot}{Θ}]

(10)

g (x, t) = [J Ψ (Θ)]^{- 1}

(11)

Assumption 1. The external disturbance $d (t)$ is bounded and the upper bound is unknown.

Therefore, the objective can be summed up as follows: for the design of the control laws $u$ of the attitude system, in case where the dynamics of $f (x, t)$ is unknown, the quad-rotor is driven to follow the reference trajectory by the estimated control force in the finite time $t_{s}$ . Hence, it can be written as follows:

\lim_{t \to t_{s}} ϕ = ϕ_{d}, \lim_{t \to t_{s}} θ = θ_{d}, \lim_{t \to t_{s}} ψ = ψ_{d} .

Definition 1. Suppose^34,35 that a $ε > 0$ is fixed. For any positive real number b, the barrier function can be defined as an even continuous function $L_{b}$ : $x \in (- ε, ε) \to L_{b} (x) \in [b, \infty]$ consistently increasing on $[0, ε]$ .

1. $\lim_{| x | \to ε} L_{b} (x) = + \infty$

2. The function $L_{b} (x)$ has a distinctive minimum at zero, that is, $L_{b} (0)$ = $b > 0$

In this study, the barrier function as follows:

L_{b} (x) = \frac{ε b}{ε - | x |}, x \in (- ε, ε)

(12)

Lemma 1. Consider a continuous system³⁶:

\overset{\cdot}{x} = f (x), f (0) = 0, x \in R^{n}

(13)

Suppose a continuous positive definite function $V : R^{n} \Rightarrow R$ , with $γ > 0$ and $δ \in (0, 1)$ . The system which positive Lyapunov function satisfies:

\overset{\cdot}{V} (x) + γ V^{δ} (x) \leq 0,

(14)

Then, the origin of the system is a finite time stable equilibrium. The settling time $t_{R}$ satisfies:

t_{R} \leq \frac{1}{γ (1 - δ)} V^{1 - δ} (0)

(15)

Design of adaptive recurrent neural network sliding mode control

Terminal sliding mode controller

Consider the attitude system (equation (9)):

\begin{matrix} {\begin{matrix} {\overset{\cdot}{x}}_{1} = x_{2} \\ {\overset{\cdot}{x}}_{2} = f + gu + d \end{matrix} \end{matrix}

(16)

where $x_{1} = [ϕ, θ, ψ]^{T}$ , and the desired attitude is $x_{d} = [ϕ_{d}, θ_{d}, ψ_{d}]^{T}$ . The tracking problem for a UAV is to make the state trajectory $x_{1}$ track the desired trajectory $x_{d}$ in finite time.

Define the attitude tracking error as follows:

e = x_{1} - x_{d}

(17)

The time derivative of the error can be expressed as:

\begin{matrix} \overset{\cdot}{e} = {\overset{\cdot}{x}}_{1} - {\overset{\cdot}{x}}_{d} = x_{2} - {\overset{\cdot}{x}}_{d} \\ \overset{\cdot\cdot}{e} = {\overset{\cdot}{x}}_{2} - {\overset{\cdot\cdot}{x}}_{d} \end{matrix}

(18)

In order to improve the fast convergence characteristics of the system, inspired by Yang et al.,³⁷ consider the terminal sliding mode as:

s = \overset{\cdot}{e} + β e^{\frac{p}{q}}

(19)

where the $β \in R^{1 \times 3}$ , $p \in R^{3 \times 1}$ , $q \in R^{3 \times 1}$ are positive parameter matrices, and $0 < p_{i} / q_{i} < 1 (i = 1, 2, 3)$ . The time derivative of the sliding mode can be expressed as:

\overset{\cdot}{s} = \overset{\cdot\cdot}{e} + β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e}

(20)

According to equation (20), the equivalent controller $u_{eq}$ can be given by:

\begin{matrix} u_{eq} = g^{- 1} [{\overset{\cdot\cdot}{x}}_{d} - f - β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e}] \end{matrix}

(21)

We can obtain the closed-loop dynamical equation:

\begin{matrix} \overset{\cdot}{s} = f - {\overset{\cdot\cdot}{Γ}}_{d} + β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e} + g u_{eq} \\ = - K_{1} {| s |}^{\frac{1}{2}} sign (s) - \int_{0}^{t} K_{2} sign (s (τ)) d (t) \end{matrix}

(22)

The $u_{sw}$ can be written as follows:

u_{sw} = - K_{1} {| s |}^{\frac{1}{2}} sign (s) - \int_{0}^{t} K_{2} sign (s (τ)) d τ

(23)

which the $K_{1} \in R^{3 \times 1}$ and $K_{2} \in R^{3 \times 1}$ in equation (23) are control gains. Consider the following variable gain of the switching controller that solves this problem:

u_{sw} = - L (t, s) {| s |}^{\frac{1}{2}} sign (s) - \int_{0}^{t} L {(t, s)}^{2} sign (s (τ)) d τ

(24)

where the $L (t, s)$ is the variable gain. Considering the implementation of the barrier function, we define the $L (t, s)$ as:

\begin{matrix} L (t, s) = {\begin{matrix} L_{1} t + L_{0}, 0 \leq t < t_{1} \\ L_{b} (x), t \geq t_{1} \end{matrix} \end{matrix}

(25)

For the definition of this barrier function, $L_{0}$ and $L_{1}$ are arbitrary positive constants. According to equations (21) and (24), the proposed controller can be represented as follows:

\begin{matrix} u = u_{eq} + u_{sw} \end{matrix}

(26)

Substituting equation (26) to obtain the derivative of equation (19):

\begin{matrix} \overset{\cdot}{s} = f + g u_{eq} - {\overset{\cdot\cdot}{x}}_{d} + β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e} + d \\ = - L (t, s) {| s |}^{\frac{1}{2}} sign (s) - \int_{0}^{t} L {(t, s)}^{2} sign (s) d (t) \end{matrix}

(27)

Rewrite of the above equation, let us take:

\begin{matrix} \overset{\cdot}{s_{1}} = - L (t, s) {| s_{1} |}^{\frac{1}{2}} sign (s_{1}) + s_{2} \\ \overset{\cdot}{s_{2}} = - L {(t, s)}^{2} sign (s_{1}) + \overset{\cdot}{d} \end{matrix}

(28)

Theorem 1. Considering the quad-rotor attitude system (equation (16)), if the control laws are designed as equation (26), then the state variables of the attitude system converge to $s$ in a finite time, Furthermore, the tracking error variables can converge to zero in finite time.

Proof. Choose the Lyapunov equation follows:

V_{1} = ζ^{T} P ζ

(29)

where $ζ = [{| s_{1} |}^{\frac{1}{2}} sign (s_{1}), s_{2}]$ and $P$ is positive definite matrix. The $V$ satisfies:

λ_{\min} P ∥ ζ ∥_{2}^{2} \leq V \leq λ_{\max} P ∥ ζ ∥_{2}^{2}

(30)

where $∥ ζ ∥_{2}^{2}$ represents the Euclidean norm of $ζ$ . According to the Assumption 1, the external variable $d (t)$ is bounded and satisfies $| \overset{\cdot}{d} | \leq d_{*}$ , the transformed disturbance satisfies $| \tilde{d} | \leq d_{*} | ζ |$ . As in Moreno,³⁸ the positive definite matrix $P$ is constructed:

[\begin{matrix} A^{T} P + PA + ϵ P + {d_{*}}^{2} C^{T} C & PB \\ B^{T} P & - 1 \end{matrix}] \leq 0

(31)

where $ϵ$ is a positive constant:

A = [\begin{matrix} - \frac{1}{2} L (t, s) & \frac{1}{2} \\ - L {(t, s)}^{2} & 0 \end{matrix}]; B = [\begin{matrix} 0 \\ 1 \end{matrix}]; C = [1 0];

Taking the derivative of $V_{1}$ as follows:

\begin{matrix} {\overset{\cdot}{V}}_{1} = \frac{1}{| ζ |} {[\begin{matrix} ζ \\ \tilde{d} \end{matrix}]}^{T} [\begin{matrix} - A^{T} + PA & PB \\ B^{T} P & 0 \end{matrix}] [\begin{matrix} ζ \\ \tilde{d} \end{matrix}] \\ \leq \frac{1}{| ζ |} {[\begin{matrix} ζ \\ \tilde{d} \end{matrix}]}^{T} [\begin{matrix} - A^{T} + PA + d_{*}^{2} C^{T} C & PB \\ B^{T} P & 0 \end{matrix}] [\begin{matrix} ζ \\ \tilde{d} \end{matrix}] \\ \leq \frac{1}{| ζ |} {[\begin{matrix} ζ \\ \tilde{d} \end{matrix}]}^{T} [\begin{matrix} - A^{T} + PA + ϵ P - ϵ P + d_{*}^{2} C^{T} C & PB \\ B^{T} P & 0 \end{matrix}] [\begin{matrix} ζ \\ \tilde{d} \end{matrix}] \\ \leq - \frac{ϵ}{| ζ |} ζ^{T} P ζ \end{matrix}

(32)

From equation (30): $∥ ζ ∥_{2}^{2} \leq \frac{V^{\frac{1}{2}} (ζ)}{λ_{\min}^{\frac{1}{2}} {P}}$ . This shows that:

{\overset{\cdot}{V}}_{1} \leq - ϵ λ_{\min}^{\frac{1}{2}} {P} V^{\frac{1}{2}} (ζ)

(33)

According to the Song et al.,³⁶ we can know $(s_{1}, s_{2})$ will converge in a predefined neighbourhood of zero in a finite time $t_{1} \leq \frac{1}{ϵ λ_{\min}^{\frac{1}{2}} {P} \frac{1}{2}} V^{\frac{1}{2}}$ .

To guarantee that the attitude system tracks the desired trajectory in finite time, consider the Lyapunov function as:

V_{2} = \frac{1}{2} e^{2}

(34)

From equation (19), taking the derivative of $V_{2}$ as:

\begin{matrix} {\overset{\cdot}{V}}_{2} = e \overset{\cdot}{e} \\ = e (- β e^{\frac{p}{q}}) \\ \leq - β {(\frac{1}{2})}^{\frac{p + q}{4 q}} {V_{2}}^{\frac{p + q}{2 q}} \end{matrix}

(35)

Hence, the attitude system will converge to the desired trajectory in finite time. From Song et al.,³⁶ the finite time $t_{2} \leq \frac{1}{β {(\frac{1}{2})}^{\frac{p + q}{4 q}} (1 - \frac{p + q}{2 q})} V^{(1 - \frac{p + q}{2 q})}$ . Proof is finished.

TLRNN structure

The newly proposed neural network architecture is illustrated by Figure 1. This neural network is composed of three layers, which are the input layer, the hidden layer with self-feedback capability and the output layer. In addition, the developed neural network has $m$ neurons in the input layer, $n$ neurons in the hidden layer and $q$ neurons in the output layer. Among some architectures, the RNN incorporates self-feedback connections within its hidden layer. The DLRNN extends the structure of the RNN by introducing two additional feedback loops: one from the output layer to the hidden layer and another from the hidden layer to the input layer. The proposed TLRNN further enhances this design by reintroducing a feedback connection from the output layer directly to the input layer. This modification more effectively improves the accuracy of information estimation by facilitating richer information flow and stronger temporal dependency modelling. The structural comparison diagrams of RNN, DLRNN and TLRNN are shown in Figure 2. To improve the approximation ability of the neural network, we construct three one-step delayed feedback loops in the network structure. The weights and capabilities of the three layers are described as follows:

$w^{I}$ : Weight of the input layer to the input layer.

$w^{D}$ : Weight of the self-feedback signal of the hidden layer.

$w^{OI}$ : Weight of the feedback signal from the output layer to the input layer.

$w^{OH}$ : Weight of the feedback signal from the output layer to the hidden layer.

$w^{HI}$ : Weight of feedback signals from the hidden layer to the input layer.

1. Input layer: Neurons in this layer receive input signals, and feedback signals from neurons in the other two layers. Therefore, the input signal of this layer can be written as follows:

Z (i) = x_{i} + \sum_{k = 1}^{q} e x_{y} (k) w_{ik}^{oI} + \sum_{j = 1}^{n} h (j) w_{ik}^{HI}

(36)

the signal $x_{i} \in R^{m}$ is considered as the input and $i = [1, 2, . . ., m]$ . $e x_{y} (k) = Z^{- 1} (y_{k}) \in R^{q}$ is a feedback signal with one-step delay of the output layer, $h (j) \in R^{n}$ is a feedback signal with one-step delay from the hidden layer, $w^{oI} \in R^{m \times q}$ and $w^{HI} \in R^{m \times n}$ .

2. Hidden layer: The neurons in this layer receive feedback from themselves and signals from all other neurons. Therefore, the input signal of this layer can be given as follows:

\begin{matrix} h (i) = \sum_{j = 1}^{m} Z (j) w_{ij}^{I} + \sum_{k = 1}^{n} e x_{h (i)} w_{ik}^{D} \\ + \sum_{l = 1}^{q} e x_{y} (l) w_{il}^{oH} \end{matrix}

(37)

where $i = [1, 2, . . ., n]$ , $w^{I} \in R^{n \times m}$ , $w^{D} \in R^{n \times n}$ and $e x_{h} = Z^{- 1} (h)$ is a feedback signal with a one-step delay of the hidden layer. Through the activation function, the output of this hidden layer is written as:

Φ (i) = \ln \frac{1}{1 + e^{- h (i)}}

(38)

3. Output layer: The output layer receives the signals of the hidden layer and provides delayed feedback signals to the hidden layer and the input layer, which output can be written as:

y = v^{T} Φ (Z, w^{I}, w^{D}, w^{oI}, w^{oH}, w^{HI}) = v^{T} Φ

(39)

where $v \in R^{n \times q}$ is the weight that the hidden layer transmits to all neurons in the output layer.

Figure 1.

Structure of the TLRNN.

Figure 2.

The difference of structure for RNN, DLRNN and TLRNN.

Design of the attitude controller

In this section, the main work is to present the design process for the adaptive TLRNN sliding mode controller. The dedicated control method proposed in this study is introduced in Figure 3. The attitude controller consists of a TLRNN, a compensator and a switching control, and the parameters of each unit are updated in time, which performs the estimation of three input signals, that is, $u 1$ , $u 2$ and $u 3$ of the attitude controller. The desired attitude $Γ_{d} = {[ϕ_{d}, θ_{d}, ψ_{d}]}^{T}$ is input as expectation trajectory, and according to the control scheme, the attitude of the quad-rotor will be adjusted to the desired value.

Figure 3.

Block diagram of the TLRNN.

Remark 1. There exist parameters $w^{I}$ , $w^{D}$ , $w^{oI}$ , $w^{oH}$ , $w^{HI}$ are the optimal weights for the TLRNN, and these weight values are evaluated with the $\hat{\cdot}$ operator. Thus, estimated weights for the TLRNN are ${\hat{w}}^{I}$ , ${\hat{w}}^{D}$ , ${\hat{w}}^{oI}$ , ${\hat{w}}^{oH}$ , ${\hat{w}}^{HI}$ .

Consider the equivalent controller of the control system obtained by TLRNN approximated:

u_{eq} = v^{T} Φ (x_{p}, w^{I}, w^{D}, w^{oI}, w^{oH}, w^{HI}) + ϵ_{p}

(40)

where the $ϵ_{p}$ is an approximation error. To reduce this error, we can increase the number of neurons in the hidden layer. The input signal for the TLRNN is $x = {[e^{T}, {\overset{\cdot}{e}}^{T}, {\overset{\cdot\cdot}{Γ}}_{d}]}^{T}$ . Since the ideal weights $w^{I}, w^{D}, w^{oI}, w^{oH}, w^{HI}$ are unknown, we can only use estimates to replace them, and estimated control ${\hat{u}}_{eq}$ as follows:

{\hat{u}}_{eq} = {\hat{v}}^{T} \hat{Φ} (x_{p}, {\hat{w}}^{I}, {\hat{w}}^{D}, {\hat{w}}^{oI}, {\hat{w}}^{oH}, {\hat{w}}^{HI})

(41)

Remark 2. The main advantage of using neural networks to directly approximate the entire controller is that it reduces reliance on precise models, enhances the system’s adaptability and robustness, and improves the automation of the design process. This makes neural networks often more advantageous than traditional methods when addressing the complex, nonlinear, and high-dimensional control problems associated with UAVs.

Based on the description above, the controller $u$ is proposed to be:

u = {\hat{u}}_{eq} + u_{com} + u_{sw}

(42)

where $u_{com}$ is the compensator controller, the expression for $u_{com}$ consists of the boundary of approximation error due to the neural network, which will be presented at a later stage. By using equation (21) in equation (42), we can get the derivative of time:

\begin{matrix} \overset{\cdot}{S} = \frac{d}{dt} (\overset{\cdot}{e} + β e^{\frac{p}{q}}) \\ = \overset{\cdot\cdot}{e} + β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e} \\ = f + gu + d - {\overset{\cdot\cdot}{x}}_{d} + β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e} \\ = f + g ({\hat{u}}_{eq} + u_{com} + u_{sw} + u_{eq} - u_{eq}) \\ + d - {\overset{\cdot\cdot}{x}}_{d} + β \frac{p}{q} e^{\frac{p}{q} - 1} \overset{\cdot}{e} \\ = - g {\tilde{u}}_{eq} + g u_{com} + g u_{sw} \end{matrix}

(43)

where the ${\tilde{u}}_{eq} = u_{eq} - {\hat{u}}_{eq}$ is the estimation error, and ${\tilde{w}}^{I} = w^{I} - {\hat{w}}^{I}$ , ${\tilde{w}}^{D} = w^{D} - {\hat{w}}^{D}$ , ${\tilde{w}}^{oI} = w^{oI} - {\hat{w}}^{oI}$ , ${\tilde{w}}^{oH} = w^{oH} - {\hat{w}}^{oH}$ , ${\tilde{w}}^{HI} = w^{HI} - {\hat{w}}^{HI}$ , $\tilde{Z} = Z - \hat{Z}$ . To derive the neural network TLRNN, we define the weight vectors $w^{I}$ , $w^{D} w^{oI} w^{oH} w^{HI}$ as:

\begin{matrix} w^{I} = {[\begin{matrix} w_{11}^{I}, . . ., w_{1 m}^{I}, . . ., w_{n 1}^{I}, . . ., w_{nm}^{I} \end{matrix}]}^{T} \in R^{mn} \\ w^{D} = {[\begin{matrix} w_{11}^{D}, . . ., w_{1 n}^{D}, . . ., w_{n 1}^{D}, . . ., w_{nn}^{D} \end{matrix}]}^{T} \in R^{nn} \\ w^{oI} = {[\begin{matrix} w_{11}^{oI}, . . ., w_{1 q}^{oI}, . . ., w_{m 1}^{oI}, . . ., w_{mq}^{oI} \end{matrix}]}^{T} \in R^{qm} \\ w^{oH} = {[\begin{matrix} w_{11}^{oH}, . . ., w_{1 q}^{oH}, . . ., w_{n 1}^{oH}, . . ., w_{nq}^{oH} \end{matrix}]}^{T} \in R^{qn} \\ w^{HI} = {[\begin{matrix} w_{11}^{HI}, . . ., w_{1 q}^{HI}, . . ., w_{n 1}^{HI}, . . ., w_{nq}^{HI} \end{matrix}]}^{T} \in R^{nm} \end{matrix}

(44)

From the Taylor expansion, we obtain the following function expression:

\begin{matrix} \tilde{Z} = D_{w}^{I} {\tilde{w}}^{I} + D_{w}^{D} {\tilde{w}}^{D} + D_{w}^{oI} {\tilde{w}}^{oI} \\ + D_{w}^{oH} {\tilde{w}}^{oH} + D_{w}^{HI} {\tilde{w}}^{HI} + O \end{matrix}

(45)

with $O$ for all higher order terms and these coefficient matrixes in equation (45) can be written as:

\begin{matrix} D_{w}^{I} = {[\begin{matrix} \frac{\partial {\tilde{Z}}_{1}}{\partial w^{I}}, \frac{\partial {\tilde{Z}}_{2}}{\partial w^{I}}, . . ., \frac{\partial {\tilde{Z}}_{n}}{\partial w^{D}} \end{matrix}]}^{T} \\ D_{w}^{D} = {[\begin{matrix} \frac{\partial {\tilde{Z}}_{1}}{\partial w^{I}}, \frac{\partial {\tilde{Z}}_{2}}{\partial w^{I}}, . . ., \frac{\partial {\tilde{Z}}_{n}}{\partial w^{D}} \end{matrix}]}^{T}, \\ D_{w}^{oI} = {[\begin{matrix} \frac{\partial {\tilde{Z}}_{1}}{\partial w^{I}}, \frac{\partial {\tilde{Z}}_{1}}{\partial w^{I}}, . . ., \frac{\partial {\tilde{Z}}_{n}}{\partial w^{oI}} \end{matrix}]}^{T} \\ D_{w}^{oH} = {[\begin{matrix} \frac{\partial {\tilde{Z}}_{1}}{\partial w^{I}}, \frac{\partial {\tilde{Z}}_{2}}{\partial w^{I}}, . . ., \frac{\partial {\tilde{Z}}_{n}}{\partial w^{oH}} \end{matrix}]}^{T} \\ D_{w}^{HI} = {[\begin{matrix} \frac{\partial {\tilde{Z}}_{1}}{\partial w^{I}}, \frac{\partial {\tilde{Z}}_{2}}{\partial w^{I}}, . . ., \frac{\partial {\tilde{Z}}_{n}}{\partial w^{HI}} \end{matrix}]}^{T} \end{matrix}

(46)

Substituting equation (41) from equation (40) and using equation (46), the following differences can be obtained:

\begin{matrix} {\tilde{u}}_{eq} = u_{eq} - {\hat{u}}_{eq} \\ = v^{T} Φ + ϵ - {\hat{v}}^{T} \hat{Φ} \\ = {\hat{v}}^{T} \hat{Φ} + {\hat{v}}^{T} \tilde{Φ} + {\hat{v}}^{T} \tilde{Φ} + ϵ \\ = {\hat{v}}^{T} (D_{w}^{I} {\tilde{w}}^{I} + D_{w}^{D} {\tilde{w}}^{D} + D_{w}^{oI} {\tilde{w}}^{oI} + D_{w}^{oH} {\tilde{w}}^{oH} \\ + D_{w}^{HI} {\tilde{w}}^{HI}) + {\hat{v}}^{T} (\hat{H} - D_{w}^{I} {\hat{w}}^{I} - D_{w}^{D} {\hat{w}}^{D} - D_{w}^{oI} {\hat{w}}^{oI} \\ - D_{w}^{oH} {\hat{w}}^{oH} - D_{w}^{HI} {\hat{w}}^{HI}) + Δ \end{matrix}

(47)

where the $Δ = {\tilde{v}}^{T} (D_{w}^{I} w^{I} + D_{w}^{D} w^{D} + D_{w}^{oI} w^{oI} + D_{w}^{oH} w^{oH} + D_{w}^{HI} w^{HI} + O) + ϵ$ is an estimation error, which causes by the unknown value of the ideal weights and approximation error.

Assumption 2. $Δ$ is bounded and $Δ_{p}$ is the maximum boundary ( $| Δ | \leq Δ_{p}$ ).

The $Δ_{p}$ compensates for the approximation error. Considering the above neural network, we can write the control $u$ as:

u = {\hat{v}}^{T} \hat{Φ} + u_{com} + u_{sw}

(48)

where $u_{com} = - {\hat{Δ}}_{p} - K_{c} s$ , ${\overset{\cdot}{\hat{Δ}}}_{p} = η^{p} s^{T}$ and $η^{p}$ is a positive matrix, $K_{c} \in R^{3 \times 1}$ is a control gain of the compensator controller, which is a position matrix.

The real-time updated neural network weights are expressed as follows:

{\begin{matrix} \overset{\cdot}{\hat{v}} = η^{v} (\hat{Φ} - D_{w}^{I} {\hat{w}}^{I} - D_{w}^{D} {\hat{w}}^{D} - D_{w}^{oI} {\hat{w}}^{oI} \\ - D_{w}^{oH} {\hat{w}}^{oH} - D_{w}^{HI} {\hat{w}}^{HI}) s^{T} \\ {\overset{\cdot}{\hat{w}}}^{I} = η^{I} D_{w}^{IT} \hat{v} s \\ {\overset{\cdot}{\hat{w}}}^{D} = η^{D} D_{w}^{DT} \hat{v} s \\ {\overset{\cdot}{\hat{w}}}^{oI} = η^{oI} D_{w}^{oIT} \hat{v} s \\ {\overset{\cdot}{\hat{w}}}^{oH} = η^{oH} D_{w}^{oHT} \hat{v} s \\ {\overset{\cdot}{\hat{w}}}^{HI} = η^{HI} D_{w}^{HIT} \hat{v} s \end{matrix}

(49)

where $η^{I}, η^{D}, η^{oI}, η^{oH}, η^{HI}$ are positive rates of learning and all the weights are bounded.

Theorem 2. Considering the application of the controller (equation (42)) and the updated weights (equation (49)) in the attitude system (equation (16)), the state variables of the attitude system converge to $s (t)$ in a finite time, Furthermore, the tracking error variables can converge to zero in finite time.

Proof. Consider the following Lyapunov function:

\begin{matrix} V_{3} = \frac{1}{2} s^{T} g^{- 1} s + \frac{1}{2 η^{v}} tr ({\tilde{v}}^{T} \tilde{v}) + \frac{1}{2 η^{I}} {({\tilde{w}}^{I})}^{T} {\tilde{w}}^{I} + \frac{1}{2 η^{D}} \\ {({\tilde{w}}^{D})}^{T} {\tilde{w}}^{D} + \frac{1}{2 η^{oI}} {({\tilde{w}}^{oI})}^{T} {\tilde{w}}^{oI} + \frac{1}{2 η^{oH}} {({\tilde{w}}^{oH})}^{T} \\ {\tilde{w}}^{oH} + \frac{1}{2 η^{HI}} {({\tilde{w}}^{HI})}^{T} {\tilde{w}}^{HI} + \frac{1}{2 η^{p}} {({\tilde{Δ}}_{p})}^{T} {\tilde{Δ}}_{p} \end{matrix}

(50)

Taking the derivative of $V_{3}$ :

\begin{matrix} {\overset{\cdot}{V}}_{3} = s^{T} g^{- 1} \overset{\cdot}{s} + \frac{1}{η^{v}} tr ({\tilde{v}}^{T} \overset{\cdot}{\tilde{v}}) + \frac{1}{η^{I}} {({\tilde{w}}^{I})}^{T} {\overset{\cdot}{\tilde{w}}}^{I} + \frac{1}{η^{D}} \\ {({\tilde{w}}^{D})}^{T} {\overset{\cdot}{\tilde{w}}}^{D} + \frac{1}{η^{oI}} {({\tilde{w}}^{oI})}^{T} {\overset{\cdot}{\tilde{w}}}^{oI} + \frac{1}{η^{oH}} {({\tilde{w}}^{oH})}^{T} \\ {\overset{\cdot}{\tilde{w}}}^{oH} + \frac{1}{η^{HI}} {({\tilde{w}}^{HI})}^{T} {\overset{\cdot}{\tilde{w}}}^{HI} + \frac{1}{η^{p}} {({\tilde{Δ}}_{p})}^{T} {\overset{\cdot}{\tilde{Δ}}}_{p} \end{matrix}

(51)

Substituting equations (43), (47) and (49) into equation (51), we can get:

{\overset{\cdot}{V}}_{3} = s^{T} (u_{com} + u_{sw} - Δ) - \frac{1}{η^{p}} {\tilde{Δ}}_{p}^{T} {\overset{\cdot}{\hat{Δ}}}_{p}

(52)

When $0 \leq t \leq t_{1}$ :

\begin{matrix} {\overset{\cdot}{V}}_{3} = - s^{T} [K_{c} sign (s) + (L_{1} t + L_{0}) {| s |}^{\frac{1}{2}} sign (s) \\ + \int {(L_{1} t + L_{0})}^{2} sign (s) dt - Δ] - \frac{{\tilde{Δ}}_{p}^{T} {\overset{\cdot}{\tilde{Δ}}}_{p}}{η^{p}} \\ = - K_{c} ∥ s ∥^{2} - s^{T} (Δ_{p} - | Δ |) \\ - (L_{1} t + L_{0}) {| s |}^{\frac{1}{2}} | s^{T} | - \int^{{(L_{1} t + L_{0})}^{2}} dt | s^{T} | \\ \leq - λ_{\min} K_{c} ∥ s ∥^{2} - Γ_{1} \end{matrix}

(53)

where $Γ_{1} = - (L_{1} t + L_{0}) {| s |}^{\frac{1}{2}} | s^{T} | - \int {(L_{1} t + L_{0})}^{2} dt | s^{T} |$ . According to the definition of barrier function and Assumption 2, we can calculate that ${\overset{\cdot}{V}}_{3} \leq$ 0.

2. When $t \geq t_{1}$ :

\begin{matrix} {\overset{\cdot}{V}}_{3} = - s^{T} [K_{c} sign (s) - \frac{ε b}{ε - | s |} {| s |}^{\frac{1}{2}} sign (s) - \int (\frac{ε^{2} b^{2}}{{(ε - | s |)}^{2}}) sign (s) dt - Δ] \\ - \frac{{\tilde{Δ}}_{p}^{T} {\overset{\cdot}{\tilde{Δ}}}_{p}}{η^{p}} = - K_{c} ∥ s ∥^{2} - s^{T} (Δ_{p} - | Δ |) \\ - (\frac{ε^{2} b^{2}}{{(ε - | s |)}^{2}}) {| s |}^{\frac{1}{2}} | s^{T} | - \int (\frac{ε^{2} b^{2}}{{(ε - | s |)}^{2}}) dt | s^{T} | \\ \leq - λ_{\min} K_{c} ∥ s ∥^{2} - Γ_{2} \end{matrix}

(54)

where $Γ_{2} = - (\frac{ε^{2} b^{2}}{{(ε - | s |)}^{2}}) {| s |}^{\frac{1}{2}} | s^{T} | - \int (\frac{ε^{2} b^{2}}{{(ε - | s |)}^{2}}) dt | s^{T} |$ . According to the definition of barrier function and Assumption 2, we can calculate that ${\overset{\cdot}{V}}_{3} \leq$ 0.

Due to $V_{3}$ is negative semidefinite and $V_{3}$ is bounded, we ensure that $s \to 0$ as $t \to \infty$ . Moreover, to show that the tracking error converges in finite time, we should guarantee that $s \to 0$ in finite time. Choose the following Lyapunov equation:

V_{4} = \frac{1}{2} s^{T} g^{- 1} s

(55)

The derivative of $V_{3}$ is as follows:

{\overset{\cdot}{V}}_{4} = s^{T} g^{- 1} \overset{\cdot}{s} = s^{T} (u_{eq} + u_{com} + u_{sw})

(56)

According to the definition of barrier function in equation (25), we divide the proof into two parts:

When $0 \leq t \leq t_{1}$ :

\begin{matrix} {\overset{\cdot}{V}}_{4} = - s^{T} [- {\tilde{u}}_{eq} + {\hat{Δ}}_{p} + K_{c} s + (L_{1} t + L_{0}) {| s |}^{\frac{1}{2}} sign (s) \\ + \int {(L_{1} t + L_{0})}^{2} sign (s) dt] \\ \leq - λ_{\min} K_{c} ∥ s ∥^{2} + \frac{∥ s ∥^{2}}{2} + Λ \\ \leq - ℑ_{1} V_{4} + Λ \end{matrix}

(57)

where $ℑ_{1} = \frac{2 λ_{\min} K_{c} - 1}{λ_{\max} (g^{- 1})}$ and $Λ = \frac{1}{2} {(∥ {\tilde{u}}_{eq} - {\hat{Δ}}_{p} ∥)}^{2}$ is a positive constant, which can be considered a very small quantity. According to Bhat and Bernstein,³⁹ concluding that $s$ converges in a predefined neighbourhood of zero in finite time.

2. When $t \geq t_{1}$ :

\begin{matrix} {\overset{\cdot}{V}}_{4} = - s^{T} [- {\tilde{u}}_{eq} + {\hat{Δ}}_{p} + K_{c} s + \frac{ε b}{ε - | s |} {| s |}^{\frac{1}{2}} sign (s) \\ + \int \frac{ε^{2} b^{2}}{{(ε - | s |)}^{2}} sign (s) dt] \\ \leq - λ_{\min} K_{c} ∥ s ∥^{2} + \frac{∥ s ∥^{2}}{2} + Λ \\ \leq - ℑ_{2} V_{4} + Λ \end{matrix}

(58)

where $ℑ_{1} = \frac{2 λ_{\min} K_{c} - 1}{λ_{\max} (g^{- 1})}$ and $Λ = \frac{1}{2} {(∥ {\tilde{u}}_{eq} - {\hat{Δ}}_{p} ∥)}^{2}$ is a positive constant, which can be considered a very small quantity. According to Bhat and Bernstein,³⁹ the $s$ will converge in a predefined neighbourhood of zero in finite time.

According to the proof of Theorem 1, the attitude system will converge to the desired trajectory in a finite time. Therefore, the proof is finished.

Simulation results

To demonstrate the superiority, three experiments are considered to validate the proposed TLRNN-based sliding mode controller on a quad-rotor. Firstly, comparison with a double loop recurrent neural network in Fei and Lu,⁴⁰ that is, DLRNN. Secondly, compared with traditional neural network architectures, that is, RBF. Finally, compared with an integral fast terminal sliding mode controller in Ghadiri et al.,³³ that is, IFTSMC.

Parameter section

$Modelling and controller parameters$ : The parameters of the quad-rotor are selected as $m = 2.3 kg$ , $g = 9.8 N / kg$ and $jxx = 0.0552 {kg}^{*} m^{2}$ , $jyy = 0.0552 {kg}^{*} m^{2}$ , $jzz = 0.1104 {kg}^{*} m^{2}$ . In order to obtain a specific representation of the quad-rotor system, we choose the initial value of the system $[ϕ, θ, ψ]^{T} = [1, 1, 1]^{T}$ , and different external disturbances are selected in three attitude directions, $d (t)$ = 0.5 ${[\sin (t), \cos (t), \sin (2 t)]}^{T}$ . The parameter matrices of the proposed sliding mode are $β$ , $p$ and $q$ , which need to be satisfied that $β$ is sufficiently large and these matrices are all positive. Three parameter matrices are designed as $β = [6.3, 5.8, 5.7]^{T}, p = [3.99, 3.5, 4.89]^{T}$ and $q = [4, 4, 5]^{T}$ . For the mentioned variable gain $L (t, s)$ in equation (25), where the parameters are $L_{0} = 10, L_{1} = 10$ , $b = 1$ and $ε = 0.03$ . $u_{com}$ is used to compensate for the approximation error due to the TLRNN, and the gain matrix $K_{C}$ is set as $[0.01, 0.01, 0.01]^{T}$ . Besides, three signals $[ϕ_{d}, θ_{d}, ψ_{d}]^{T}$ = ${[\sin (2 t), \cos (t), \sin (t)]}^{T}$ are designed as the desired attitude trajectories.

Sliding mode parameters: The parameters $p$ and $q$ were chosen to satisfy the finite-time convergence condition 0 ¡ $p_{i}$ / $q_{i}$ ¡ 1. The values were selected to provide a smooth yet sufficiently aggressive convergence profile. The parameter $β$ primarily governs the convergence rate in the sliding phase. A larger $β$ leads to faster convergence but may induce overshoot. Its value was incrementally increased in simulations until a satisfactory response was obtained without significant overshoot.

Super-twisting gains: The initial linear gain $L_{0}$ was chosen to be large enough to ensure the initial establishment of the sliding mode. The ramp gain $L_{1}$ was then tuned to ensure a smooth transition to the barrier function-based gain. The barrier function parameters b and $ε$ define the adaptive gain’s behaviour. $ε$ defines the boundary layer and b defines the minimum gain inside it. These were tuned to effectively suppress chattering while maintaining robustness against the estimated disturbances.

NN learning rates: The learning rates for the TLRNN were set to a common, relatively small value (0.001) to ensure stable and smooth weight adaptation without causing overshoot or oscillation in the control signal. This is a standard practice in adaptive control to ensure the separation of dynamics between the fast controller and the slower parameter update laws.

Compensator gain: This gain provides additional robustness. It was set to a small value (0.01) to minimally influence the nominal controller performance while ensuring steady-state error elimination.

$TLRNN parameters$ : The inputs to the TLRNN are nine different signals: $e \in R^{3}, \overset{\cdot}{e} \in R^{3}$ and $\overset{\cdot\cdot}{Γ} \in R^{3}$ . TLRNN has the structure of $m - n - q$ , to ensure that the neural network performs better in tracking the desired trajectory and to minimize the increase in complexity, we select the best parameter values after several comparisons, and $m$ = 9, $n$ = 7, $q$ = 3. The weights of TLRNN are updated online, and we design the weights at the initial time as $1.0 \times 10^{- 10}$ . The value of $η^{I}, η^{D}, η^{oI}, η^{oH}, η^{HI}$ can change the weight parameters, which can lead to changes in the control action, and we have chosen their values to be $η^{I} = η^{D} = η^{oI} = η^{oH} = η^{HI} = η^{v} = η^{p} = 1.0 \times 10^{- 3}$ .

The limits of input saturation are: $u_{\min} = - 1$ and $u_{\max} = 1$ for $DLRNN$ ; $u_{\min} = - 0.25$ and $u_{\max} = 0.25$ for $RBF$ ; $u_{\min} = - 0.25$ and $u_{\max} = 0.25$ for $TLRNN$ .

Comparative analysis

Figure 4 shows the tracking results of the actual attitude compared to the desired attitude, which is obtained using the three different methods. In the figure, we can see that roll, pitch and yaw signals are successful in tracking the desired trajectory, and that the proposed method is the fastest. Therefore, we can also conclude that the proposed method is better than the other two methods in the presence of perturbations.

Figure 4.

$ϕ, θ,$ and $ψ$ tracking results.

After incorporating multiple rounds of simulation experiments, the comparison of stabilization times under the stabilization error threshold of 0.01 is presented in Table 1 below.

Table 1.

Stabilization time comparison under different neural network estimators.

Error term	DLRNN	RBF	IFTSMC	TLRNN
$e_{ϕ}$	1.38 s	0.95 s	2.01 s	0.68 s
$e_{θ}$	0.61 s	0.18 s	0.38 s	0.07 s
$e_{ψ}$	1.63 s	1.05 s	2.27 s	0.75 s

To demonstrate the performance comparison under different payload conditions, we doubled the mass parameter $m$ of the UAV. Due to space limitations, the superiority of the proposed scheme is illustrated through the system stabilization time. The comparison under a stabilization error threshold of 0.01 is presented in the Table 2 below:

Table 2.

Stabilization time comparison under different neural network estimators with $2 m$ .

Error term	DLRNN	RBF	IFTSMC	TLRNN
$e_{ϕ}$	1.76 s	1.03 s	2.23 s	0.78 s
$e_{θ}$	0.69 s	0.19 s	0.59 s	0.07 s
$e_{ψ}$	2.07 s	1.31 s	2.71 s	0.92 s

The Figure 5 shows a comparison of the tracking errors for the three attitude angles. From the figure, it can be seen that the three tracking errors obtained by the proposed method converge to zero in the shortest possible time.

Figure 5.

$ϕ$ , $θ$ , $ψ$ tracking errors.

The comparison of the sliding surfaces of the three methods can be shown in Figure 6. The sliding surfaces under each method converge to the origin in finite time, and the proposed method is superior to other methods in convergence time.

Figure 6.

$ϕ, θ,$ and $ψ$ sliding surfaces.

In addition, the control inputs of three attitudes are shown in Figure 7. The proposed method has more stable control inputs and less chattering compared to using RBF and DLRNN. Based on the described barrier function, an effective reduction of the control input can be observed after the time $t_{1}$ . Moreover, the time-varying disturbance used in the text is shown in Figure 8.

Figure 7.

Control inputs $u_{ϕ}$ , $u_{θ}$ , $u_{ψ}$ .

Figure 8.

Time-varying disturbance.

Furthermore, several performance metrics related to tracking errors are utilized to demonstrate the effectiveness of the proposed methodology.

Integral absolute error (IAE): IAE = $\int_{0}^{T} ∣ e (t) ∣ dt$ . According to Figure 9(a), the proposed controller has the smallest average IAE, indicating that the controller proposed in this study demonstrates superior control performance.

Integral time absolute error (ITAE): ITAE = $\int_{0}^{T} t ∣ e (t) ∣ dt$ . A comparison of the ITSE of the three control strategies, as illustrated in Figure 9(b), reveals that the proposed control scheme continues to exhibit the smallest average value.

The triple-loop feedback structure of the TLRNN enables a more dynamic memory and richer representation of the system’s uncertain dynamics ( $f (x, t) + d (t)$ ). This leads to a more precise estimation of the equivalent control law ( ${\hat{u}}_{eq}$ ), minimizing the approximation error ( ${\hat{u}}_{eq}$ ). The TLRNN converges ∼30% faster and to a more stable value. Consequently, the compensator controller ( $u_{com}$ ) and the switching controller ( $u_{sw}$ ) are required to handle a much smaller residual uncertainty.

Figure 9.

Performance metrics comparison: (a) IAE and (b) ITAE.

The ITAE metric specifically penalizes errors that persist over time. The barrier function-based gain $L (t, s)$ , is pivotal in minimizing ITAE. As shown in the new Figure 9, when the tracking error $| e (t) |$ is large, the gain $L (t, s)$ rapidly increases to its maximum allowable value ( $L_{1} t + L_{0}$ ), injecting strong control effort to drive the error down aggressively. This rapid initial response directly reduces the time component of the ITAE integral. As the error approaches zero and enters the predefined boundary layer ( $| s | \to ε$ ), the gain adapts according to $L_{b} (s) = \frac{ε b}{ε - | s |}$ . This provides just-sufficient gain to maintain performance without introducing excessive chattering, which would increase the IAE. In contrast, the fixed-gain controllers (RBF) or less adaptive structures (DLRNN) must use conservatively high gains for worst-case scenarios, leading to more chattering (higher IAE) or slower convergence (higher ITAE).

In conclusion, according to the tracking trajectories, tracking errors, sliding mode surfaces and control inputs for roll, pitch and yaw angle under four different methods, the finite time tracking of quad-rotor UAV attitude by the proposed method can be obtained, which also proves the fast and accurate estimation of unknown dynamics by TLRNN and ensures the good stability of the system.

Additionally, we have deployed a quadrotor UAV hardware platform to validate the proposed algorithm in Figure 10. The UAV is equipped with an integrated flight control system embedded in the onboard computer (avionics computer), which is powered by a dedicated battery module. A real-time data communication module enables bidirectional interaction with the ground control station (GCS), facilitating the transmission of flight commands from the GCS to the UAV and the feedback of flight status data to the ground. The entire system operates through the coordinated execution of the power module, actuator, flight control unit and data communication unit, forming a closed-loop control structure that ensures stable and reliable autonomous flight.

Figure 10.

The quadrotor’s hardware structure.

The proposed TLRNN controller in this work indeed introduces higher computational complexity compared to linear controllers (e.g. PID) or feedforward neural networks (e.g. RBF). This complexity primarily stems from the forward computation of the triple feedback loops and the online parameter update laws (equation (49)). A preliminary analysis indicates that for a network structure with m = 9, n = 7, q = 3, the computational load per control cycle is estimated to be within 400 floating point operations (FLOPs). Although this exceeds the requirements of simpler controllers, such a computational demand remains well within the capabilities of modern mainstream embedded flight control processors (e.g. STM32H7 series), allowing stable operation at control frequencies of 1 kHz or even higher. The real-time feasibility of the algorithm has been validated through simulations with a control period of 1 ms (1 kHz).

Conclusion

A triple-loop recurrent neural network super-twisting terminal sliding mode control scheme based on the barrier function is proposed for the attitude tracking control problem of a quad-rotor UAV with model uncertainty. The new neural network TLRNN is mainly used to approximate the unknown part of the equivalent controller. Compared with the conventional neural network, the newly designed TLRNN better estimates the unknown dynamics and can achieve better approximation performance. The parameters of the newly designed neural network are mediated online by adaptive laws. In addition, using a variable gain super-twisting algorithm based on a barrier function, the scheme also effectively reduces chattering. The proposed technique is compared with the conventional neural networks RBF and DLRNN, and the experimental results show that the proposed technique is superior to RBF and DLRNN in the attitude tracking control of quad-rotor UAVs.

Limitations and future work

Although the proposed TLRNN-based controller demonstrates superior performance in simulations, several challenges remain. Firstly, the computational burden of the triple-loop structure may limit its application in real-time systems with limited processing power. Future work will focus on optimizing the network structure and implementing it on embedded platforms. Secondly, the performance relies on the initial weights and learning rates, which may require careful tuning. Automated hyperparameter optimization techniques could be explored. Lastly, the proposed TLRNN introduces additional computational load compared to traditional controllers due to its triple-loop feedback structure and online parameter updates. A theoretical analysis estimates the complexity to be approximately $O (mn + nn + nq + mq)$ for network evaluation and a similar order for weight updates. For the selected architecture $(m = 9, n = 7, q = 3)$ , this translates to an estimated load of under 400 FLOPs per control cycle. This is well within the capabilities of modern embedded flight control processors.

Footnotes

Handling Editor: Chaofang Hu

ORCID iD

Qinglei Li

Funding

The author received no financial support for the research, authorship and/or publication of this article.

Declaration of conflicting interests

The author declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

References

Alzahrani

Oubbati

Barnawi

, et al. UAV assistance paradigm: state-of-the-art in applications and challenges. J Netw Comput Appl 2020; 166: 102706.

Mohsan

SAH

Othman

NQH

, et al. Unmanned aerial vehicles (UAVs): practical aspects, applications, open challenges, security issues, and future trends. Intell Serv Robot 2023; 16(1): 109–137.

Gedefaw

Abera

Abdissa

CM.

A review of modeling and control techniques for unmanned aerial vehicles. Eng Rep 2025; 7(6): e70215.

Fang

Han

Wang

, et al. Routing UAVs in landslides monitoring: a neural network heuristic for team orienteering with mandatory visits. Transp Res E Logist Transp Rev 2023; 175: 103172.

Liu

, et al. Multi-object tracking meets moving UAV. In: Proceedings of the IEEE/CVF confernce on computer vision and pattern recognition, New Orleans, LA, USA, 21–24 June 2022, pp. 8876–8885.

Bravo

RZB

Leiras

Cyrino Oliveira

. The use of UAVs in humanitarian relief: an application of POMDP-based methodology for finding victims. Prod Oper Manag 2019; 28(2): 421–440.

Ullah

Sami

Shaoping

, et al. A computationally efficient adaptive robust control scheme for a quad-rotor transporting cable-suspended payloads. Proc Inst Mech Eng G J Aerosp Eng 2022; 236(2): 379–395.

Dou

Wen

An adaptive robust attitude tracking control of quadrotor UAV with the modified Rodrigues parameter. Meas Control 2022; 55(9–10): 1167–1179.

Menyechel Eneyew

Ayalew Asfaw

Merga Abdissa

. Optimized backstepping fuzzy sliding mode controller for trajectory tracking of mobile manipulator. Eng Rep 2025; 7(7): e70269.

10.

Dirara

Yareshe

Abdissa

CM.

Design and analysis of adaptive fuzzy super-twisting sliding mode controller for uncertain 2-DOF robotic manipulator. IEEE Access 2025; 2025: 99.

11.

Shraim

Awada

Youness

A survey on quadrotors: configurations, modeling and identification, control, collision avoidance, fault diagnosis and tolerant control. IEEE Aerosp Electron Syst Mag 2018; 33(7): 14–33.

12.

Tian

Ouyang

, et al. Sway and disturbance rejection control for varying rope tower cranes suffering from friction and unknown payload mass. Nonlinear Dyn 2021; 105(4): 3149–3165.

13.

Labbadi

Cherkaoui

Robust adaptive nonsingular fast terminal sliding-mode tracking control for an uncertain quadrotor UAV subjected to disturbances. ISA Trans 2020; 99: 290–304.

14.

Zhao

Zhang

, et al. Terminal sliding mode control with self-tuning for coronary artery system synchronization. Int J Biomath 2017; 10(3): 1750041.

15.

Mehmood

Aslam

Ullah

, et al. Adaptive robust trajectory tracking control of multiple quad-rotor UAVs with parametric uncertainties and disturbances. Sensors 2021; 21(7): 2401.

16.

Nian

Zhou

, et al. 2-D path following for fixed wing UAV using global fast terminal sliding mode control. ISA Trans 2023; 136: 162–172.

17.

Mofid

Mobayen

Adaptive sliding mode control for finite-time stability of quad-rotor UAVs with parametric uncertainties. ISA Trans 2018; 72: 1–14.

18.

Qiu

Cai

Peng

. Predefined time consensus control of nonlinear multi-agent systems for Industry 5.0. IEEE Trans Consum Electron 2023; 70(1): 1913–1922.

19.

Zhao

Jia

She

, et al. Robust model-free super-twisting sliding-mode control method based on extended sliding-mode disturbance observer for PMSM drive system. Control Eng Pract 2023; 139: 105657.

20.

Tan

Jing

Gao

, et al. Adaptive improved super-twisting integral sliding mode guidance law against maneuvering target with terminal angle constraint. Aerosp Sci Technol 2022; 129: 107820.

21.

Bosera

Olana

Merga

, et al. Adaptive PSO based gain optimization of sliding mode control for position tracking control of magnetic levitation systems. In: 2022 International conference on information and communication technology for development for africa (ICT4DA). Bahir Dar, Ethiopia, 25–27 November 2022, pp. 157–162. IEEE.

22.

Kahouadji

Mokhtari

Choukchou-Braham

, et al. Real-time attitude control of 3 DOF quadrotor UAV using modified super twisting algorithm. J Franklin Inst 2020; 357(5): 2681–2695.

23.

Abera

Abdissa

Lemma

LN.

An improved nonsingular adaptive super twisting sliding mode controller for quadcopter. PLoS One 2024; 19(10): e0309098.

24.

Obeid

Laghrouche

Fridman

, et al. Barrier function-based adaptive super-twisting controller. IEEE Trans Autom Control 2020; 65(11): 4928–4933.

25.

Zhao

Zhang

, et al. Observer-based

h_{\infty}

synchronization control for input and output time-delays coronary artery system. Asian J Control 2019; 21(3): 1142–1152.

26.

Zhi

Zhao

Event-triggered finite-time consensus control of leader–follower multi-agent systems with unknown velocities. Trans Inst Meas Control 2023; 45(13): 2515–2525.

27.

Jiang

Zhou

, et al. Neural network based model predictive control for a quadrotor UAV. Aerospace 2022; 9(8): 460.

28.

Al Aela

Kenne

Mintsa

. Adaptive neural network and nonlinear electrohydraulic active suspension control system. J Vib Control 2020; 28(3–4): 243–259.

29.

Zhuang

Multiscale computation on feedforward neural network and recurrent neural network. Front Struct Civ Eng 2020; 14(6): 1285–1298.

30.

Flores

. Transition control of a tail-sitter UAV using recurrent neural networks. In: 2009 6th International conference on electrical engineering, computing science and automatic control (CCE). Toluca, Mexico, 10–13 January 2009, pp. 1–6. IEEE.

31.

Liu

Wang

Zeng

An overview of the stability analysis of recurrent neural networks with multiple equilibria. IEEE Trans Neural Netw Learn Syst 2021; 34(3): 1098–1111.

32.

Jiang

Cai

Liu

, et al. An integrated tracking control approach based on reinforcement learning for a continuum robot in space capture missions. J Aerosp Eng 2022; 35(5): 04022065.

33.

Ghadiri

Khodadadi

Hazareh

GA.

Finite-time integral fast terminal sliding mode control for uncertain quadrotor UAV based on state-dependent Riccati equation observer subjected to disturbances. J Vib Control 2024; 30(11–12): 2528–2548.

34.

Tee

SS.

Control of nonlinear systems with partial state constraints using a barrier Lyapunov function. Int J Control 2011; 84(12): 2008–2023.

35.

Tee

Tay

EH.

Barrier Lyapunov functions for the control of output-constrained nonlinear systems. Automatica 2009; 45(4): 918–927.

36.

Song

Sun

Finite-time control for nonlinear spacecraft attitude based on terminal sliding mode technique. ISA Trans 2014; 53(1): 117–124.

37.

Yang

, et al. Continuous nonsingular terminal sliding mode control for systems with mismatched disturbances. Automatica 2013; 49(7): 2287–2291.

38.

Moreno

. A linear framework for the robust stability analysis of a generalized super-twisting algorithm. In: 2009 6th International conference on electrical engineering, computing science and automatic control (CCE). Toluca, Mexico, 10–13 January 2009, pp.1–6. IEEE.

39.

Bhat

Bernstein

DS.

Finite-time stability of continuous autonomous systems. SIAM J Control Optim 2000; 38(3): 751–766.

40.

Fei

Adaptive sliding mode control of dynamic systems using double loop recurrent neural network structure. IEEE Trans Neural Netw Learn Syst 2017; 29(4): 1275–1286.

A novel RNN terminal sliding mode controller design based on barrier function for uncertain quad-rotor UAV systems

Abstract

Keywords

Introduction

Problem formulation and preliminaries

Design of adaptive recurrent neural network sliding mode control

Terminal sliding mode controller

TLRNN structure

Design of the attitude controller

Simulation results

Parameter section

Comparative analysis

Conclusion

Limitations and future work

Footnotes

ORCID iD

Funding

Declaration of conflicting interests

References