Abstract
The control of nonlinear dynamics is gaining increasing attention since many practical systems are with such kind of characteristics. To deal with the system uncertainty, in this paper, the efficient learning control using neural network is proposed for the nonlinear strict-feedback system. The whole scheme is with the back-stepping design, while the novel learning is proposed for the neural network weights update. To deal with the approximation error, the robust item is added. The stability of the closed-loop dynamics is analysed and the effectiveness of the design is verified through flight simulation.
Introduction
Nonlinear dynamics exists in many practical systems such as robots, 1 manipulators, 2 flight vehicle, 3 quadrotors, 4 and MEMS (microelectromechanical system) gyroscope. 5 Control of nonlinear dynamics6–8 is challenging since the design should be according to the structure while the nonlinearity is difficult to deal with. For the nonlinear design, the method is based on Lyapunov theory to make the energy decreasing. One important concern is on how to deal with the unknown nonlinear function. Some designs are using the knowledge of upper bound, while some works are based on the linearized parametric model. The other concern is on the form of the dynamics. For example, the controllability canonical form is different from the strict-feedback system and the pure-feedback system. For the dynamics in controllability canonical form, the main design can be with the error surface and then the robust design can be used. The control of the strict-feedback system is well studied using back-stepping and dynamic surface design. Though the design might be efficient, how to deal with the unknown dynamics is not easy since there might be not enough information to construct the linearized parametric model.
The intelligent control9–11 can provide learning-based structure and the design is more convenient. Many works can be found as approximate control, reinforcement learning control, adaptive dynamic programming control,
9
fault tolerant control,
12
and so on. Though the motivation might be quite different, the idea is clear that the neural network (NN) can be used as bridge between known and unknown. Some works can be further included with disturbance observer,13,14 sliding mode design, and
For the system with unknown nonlinear functions, the NN can be used for approximation. Typically, two kinds of designs are widely designed. One way is to approximate the ideal control input, while the other is towards the nonlinear function approximation. The designs can be found in the literature,16–18 while the application can be found in robot system, ship system, flight system, and mechanical systems. If the nonlinear function can be approximated as precise as possible, the tracking performance can be better. 19 However, most neural control is on the closed-loop stability using the tracking error to tune the NN weights. In this way, the system can be stable but after checking carefully, the approximation is far from the true value of the nonlinear function. Recently, some works have been towards the learning improvement using composite learning,20,21 In the design, the theoretical analysis is rigorously presented, while the prediction error is constructed using the dynamics and the approximation. In practice, more practical designs are expected if more system information can be obtained such as the derivative of the system state.
Based on the above-mentioned discussion, there exist many works on intelligent control. But most designs are towards the function approximation, and then the controller is constructed to obtain the system stability. During the process, the attention is on closed-loop system stability using tracking error to tune the weights. But the adaptation of the intelligent system is not sufficiently considered. Thus, in this paper, the approximation performance is considered and the new prediction error is given using the derivative of the system state. Furthermore, the efficient learning update law is constructed and the closed-loop system stability is analysed.
The structure of the paper is given as follows. Section ‘Model dynamics and problem formulation’ presents the nonlinear strict-feedback dynamics. Sections ‘Efficient learning control’ and ‘Stability analysis’ present the learning control and the closed-loop stability analysis, respectively. The verification is presented in section ‘Simulation’. Section ‘Conclusions and future works’ gives the conclusions and the future discussions.
Model dynamics and problem formulation
In this paper, the following dynamics with strict-feedback form is considered
where
Assumption 1
The system states
Assumption 2
The functions
Efficient learning control
For the strict-feedback design, the back-stepping scheme is of great interest since the design can break the complex dynamics into several simple dynamics. The main difficulty is the so-called ‘explosion of complexity’. Several designs can be introduced for simplification such as dynamic surface control and the command filtered back-stepping. In this paper, the derivative of the virtual control signal is obtained using
Step 1. As the first equation shown in dynamics (1), it is known that
where
Define the tracking error
where
Design virtual control
where
Define
where
Define the prediction error as
Since
Then, the following equality can be obtained
The NN weight update is given as
where
Step i. As the ith equation shown in dynamics (1) and using NN to approximate
where
Design virtual control
where
The derivative of
where
The prediction error is constructed as
Since
Then, the following equality can be obtained
The NN weight update is given as
where
Step n. As the nth equation shown in dynamics (1) and using NN to approximate
where
The final control signal
where
The derivative of
where
The prediction error is constructed as
Since
Then, the following equality can be obtained
The NN weight update is given as
where
Stability analysis
Theorem 1
Consider the dynamics (1) with the virtual signals (4), (11), (18) and the neural weights update (9), (16), (23). Then, the tracking errors are bounded.
Proof
The Lyapunov function is selected as
The derivative of
Then it is calculated as
The equation is further obtained as
where
Simulation
The flight dynamics
22
are presented with attack angle α, flight path angle (FPA)
Define
The way of using the tracking error to update the neural weight is denoted as ‘Method 1’, while the design in this paper is named as ‘Proposed Method’ which means the predictor-based update design. To show the performance, the index is selected as
In the simulation, the altitude will climb from 86,000 to 87,000 ft in 50 s, while the altitude will decrease from 87,000 to 85,000 ft in the next 50 s. Given the reference signal of altitude, the flight path angle is generated through the similar way as in Zhang et al. 23 In Example 1, there is no noise, while in Example 2 there exist noise for a and q.
Example 1
The simulation results are presented in Figures 1–6. It is clear that the proposed method obtains better tracking performance with high tracking precision for system states tracking in Figures 1 and 2. The control input responds smoother as shown in Figure 3. The neural approximation is depicted in Figure 5, while the trajectory of NN weights is shown in Figure 6. The tracking performance is demonstrated in Figure 4. Overall, the proposed method achieves the better convergence and the higher tracking accuracy.

Altitude tracking.

System states.

Elevator deflection.

Performance index.

NN approximation.

NN weights.
Example 2
The random noises with amplitude 0.0001 and 0.001 are added for a and q. In Figure 7, the system response is demonstrated, while in Figure 8 the elevator deflection is depicted. Furthermore, the NN response is shown in Figure 9. It is interesting to see that the proposed method can achieve much better performance in case of measurement noise. Also from the response of the elevator deflection and the NN weights, more chattering occurs in case of noise.

System states with random noises.

Elevator defection.

NN weights with random noises.
Conclusions and future works
The efficient learning-based control is designed for the strict-feedback systems. The design constructs the signal to obtain the prediction error for the neural weight update. The system stability is analysed and the control performance is verified through nonlinear dynamic simulation.
For the future work, the output-feedback design can be studied. In reality, the time-varying disturbance exists in the dynamics and the new estimation design can be analyzed. For practical applications, the method can be applied to manipulators, underwater vehicles, quadrotor, and automobile dynamics for experimental purpose.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
