Decentralized Neural Backstepping Control Applied to a Robot Manipulator

Abstract

This paper presents a discrete-time decentralized control scheme for trajectory tracking of a two degrees of freedom (DOF) robot manipulator. A high order neural network (HONN) is used to approximate a decentralized control law designed by the backstepping technique as applied to a block strict feedback form (BSFF). The weights for each neural network are adapted online by an extended Kalman filter training algorithm. The motion for each joint is controlled independently using only local angular position and velocity measurements. The stability analysis for the closed-loop system via the Lyapunov approach is included. Finally, the real-time results show the feasibility of the proposed control scheme using a robot manipulator.

Keywords

Decentralized control High-Order Neural Networks Extended Kalman Filter Backstepping

1. Introduction

Robotic arm control has become a significant research area for different applications due to the relevancy that they have acquired in performing tasks classified as dangerous or which require higher accuracy. In this context, different control schemes have been proposed to guarantee efficient trajectory tracking and stability [1, 2]. Fast advances in computational technology offer different possibilities for implementing control algorithms within the approach of a centralized control design [3]. However, it is a great challenge to obtain an efficient control for these systems, due to their highly nonlinear complex dynamics with strong interconnections, parameters which are difficult to measure and unmodelled dynamics. Considering only the most important terms on the mathematical model, control algorithms with a great number of mathematical operations are required, which affect real-time implementation feasibility.

In [4], the authors present a control approach based on open-loop optimization using a genealogical decision tree (GDT), which can be used for solving both tracking and regulation. A novel task-space robust control approach with suitable tracking performance under imperfect transformation is presented in [5]; the proposed control law is derived based on Lyapunov's direct method to guarantee stability in the presence of both structured and unstructured uncertainties. On the other hand, within the area of control systems theory, for more than three decades, an alternative approach has been developed considering a global system as a set of interconnected subsystems, for which it is possible to design independent controllers, considering only local variables to each subsystem: decentralized control [6, 7]. This type of control has been applied in robotics, mainly in cooperative multiple mobile robots and robot manipulators, where it is natural to consider each mobile robot or each manipulator as a subsystem of the whole system. For robot manipulators, each joint and the respective link is considered as a subsystem in order to develop local controllers, which consider only local angular position and angular velocity measurements, and compensate for interconnections effects, usually assumed as disturbances. The resulting controllers are easily implemented for real-time applications [8]. In [9], a decentralized control for robot manipulators is developed, decoupling the dynamic model of the manipulator in a set of linear subsystems with uncertainties and simulations for a two joint robot are included.

In [10], a decentralized control for robot manipulators is reported; it is based on the estimation of each joint dynamics using feedforward neural networks. A decentralized control scheme, on the basis of a recurrent neural identifier and the block control structure, is shown in [11]. This approach was tested only via simulation, with a two degrees of freedom robot manipulator, and with an Articulated Nimble Adaptable Trunk (ANAT) manipulator, with seven degrees of freedom. In [12], the authors propose a robust adaptive decentralized control algorithm for trajectory tracking of robot manipulators. The controller, which consists of a Proportional plus Derivative (PD) feedback part and a dynamic compensation part, is designed based on the Lyapunov methodology.

In recent literature about adaptive and robust control, numerous approaches have been proposed for the design of nonlinear control systems. Among these, adaptive backstepping constitutes a major design methodology [13]. The idea behind the backstepping approach is that some appropriate functions of state variables are selected recursively as virtual control inputs for lower dimension subsystems of the overall system. Each backstepping stage results in a new virtual control design from the preceding design stages; when the procedure ends, a feedback design for the true control input results, which achieves the original design objective.

In this paper, the authors present a decentralized neural backstepping approach in order to design a suitable controller for each subsystem. Afterwards, each local controller is approximated by a high order neural network (HONN) [14]. The neural network learning is performed online by means of an extended Kalman filter (EKF) [15] and the controllers are designed for each joint, using only local angular position and velocity measurements. Real-time results for the proposed scheme using a two DOF robot manipulator are presented.

2. Discrete-time decentralized systems

Let consider a class of discrete-time nonlinear perturbed and interconnected system which can be presented in the block strict feedback form (BSFF) [13] consisting of r blocks

\begin{matrix} χ_{i}^{1} (k + 1) & = & f_{i}^{1} (χ_{i}^{1}) + B_{i}^{1} (χ_{i}^{1}) χ_{i}^{2} + Γ_{i ℓ}^{1} \\ χ_{i}^{2} (k + 1) & = & f_{i}^{1} (χ_{i}^{1}, χ_{i}^{2}) + B_{i}^{2} (χ_{i}^{1}, χ_{i}^{2}) χ_{i}^{3} + Γ_{i ℓ}^{2} \\ ​ & ⋮ & ​ \\ χ_{i}^{r - 1} (k + 1) & = & f_{i}^{r - 1} (χ_{i}^{1}, χ_{i}^{2}, \dots, χ_{i}^{r - 1}) \\ ​ & ​ & + B_{i}^{r - 1} (χ_{i}^{1}, χ_{i}^{2}, \dots, χ_{i}^{r - 1}) χ_{i}^{r} + Γ_{i ℓ}^{r - 1} \\ χ_{i}^{r} (k + 1) & = & f_{i}^{r} (χ_{i}) + B_{i}^{r} (χ_{i}) u_{i} + Γ_{i ℓ}^{r} \end{matrix}

(1)

where $χ_{i} \in ℜ^{n_{i}}$ , $χ_{i} = {[\begin{matrix} χ_{i}^{1 T} & χ_{i}^{2 T} & \dots & χ_{i}^{r T} \end{matrix}]}^{T}$ and $χ_{i}^{j} \in ℜ^{n_{i j} \times 1}$ , $χ_{i}^{j} = {[\begin{matrix} χ_{i 1}^{j} & χ_{i 2}^{j} & \dots & χ_{i l}^{j} \end{matrix}]}^{T}$ , i = 1,…, N; j = 1,…, r; l = 1,…, n_ij; N is the number of subsystems, $u_{i} \in ℜ^{m_{i}}$ is the input vector, the rank of $B_{i}^{j} = n_{i j}$ , $\sum_{j = 1}^{r} n_{i j} = n_{i}, \forall χ_{i}^{j} \in D_{χ_{i}^{j}} \subset ℜ^{n_{i j}}$ . We assume that $f_{i}^{j}$ , $B_{i}^{j}$ y $Γ_{i}^{j}$ are smooth and bounded functions, $f_{i}^{j} (0) = 0$ and $B_{i}^{j} (0) = 0$ . The integers $n_{i 1} \leq n_{i 2} \leq \dots \leq n_{i j} \leq m_{i}$ define the different subsystem structures. The interconnection terms are given by

\begin{matrix} Γ_{i ℓ}^{1} & = & \sum_{ℓ = 1, ℓ \neq i}^{N} γ_{i ℓ}^{1} (χ_{ℓ}^{1}) \\ Γ_{i ℓ}^{2} & = & \sum_{ℓ = 1, ℓ \neq i}^{N} γ_{i ℓ}^{2} (χ_{ℓ}^{1}, χ_{ℓ}^{2}) \\ ​ & ⋮ & ​ \\ Γ_{i ℓ}^{r - 1} & = & \sum_{ℓ = 1, ℓ \neq i}^{N} γ_{i ℓ}^{r - 1} (χ_{ℓ}^{1}, χ_{ℓ, \dots,}^{2} χ_{ℓ}^{r - 1}) \\ Γ_{i ℓ}^{r} & = & \sum_{ℓ = 1, ℓ \neq i}^{N} γ_{i ℓ}^{r} (χ_{ℓ}) \end{matrix}

(2)

where $χ_{ℓ}$ represents the state vector of the ℓ-th subsystem with $1 \leq ℓ \leq N$ and $ℓ \neq i$ . Interconnection terms (2) reflect the interaction between the i-th subsystem and the other ones.

3. High-order neural networks (HONN)

This section described some preliminaries about discrete-time high order neural networks (HONN) and extended Kalman filter algorithm (EKF).

3.1 Discrete-time HONNs

Considering the HONN described by

\begin{array}{l} ϕ (w, z) = ω^{T} S (z) \\ S (z) = [\begin{matrix} s_{1}^{T} (z), & s_{2}^{T} (z), & \dots, & s_{m}^{T} (z) \end{matrix}] \\ s_{i} (z) = [\begin{matrix} \prod_{j \in I_{1}} {[s (z_{j})]}^{d_{j} (i_{1})} & \dots & \prod_{j \in I_{m}} {[s (z_{j})]}^{d_{j} (i_{m})} \end{matrix}] \\ i = 1, 2, \dots, L \end{array}

(3)

where $z = {[z_{1}, z_{2}, \dots, z_{p}]}^{T} \in Ω_{z} \subset ℜ^{p}$ , p is a positive integer which denotes the number of external inputs, L denotes the neural network (NN) node number, $φ \in ℜ^{m}, {I_{1}, I_{2}, \dots, I_{L}}$ is a collection of not ordered subsets of ${1, 2, \dots, p}, S (z) \in ℜ^{L \times m}$ , $d_{j} (i_{j})$ is a nonnegative integer, $w \in ℜ^{L}$ is an adjustable synaptic weight vector, and $s (z_{j})$ is chosen as the hyperbolic tangent function:

s (z_{j}) = \frac{e^{z_{j}} - e^{- z_{j}}}{e^{z_{j}} + e^{- z_{j}}}

(4)

For a desired function $w \in ℜ^{L}$ , assume that an ideal weight vector $w \in ℜ^{L}$ exists such that the smooth function vector $w \in ℜ^{L}$ can be approximated by an ideal NN on a compact subset $w \in ℜ^{L}$

u^{*} (z) = w^{* T} S (z) + ε_{z}

(5)

where $ε_{z} \subset ℜ^{m}$ is the bounded NN approximation error vector; note that $‖ ε_{z} ‖$ can be reduced by increasing the number of the adjustable weights. The ideal weight vector $w^{*}$ is an artificial quantity required only for analytical purposes [14, 16]. In general, it is assumed that there exists an unknown but constant weight vector $w^{*},$ whose estimate is $w \in ℜ^{L}$ . Hence, it is possible to define:

\tilde{w} (k) = w (k) - w^{*}

(6)

as the estimation error.

3.2 Extended Kalman filter algorithm

It is known that Kalman filtering (KF) estimates the state of a linear system with additive state and output white noises [18]. For KF-based NN training, the network weights become the states to be estimated. In this case, the error between the NN output and the measured plant output can be considered as additive white noise. Because NN mapping is nonlinear, an EKF-type is required. The training goal is to find the optimal weight values w_i^j(k) which minimize the prediction error. We use an EKF-based training algorithm described by:

\begin{aligned} K (k) = & P (k) H (k) [R (k)_{+} H^{T} (k) P (k) P (k) H (k)]^{- 1} \\ w (k + 1) = & w (k)_{+} η K (k) [x (k) - \hat{x} (k)] \\ p (k + 1) = & P (k) - K (k) H^{T} (k) P (k) + Q (k) \end{aligned}

(7)

where P∈ℜ^LxL is the prediction error covariance matrix, w∈ℜ^L is the weight (state) vector, η is a design parameter such that 0 ≤ η ≤ 1, L is the respective number of NN weights, x∈ℜ^m is the measured plant state, x̂∈ℜ^m is the NN output, K∈ℜ^Lxm is the Kalman gain matrix, Q∈ℜ^LxL is the state noise associated covariance matrix, R∈ℜ^mxm is the measurement noise associated covariance matrix and H∈ℜ^mxm is a matrix, for which each entry (H_ij) is the derivative of one of the NN output (x̂C_j), with respect to one NN weight (w_j), as follows

H_{i}^{j} (k) = {[\frac{\partial {\hat{x}}_{i}^{j} (k)}{\partial w_{i}^{j} (k)}]}_{w_{i}^{j} (k) = w_{i}^{j} (k + 1)}^{T}

(8)

where i=1,…,m and j=1,…, L. Usually P and Q are initialized as diagonal matrices, with entries P(0) and Q(0), respectively. It is important to remark that H(k), K(k) and P(k) for the EKF are bounded [18].

4. Controller design

The model of many practical nonlinear systems can be expressed in (or transformed into) a special state-space form named the block strict feedback form (BSFF) [13] as follows:

\begin{array}{l} x_{i}^{j} (k + 1) = f_{i}^{j} ({\bar{x}}_{i}^{j} (k)) + g_{i}^{j} ({\bar{x}}_{i}^{j} (k)) x_{i}^{j + 1} (k) + d_{i}^{j} (k) \\ x_{i}^{r} (k + 1) = f_{i}^{r} (x_{i} (k)) + g_{i}^{r} (x_{i} (k)) u_{i} (k) + d_{i}^{r} (k) \\ y (k) = x_{i}^{1} (k), j = 1, 2, &, r - 1 \end{array}

(9)

where $x_{i} (k) = {[x_{i}^{1 T} (k), \dots, x_{i}^{r T} (k)]}^{T}$ are the state variables, ${\bar{x}}_{i}^{j} (k) = {[x_{i}^{1 T} (k), x_{i}^{2 T} (k), ..., x_{i}^{j T} (k)]}^{T},$ $x_{i}^{j} (k) \in ℜ^{n}, i = 1, ..., N$ , $r \geq 2,$ r is the number of blocks, N is the number of subsystems, $u_{i} (k) \in ℜ^{m}$ are the system inputs, $y (k) \in ℜ^{m}$ is the system output, $d_{i}^{j} \in ℜ^{n_{i}}$ is the bounded unknown disturbance vector which includes all the effects of the other connected subsystems; then, a constant ${\bar{d}}_{i}^{j}$ exists such that $‖ d_{i}^{j} (k) ‖ \leq {\bar{d}}_{i}^{j}$ , for $0 < k < \infty$ . $f_{i}^{j} (\cdot)$ and $g_{i}^{j} (\cdot)$ are unknown smooth nonlinear functions. If we consider the original system (9) as a one-step ahead predictor, we can transform it into an equivalent maximum r-step ahead one, which can predict the future states $x_{i}^{1} (k + r), x_{i}^{2} (k + r - 1), ..., x_{i}^{r} (k + 1)$ ; then, the causality contradiction is avoided when the controller is constructed based on the maximum r-step ahead prediction by backstepping [14, 19]. Then, system (9) can be rewritten as

\begin{array}{l} x_{i}^{1} (k + r) = F_{i}^{1} ({\bar{x}}_{i}^{1} (k)) + G_{i}^{1} ({\bar{x}}_{i}^{1} (k)) x_{i}^{2} (k + r + 1) + d_{i}^{1} (k + r) \\ ⋮ \\ x_{i}^{r - 1} (k + 2) = F_{i}^{r - 1} ({\bar{x}}_{i}^{r - 1} (k)) + G_{i}^{r - 1} ({\bar{x}}_{i}^{r - 1} (k)) \\ x_{i}^{r - 1} (k + 1) + d_{i}^{r + 1} (k + 2) \\ x_{i}^{r} (k + 1) = F_{i}^{r} (x_{i} (k)) + G_{i}^{r} (x_{i} (k)) u_{i} (k) + d_{i}^{r} (k) \\ y (k) = x_{i}^{1} (k) \end{array}

(10)

where F_i^j(·) and G_i^j(·) are unknown smooth functions of f_i^j(x̄^j_i(k)) and g_i^j(g^j_i(x̄^j_i(k)) respectively. To simplify the analysis, let us define 1 ≤ j ≤ r−1.

The objective is to design a control u_i(k) to force the system output χ_i(k) to follow a desired trajectory x_id(k). Once (10) is defined, we apply the well-known backstepping technique [13]. For system (10), we can define the desired virtual controls (α^j*(k), j = 1,…,r−1) and the ideal practical control (u*(k)) as follows:

\begin{array}{l} α_{i}^{1 *} (k) ≅ x_{i}^{2} (k) = φ_{i}^{1} ({\bar{x}}_{i}^{1} (k), x_{i d} (k + r)) \\ α_{i}^{2 *} (k) ≜ x_{i}^{3} (k) = φ_{i}^{2} ({\bar{x}}_{i}^{2} (k), α_{i}^{1 *} (k)) \\ ⋮ \\ α_{i}^{r - 1 *} (k) ≜ x_{i}^{r} (k) = φ_{i}^{r - 1} ({\bar{x}}_{i}^{r - 1} (k), α_{i}^{r - 2 *} (k)) \\ u_{i}^{*} (k) = φ_{i}^{r} (x_{i} (k), α_{i}^{r - 1 *} (k)) \\ χ_{i} (k) = x_{i}^{1} (k) \end{array}

(11)

φ^j_i(γ) with 1 ≤ j ≤ r are nonlinear smooth functions. It is obvious that the desired virtual controls α*_i(k) and the ideal control u_i*(k) will drive the output χ*_i(k) to track the desired signal x*_i_d(k) only if the exact system model is known and there are no unknown disturbances. However, in practical applications, these two conditions cannot be satisfied. In the following, neural networks will be used to approximate the desired virtual controls, as well as the desired practical control, when the conditions established above are not satisfied. As in [14], we construct the virtual and practical controls via backstepping without the causality contradiction [17]. Let us approximate the virtual controls and practical control by using the following HONN:

\begin{array}{l} α_{i}^{j} (k) = w_{i}^{j} ​^{T} S^{j} (z_{i}^{j} (k)) \\ u_{i} (k) = w_{i}^{r} ​^{T} S^{r} (z_{i}^{r} (k)), j = 1, \dots, r - 1 \end{array}

(12)

with

\begin{array}{l} z_{i}^{j} (k) = {[\begin{matrix} x_{i}^{1} (k), & x_{i d}^{1} (k + r) \end{matrix}]}^{T} \\ z_{i}^{j} (k) = {[\begin{matrix} {\bar{x}}_{i}^{j} (k), & α_{i}^{j - 1} (k) \end{matrix}]}^{T}, j = 1, \dots, r - 1 \\ z_{i}^{r} (k) = {[\begin{matrix} x_{i} (k), & α_{i}^{r - 1} (k) \end{matrix}]}^{T} \end{array}

where w^j_i∈ℜ^Lj are the estimates of ideal constant weights w^j*_i and S^j∈ℜ^Ljxnj with j = 1,…, r. Define the weight estimation error as

{\tilde{w}}_{i}^{j} (k) = w_{i}^{j} (k) - w_{i}^{j *}

(13)

Using the ideal constant weights and according to [20], it follows that an HONN exists, which approximates the virtual controls and practical control with a minimum error, defined as

\begin{array}{l} α_{i}^{j} (k) = w_{i}^{j *^{T}} S_{i}^{j} (z_{i}^{j} (k)), i = 1, \dots, N \\ u_{i} (k) = w_{i}^{r *^{T}} S_{i}^{r} (z_{i}^{r} (k)) + ε_{z_{i}^{j}}, j = 1, \dots, r - 1 \end{array}

(14)

Then, the corresponding weights updating laws using an EKF are defined as

w_{i}^{j} (k + 1) = w_{i}^{j} (k) + η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)

(15)

with

\begin{array}{l} K_{i}^{j} (k) = P_{i}^{j} (k) H_{i}^{j} (k) M_{i}^{j} (k) \\ M_{i}^{j} (k) = [R_{i}^{j} (k) + H_{i}^{j T} (k) P_{i}^{j} (k) H_{i}^{j} (k)] - 1 \\ P_{i}^{j} (k + 1) = P_{i}^{j} (k) - K_{i}^{j} (k) H_{i}^{j T} (k) P_{i}^{j} (k) + Q_{i}^{j} (k) \end{array}

(16)

H_{i}^{j} (k) = [\frac{\partial {\overset{⌢}{υ}}_{i}^{j} (k)}{\partial w_{i}^{j} (k)}]

(17)

and

e_{i}^{j} (k) = υ_{i}^{j} (k) - {\hat{υ}}_{i}^{j} (k)

(18)

where v^j_i(k)∈ℜ^nj is the desired signal and v^j_i(k)∈ℜ^nj is the HONN function approximation defined, respectively as follows:

\begin{matrix} υ_{i}^{1} (k) = x_{i d}^{1} (k) \\ υ_{i}^{2} (k) = x_{i}^{2} (k) \\ ⋮ \\ υ_{i}^{r} (k) = x_{i}^{r} (k) \end{matrix}

(19)

and

\begin{matrix} {\hat{υ}}_{i}^{1} (k) = χ_{i}^{1} (k) \\ {\hat{υ}}_{i}^{2} (k) = α_{i}^{1} (k) \\ ⋮ \\ {\hat{υ}}_{i}^{r} (k) = α_{i}^{r - 1} (k) \end{matrix}

(20)

e_i^j(k) denotes the error at each step as

\begin{array}{l} e_{i}^{1} (k) = x_{i d}^{1} (k) - χ_{i}^{1} (k) \\ e_{i}^{2} (k) = x_{i}^{2} (k) - α_{i}^{1} (k) \\ ⋮ \\ e_{i}^{r} (k) = x_{i}^{r} (k) - α_{i}^{r - 1} (k) \end{array}

(21)

The whole proposed decentralized neural backstepping control scheme is shown in Figure 1.

Figure 1.

Decentralized neural backstepping control scheme

5. Two DOF robot manipulator application

This section presents the real-time application for a two DOF robot manipulator.

5.1 Robot manipulator description

In order to evaluate, via real-time implementation, the performance of the proposed control algorithm, we use a two DOF robot manipulator moving in the vertical plane as presented in Fig. 2. High-torque brushless direct-drive servos are used to drive the joints without gear reduction. The advantages of these types of direct-drive actuators include freedom from backlash and significantly lower joint friction compared to actuators with gear drives. The motors used in the experimental arm are models DM1200-A and DM1015-B from the Parker Compumotor company, for the shoulder and elbow joints, respectively.

Figure 2.

Robot manipulator

For this application, the servos are operated in “torque mode”, so the motors act as torque sources and accept an analogue voltage as a reference of torque signal. In this configuration, the motor DM1200-A is capable of delivering a maximum torque of 150 Nm and the motor DM1015-B only 15 Nm. Angular position information is obtained from incremental encoders located on the motors, which have a resolution of 1,024,000 pulses/rev for the first motor and 655,300 pulses/rev for the second (accuracy 0.0069 for both motors), and angular velocity information is obtained by numerical differentiation of the position signal. A motion control board DS1103, based on the TMS320C31 32-bit floating-point microprocessor from Texas Instruments, was used to run the control algorithms. This board is mounted in an IBM-compatible 486 66-MHz host computer, which provides an environment for program generation, compilation, loading data for plotting purposes and downloading programs for real-time execution. The control program is written in C programming language for the TMS320 compiler and executed in the control board at a 2.5 ms sampling rate. Moreover, a control interface with Matlab/Simulink and dSPACE Control Desk 2.3 was used in order to display all required signals. A detailed tutorial about operating and programming with Matlab/Simulink and Control Desk can be found in [21].

5.2 Control objective

Define the following states:

\begin{array}{l} x^{1} (k) = [\begin{matrix} χ_{1}^{1} (k) \\ χ_{2}^{1} (k) \end{matrix}]; x^{2} (k) = [\begin{matrix} χ_{1}^{2} (k) \\ χ_{2}^{2} (k) \end{matrix}]; u (k) = [\begin{matrix} u_{1} (k) \\ u_{2} (k) \end{matrix}]; \\ x_{d} (k) = [\begin{matrix} x_{1 d}^{1} (k) \\ x_{2 d}^{1} (k) \end{matrix}]; χ (k) = x^{1} (k) \end{array}

(22)

where χ¹₁(k) and χ¹₂(k) are the angular positions, χ²₁(k) and χ²₂(k) are the angular velocities, x₁¹ _d(k) and x₂²_d(k) are the desired trajectory signals, u₁(k) and u₂(k) represent the applied torque to joints 1 and 2, respectively. The control objective is to drive the output χ(k) to track the reference x_d(k). Using (22), the discrete-time two DOF robot manipulator model can be represented in the BSFF, consisting of two blocks (r = 2) for each joint

\begin{array}{l} x_{i}^{1} (k + 1) = f_{i}^{1} (x_{i}^{1} (k)) + g_{i}^{1} (x_{i}^{1} (k)) x_{i}^{2} (k) \\ x_{i}^{2} (k + 1) = f_{i}^{2} ({\bar{x}}_{i}^{2} (k)) + g_{i}^{2} ({\bar{x}}_{i}^{2} (k)) u_{i} (k) \end{array}

(23)

where x̄²_i²(k) = [x¹_i(k) x_i²(k)]^T, i = 1, 2 subsystems, f_i¹(x¹ _i (k)), g¹_i(x¹ _i (k)), f_i²(x̄_i²(k)) and g²_i(x̄_i 2(k)) are assumed to be unknown. To this end, we use an HONN to approximate the desired virtual controls and the ideal practical control described as

\begin{array}{l} α_{i}^{1 *} (k) ≜ x_{i}^{2} (k) = φ_{i}^{1} (x_{i}^{1} (k), x_{i d}^{1} (k + 2)) \\ u_{i}^{*} (k) = φ_{i}^{1} (x_{i}^{1} (k), x_{i}^{2} (k), α_{i}^{1 *} (k)) \\ χ_{i} (k) = x_{i}^{1} (k) . \end{array}

(24)

The HONN proposed for this application is defined as follows:

\begin{array}{l} α_{i}^{1 *} (k) = w_{i}^{1 T} S^{1} (z_{i}^{1} (k)) \\ u_{i} (k) = w_{i}^{2 T} S^{2} (z_{i}^{2} (k)) \end{array}

(25)

with

\begin{array}{l} z_{i}^{1} (k) = [x_{i}^{1} (k), x_{i d}^{1} (k + 2)] \\ z_{i}^{2} (k) = [x_{i}^{1} (k), x_{i}^{2} (k), α_{i}^{1} (k)] . \end{array}

(26)

The weights are updated using the EKF (15) - (21) with i = 1, 2 and

\begin{array}{l} e_{i}^{1} (k) = x_{i d}^{1} (k) - x_{i}^{1} (k) \\ e_{i}^{2} (k) = x_{i}^{2} (k) - α_{i}^{1} (k) . \end{array}

(27)

The training is performed online using a series-parallel configuration. All the NN states are initialized in a random way. The covariance matrices are initialized as diagonal and the nonzero elements are: P_i¹ = P_i² = 100, Q_i¹ = Q_i² = 1000 and R_i¹ = R²_i = 1 × 10¹², respectively. These values are heuristically selected.

In fact, the system model must be expressed in the block strict feedback form (BSFF) [13] before starting the designing. The electromechanical system considered in this paper is already in this form.

According to the actuator's manufacturer, the direct-drive motors are able to supply torques within the following bounds

\begin{array}{l} | u_{1} | \leq τ_{1}^{\max} = 150 [Nm] \\ | u_{2} | \leq τ_{2}^{\max} = 15 [Nm] . \end{array}

5.3 Real-time results

For the experiments we select the following discrete-time trajectories

\begin{array}{l} x_{1 d}^{1} (k) = b_{1} (1 - e^{d_{1} k T^{3}}) + c_{1} (1 - e^{d_{1} k T^{3}}) \sin (ω_{1} k T) [rad] \\ x_{2 d}^{1} (k) = b_{2} (1 - e^{d_{2} k T^{3}}) + c_{2} (1 - e^{d_{2} k T^{3}}) \sin (ω_{2} k T) [rad] \end{array}

where b₁=π/4, c₁=Π/18, d₁=−2.0 and ω₁=5 [rad/s] are parameters of the desired position trajectory for the first joint, whereas b₂=π/3, c₂=25π/36, d₂=−1.8 and ω₂=1.0 [rad/s] are parameters of the desired position trajectory for the second joint. The sampling time is selected as T = 2.5 milliseconds.

These trajectories incorporate a sinusoidal term to evaluate the performance for relatively fast periodic signals, where the nonlinearities of the robot dynamics are really important; additionally, they include a term which smoothly grows for maintaining the robot in an operation state without saturating actuators.

The trajectory tracking results for decentralized neural backstepping (DNBS) control scheme are presented in Figs. 3 and 4. For the real system, the initial conditions are the same as those of the reference system, both restricted to be equal to zero according to the experimental prototype architecture, therefore, transient errors do not appear. The tracking error performance can be verified for joints 1 and 2 in Fig. 5. The applied torques to each joint are shown in Fig. 6.

Figure 3.

Trajectory tracking for joint 1 x₁¹_d (k) (solid line) and χ¹₁(k) (dashed line)

Figure 4.

Trajectory tracking for joint 2 x¹_2d (k) (solid line) and 2 x¹_2d(k) (dashed line)

Figure 5.

Tracking errors for joints 1 and 2

Figure 6.

Applied torques to joints 1 and 2

It is easy to see that both control signals are always inside the prescribed limits given by the actuator's manufacturer; that is, their absolute values are smaller than the bounds τ₁^max and τ₂^max, respectively. Time evolution of the position error e¹_i reflects that the control system performance is very good. The performance criterion considered in this paper is the mean square error (MSE) value of the position error calculated as

MSE [e_{i}^{1}] = \sqrt{\frac{1}{t} \sum_{k = 0}^{n} {‖ e_{i}^{1} ‖}^{2} T}

(28)

where T is the sampling time and t = 20 seconds.

The respective mean square errors for the proposed scheme are included in Table 1. According to the mean square errors presented above, the proposed scheme based on the backstepping technique presents a good performance for trajectory tracking and reduced computational complexity.

Table 1.

Mean square error for the real joint positions

Control algorithm	MSE[e ₁¹ (k)]	MSE[e ₂¹(k)]
DNBS	4.1611e-5	0.0013

6. Stability analysis

Before proceeding to demonstrate the stability analysis to prove that the tracking error (21) is SGUUB, we need to establish the following lemmas.

Lemma 1 [20] The dynamics of the tracking errors (18) can be formulated as

e_{i}^{j} (k + 1) = e_{i}^{j} (k) + Δ e_{i}^{j} (k), (1 \leq j \leq r)

(29)

with $Δ e_{i}^{j} (k) \leq - γ_{i}^{j} e_{i}^{j} (k)$ and $γ_{i}^{j} = \max ‖ H_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) ‖$ .

Proof Using (18) and considering that v(k) do not depend on the HONN parameters, we obtain

\frac{\partial e_{i}^{j} (k)}{\partial ω_{i}^{j} (k)} = \frac{\partial {\hat{υ}}_{i}^{j} (k)}{\partial ω_{i}^{j} (k)}

(30)

Let us approximate (30) by

Δ e_{i}^{j} (k) = {[\frac{\partial e_{i}^{j} (k)}{\partial ω_{i}^{j} (k)}]}^{T} Δ ω_{i}^{j} (k)

(31)

Substituting (17) and (30) in (31)

Δ e_{i}^{j} (k) = - H_{i}^{T} (k) η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)

(32)

Let us define

γ_{i}^{j} = \max ‖ H_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) ‖

(33)

then we have

Δ e_{i}^{j} (k) \leq - γ_{i}^{j} e_{i}^{j} (k)

(34)

Lemma 2 [20] The HONN weights updated with (15), based on the EKF algorithm (16), are bounded.

Proof From (13) and (15) it is possible to write the dynamics of the weight estimation error as

{\tilde{w}}_{i}^{j} (k + 1) = {\tilde{w}}_{i}^{j} (k) + η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)

(35)

Using (12), (14) and (17) system (35) can be written as

\begin{array}{l} {\tilde{w}}_{i}^{j} (k + 1) = {\tilde{w}}_{i}^{j} (k) - η_{i}^{j} K_{i}^{j} (k) S^{j T} (k) (z_{i}^{j} (k)) {\tilde{w}}_{i}^{j} (k) + η_{i}^{j} K_{i}^{j} (k) ε_{z_{i}^{j}} \\ = A_{i}^{j} (k) {\tilde{w}}_{i}^{j} (k) + B_{i}^{j} (k) υ_{z_{i}^{j}} (k) \end{array}

with j = 1,…,r and

\begin{array}{l} A_{i}^{j} (k) = & [I - η_{i}^{j} K_{i}^{j} (k) S^{j T} (k) (z_{i}^{j} (k))] \\ B_{i}^{j} (k) = & η_{i}^{j} \\ υ_{z_{i}^{j}} (k) = & K_{i}^{j} (k) ε_{z_{i}^{j}} \end{array}

(36)

It remains that the EKF algorithm is used only to train the neural network weights which become the states to be estimated by the EKF and the neural network approximation error vector ε_zij is bounded (this is a well-known NN property [22]). Moreover, consider the boundedness of ε_jz and S(z_i^j(k)), then, by selecting η^j_i appropriately, A^j_i(k) satisfies |φ(k(1),k(0))|(k). By applying Lemma 1, w˜^j_i(k) is bounded.

Theorem 1 For the i-th subsystem of (1) in the absence of interconnections, the i-th subsystem of HONN (12) trained with the EKF-based algorithm (16) to approximate i-th control law (11), ensures that the tracking error (21) is semiglobally uniformly ultimately bounded (SGUUB); moreover, the HONN weights remain bounded.

Proof For the following j-th (j = 1,…,r–1) equations of i-th subsystem in (1), with the virtual control α^j*_i(k) approximated by the i-th subsystem of HONN α ^j _i (k) = w^jT_i S^jT(k)(z_i^j(k)) and e¹_i(k) defined as in (19), consider the Lyapunov function candidate

V_{i}^{j} (k) = e_{i}^{j T} (k) e_{i}^{j} (k) + {\tilde{w}}_{i}^{j T} (k) {\tilde{w}}_{i}^{j} (k)

(37)

whose first difference is

\begin{array}{l} Δ V_{i}^{j} (k) = V_{i}^{j} (k + 1) - V_{i}^{j} (k) \\ = e_{i}^{j T} (k + 1) e_{i}^{j} (k + 1) + {\tilde{w}}_{i}^{j T} (k + 1) {\tilde{w}}_{i}^{j} (k + 1) \\ - e_{i}^{j T} (k) e_{i}^{j} (k) + {\tilde{w}}_{i}^{j T} (k) {\tilde{w}}_{i}^{j} (k) \end{array}

(38)

From (13) and (15)

{\tilde{w}}_{i}^{j} (k + 1) = {\tilde{w}}_{i}^{j} (k) + η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)

(39)

Let us define

\begin{array}{l} {[{\tilde{w}}_{i}^{j} (k) + η_{i}^{j} K_{i}^{j} (k) e_{i}^{1} (k)]}^{T} [{\tilde{w}}_{i}^{j} (k) + η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)] = \\ {\tilde{w}}_{i}^{j T} (k) {\tilde{w}}_{i}^{j} (k) + 2 {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k) \\ + {(η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))}^{T} (η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)) \end{array}

(40)

From (21), then

\begin{array}{l} e_{i}^{j} (k + 1) = e_{i}^{j} (k + 1) + Δ e_{i}^{j} (k) \\ e_{i}^{j T} (k + 1) e_{i}^{j} (k + 1) = e_{i}^{j T} (k) e_{i}^{j} (k) \\ + e_{i}^{j T} (k) Δ e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) Δ e_{i}^{j} (k) \\ e_{i}^{j T} (k + 1) e_{i}^{j} (k + 1) - e_{i}^{j T} (k) e_{i}^{j} (k) = e_{i}^{j T} (k) Δ e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) Δ e_{i}^{j} (k) \end{array}

where Δe^j_i(k) is the error difference. Substituting (39) and (40) in (38) results in

\begin{array}{l} Δ V_{i}^{j} (k) = e_{i}^{j T} (k) Δ e_{i}^{j} (k) + Δ e_{i}^{j T} (k) e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) Δ e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) Δ e_{i}^{j} (k) \\ + 2 {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k) \\ + {(η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))}^{T} (η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)) \end{array}

(41)

From Lemma 1, substituting (34), we obtain

\begin{array}{l} Δ V_{i}^{j} (k) \leq - 2 γ_{i}^{j} e_{i}^{j T} (k) e_{i}^{j} (k) + γ_{i}^{j^{2}} e_{i}^{j T} (k) e_{i}^{j} (k) \\ + 2 {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k) \\ + {(η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))}^{T} (η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k)) \\ \leq - 2 γ_{i}^{j} {‖ e_{i}^{j} (k) ‖}^{2} + γ_{i}^{j^{2}} {‖ e_{i}^{j} (k) ‖}^{2} \\ + 2 ‖ {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) ‖ ‖ e_{i}^{j} (k) ‖ \\ + {‖ η_{i}^{j} K_{i}^{j} (k) ‖}^{2} {‖ e_{i}^{j} (k) ‖}^{2} \end{array}

(42)

where γ^j_i = max‖H_i^jT (k)η_i^jK_i^j (k)‖. From Lemma 2, it follows that w̄^j_i(k) is bounded; then, there is η^j_i>0 such that

Δ V_{i}^{j T} (k) \leq 0, once ‖ e_{i}^{j} (k) ‖ > κ_{i}^{j}

(43)

with κ^j_i defined as

κ_{i}^{j} = \frac{2 η_{i}^{j} {\bar{w}}_{i}^{j} {\bar{K}}_{i}^{j}}{2 γ_{i}^{j} - γ_{i}^{j^{2}} - η_{i}^{j^{2}} {\bar{K}}_{i}^{j^{2}}}

where w̄^j_i and K̄^j_i are the upper bound of w̄^j_i (k) and K_i^j(k), respectively [19]. From (43), it follows the boundedness of ΔV^j_i for k≥k_T, that leads to the SGUUB of e_i^j(k).

Theorem 2 For the i-th subsystem of (1) in the presence of interconnections, the i-th subsystem of HONN (12) with i=1,…,N;j = 1,…,r_i trained with the EKF-based algorithm (16) to approximate the i-th control law (11), ensures that the tracking error (21) is semiglobally uniformly ultimately bounded (SGUUB); moreover, the HONN weights remain bounded.

Proof Let $V (k) = \sum_{i = 1}^{N} \sum_{j = 1}^{r_{i}} V_{i}^{j} (k)$ , then

\begin{array}{l} Δ V (k) = \sum_{i = 1}^{N} \sum_{j = 1}^{r_{i}} [e_{i}^{j T} (k) Δ e_{i}^{j} (k) + Δ e_{i}^{j T} (k) e_{i}^{j} (k) \\ + Δ e_{i}^{j T} (k) Δ e_{i}^{j} (k) \\ + 2 {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k) \\ {(η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))}^{T} (η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))] \end{array}

substituting Δe_i^j(k) ≤ -γ^j_ie_i^j (k), we obtain

\begin{array}{l} Δ V (k) \leq \sum_{i = 1}^{N} \sum_{j = 1}^{r_{i}} [- 2 γ_{i}^{j} e_{i}^{j T} (k) e_{i}^{j} (k) + γ_{i}^{j^{2}} e_{i}^{j T} (k) e_{i}^{j} (k) \\ + 2 {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k) \\ + {(η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))}^{T} (η_{i}^{j} K_{i}^{j} (k) e_{i}^{j} (k))] \\ \leq \sum_{i = 1}^{N} \sum_{j = 1}^{r_{i}} [- 2 γ_{i}^{j} {‖ e_{i}^{j} (k) ‖}^{2} + γ_{i}^{j^{2}} {‖ e_{i}^{j} (k) ‖}^{2} \\ + 2 ‖ {\tilde{w}}_{i}^{j T} (k) η_{i}^{j} K_{i}^{j} (k) ‖ ‖ e_{i}^{j} (k) ‖ \\ + {‖ η_{i}^{j} K_{i}^{j} (k) ‖}^{2} {‖ e_{i}^{j} (k) ‖}^{2}] \end{array}

where γ^j_i = max‖H_i^jT (k)η_i^jK_i^j (k)‖. From Lemma 2, it follows that w̄^j_i is bounded; then, there is η^j_i such that

Δ V_{i}^{j T} (k) \leq 0, once ‖ e_{i}^{j} (k) ‖ > κ_{i}^{j}

(44)

with κ^j_i defined as

κ_{i}^{j} = \frac{2 η_{i}^{j} {\bar{w}}_{i}^{j} {\bar{K}}_{i}^{j}}{2 γ_{i}^{j} - γ_{i}^{j^{2}} - η_{i}^{j^{2}} {\bar{K}}_{i}^{j^{2}}}

where w̄^j_i and K̄^j_i are the upper bound of w̄^j_i(k) and K_i^j(k), respe ctiv ely [1 9]. From (44), i t follo ws the bounde dness of V(k) fo r k≥k_T, that leads to the SGUUB of e_i^j(k) ∀i = 1,…,N;j = 1,…,r_i.

7. Conclusions

This paper presents a decentralized neural control scheme based on the backstepping technique. The control law for each joint is approximated by a high order neural network. The training of each neural network is performed online using an extended Kalman filter. The stability analysis proves that the tracking error is semiglobally uniformly ultimately bounded (SGUUB). Real-time results confirm the effectiveness of the proposed scheme for trajectory tracking when applied to a two DOF robot manipulator.

Footnotes

8. Acknowledgments

The authors thank to Universidad Autonoma del Carmen (UNACAR), Mexico. The first author thanks to Programa de Mejoramiento del Profesorado (PROMEP), Mexico, for supporting this research and Instituto Tecnológico de la Laguna (ITL), Mexico, for allowing us to use the two DOF robot manipulator and carry out the corresponding realtime application.

References

Sanchez

E. N.

and Ricalde

L. J.

(2003), Trajectory tracking via adaptive recurrent neural control with input saturation, Proc. of International Joint Conference on Neural Networks, pp. 359–364, Portland, Oregon, USA.

Santibañez

Kelly

and Llama

M. A.

(2005), A novel global asymptotic stable set-point fuzzy controller with bounded torques for robot manipulators, IEEE Transactions on Fuzzy Systems, vol. 13, no. 3, pp. 362–372.

Gourdeau

(1997), Object-oriented programming for robotic manipulator simulation, IEEE Robotics and Automation, vol. 4, no. 3, pp. 21–29.

Najim

Ikonen

and Gomez-Ramirez

, (2008), Trajectory Tracking Control Based on a Genealogical Decision Tree Controller for Robot Manipulators, International Journal of Innovative Computing, Information and Control, vol. 4, no. 1, pp. 53–62.

Fateh

M. M.

and Soltanpour

M. R.

(2009), Robust Task-space Control of Robot Manipulators under Imperfect Transformation of Control Space, International Journal of Innovative Computing, Information and Control, vol. 5, no. 11(A), pp. 3949–3960.

Jiang

Z. P.

(1999), New results in decentralized adaptive nonlinear control with output feedback, Proc. of the 38th IEEE Conference on Decision and Control, pp. 4772–4777, Phoenix, Arizona, USA.

Huang

Tan

K. K.

and Lee

T. H.

(2003), Decentralized control design for large-scale systems with strong interconnections using neural networks, IEEE Transactions on Automatic Control, vol. 48, no. 5, pp. 805–810.

Liu

(1999), Decentralized control of robot manipulators: Nonlinear and adaptive approaches, IEEE Transactions on Automatic Control, vol. 44, no. 2, pp. 357–363.

M. L.

and Er

M. J.

(2000), Decentralized control of robot manipulators with coupling and uncertainties, Proc. of the American Control Conference, pp. 3326–3330, Chicago, Illinois, USA.

10.

Safaric

and Rodic

(2000), Decentralized neural-network sliding-mode robot controller, Proc. of 26th Annual Conference on the IEEE Industrial Electronics Society, pp.906–911, Nagoya, Aichi, Japan.

11.

Sanchez

E. N.

Gaytan

and Saad

(2006), Decentralized neural identification and control for robotics manipulators, Proc. of the IEEE International Symposium on Intelligent Control, pp. 1614–1619, Munich, Germany.

12.

L. C.

(1992), Robust adaptive decentralized control robot manipulators, IEEE Transactions on Automatic Control, vol. 37, no. 1, pp. 106–110.

13.

Kristic

Kanellakopoulos

and Kokotovic

(1995), Nonlinear and Adaptive Control Design, John Wiley & Sons Inc, New York, USA.

14.

S. S.

Zhang

and Lee

T. H.

(2004), Adaptive neural network control for a class of MIMO nonlinear systems with disturbances in discrete-time, IEEE Transactions on Systems, Man, and Cybernetics Part B, vol. 34, no. 4, pp. 1630–1645.

15.

Alanis

A. Y.

Sanchez

E. N.

and Loukianov

A. G.

, Discrete-time adaptive backstepping nonlinear control via high-order neural networks, IEEE Transactions on Neural Networks, vol. 18, no. 4, pp. 1185–1195, 2007.

16.

Rovithakis

G. A.

and Christodoulou

M. A.

(2000), Adaptive Control with Recurrent High-Order Neural Networks, Springer, London, U.K.

17.

Haykin

(2001), Kalman Filtering and Neural Networks, John Wiley & Sons Inc, New York, USA.

18.

Song

and Grizzle

J. W.

(1995), The extended Kalman filter as local asymptotic observer for discrete-time nonlinear systems, Journal of Mathematical Systems, Estimation and Control, vol. 5, no. 1, pp. 59–78.

19.

Chen

and Khalil

(1995), Adaptive control of a class of nonlinear discrete-time systems using neural networks, IEEE Transactions on Automatic Control, vol. 40, no. 5, pp. 791–801.

20.

Sanchez

E. N.

Alanis

A. Y.

and Loukianov

A. G.

(2008), Discrete-Time High Order Neural Control Trained with Kalman Filtering, Springer-Verlag, Berlin, Germany.

21.

Cruz

J. R.

(2003), Development of a control interface for a manipulator with two degrees of freedom, Master of Sciences Thesis (in Spanish), Instituto Tecnologico de la Laguna, Mexico.

22.

Cybenko

(1989), Approximation by superpositions of a sigmoidal function, Mathematics of Control, Signals, and Systems (MCSS), vol. 2, no. 4, pp. 304–314.