Sage Journals: Discover world-class research

Abstract

An observer-based adaptive iterative learning control using a filtered fuzzy neural network is proposed for repetitive tracking control of robotic systems. A state tracking error observer is introduced to design the iterative learning controller using only the measurement of joint position. We first derive an observation error model based on the state tracking error observer. Then, by introducing some auxiliary signals, the iterative learning controller is proposed based on the use of an averaging filter. The main control force consists of a filtered fuzzy neural network used to approximate for unknown system nonlinearity, a robust learning term used to compensate for uncertainty, and a stabilization term used to guarantee the boundedness of internal signals. The adaptive laws combining time domain and iteration domain adaptation are presented to ensure the convergence of learning error. We show that all the adjustable parameters as well as internal signals remain bounded for all iterations. The norm of output tracking error will asymptotically converge to a tunable residual set as iteration goes to infinity.

1. Introduction

Due to the repeatability of operation for robotic systems, it is suitable to perform the control task by using the technique of iterative learning control (ILC) for a repetitive tracking control problem. ILC is basically a non-model-based learning approach which is very effective in dealing with repetitive control tasks [1 –5]. Initially, PID-type ILC algorithms which required a certain a priori knowledge of robot dynamics were developed for robot manipulators based on the contraction mapping theory [6 –9]. In D-type, PD-type, or PID-type ILC, the acceleration errors of joint variables are required to construct the learning controller. However, the acceleration measurement is unfortunately not realizable in practice and becomes the major disadvantage of D-type ILC. On the other hand, P-type ILC uses only the velocity errors of joint variables for the design of updated learning function. Although the requirement of acceleration measurement is removed, more strict conditions on the robot manipulators are needed for technical analysis. In general, it is hard to apply the traditional PID-type ILC for repetitive tracking control of robot manipulator using only joint position measurement.

Recently, adaptive iterative learning control (AILC) has been widely studied in the research field of the ILC. One of the most attractive advantages of AILC scheme is the capability to deal with the issues of large initial resetting error, large input disturbance, and iteration-varying desired trajectory. In the past decade, the AILC schemes have been utilized for repeated tracking control of robotic systems [10 –14], or a class of nonlinear systems [15, 16]. In order to relax the restrict Lipschitz condition on the plant's nonlinearity, the Lyapunov-like approach instead of contraction mapping theory is applied in AILC to analyze the stability and convergence. If the system nonlinearties are unknown, the fuzzy systems or neural networks were often introduced to approximate the nonliearties and provide the basis functions for the design of AILC [17 –19]. However, both PID-type ILCs and AILCs require at least the joint velocities to develop the AILC algorithms. If only the measurement of joint positions is available, an observer is one of the possible choice to design the AILC. In [20], an observer-based ILC for a time-varying nonlinear system was proposed to overcome the unmeasurable states. Although an observer-based ILC system can guarantee that the tracking error will converge to zero, the initial resetting errors at each iteration were not considered. In [21], an observer-based ILC scheme was proposed for a class of nonlinear systems with unknown parametric uncertainties. The Lyapunov-Krasovskii-like composite energy function was applied to analyze the closed-loop stability and learning performance. Nevertheless, the plant nonlinearities must be linearly parameterizable. In [22], a framework for ILC by using an observer to estimate the controlled variable was presented. However, it was necessary to assume that the ILC input converges to a bounded signal. In [23], the observer-based ILC with evolutionary programming algorithm was proposed for MIMO nonlinear systems. The evolutionary programming was applied to search for the optimal and feasible learning gain to speed up the convergence of the ILC and the tracking error will converge to zero via successive learning. But the AILC was developed for MIMO nonlinear plants with nonlinearities satisfying Lipschitz continuous condition. In [24], an observer-based AILC was developed for a class of nonlinear systems with unknown time-varying parameters and unknown time-varying delays. The linear matrix inequality (LMI) approach was used to design the nonlinear state observer. By constructing a Lyapunov-Krasovskii-like composite energy function, the boundedness of the internal signals and the convergence of tracking error can be proved. However, it was assumed that the plant nonlinearities satisfy Lipschitz continuous condition and the unknown system parameters must be linear with respect to some known nonlinear vector-valued functions. In [25], a velocity-observer-based ILC for trajectory tracking of rigid robot manipulators with external disturbances without using the velocity measurement was proposed. However, the inertia matrix of the robotic systems needed to satisfy Lipschitz continuous condition and the initial resetting errors at each iteration were assumed to be zero. Besides, the disturbances were assumed to be repetitive and the velocities were assumed to be bounded.

In order to relax the condition of measurement for joint velocity and without the requirement of Lipschitz condition on the robot unknown nonlinearity and the zero initial resetting errors at each iteration, a new observer-based adaptive iterative learning controller using a new filtered fuzzy neural network (filtered-FNN) is proposed for repetitive tracing control of robotic systems in this paper. A state tracking error observer is firstly presented to deal with the problem that only joint positions are available. Based on this observer, a state tracking error model including a filtered-FNN approximation term can be derived by using an interesting s-domain transfer function technique which is usually utilized in the area of traditional model reference adaptive control [26]. An averaging filter is then proposed to solve the implementation problem of the controller. Under the derived error model, a fuzzy neural learning component is designed to approximate the unknown nonlinearities by a filtered-FNN using state estimation vector as the network input, a robust learning component is constructed to compensate for the uncertainties from approximation error and state estimation error, and a stabilization component is used to guarantee the boundedness of internal signals. The main features of this iterative learning controller and its contributions relative to the related works are summarized as follows.

Compared with most of the works using adaptive iterative learning control for robotic systems or nonlinear systems, this paper can design a realizable adaptive iterative learning controller using only joint position measurement. In other words, it is not necessary to measure the states for the controller design.

A new design approach is introduced to derive the error model so that a filtered-FNN can be applied for compensation of the unknown system nonlinearities. The filtered-FNN can be treated as a dynamic version of traditional FNN which plays an important role in this proposed adaptive iterative learning controller.

Compared with the fuzzy system or neural-network-based adaptive iterative learning controller using state measurement [17 –19], the stability analysis becomes more difficult since the boundedness of output tracking error cannot guarantee the boundedness of input signal. A new analysis to prove the regularity of internal signals is successfully derived in this paper so that we ensure the boundedness of all the adjustable parameters and internal signals during the learning process. Furthermore, we guarantee that the norm of output tracking error will asymptotically converge to a tunable residual set which depends on the design parameters if iteration number is large enough.

This paper is organized as follows. In Section 2, a problem formulation is given. The error model between system output and desired output is derived in Section 3. Based on the derived error model, the filtered-FNN-based AILC using observer design is presented in Section 4. Analysis of closed-loop Section 6. The detailed description of the proposed filtered FNN is given in the Appendix.

In the subsequent discussions, the following notations will be used.

$|   \cdot   |$ denotes the absolute value of a scalar or the Euclidean or any other consistent norm of a vector or matrix.

L_pe[0,T] denotes the set of Lebesgue measurable (or piecewise continuous) real-valued (vector) functions with

{| \cdot |}_{p e} = {\begin{cases} {(\int_{0}^{T} ‍ {| \cdot |}^{p} d t)}^{1 / p}, & if p \in [1, \infty), \\ \sup_{0 \leq t \leq T} | \cdot |, & if p = \infty . \end{cases}

(1)

${∥ {(\cdot)}_{t} ∥}_{\infty} = \sup_{τ \leq t} | (\cdot) (τ) |$ denotes the truncated L_∞ norm of the argument function or vector [26].

${∥ G (s) ∥}_{\infty}$ denotes the H_∞ norm of the transfer function G(s).

G(s)[u(t)] denotes the filtered version of u(t) with any proper or strictly proper transfer function G(s).

2. Problem Formulation

In this paper, we consider an uncertain robotic system with n rigid bodies which can perform a given task repeatedly over a finite time interval [0,T] as follows:

D (q^{j} (t)) {\ddot{q}}^{j} (t) + B (q^{j} (t), {\dot{q}}^{j} (t)) {\dot{q}}^{j} (t) + F (q^{j} (t), {\dot{q}}^{j} (t)) = u^{j} (t),

(2)

where j∈Z_/+ denotes the index of iteration number and t∈[0,T] denotes the time index. The signals q^j(t), ${\dot{q}}^{j} (t)$ , ${\ddot{q}}^{j} (t) \in ℛ^{n \times 1}$ are, respectively, the generalized joint position, joint velocity, and joint acceleration vectors. D(q^j(t))∈ℛ^{n × n} is the inertia matrix, $B (q^{j} (t), {\dot{q}}^{j} (t)) \in ℛ^{n \times n}$ is the centripetal plus Coriolis force matrix, $F (q^{j} (t), {\dot{q}}^{j} (t)) \in ℛ^{n \times 1}$ are the gravitational plus frictional forces, and u^j ∈ ℛ^{n × 1} is the joint torque vector. It is noted the inertia matrix D(q^j(t)) is assumed to be positive definite and bounded for all t∈[0,T] and iteration j≥1 as

0 < m_{1} I_{n \times n} \leq D (q^{j} (t)) \leq m_{2} I_{n \times n},

(3)

where m₁,m₂>0 and I_{n × n} is an n × n identity matrix. Since the inverse of inertia matrix exists for all joint variables, the dynamic formulation of the robotic system can be written as follows:

\begin{matrix} {\ddot{q}}^{j} (t) = - D^{- 1} (q^{j} (t)) \\ \times [B (q^{j} (t), {\dot{q}}^{j} (t)) {\dot{q}}^{j} (t) + F (q^{j} (t), {\dot{q}}^{j} (t))] \\ + D^{- 1} (q^{j} (t)) u^{j} (t) . \end{matrix}

(4)

Let $f (q^{j} (t), {\dot{q}}^{j} (t)) = D^{- 1} (q^{j} (t)) [B (q^{j} (t), {\dot{q}}^{j} (t)) {\dot{q}}^{j} (t) + F (q^{j} (t), {\dot{q}}^{j} (t))] \in ℛ^{n \times 1}$ and b(q^j(t)) = D⁻¹(q^j(t))∈ℛ^{n × n} and choose the output variable as y^j(t) = q^j(t)∈ℛ^{n × 1}, state variable as $X^{j} (t) = {[x_{1}^{j} {(t)}^{⊤}, x_{2}^{j} {(t)}^{⊤}]}^{⊤} = {[y^{j} {(t)}^{⊤}, {\dot{y}}^{j} {(t)}^{⊤}]}^{⊤} = {[q^{j} {(t)}^{⊤}, {\dot{q}}^{j} {(t)}^{⊤}]}^{⊤} \in ℛ^{2 n \times 1}$ , then we have

\begin{matrix} {\dot{x}}_{1}^{j} (t) = x_{2}^{j} (t), \\ {\dot{x}}_{2}^{j} (t) = - f (X^{j} (t)) + b (X^{j} (t)) u^{j} (t), \\ y^{j} (t) = x_{1}^{j} (t), \end{matrix}

(5)

or equivalently in the following state-space form:

\begin{matrix} {\dot{X}}^{j} (t) = A X^{j} (t) + B (- f (X^{j} (t)) + b (X^{j} (t)) u^{j} (t)), \\ y^{j} (t) = C^{⊤} X^{j} (t), \end{matrix}

(6)

where

A = [\begin{bmatrix} 0 & I_{n \times n} \\ 0 & 0 \end{bmatrix}] \in ℛ^{2 n \times 2 n}, B = [\begin{bmatrix} 0 \\ I_{n \times n} \end{bmatrix}] \in ℛ^{2 n \times n}, \begin{matrix} C = [\begin{bmatrix} I_{n \times n} \\ 0 \end{bmatrix}] \in ℛ^{2 n \times n}, \end{matrix}

(7)

Here, 0<b(X^j(t)) ≤ m₁⁻¹I_{n × n}. In this paper, we assume that only joint position y^j(t) = q^j(t) is measurable for controller design. Let the signals q_d(t), ${\dot{q}}_{d} (t)$ , ${\ddot{q}}_{d} (t) \in ℛ^{n \times 1}$ be, respectively, the desired generalized joint position, joint velocity, and joint acceleration vectors; the desired state trajectory can be defined as $X_{d} (t) = {[x_{d_{1}} {(t)}^{⊤}, x_{d_{2}} {(t)}^{⊤}]}^{⊤} = {[y_{d} {(t)}^{⊤}, {\dot{y}}_{d} {(t)}^{⊤}]}^{⊤} = {[q_{d} {(t)}^{⊤}, {\dot{q}}_{d} {(t)}^{⊤}]}^{⊤}$ . Now, given a specified desired output trajectory y_d(t), t∈[0,T] and an initial desired output y_d(0)≠y^j(0) for all j≥1, the control objective for the robotic systems executing a repeatable task is to force the output y^j(t) to follow y_d(t) as close as possible.

3. Derivations of Error Model and Controller

3.1. Derive the Error Model

Define an output tracking error as e^j(t) = y_d(t)-y^j(t) and state tracking errors as e₁^j(t) = y_d(t)-y^j(t), $e_{2}^{j} (t) = {\dot{y}}_{d} (t) - {\dot{y}}^{j} (t)$ . It is assumed that the initial output tracking error vector at each iteration is not necessarily zero, small, and fixed but satisfies $| e^{j} (0) | = ɛ^{j}$ for a known positive constants ɛ^j since the joint position vector y^j(t) is measurable. Let the state tracking error vector be defined as

E^{j} (t) = X_{d} (t) - X^{j} (t) = {[e_{1}^{j} {(t)}^{⊤}, e_{2}^{j} {(t)}^{⊤}]}^{⊤} = {[e^{j} {(t)}^{⊤}, {\dot{e}}^{j} {(t)}^{⊤}]}^{⊤} \in ℛ^{2 n \times 1} .

(8)

Then we can derive ${\ddot{e}}^{j} (t)$ as follows:

{\ddot{e}}^{j} (t) = {\ddot{y}}_{d} (t) + f (X^{j} (t)) - b (X^{j} (t)) u^{j} (t) = - K_{c}^{⊤} E^{j} (t) + f (X^{j} (t)) + {\ddot{y}}_{d} (t) + K_{c}^{⊤} E^{j} (t) - b (X^{j} (t)) u^{j} (t),

(9)

where K_c = [k₂^cI,k₁^cI]^⊤ ∈ ℛ^{2n × n} is a feedback gain matrix such that the characteristic polynomial of A_c = A-BK_c^⊤ is Hurwitz. Therefore, the tracking error dynamics will satisfy

\begin{matrix} {\dot{E}}^{j} (t) \\ = A_{c} E^{j} (t) + B [f (X^{j} (t)) + {\ddot{y}}_{d} (t) + K_{c}^{⊤} E^{j} (t) - b (X^{j} (t)) u^{j} (t)] \\ = A_{c} E^{j} (t) + B [f (X^{j} (t)) + {\ddot{y}}_{d} (t) + K_{c}^{⊤} X_{d} (t) - K_{c}^{⊤} X^{j} (t) - u^{j} (t) + (1 - b (X^{j} (t))) u^{j} (t)], \\ = A_{c} E^{j} (t) + B [h (X^{j} (t)) - u^{j} (t) + (1 - b (X^{j} (t))) u^{j} (t)], \\ e^{j} (t) = C^{⊤} E^{j} (t), \end{matrix}

(10)

where $h (X^{j} (t)) \equiv f (X^{j} (t)) - K_{c}^{⊤} X^{j} (t) + {\ddot{y}}_{d} (t) + K_{c}^{⊤} X_{d} (t)$ . Note that the state tracking error vector E^j(t) in the tracking error dynamics (10) is not assumed to be available for measurement. Hence, it is necessary to construct a state estimation vector ${\hat{X}}^{j} (t) = {[{\hat{x}}_{1}^{j} {(t)}^{⊤}, {\hat{x}}_{2}^{j} {(t)}^{⊤}]}^{⊤} = {[{\hat{y}}^{j} {(t)}^{⊤}, {\dot{\hat{y}}}^{j} {(t)}^{⊤}]}^{⊤} \in ℛ^{2 n \times 1}$ for estimation of the state vector. In order to construct a state estimation vector, we first define an output tracking error estimation vector as ${\hat{e}}^{j} (t) = y_{d} (t) - {\hat{y}}^{j} (t)$ . Then the state tracking error estimation vector can be defined as

{\hat{E}}^{j} (t) = X_{d} (t) - {\hat{X}}^{j} (t) = {[{\hat{e}}_{1}^{j} {(t)}^{⊤}, {\hat{e}}_{2}^{j} {(t)}^{⊤}]}^{⊤} = {[{\hat{e}}^{j} {(t)}^{⊤}, {\dot{\hat{e}}}^{j} {(t)}^{⊤}]}^{⊤} \in ℛ^{2 n \times 1} .

(11)

The state tracking error observer is designed as

\begin{matrix} {\dot{\hat{E}}}^{j} (t) = A_{c} {\hat{E}}^{j} (t) + K_{o} (e^{j} (t) - {\hat{e}}^{j} (t)), {\hat{E}}^{j} (0) = 0, \\ {\hat{e}}^{j} (t) = C^{⊤} {\hat{E}}^{j} (t), \end{matrix}

(12)

where K_o = [k₁^oI,k₂^oI]^⊤ ∈ ℛ^{2n × n} is the observer gain vector such that the characteristic polynomial of A_o = A_c-K_oC^⊤ is Hurwitz. Define an output observation error vector as ${\tilde{e}}^{j} (t) = e^{j} (t) - {\hat{e}}^{j} (t)$ . Then the state observation error vector can be defined as

{\tilde{E}}^{j} (t) = E^{j} (t) - {\hat{E}}^{j} (t) = {[{\tilde{e}}_{1}^{j} {(t)}^{⊤}, {\tilde{e}}_{2}^{j} {(t)}^{⊤}]}^{⊤} = {[e^{j} {(t)}^{⊤} - {\hat{e}}^{j} {(t)}^{⊤}, {\dot{e}}^{j} {(t)}^{⊤} - {\dot{\hat{e}}}^{j} {(t)}^{⊤}]}^{⊤} \in ℛ^{2 n \times 1} .

(13)

By using (10) and (12), we have the following observation error dynamics:

\begin{matrix} {\dot{\tilde{E}}}^{j} (t) = A_{o} {\tilde{E}}^{j} (t) + B [h (X^{j} (t)) - u^{j} (t) + (1 - b (X^{j} (t))) u^{j} (t)], \\ {\tilde{e}}^{j} (t) = C^{⊤} {\tilde{E}}^{j} (t) . \end{matrix}

(14)

Note that $| {\tilde{e}}^{j} (0) | = | e^{j} (0) - {\hat{e}}^{j} (0) | = | e^{j} (0) | = ɛ^{j}$ . Based on the universal approximation theorem, we know that the nonlinear function h(X^j(t)) can be approximated by a traditional FNN [19] W^j(t)^⊤O⁽³⁾(X^j(t)). Here O⁽³⁾(X^j(t))∈ℛ^{M × 1} is the basis function vector with M being the number of rule nodes and W^j(t)∈ℛ^{M × n} is the weight matrix of the output layer. According to the universal approximation theorem, there will exist an optimal weight matrix W^* such that h(X^j(t)) = W^*⊤O⁽³⁾(X^j(t)) + ϵ(X^j(t)), where ϵ(X^j(t)) is the approximation error satisfying |ϵ(X^j(t))| ≤ ϵ^* in a certain compact set. This implies that (14) can be rewritten as

\begin{matrix} {\dot{\tilde{E}}}^{j} (t) = A_{o} {\tilde{E}}^{j} (t) + B [W^{* ⊤} O^{(3)} (X^{j} (t)) + ϵ^{j} (X^{j} (t)) + {\ddot{y}}_{d} (t) + K_{c}^{⊤} X_{d} (t) - u^{j} (t) + (1 - b (X^{j} (t))) u^{j} (t)] = A_{o} {\tilde{E}}^{j} (t) + B [W^{* ⊤} O^{(3)} ({\hat{X}}^{j} (t)) - u^{j} (t) + δ^{j} (t) + (1 - b (X^{j} (t))) u^{j} (t)], \\ {\tilde{e}}^{j} (t) = C^{⊤} {\tilde{E}}^{j} (t), \end{matrix}

(15)

where $δ^{j} (t) = W^{* ⊤} (O^{(3)} (X^{j} (t)) - O^{(3)} ({\hat{X}}^{j} (t))) + ϵ (X^{j} (t))$ is the lumped uncertainty term which includes the difference between FNN networks with input using state X^j(t) and estimated state ${\hat{X}}^{j} (t)$ and the network approximation error. It is easily proven that the lumped uncertainty term satisfies |δ^j(t)| ≤ δ^*. However, it is noted that the value of the unknown constant δ^* might be large.

To see how to design the iterative learning controller u^j(t) to achieve the control objective, we now adopt the mixed use of a time signal and a Laplace transfer function to obtain the explicit expression of ${\tilde{e}}^{j} (t)$ in (15) in time domain with a filtered version as follows:

{\tilde{e}}^{j} (t) = H (s) [W^{* ⊤} O^{(3)} ({\hat{X}}^{j} (t)) - u^{j} (t) + δ^{j} (t) + (1 - b (X^{j} (t))) u^{j} (t)],

(16)

where $H (s) = C^{⊤} {(s I - A_{o})}^{- 1} B = (1 / (s^{2} + (k_{1}^{c} + k_{1}^{o}) s + (k_{1}^{c} k_{1}^{o} + k_{2}^{c} k_{2}^{o}))) I_{n \times n}$ . The observer gain vector K_o = [k₁^oI_{n × n},k₂^oI_{n × n}]^⊤ is chosen such that $H (s) = (1 / ℓ (s) L (s)) I_{n \times n}$ with ℓ(s) = s + λ and L(s) being any Hurwitz polynomial of order 1. Then (16) can be rewritten as

\begin{matrix} {\tilde{e}}^{j} (t) \\ = H (s) L (s) [W^{* ⊤} \frac{1}{L (s)} [O^{(3)} ({\hat{X}}^{j} (t))] - \frac{1}{L (s)} [u^{j} (t)] + \frac{1}{L (s)} [(1 - b (X^{j} (t))) u^{j} (t) + δ^{j} (t)]] \\ = \frac{1}{ℓ (s)} [W^{* ⊤} \frac{1}{L (s)} [O^{(3)} ({\hat{X}}^{j} (t))] - \frac{1}{L (s)} [u^{j} (t)] + δ_{L}^{j} (t)], \end{matrix}

(17)

where $δ_{L}^{j} (t) = (1 / L (s)) [(1 - b (X^{j} (t))) u^{j} (t) + δ^{j} (t)]$ .

3.2. Construct Some Useful Signals

According to the filtered version of output tracking error model (17), we define an augmented signal with filtered version as

y_{a}^{j} (t) = \frac{1}{ℓ (s)} [v^{j} (t) - \frac{1}{L (s)} [u^{j} (t)]], y_{a}^{j} (0) = 0,

(18)

where v^j(t) is an auxiliary input to be designed later. Then, design an auxiliary error signal as

e_{a}^{j} (t) = {\tilde{e}}^{j} (t) - y_{a}^{j} (t) .

(19)

The initial condition e_a^j(0) will satisfy $| e_{a}^{j} (0) | = | {\tilde{e}}^{j} (0) - y_{a}^{j} (0) | = | {\tilde{e}}^{j} (0) | = | e^{j} (0) | \equiv ɛ^{j}$ . Substituting (17) and (18) into (19), we can find that

e_{a}^{j} (t) = {\tilde{e}}^{j} (t) - y_{a}^{j} (t) = \frac{1}{ℓ (s)} [W^{* ⊤} \frac{1}{L (s)} [O^{(3)} ({\hat{X}}^{j} (t))] - v^{j} (t) + δ_{L}^{j} (t)] .

(20)

It is noted that we apply the Laplace operation to derive the tracking error model (20) for technical analysis later as that used in the traditional model reference adaptive control [26]. The equivalent time-domain state space representation of (20) is

{\dot{e}}_{a}^{j} (t) = - λ e_{a}^{j} (t) + W^{* ⊤} \frac{1}{L (s)} [O^{(3)} ({\hat{X}}^{j} (t))] - v^{j} (t) + δ_{L}^{j} (t) .

(21)

In order to overcome the uncertainty from initial output tracking error, a dead-zone signal e_ϕ^j(t) is introduced as follows:

e_{ϕ}^{j} (t) = e_{a}^{j} (t) - ϕ^{j} (t) sat (\frac{e_{a}^{j} (t)}{ϕ^{j} (t)}), ϕ^{j} (t) = ɛ^{j} e^{- λ t},

(22)

where $sat (e_{a}^{j} (t) / ϕ^{j} (t)) = {[sat (e_{a, 1}^{j} (t) / ϕ^{j} (t)), \dots, (e_{a, n}^{j} (t) / ϕ^{j} (t))]}^{⊤} \in ℛ^{n \times 1}$ and

sat (\frac{e_{a, i}^{j} (t)}{ϕ^{j} (t)}) = {\begin{matrix} 1, & if e_{a, i}^{j} (t) > ϕ^{j} (t), \\ \frac{e_{a, i}^{j} (t)}{ϕ^{j} (t)}, & if | e_{a, i}^{j} (t) | \leq ϕ^{j} (t), & i = 1, \dots, n, \\ - 1, & if e_{a, i}^{j} (t) < - ϕ^{j} (t), \end{matrix}

(23)

and ϕ^j(t) is the width of boundary layer. Note that ϕ^j(t) is designed to decrease along time axis with the initial condition chosen as ϕ^j(0) = ɛ^j for jth iteration and 0<ɛ^je– ^λT ≤ ϕ^j(t) ≤ ɛ^j, ∀t∈[0,T],j≥1. According to (22), it is easy to show that e_ϕ^j(0) = 0, ∀j≥1. Now let us differentiate $(1 / 2) e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t)$ as follows:

\begin{matrix} \frac{1}{2} \frac{d}{d t} e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) \\ = e_{ϕ}^{j} {(t)}^{⊤} {\dot{e}}_{ϕ}^{j} (t) \\ = e_{ϕ}^{j} {(t)}^{⊤} ({\dot{e}}_{a}^{j} (t) - sgn (e_{ϕ}^{j} (t)) {\dot{ϕ}}^{j} (t)) \\ = e_{ϕ}^{j} {(t)}^{⊤} {- λ e_{a}^{j} (t) - sgn (e_{ϕ}^{j} (t)) {\dot{ϕ}}^{j} (t)} + e_{ϕ}^{j} {(t)}^{⊤} {W^{* ⊤} \frac{1}{L (s)} [O^{(3)} ({\hat{X}}^{j} (t))] - v^{j} (t) + δ_{L}^{j} (t)} \\ = - λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + e_{ϕ}^{j} {(t)}^{⊤} {W^{* ⊤} \frac{1}{L (s)} [O^{(3)} ({\hat{X}}^{j} (t))] - v^{j} (t) + δ_{L}^{j} (t)}, \end{matrix}

(24)

where $sgn (e_{ϕ}^{j} (t)) = [sgn (e_{ϕ, 1}^{j} (t)), \dots, sgn {(e_{ϕ, n}^{j} (t))]}^{⊤}$ is the typical signum function vector. In the following, we will show that the uncertainty term δ_L^j(t) in (17) can be bounded in a linearly parameterized form if the following normalization signal m^j(t) [27] is utilized

m^{j} (t) = \frac{δ_{2}}{s + δ_{1}} [1 + | u^{j} (t) |], m^{j} (0) > \frac{δ_{2}}{δ_{1}},

(25)

where δ₁,δ₂>0 and δ₁<δ₁^*. Here δ₁^* is the least positive constant such that 1/L(s – δ₁^*) is a stable system. In practical, δ₁ can be chosen as small as possible. Note that 1-b(X^j(t)) and δ^j(t) are bounded and $1 / L (s)$ is a strictly proper stable transfer function. Hence, according to the definition of δ_L(t) and by using Lemma 3.1 in [27], we can prove that

| δ_{L}^{j} (t) | \leq | \frac{1}{L (s)} [(1 - b (X^{j} (t))) u^{j} (t) + δ^{j} (t)] | \leq θ_{1}^{*} m^{j} (t) + θ_{2}^{*} = [θ_{1}^{*}, θ_{2}^{*}] [\begin{bmatrix} m^{j} (t) \\ 1 \end{bmatrix}] \equiv θ^{* ⊤} Y^{j} (t)

(26)

for some positive constants θ₁^*,θ₂^*.

3.3. Design the Filtered-FNN-Based Iterative Learning Controller

Based on the aforementioned derived error model and the useful signals, we now design u^j(t) and v^j(t) as follows:

u^{j} (t) = \frac{L (s)}{F (τ s)} [v^{j} (t)],

(27)

Here $W^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t))$ is a filtered-FNN (see Appendix) used as an approximator to compensate for h(X^j(t)), $sat (e_{a}^{j} (t) / ϕ^{j} (t)) θ^{j} {(t)}^{⊤} Y^{j} (t)$ is a robust learning term used to overcome the uncertainties due to function approximation error and the error induced by using state estimation, and e_ϕ^j(t)Y^j(t)^⊤Y^j(t) is a stabilization term used to guarantee the boundedness of all the closed-loop signals. In this controller, W^j(t) is the weight matrix of the filtered-FNN and θ^j(t) is control parameter vector which are used to compensate for the unknown W^* and θ^*, respectively. Furthermore, F(τs) = (τs + 1)² with τ>0 being a small constant. In the literature, 1/F(τs) is referred to as an averaging filter, which is obviously a low-pass filter whose bandwidth can be arbitrarily enlarged as τ approaches 0. If we define the parameter error as $\tilde{W} (t) = W^{j} (t) - W^{*}$ and ${\tilde{θ}}^{j} (t) = θ^{j} (t) - θ^{*}$ and substitute (28) into (24), we have the following error dynamics for technical analysis later:

v^{j} (t) = W^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + sat (\frac{e_{a}^{j} (t)}{ϕ^{j} (t)}) θ^{j} {(t)}^{⊤} Y^{j} (t) + e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t) .

(28)

A set of stable adaptive laws is necessary to tune the control parameters. The adaptive laws combining time domain and iteration domain adaptation without knowledge of known bounds on optimal parameters or dead zone mechanism are proposed as follows:

\begin{matrix} \frac{1}{2} \frac{d}{d t} e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) \\ = - λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + e_{ϕ}^{j} {(t)}^{⊤} {- {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) - sat (\frac{e_{a}^{j} (t)}{ϕ^{j} (t)}) θ^{j} {(t)}^{⊤} Y^{j} (t) - e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t) + δ_{L}^{j} (t)} \\ \leq - λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) - e_{ϕ}^{j} {(t)}^{⊤} {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) \\ - | e_{ϕ}^{j} (t) | {\tilde{θ}}^{j} {(t)}^{⊤} Y^{j} (t) - e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t) . \end{matrix}

(29)

with W^j(0) = W^j–1(T),θ^j(0) = θ^j–1(T) for j≥1, and 0<γ₁,γ₂<1, β₁,β₂>0. In these adaptive laws, γ₁,γ₂ and β₁,β₂ are defined as the weighting gains and adaptation gains, respectively. For the first iteration, we set W⁰(t) = W⁰ and θ⁰(t) = θ⁰ to be any constant matrix or vector, ∀t∈[0,T] and ∀j≥1. Equations (30) and (31) will become pure time-domain adaptive laws if γ₁ = γ₂ = 0, or pure iteration-domain adaptive laws if γ₁ = γ₂ = 1.

4. Analysis of Stability and Convergence

To prove the stability and convergence of the proposed learning system, we first give the following three lemmas.

Lemma 1. Consider the robotic system (2) performing a repetitive control task. If one applies the observer-based adaptive filtered-FNN iterative learning controller (18), (19), (22), (25), (27), and (28) with adaptation laws (30) and (31), then one guarantees that $e_{ϕ}^{1} (t), e_{a}^{1} (t), {\tilde{W}}^{1} (t)$ , and ${\tilde{θ}}^{1} (t)$ are bounded.

Proof. Choose a Lyapunov-like positive function as

(1 - γ_{1}) {\dot{W}}^{j} = - γ_{1} W^{j} + γ_{1} W^{j - 1} (t) + β_{1} O^{(4)} ({\hat{X}}^{j} (t)) e_{ϕ}^{j} {(t)}^{⊤},

(30)

and compute its derivative with respect to time t along (29), (30), and (31); then we have

(1 - γ_{2}) {\dot{θ}}^{j} = - γ_{2} θ^{j} (t) + γ_{2} θ^{j - 1} (t) + β_{2} | e_{ϕ}^{j} (t) | Y^{j} (t)

(31)

Since $- γ_{1} W^{j} (t) + γ_{1} W^{j - 1} (t) = - γ_{1} {\tilde{W}}^{j} (t) + γ_{1} {\tilde{W}}^{j - 1} (t)$ and $- γ_{2} θ^{j} (t) + γ_{2} θ^{j - 1} (t) = - γ_{2} {\tilde{θ}}^{j} (t) + γ_{2} {\tilde{θ}}^{j - 1} (t)$ , V_a^j(t) in (33) can be simplified by using (30) and (31) as

V_{a}^{j} (t) = \frac{1}{2} e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + \frac{(1 - γ_{1})}{2 β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} {\tilde{W}}^{j} (t)} + \frac{(1 - γ_{2})}{2 β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} {\tilde{θ}}^{j} (t)

(32)

where we use the property of $tr {{\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) e_{ϕ}^{j} {(t)}^{⊤}} = e_{ϕ}^{j} {(t)}^{⊤} {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t))$ . Since ${\tilde{W}}^{0} (t) = W^{0} (t) - W^{*} = W^{0} - W^{*} \equiv {\bar{W}}^{0}$ and ${\tilde{θ}}^{0} (t) = θ^{0} (t) - θ^{*} = θ^{0} - θ^{*} \equiv {\bar{θ}}^{0}$ are bounded for all t∈[0,T] so that if j = 1, (34) can be rewritten as

{\dot{V}}_{a}^{j} (t) = \frac{1}{2} \frac{d}{d t} e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + \frac{(1 - γ_{1})}{β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} {\dot{\tilde{W}}}^{j} (t)} + \frac{(1 - γ_{2})}{β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} {\dot{\tilde{θ}}}^{j} (t) \leq - λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + e_{ϕ}^{j} {(t)}^{⊤} {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) - | e_{ϕ}^{j} (t) | {\tilde{θ}}^{j} {(t)}^{⊤} Y^{j} (t) + \frac{1}{β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} (1 - γ_{1}) {\dot{\tilde{W}}}^{j} (t)} + \frac{1}{β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} (1 - γ_{2}) {\dot{\tilde{θ}}}^{j} (t) .

(33)

Note that the initial value V_a¹(0) is bounded since e_ϕ¹(0) = 0, ${\tilde{W}}^{1} (0) = W^{1} (0) - W^{*} = W^{0} (T) - W^{*} = {\bar{W}}^{0}$ and ${\tilde{θ}}^{1} (0) = θ^{1} (0) - θ^{*} = θ^{0} (T) - θ^{*} = {\bar{θ}}^{0}$ . Together with the result of (35), it readily implies V_a¹(t),e_ϕ¹(t), ${\tilde{W}}^{1} (t), {\tilde{θ}}^{1} (t) \in L_{\infty e} [0, T]$ . Since the filtered basis function vector $O^{(4)} ({\hat{X}}^{j} (t))$ is bounded for all j≥1, we conclude that e_a¹(t) (by (22)) ∈L_∞e[0,T].

Lemma 2. Consider the problem setup in Lemma 1. The proposed observer-based adaptive filtered-FNN iterative learning controller guarantees that $e_{ϕ}^{j} (T), {\tilde{W}}^{j} (T)$ , and ${\tilde{θ}}^{j} (T)$ are bounded, for all j≥1 as well as $\lim_{j \to \infty} \int_{0}^{T} ‍ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) d t = 0$ and $\lim_{j \to \infty} e_{ϕ}^{j} {(T)}^{⊤} e_{ϕ}^{j} (T) = 0$ .

Proof. Define a positive function V^j(T) as

\begin{matrix} {\dot{V}}_{a}^{j} (t) \\ \leq - λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) - \frac{γ_{1}}{β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} {\tilde{W}}^{j} (t)} \\ + \frac{γ_{1}}{β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} {\tilde{W}}^{j - 1} (t)} \\ - \frac{γ_{2}}{β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} {\tilde{θ}}^{j} (t) + \frac{γ_{2}}{β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} {\tilde{θ}}^{j - 1} (t) \\ = - λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) \\ - \frac{γ_{1}}{2 β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} {\tilde{W}}^{j} (t)} - \frac{γ_{2}}{2 β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} {\tilde{θ}}^{j} (t) \\ - \frac{γ_{1}}{2 β_{1}} tr {{({\tilde{W}}^{j} (t) - {\tilde{W}}^{j - 1} (t))}^{⊤} ({\tilde{W}}^{j} (t) - {\tilde{W}}^{j - 1} (t))} \\ - \frac{γ_{2}}{2 β_{2}} {({\tilde{θ}}^{j} (t) - {\tilde{θ}}^{j - 1} (t))}^{⊤} ({\tilde{θ}}^{j} (t) - {\tilde{θ}}^{j - 1} (t)) \\ + \frac{γ_{1}}{2 β_{1}} tr {{\tilde{W}}^{j - 1} {(t)}^{⊤} {\tilde{W}}^{j - 1} (t)} + \frac{γ_{2}}{2 β_{2}} {\tilde{θ}}^{j - 1} {(t)}^{⊤} {\tilde{θ}}^{j - 1} (t) \\ \leq \frac{γ_{1}}{2 β_{1}} tr {{\tilde{W}}^{j - 1} {(t)}^{⊤} {\tilde{W}}^{j - 1} (t)} + \frac{γ_{2}}{2 β_{2}} {\tilde{θ}}^{j - 1} {(t)}^{⊤} {\tilde{θ}}^{j - 1} (t) \\ \equiv V_{b}^{j - 1} (t), \end{matrix}

(34)

Using the technique of integration by parts, we have

{\dot{V}}_{a}^{1} (t) \leq V_{b}^{0} (t) = \frac{γ_{1}}{2 β_{1}} tr {{\bar{W}}^{0 ⊤} {\bar{W}}^{0}} + \frac{γ_{2}}{2 β_{2}} {\bar{θ}}^{0 ⊤} {\bar{θ}}^{0} .

(35)

The difference between V^j(T) and V^j–1(T) can be derived by the facts of ${\tilde{W}}^{j} (0) = {\tilde{W}}^{j - 1} (T)$ and ${\tilde{θ}}^{j} (0) = {\tilde{θ}}^{j - 1} (T)$ as follows:

V^{j} (T) = \int_{0}^{T} ‍ [\frac{γ_{1}}{2 β_{1}} tr {{\tilde{W}}^{j} {(t)}^{⊤} {\tilde{W}}^{j} (t)} + \frac{γ_{2}}{2 β_{2}} {\tilde{θ}}^{j} {(t)}^{⊤} θ^{j} (t)] d t + \frac{1 - γ_{1}}{2 β_{1}} tr {{\tilde{W}}^{j} {(T)}^{⊤} {\tilde{W}}^{j} (T)} + \frac{{1 - γ}_{2}}{2 β_{2}} {\tilde{θ}}^{j} {(T)}^{⊤} θ^{j} (T) .

(36)

Integrating both sides of (29) from 0 to T gives

\begin{matrix} \frac{(1 - γ_{1})}{2 β_{1}} tr {{\tilde{W}}^{j} {(T)}^{⊤} {\tilde{W}}^{j} (T)} = \frac{(1 - γ_{1})}{β_{1}} \int_{0}^{T} ‍ tr {{\tilde{W}}^{j} {(t)}^{⊤} {\dot{\tilde{W}}}^{j} (t)} d t + \frac{(1 - γ_{1})}{2 β_{1}} tr {{\tilde{W}}^{j} {(0)}^{⊤} {\tilde{W}}^{j} (0)}, \\ \frac{(1 - γ_{2})}{2 β_{2}} {\tilde{θ}}^{j} {(T)}^{⊤} θ^{j} (T) = \frac{(1 - γ_{2})}{β_{2}} \int_{0}^{T} ‍ {\tilde{θ}}^{j} {(t)}^{⊤} {\dot{\tilde{θ}}}^{j} (t) d t + \frac{(1 - γ_{2})}{2 β_{2}} {\tilde{θ}}^{j} {(0)}^{⊤} {\tilde{θ}}^{j} (0) . \end{matrix}

(37)

where we use the property of $(1 / 2) e_{ϕ}^{j} {(0)}^{⊤} e_{ϕ}^{j} (0) = 0$ . Substituting (39) into (38), it yields

V^{j} (T) - V^{j - 1} (T) = \int_{0}^{T} ‍ [\frac{γ_{1}}{2 β_{1}} (tr {{\tilde{W}}^{j} {(t)}^{⊤} {\tilde{W}}^{j} (t)} - tr {{\tilde{W}}^{j - 1} {(t)}^{⊤} {\tilde{W}}^{j - 1} (t)}) + \frac{γ_{2}}{2 β_{2}} (({\tilde{θ}}^{j} {(t)}^{⊤} {\tilde{θ}}^{j} (t)) - ({\tilde{θ}}^{j - 1} {(t)}^{⊤} {\tilde{θ}}^{j - 1} (t)))] d t + \frac{(1 - γ_{1})}{β_{1}} \int_{0}^{T} ‍ tr {{\tilde{W}}^{j} {(t)}^{⊤} {\dot{\tilde{W}}}^{j} (t)} d t + \frac{(1 - γ_{1})}{2 β_{1}} tr {{\tilde{W}}^{j} {(0)}^{⊤} {\tilde{W}}^{j} (0)} - \frac{(1 - γ_{1})}{2 β_{1}} tr {{\tilde{W}}^{j - 1} {(T)}^{⊤} {\tilde{W}}^{j - 1} (T)} + \frac{(1 - γ_{2})}{β_{2}} \int_{0}^{T} ‍ {\tilde{θ}}^{j} {(t)}^{⊤} {\dot{\tilde{θ}}}^{j} (t) d t + \frac{(1 - γ_{2})}{2 β_{2}} {\tilde{θ}}^{j} {(0)}^{⊤} {\tilde{θ}}^{j} (0) - \frac{(1 - γ_{2})}{2 β_{2}} {\tilde{θ}}^{j - 1} {(T)}^{⊤} {\tilde{θ}}^{j - 1} (T) = \int_{0}^{T} ‍ [- \frac{γ_{1}}{2 β_{1}} \times tr {{({\tilde{W}}^{j} (t) - {\tilde{W}}^{j - 1} (t))}^{⊤} ({\tilde{W}}^{j} (t) - {\tilde{W}}^{j - 1} (t))} - \frac{γ_{2}}{2 β_{2}} {(\tilde{θ} (t) - {\tilde{θ}}^{j - 1} (t))}^{⊤} (\tilde{θ} (t) - {\tilde{θ}}^{j - 1} (t))] d t + \int_{0}^{T} ‍ [- e_{ϕ}^{j} {(t)}^{⊤} {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + | e_{ϕ}^{j} (t) | {\tilde{θ}}^{j} {(t)}^{⊤} Y^{j} (t)] d t \leq \int_{0}^{T} ‍ [- e_{ϕ}^{j} {(t)}^{⊤} {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + | e_{ϕ}^{j} (t) | {\tilde{θ}}^{j} {(t)}^{⊤} Y^{j} (t)] d t .

(38)

Since V¹(T) is bounded by Lemma 1 and V^j(T) is positive and monotonically decreasing, we conclude by the result of (40) that V^j(T) is bounded for all j≥1 and will converge as j approaches infinity to some limit value V(T) which is independent of j. The boundedness of V^j(T) also ensures the boundedness of ${\tilde{W}}^{j} (T)$ and ${\tilde{θ}}^{j} (T)$ for all j≥1. On the other hand, (40) also implies

\int_{0}^{T} ‍ [- e_{ϕ}^{j} {(t)}^{⊤} {\tilde{W}}^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + | e_{ϕ}^{j} (t) | {\tilde{θ}}^{j} {(t)}^{⊤} Y^{j} (t)] d t \leq - \int_{0}^{T} ‍ (λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t)) d t - \frac{1}{2} e_{ϕ}^{j} {(T)}^{⊤} e_{ϕ}^{j} (T),

(39)

It follows that e_ϕ^j(T)^⊤e_ϕ^j(T) are bounded for all j≥1. Furthermore, $\lim_{j \to \infty} \int_{0}^{T} ‍ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) d t = 0$ and $\lim_{j \to \infty} e_{ϕ}^{j} {(T)}^{⊤} e_{ϕ}^{j} (T) = 0$ .

Using the boundedness of ${\tilde{W}}^{j} (T)$ and ${\tilde{θ}}^{j} (T)$ (or equivalently the boundedness of ${\tilde{W}}^{j} (0)$ and ${\tilde{θ}}^{j} (0)$ for all j≥1 as shown in Lemma g2, boundedness of all internal signals for all j≥1 is now established in the following Lemma.

Lemma 3. Consider the problem setup in Lemma 1. The proposed observer-based adaptive filtered-FNN iterative learning controller ensures that all the internal signals are bounded; that is, e_ϕ^j(t), e_a^j(t), ${\tilde{e}}^{j}$ , y_a^j(t), ${\hat{E}}^{j} (t)$ , E^j(t), ${\tilde{E}}^{j}$ , W^j, θ^j(t), u^j(t), ${\dot{W}}^{j}$ , ${\dot{θ}}^{j} \in L_{\infty e} [0, T]$ .

Proof. Integrating (34) from 0 to t, we have

V^{j} (T) - V^{j - 1} (T) \leq - \int_{0}^{T} ‍ (λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) + e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t)) d t - \frac{1}{2} e_{ϕ}^{j} {(T)}^{⊤} e_{ϕ}^{j} (T) .

(40)

Since V^j(T), defined in (36), is bounded for all j≥1 according to Lemma 2, we conclude that $\int_{0}^{T} ‍ V_{b}^{j - 1} (t) d t$ is bounded for all j≥1. Furthermore, the initial value V_a^j(0) is also bounded for all j≥1 due to Lemma 2. This readily implies from (44) that V_a^j(t) and hence, e_ϕ^j(t), ${\tilde{W}}^{j} (t), {\tilde{θ}}^{j} (t) \in L_{\infty e} [0, T]$ . Using the same argument given in Lemma 1, it can be easily shown that e_a^j(t)∈L_∞e[0,T] for all j≥1.

However, the boundedness of V_a^j(t), e_ϕ^j(t), ${\tilde{W}}^{j} (t)$ , ${\tilde{θ}}^{j} (t)$ , e_a^j(t), and ${\dot{W}}^{j} (t)$ cannot guarantee the boundedness of Y^j(t) (or equivalently m^j(t)) and input u^j(t). In order to show the boundedness of Y^j(t) for all t∈[0,T], we first note that $\int_{0}^{T} ‍ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t) d t \in L_{\infty e} [0, T]$ are established in (41). In addition, $\int_{0}^{t} ‍ e_{ϕ}^{j} {(t^{'})}^{⊤} e_{ϕ}^{j} (t^{'}) Y^{j} {(t^{'})}^{⊤} Y^{j} (t^{'}) d t^{'} \in L_{\infty e} [0, T]$ since $\int_{0}^{t} ‍ e_{ϕ}^{j} {(t^{'})}^{⊤} e_{ϕ}^{j} (t^{'}) Y^{j} {(t^{'})}^{⊤} Y^{j} (t^{'}) d t^{'} \leq \int_{0}^{T} ‍ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t) d t$ . But the boundedness of e_ϕ^j(t) and $\int_{0}^{t} ‍ e_{ϕ}^{j} {(t^{'})}^{⊤} e_{ϕ}^{j} (t^{'}) Y^{j} {(t^{'})}^{⊤} Y^{j} (t^{'}) d t^{'}$ only ensures that Y^j(t) is bounded everywhere except on a set of measure zero. Now we adopt some techniques given in chapter 2 of [26]. Firstly, rewrite u^j(t) in (27) as follows:

\int_{0}^{T} ‍ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t) d t \leq V^{j - 1} (T) - V^{j} (T) \leq V^{1} (T),

(41)

Since $W^{j} (t), O^{(4)} ({\hat{X}}^{j} (t)), θ^{j} (t)$ , and $\int_{0}^{t} ‍ e_{ϕ}^{j} {(t^{'})}^{⊤} Y^{j} {(t^{'})}^{⊤} Y^{j} (t^{'}) d t^{'}$ are bounded for t∈[0,T], and L(s)/F(τs), sL(s)/F(τs) are proper or strictly proper stable transfer functions, (45) implies that u^j(t) will satisfy

\int_{0}^{T} ‍ λ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) d t \leq V^{j - 1} (T) - V^{j} (T) \leq V^{1} (T),

(42)

for some k₁,k₂>0 by lemma 2.6 (output is bounded by truncated L_∞ norm of input for a stable linear system) in [26]. Now we construct an extended dynamic equation by using (15) and (25) as follows:

\frac{1}{2} e_{ϕ}^{j} {(T)}^{⊤} e_{ϕ}^{j} (T) \leq V^{j - 1} (T) - V^{j} (T) \leq V^{1} (T) .

(43)

Let X_a^j(t) be the state vector of the extended dynamic equation (47). Taking norms on (47) will yield

V_{a}^{j} (t) \leq V_{a}^{j} (0) + \int_{0}^{t} ‍ V_{b}^{j - 1} (t^{'}) d t^{'} \leq V_{a}^{j} (0) + \int_{0}^{T} ‍ V_{b}^{j - 1} (t) d t .

(44)

for some k₃,k₄>0. This implies that X_a^j(t) is regular [26] so that X_a^j(t) and hence, ${\tilde{E}}^{j} (t), m^{j} (t)$ can grow at most exponentially fast and no finite time escape during a finite time interval [0,T]. This condition guarantees a certain degree of smoothness of signal m^j(t). Therefore, we show that Y^j(t) is a certain degree of smooth signal. Together with the result of $\int_{0}^{t} ‍ e_{ϕ}^{j} (t^{'}) Y^{j} {(t^{'})}^{⊤} Y^{j} (t^{'}) d t^{'} \in L_{\infty e} [0, T]$ , we can now conclude that Y^j(t)∈L_∞e[0,T].

Due to $W^{j} (t), O^{(4)} ({\hat{X}}^{j} (t)), θ^{j} (t), Y^{j} (t), e_{ϕ}^{j} (t) \in L_{\infty e} [0, T]$ , this also implies that v^j(t)∈L_∞e[0,T] (by (28)). Since e_ϕ^j(t), W^j(t), θ^j(t), $O^{(4)} ({\hat{X}}^{j} (t))$ , Y^j(t)∈L_∞e[0,T], we conclude that ${\dot{W}}^{j} (t)$ (by (30)), ${\dot{θ}}^{j} (t)$ (by (31)) ∈L_∞e[0,T]. Since v^j(t)∈L_∞e[0,T] and L(s)/F(τs) is a proper stable transfer function, this implies that u^j(t)∈L_∞e[0,T] (by (27)). Since v^j(t),u^j(t)∈L_∞e[0,T], 1/L(s) and 1/ℓ(s) are strictly proper stable transfer functions, this implies that y_a^j(t)∈L_∞e[0,T] (by (18)). As noted above, e_a^j(t),y_a^j(t)∈L_∞e[0,T], we have ${\tilde{e}}^{j} (t) \in L_{\infty e} [0, T]$ (by (19)). Moreover, because A_c is a Hurwitz matrix and ${\tilde{e}}^{j} \in L_{\infty e} [0, T]$ , this implies that ${\hat{E}}^{j} (t) \in L_{\infty e} [0, T]$ (by (12)). Finally, since A_o are Hurwitz matrices and $- b (X^{j} (t)) u^{j} (t) + W^{* ⊤} O^{(3)} ({\hat{X}}^{j} (t)) + δ^{j} (t) \in L_{\infty e} [0, T]$ , we have ${\tilde{E}}^{j} \in L_{\infty e} [0, T]$ (by (14)). Since ${\hat{E}}^{j} (t), {\tilde{E}}^{j} (t) \in L_{\infty e} [0, T]$ , it implies E^j(t)∈L_∞e[0,T] (by (13)). This completes the proof.

Based on Lemmas 1, 2, and 3, we now state the main result in the following theorem.

Theorem 4. Consider the system setup in Lemma 1. The proposed observer-based adaptive filtered-FNN iterative learning controller guarantees the tracking performance and system stability as follows:

(T1)

$\lim_{j \to \infty} e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) = 0$ , for all t∈[0,T].

(T2)

$\lim_{j \to \infty} | e_{a}^{j} (t) | \leq e^{- λ t} ɛ^{\infty}$ , for all t∈[0,T].

(T3)

$\lim_{j \to \infty} | {\tilde{e}}^{j} (t) | \leq e^{- λ t} ɛ^{\infty} + τ k_{6} 5$ , for all t∈[0,T] and for a constant k₅>0.

(T4)

Let δ and k6 be the positive constants such that the transition matrix Φ(t) of A_c satisfies |Φ(t)| ≤ k₆e ^−δt. Then there exists a positive constant k7 such that $\lim_{j \to \infty} | {\hat{e}}^{j} (t) | = | {\hat{e}}^{\infty} (t) | \leq k_{7} (ɛ^{\infty} ((e^{- λ t} - e^{- δ t}) / (δ - λ)) + τ k_{5} ((1 - e^{- δ t}) / δ))$ , for all t∈[0,T].

(T5)

$\lim_{j \to \infty} | e^{j} (t) | = | e^{\infty} (t) | \leq e^{- λ t} ɛ^{\infty} + τ k_{5} + k_{7} (ɛ^{\infty} ((e^{- λ t} - e^{- δ t}) / (δ - λ)) + τ k_{5} ((1 - e^{- δ t}) / δ))$ , for all t∈[0,T].

Proof. (T1) Since e_ϕ^j(t)^⊤e_ϕ^j(t)∈L_∞e[0,T] and $(d / d t) e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) = 2 e_{ϕ}^{j} {(t)}^{⊤} ({\dot{e}}_{a}^{j} (t) - sgn (e_{ϕ}^{j} (t)) {\dot{ϕ}}^{j} (t)) \in L_{\infty e} [0, T]$ for all j≥1. These facts imply that e_ϕ^j(t)^⊤e_ϕ^j(t) is uniformly continuous over [0,T] for all j≥1. On the other hand, e_ϕ^j(t)^⊤e_ϕ^j(t) satisfies $\lim_{j \to \infty} \int_{0}^{T} ‍ e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) d t = 0$ due to the result of Lemma 2. We can now conclude, by using similar argument for Barbalat's lemma (e.g., Lemma 3.2.6 in [28]), that $\lim_{j \to \infty} e_{ϕ}^{j} {(t)}^{⊤} e_{ϕ}^{j} (t) = 0$ for all t∈[0,T].

(T2) According to the definition of e_ϕ^j(t) in (22), we can derive the bound of

u^{j} (t) = \frac{L (s)}{F (τ s)} [v^{j} (t)] = \frac{L (s)}{F (τ s)} [W^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + sat (\frac{e_{a}^{j} (t)}{ϕ^{j} (t)}) θ^{j} {(t)}^{⊤} Y^{j} (t) + e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t)] = \frac{L (s)}{F (τ s)} [W^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + sat (\frac{e_{a}^{j} (t)}{ϕ^{j} (t)}) θ^{j} {(t)}^{⊤} Y^{j} (t)] + \frac{s L (s)}{F (τ s)} [\int_{0}^{t} ‍ e_{ϕ}^{j} {(t^{'})}^{⊤} Y^{j} {(t^{'})}^{⊤} Y^{j} (t^{'}) d t^{'}] .

(45)

for all t∈[0,T].

(T3) Substituting (27) into (18), we can find that e_a^j(t) actually satisfies

| u^{j} (t) | \leq k_{1} {∥ {(Y^{j})}_{t} ∥}_{\infty} + k_{1} \leq k_{2} {∥ {(m^{j})}_{t} ∥}_{\infty} + k_{2}

(46)

Since v^j(t) is bounded and the H_∞ norm of ${∥ (1 / s) (1 - 1 / F (τ s)) ∥}_{\infty} = 2 τ$ and ${∥ s / ℓ (s) ∥}_{\infty} = \max {1,1 / λ}$ , we can conclude that

[\begin{bmatrix} {\dot{\tilde{E}}}^{j} (t) \\ {\dot{m}}^{j} (t) \end{bmatrix}] = [\begin{bmatrix} A_{o} & 0 \\ 0 & - δ_{1} \end{bmatrix}] [\begin{bmatrix} {\tilde{E}}^{j} (t) \\ m^{j} (t) \end{bmatrix}] + [\begin{bmatrix} B \\ 0 \end{bmatrix}] (- b (X^{j} (t)) u^{j} (t) + W^{* ⊤} O^{(3)} ({\hat{X}}^{j} (t)) + δ^{j} (t)) + [\begin{bmatrix} 0 \\ δ_{2} (1 + | u^{j} (t) |) \end{bmatrix}] .

(47)

for some k₅>0. Taking norms on (50), we find that

∥ {\dot{X}}_{a}^{j} (t) ∥ \leq k_{3} ∥ X_{a}^{j} (t) ∥ + k_{3} | u^{j} (t) | + k_{3} \leq k_{4} {∥ {(X_{a}^{j})}_{t} ∥}_{\infty} + k_{4}

(48)

As iteration goes to infinity,

\lim_{j \to \infty} | e_{a}^{j} (t) | = | e_{a}^{\infty} (t) | \leq ϕ^{\infty} (t) = e^{- λ t} ɛ^{\infty}

(49)

(T4) Consider the following tracking error estimation dynamics as j→∞,

\begin{matrix} e_{a}^{j} (t) = {\tilde{e}}^{j} (t) - y_{a}^{j} (t) \\ = {\tilde{e}}^{j} (t) - \frac{1}{ℓ (s)} [v^{j} (t) - \frac{1}{L (s)} [u^{j} (t)]] \\ = {\tilde{e}}^{j} (t) - \frac{1}{ℓ (s)} [v^{j} (t) - \frac{1}{F (τ s)} [v^{j} (t)]] \\ = {\tilde{e}}^{j} (t) - \frac{1}{ℓ (s)} (1 - \frac{1}{F (τ s)}) [v^{j} (t)] \\ ≜ {\tilde{e}}^{j} (t) - R^{j} (t) . \end{matrix}

(50)

The solution of (54) in time domain is given by

| R^{j} (t) | \leq {∥ \frac{1}{s} (1 - \frac{1}{F (τ s)}) ∥}_{\infty} {∥ \frac{s}{ℓ (s)} ∥}_{\infty} {∥ {(v^{j} (t))}_{t} ∥}_{\infty} \leq τ k_{5}

(51)

Taking norms on (55), it yields

| {\tilde{e}}^{j} (t) | \leq | e_{a}^{j} (t) | + | R^{j} (t) | \leq | e_{a}^{j} (t) | + τ k_{5} .

(52)

for some positive constant $k_{7} = k_{6} | C ∥ K_{o} |$ .

(T5) Finally, we investigate the tracking performance in the final iteration when (T1), (T2), (T3), and (T4) of this theorem are achieved. Since ${\tilde{e}}^{j} (t) = e^{j} (t) - {\hat{e}}^{j} (t)$ , we have

\lim_{j \to \infty} | {\tilde{e}}^{j} (t) | \leq e^{- λ t} ɛ^{\infty} + τ k_{5} .

(53)

As iteration goes to infinity,

\begin{matrix} {\dot{\hat{E}}}^{\infty} (t) = A_{c} {\hat{E}}^{\infty} (t) + K_{o} {\tilde{e}}^{\infty} (t), {\hat{E}}^{\infty} (0) = 0, \\ {\hat{e}}^{\infty} (t) = C^{⊤} {\hat{E}}^{\infty} (t) . \end{matrix}

(54)

for t∈[0,T]. This completes the proof.

Remark 5. In Theorem 4, we show that the output learning error e^j(t) will converge to a residual set as the number of iterations approaches infinity. In general, the size of the residual set can be tuned by two design parameters. The first one λ is the decay rate of the boundary layer ϕ^j(t) = ɛ^je – ^λt which will be chosen as large as possible. The second one τ is the filter parameter of the averaging filter 1/F(τs) = 1/(τs + 1)² which will be chosen as small as possible. To guarantee a satisfied learning performance, we will usually choose a larger λ and smaller τ. Furthermore, if there is no initial output error, that is, ɛ^j = 0, the output learning error will converge to a residual set whose level of magnitude depends on τ only.

5. Simulation Example

In this section, a computer simulation is conducted to demonstrate the learning effect of the proposed observer based adaptive filtered-FNN iterative learning controller. Here we consider a two-link planar robotic system [29] with the dynamic equation of

{\hat{e}}^{\infty} (t) = \int_{0}^{t} ‍ C^{⊤} Φ (t - t^{'}) K_{o} {\tilde{e}}^{\infty} (t^{'}) d t^{'} .

(55)

where $D_{11} = m_{1} l_{c 1}^{2} + m_{2} (l_{1}^{2} + l_{c 2}^{2} + 2 l_{1} l_{c 2} \cos (q_{2}^{j} (t))) + I_{1} + I_{2}$ , $D_{12} = D_{21} = m_{2} l_{1} l_{c 2} \cos (q_{2}^{j} (t)) + m_{2} l_{c 2}^{2} + I_{2}$ , D₂₂ = m₂l_c2² + I₂, and h = m₂l₁l_c2sin(q₂^j(t)). Here m_i,I_i,l_i, and l_{c
_i} represent mass, inertia, length of link i, and the distance from the previous joint to the center of mass of link i, respectively.

In this simulation, we set m₁ = 10 kg, m₂ = 5 kg, l₁ = 1 m, l₂ = 0.5 m, l_c1 = 0.5 m, l_c2 = 0.25 m, I₁ = 0.83 kg-m², and I₂ = 0.3 kg-m². The control objective is to let q^j(t) = [q₁^j(t),q₂^j(t)]^⊤ track the desired trajectory $q_{d} (t) = [q_{d_{1}} (t), q_{d_{2}} {(t)]}^{⊤} = {[\sin (t), \cos (t)]}^{⊤}$ as close as possible over a finite time interval [0,15] when only the angular position q^j(t) is measurable. The design steps are summarized as follows.

(S1)

Specify the observer and feedback gain vectors K_o = [k₁^oI_2×2,k₂^oI_2×2]^⊤ = [16I_2×2, 9I_2×2]^⊤ ∈ ℛ^4×2, K_c = [k₂^cI_2×2,k₁^cI_2×2]^⊤ = [4I_2×2, 4I_2×2]^⊤ ∈ ℛ^4×2, respectively, such that the matrices A_c = A-BK_c^⊤ and A_o = A_c-K_oC^⊤ are Hurwitz.

(S2)

Design the tracking error observer as in (12) to obtain ${\hat{e}}^{j} (t)$ , ${\tilde{e}}^{j} (t) = e^{j} (t) - {\hat{e}}^{j} (t)$ and the state estimation vector ${\hat{X}}^{j} (t) = X_{d} (t) - {\hat{E}}^{j} (t)$ .

(S3)

Select $H (s) = (1 / ℓ (s) L (s)) I_{2 \times 2}$ with ℓ(s) = s + λ = s + 10 and L(s) = s + 10. Then, define an augmented signal with filtered version as $y_{a}^{j} (t) = (1 / ℓ (s)) [v^{j} (t) - (1 / L (s)) [u^{j} (t)]], y_{a}^{j} (0) = 0$ , an auxiliary error signal as $e_{a}^{j} (t) = {\tilde{e}}^{j} (t) - y_{a}^{j} (t)$ , and a dead-zone signal $e_{ϕ}^{j} (t) = e_{a}^{j} (t) - ϕ^{j} (t) sat (e_{a}^{j} (t) / ϕ^{j} (t))$ with ϕ^j(t) = ɛ^je – ^λt = ɛ^je – ^10t. The normalization signal is given as $m^{j} (t) = (δ_{2} / (s + δ_{1})) [1 + | u^{j} (t) |], m^{j} (0) > δ_{2} / δ_{1}$ with δ₁ = δ₂ = 0.01.

(S4)

Construct the membership functions for ${\hat{X}}^{j} (t)$ . Then, solve the filtered basis function vector $O^{(4)} ({\hat{X}}^{j} (t))$ . The filtered-FNN with input ${\hat{X}}^{j} (t)$ is given as $W^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t))$ . Since the working domain of the desired trajectory $X_{d} (t) = {[q_{d_{1}} (t), q_{d_{2}} (t), {\dot{q}}_{d_{1}} (t), {\dot{q}}_{d_{2}} (t)]}^{⊤} = {[\sin (t), \cos (t), \cos (t), - \sin (t)]}^{⊤}$ is within the interval [−1, 1], we choose the centers as m = [m₁,m₂,m₃,m₄] with m_i = [m_i1,m_i2,m_i3,m_i4,m_i5] = [−1, −0.5, 0, 0.5, 1], i = 1, 2, 3, 4 and variances as σ=[σ₁,σ₂,σ₃,σ₄] with σ_i = [σ_i1,σ_i2,σ_i3,σ_i4,σ_i5] = [0.25, 0.25, 0.25, 0.25, 0.25], i = 1, 2, 3, 4 to cover this interval, respectively. In addition, we set the control parameter θ⁰(t) = θ⁰ = 0.1 at the first iteration for all t∈[0, 15]. It is noted that the initial values of the consequent parameters W⁰(t) can be roughly estimated if the nonlinear function h(X^j(t)) of the robotic system is partially known. However, we often arbitrarily choose this initial parameters.

(S5)

Design the observer-based adaptive filtered-FNN iterative learning controller as u^j(t) = (L(s)/F(τs))[v^j(t)] where $v^{j} (t) = W^{j} {(t)}^{⊤} O^{(4)} ({\hat{X}}^{j} (t)) + sat (e_{a}^{j} (t) / ϕ^{j} (t)) θ^{j} {(t)}^{⊤} Y^{j} (t) + e_{ϕ}^{j} (t) Y^{j} {(t)}^{⊤} Y^{j} (t)$ . The averaging filter is designed with τ = 0.0001.

(S6)

Finally, the adaptation algorithms (30) and (31) are adopted to update the filtered-FNN parameters and control parameters with γ₁ = γ₂ = 0.5, β₁ = β₂ = 500.

In order to show the robustness to the varying initial output tracking error, we first assume the initial joint position vector of the two-link planar robotic system taking the following arbitrary values for the first five iterations: q^j(0) = [q₁^j(0),q₂^j(0)]^⊤ = [0.05, 0.3]^⊤,[0.25, 0.8]^⊤,[0.14, 0.6]^⊤,[0.09, 0.4]^⊤,[−0.1, 0.5]^⊤. At the beginning of each iteration, the initial value of the boundary layer ϕ^j(t) is then chosen according to $ϕ^{j} (0) = ɛ^{j} = | {[e_{a, 1}^{j} (0), e_{a, 2}^{j} (0)]}^{⊤} | = | {[{\tilde{e}}_{1}^{j} (0), {\tilde{e}}_{2}^{j} (0)]}^{⊤} | = | {[e_{1}^{j} (0), e_{2}^{j} (0)]}^{⊤} | = | {[\sin (0) - q_{1}^{j} (0), \cos (0) - q_{2}^{j} (0)]}^{⊤} |$ . For example, ϕ¹(0) = 0.05,ϕ²(0) = 0.25,ϕ³(0) = 0.14,ϕ⁴(0) = 0.09, and ϕ⁵(0) = 0.1. To study the effect of learning performances, $\sup_{t \in [0,15]} | e_{ϕ, 1}^{j} (t) |$ and $\sup_{t \in [0,15]} | e_{ϕ, 2}^{j} (t) |$ versus iteration j are shown in Figures 1(a) and 1(b), respectively. It is clear that the asymptotic convergence proved in (T1) of Theorem 4 is achieved. Since the learning process is almost completed at the 5th iteration, we demonstrate the auxiliary errors e_{a, 1}⁵(t) and e_{a, 2}⁵(t) in Figures 2(a) and 2(b), respectively. The trajectories of e_{a, 1}⁵(t) and e_{a, 2}⁵(t) satisfy −0.5e^–10t ≤ e_{a, 1}⁵(t) ≤ 0.5e^–10t and −0.5e^–10t ≤ e_{a, 2}⁵(t) ≤ 0.5e^–10t, respectively. This clearly proves (T2) of Theorem 4. The output tracking errors e₁⁵(t) and e₂⁵(t) are shown in Figures 3(a) and 3(b) with satisfied performance even there exists varying initial output errors. The nice tracking performances at the 5th iteration between joint position vector q^j(t) = [q₁^j(t),q₂^j(t)]^⊤ and desired joint position vector q_d(t) = [q_{d, 1}(t),q_{d, 2}(t)]^⊤ are presented in Figures 4(a) and 4(b), respectively. Finally, the bounded learned control forces u₁⁵(t) and u₂⁵(t) are plotted in Figures 5(a) and 5(b), respectively.

Figure 1:

(a) $\sup_{t \in [0,15]} | e_{ϕ, 1}^{j} (t) |$ versus iteration j; (b) $\sup_{t \in [0,15]} | e_{ϕ, 2}^{j} (t) |$ versus iteration j.

Figure 2:

(a) e_{a, 1}⁵(t) (solid line) and ± ϕ⁵(t) (dotted lines) versus time t; (b) e_{a, 2}⁵(t) (solid line) and ± ϕ⁵(t) (dotted lines) versus time t.

Figure 3:

(a) e₁⁵(t) versus time t; (b) e₂⁵(t) versus time t.

Figure 4:

(a) q₁⁵(t) (solid line) and q_{d, 1}(t) (dotted line) versus time t; (b) q₂⁵(t) (solid line) and q_{d, 2}(t) (dotted line) versus time t.

Figure 5:

(a) u₁⁵(t) versus time t; (b) u₂⁵(t) versus time t.

6. Conclusion

An observer-based adaptive filtered-FNN iterative learning controller for repeated tracking control of uncertain robotic systems is proposed in this paper. A tracking error observer is designed to estimate the unknown joint variables since only joint positions are assumed to be measurable. An error observation dynamic based on the tracking error observer is derived for the design of the iterative learning controller. The main control force is designed by using a technique of averaging filter and some auxiliary signals. In this main control force, a fuzzy neural learning component based on a filtered-FNN is used to approximate the unknown nonlinear function, a robust learning component is designed to compensate for the other uncertainties, and a stabilization term is applied to guarantee the boundedness of internal signals. The adaptive laws combining time domain and iteration domain adaptation for the network parameters and control parameters are proposed to ensure the stability and convergence of the learning system. A Lyapunov-like analysis has been developed to solve both boundedness of internal signals inside the closed-loop system and asymptotic convergence of learning error. It is shown that the output tracking error asymptotically converges to a tunable residual set whose level of magnitude can be tuned by design parameters as the iteration number increases.

Footnotes

Appendix

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This work is supported by the National Science Council under Grants NSC102-2221-E-211-003 and NSC102-2221-E-211-011.

References

Moore

K. L.

and Jian-Xin

X. U.

, “Special issue on iterative learning control,” International Journal of Control, vol. 73, no. 10, pp. 819–823, 2000.

Bristow

D. A.

Tharayil

, and Alleyne

A. G.

, “A survey of iterative learning control: a learning-based method for high-performance tracking control,” IEEE Control Systems Magazine, vol. 26, no. 3, pp. 96–114, 2006.

Ahn

H.-S.

Chen

Y. Q.

, and Moore

K. L.

, “Iterative learning control: brief survey and categorization,” IEEE Transactions on Systems, Man and Cybernetics C, vol. 37, no. 6, pp. 1099–1121, 2007.

Wang

Gao

, and Doyle

F. J.

III , “Survey on iterative learning control, repetitive control, and run-to-run control,” Journal of Process Control, vol. 19, no. 10, pp. 1589–1600, 2009.

J.-X.

, “A survey on iterative learning control for nonlinear systems,” International Journal of Control, vol. 84, no. 7, pp. 1275–1294, 2011.

Horowitz

, “Learning control of robot manipulators,” Journal of Dynamic Systems, Measurement and Control, Transactions of the ASME, vol. 115, no. 2 B, pp. 402–411, 1993.

Wang

Soh

Y. C.

, and Cheah

C. C.

, “Robust motion and force control of constrained manipulators by learning,” Automatica, vol. 31, no. 2, pp. 257–262, 1995.

Chien

C.-J.

and Liu

J.-S.

, “A P-type iterative learning controller for robust output tracking of nonlinear time-varying systems,” International Journal of Control, vol. 64, no. 2, pp. 319–334, 1996.

Bouakrif

, “D-type iterative learning control without resetting condition for robot manipulators,” Robotica, vol. 29, no. 7, pp. 975–980, 2011.

10.

Tayebi

, “Adaptive iterative learning control for robot manipulators,” Automatica, vol. 40, no. 7, pp. 1195–1203, 2004.

11.

Chien

C.-J.

and Tayebi

, “Further results on adaptive iterative learning control of robot manipulators,” Automatica, vol. 44, no. 3, pp. 830–837, 2008.

12.

Jia

X.-G.

and Yuan

Z.-Y.

, “Adaptive iterative learning control for robot manipulators,” in Proceedings of the IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS '10), pp. 139–142, Xiamen, China, October 2010.

13.

Wei

J.-M.

and Hu

Y.-A.

, “Adaptive iterative learning control for robot manipulators with initial resetting errors,” Applied Mechanics and Materials, vol. 130–134, pp. 265–269, 2012.

14.

Ngo

Wang

Mai

T. L.

Nguyen

M. H.

, and Wei

S. N.

, “An adaptive iterative learning control for robot manipulator in task space,” International Journal of Computers, Communications and Control, vol. 7, no. 3, pp. 510–521, 2012.

15.

Tayebi

and Chien

C.-J.

, “A unified adaptive iterative learning control framework for uncertain nonlinear systems,” IEEE Transactions on Automatic Control, vol. 52, no. 10, pp. 1907–1913, 2007.

16.

Chen

J. M.

, and Li

, “Practical adaptive iterative learning control framework based on robust adaptive approach,” Asian Journal of Control, vol. 13, no. 1, pp. 85–93, 2011.

17.

Chien

C.-J.

, “A combined adaptive law for fuzzy iterative learning control of nonlinear systems with varying control tasks,” IEEE Transactions on Fuzzy Systems, vol. 16, no. 1, pp. 40–51, 2008.

18.

Wang

Y.-C.

and Chien

C.-J.

, “Decentralized adaptive fuzzy neural iterative learning control for nonaffine nonlinear interconnected systems,” Asian Journal of Control, vol. 13, no. 1, pp. 94–106, 2011.

19.

Wang

Y. C.

and Chien

C. J.

, “Repetitive tracking control of nonlinear systems using reinforcement fuzzy-neural adaptive iterative learning controller,” Applied Mathematics and Information Sciences, vol. 6, no. 3, pp. 473–481, 2012.

20.

Tayebi

and Xu

J.-X.

, “Observer-based iterative learning control for a class of time-varying nonlinear systems,” IEEE Transactions on Circuits and Systems I, vol. 50, no. 3, pp. 452–455, 2003.

21.

J.-X.

and Xu

, “Observer based learning control for a class of nonlinear systems with time-varying parametric uncertainties,” IEEE Transactions on Automatic Control, vol. 49, no. 2, pp. 275–281, 2004.

22.

Wallén

Norrlöf

, and Gunnarsson

, “A framework for analysis of observer-based ILC,” Asian Journal of Control, vol. 13, no. 1, pp. 3–14, 2011.

23.

Y.-Y.

Tsai

J. S.-H.

Guo

S.-M.

T.-J.

, and Chen

C.-W.

, “Observer-based iterative learning control with evolutionary programming algorithm for MIMO nonlinear systems,” International Journal of Innovative Computing Information and Control, vol. 7, no. 3, pp. 1357–1374, 2011.

24.

Chen

W.-S.

R.-H.

, and Li

, “Observer-based adaptive iterative learning control for nonlinear systems with time-varying delays,” International Journal of Automation and Computing, vol. 7, no. 4, pp. 438–446, 2010.

25.

Bouakrifa

Boukhetalab

, and Boudjemab

, “Velocity observer-based iterative learning control for robot manipulators,” International Journal of Systems Science, vol. 44, no. 2, pp. 214–222, 2013.

26.

Annaswamy

A. M.

and Narendra

K. S.

, Stable Adaptive Systems, Prentice-Hall, 1988.

27.

Ioannou

P. A.

and Tsakalis

K. S.

, “A robust direct adaptive control,” IEEE Transactions on Automatic Control, vol. 31, no. 11, pp. 1033–1043, 1986.

28.

Ioannou

P. A.

and Sun

, Robust Adaptive Control, Prentice Hall, Englewood Cliffs, NJ, USA.

29.

Slotine

J. J.

and Li

, Applied Nonlinear Control, Prentice-Hall, Englewood Cliffs, NJ, USA.

An Observer-Based Adaptive Iterative Learning Control Using Filtered-FNN Design for Robotic Systems

Abstract

1. Introduction

2. Problem Formulation

3. Derivations of Error Model and Controller

3.1. Derive the Error Model

3.2. Construct Some Useful Signals

3.3. Design the Filtered-FNN-Based Iterative Learning Controller

4. Analysis of Stability and Convergence

5. Simulation Example

6. Conclusion

Footnotes

Appendix

Conflict of Interests

Acknowledgment

References