Nonlinear Receding-Horizon Control of Rigid Link Robot Manipulators

Abstract

The approximate nonlinear receding-horizon control law is used to treat the trajectory tracking control problem of rigid link robot manipulators. The derived nonlinear predictive law uses a quadratic performance index of the predicted tracking error and the predicted control effort. A key feature of this control law is that, for their implementation, there is no need to perform an online optimization, and asymptotic tracking of smooth reference trajectories is guaranteed. It is shown that this controller achieves the positions tracking objectives via link position measurements. The stability convergence of the output tracking error to the origin is proved. To enhance the robustness of the closed loop system with respect to payload uncertainties and viscous friction, an integral action is introduced in the loop. A nonlinear observer is used to estimate velocity. Simulation results for a two-link rigid robot are performed to validate the performance of the proposed controller.

Keywords

receding-horizon control nonlinear observer robot manipulators integral action robustness

1. Introduction

During recent years much emphasis has been placed on flexible manufacturing processes where the most important factors are quality, costs and time. Both fast motion in unconstrained space and mechanical interaction with the environment are required in most manufacturing systems. Industrial robots are often used to meet this demand and to perform various tasks such as material assembling, painting or welding. To accomplish these tasks efficiently and accurately, several control approaches have been proposed in the literature. Among these, a simple PD-control scheme that achieves satisfactory performance (Spong M. W. & Vidyasagar M. 1989) in the absence of gravity. However, robot manipulator is highly nonlinear system with coupling between joints and the gravity effects. The computed torque control or feedback linearization control has been also used to achieve best tracking performance. The implantation of the computed torque controller requires exact knowledge of the robot dynamics. Unfortunately, model uncertainties are frequently encountered in robotics due to unknown or changing payload and friction. These model uncertainties may decrease significantly the performance of this method in terms of tracking accuracy. Therefore, to achieve acceptable performance, even when all kinds of uncertainties are encountered, numerous robust control algorithms have been used like the variable structure approach (Slotine J. J. E. & Sastry S. S. 1983), robust adaptive approach (Ortega R. & Spong M. W. 1989), (Lee K. W. & Khalil H. K. 1997), (Canudas C. W. & Fixot N. 1992), (Spong M. W. 1992) and nonlinear H_∞ approach (Chen B. S. et al 1994). A first survey of early results in robust control has been compiled in (Abdullah C. et al; 1991) and the second survey of recent results has been given in (Sage H. G. et al, 1999). Finally the robotic applications require effective control laws that achieve accurate tracking of fast motion despite the variations of inertia and gravitational load of the manipulator during operation.

Model predictive control of linear systems has received considerable attention in the last decade due to its robustness with respect to model uncertainties. However, many systems are inherently nonlinear. Since linear models are often inadequate to describe accurately the process dynamics, then nonlinear models should be used. Much effort has been made to extend linear predictive control to nonlinear systems (Michalska H & Mayne D. Q. 1993). The disadvantages of the proposed approach is the heavy online computation burden that causes two important problems in implementation of the nonlinear predictive control. One is the computation delay that cannot be ignored and the other is the global solution that cannot be guaranteed in each optimization problem. The application of these nonlinear control laws to nonlinear systems characterized by fast dynamics (such robotics) sound like unusual proposal. To overcome the computation burden, several nonlinear predictive laws have been developed in (Ping L. 1995), (Singh S. M. 1995), (Souroukh M. & Kravaris C. 1996), (Chen W. H. et al, 2003), where the one step ahead predictive output error is obtained by expanding the output signal and the is used to derive the offline control laws. In this paper, the nonlinear receding-horizon controller proposed in (Ping L. 1998) is applied to robot manipulator to achieve position angular tracking objectives. To derive the control law, the predictive tracking error and the predicted control effort are minimized over a fixed time horizon. This approximate nonlinear controller is given in a closed form and thus no online optimization is required. Moreover, to increase the robustness of the control algorithm with regard to model uncertainties, we propose to introduce an integral action in the loop. The well-known Lyapunov based theory is used to show the asymptotic stability of the closed loop system in matched or mismatched case.

The major drawback of the proposed schemes is the requirement of measurement of motor speed. Speed measurements increase cost and impose constraints on the achievable bandwidth. Thus, to overcome this problem a nonlinear observer is used to estimate position and velocity angular of robot manipulator.

The outline of this paper proceeds as follows. In the next section, a dynamic model of robot manipulator is presented. In section 3, the approximate receding-horizon control scheme is developed to allow position angular tracking of a desired references trajectory. Stability analysis and robustness are treated in section 4. The high gain observer used to estimate unmeasured output elements (velocity angular) is presented in section 5. Simulation results are given in section 6. In the last, we conclude with some remarks.

2. Dynamic model of rigid link robot manipulators

The Euler–Lagrange equations are a tool from analytical mechanics that can be used to derive the equations of motion for a mechanical system. In this approach the joint q(t) are considered as generalized coordinates. The kinetic energy of a robot manipulator with n degrees of freedom can be calculated as:

Γ (q, \dot{q}) = \frac{1}{2} \dot{q} {(t)}^{T} D (q) \dot{q} (t),

where D(q) is the inertia matrix. Let U(q) : ℜⁿ → ℜ be a continuously differentiable function, called the potential energy. The Lagrangian function is defined (Spong M. W. & Vidyasagar M. 1989) by:

L (q, \dot{q}) = Γ (q, \dot{q}) - U (q) .

The dynamics of the manipulator are described by Lagrange's equations:

\frac{d}{d t} \frac{\partial L (q, \dot{q})}{\partial {\dot{q}}_{k}} - \frac{\partial L (q, \dot{q})}{\partial q_{k}} = u_{k}, k = 1, \dots \dots, n,

reference signal in a r_i^th order Taylor series, ri is the relative degree of the i^th element of the output. Then, the continuous minimization of the predictive tracking errors

where u₁, u₂,……u_n represent generalized input torques. Inserting the kinetic energy and the potential energy for the Lagrange L(q, q̇) above leads to the matrix description:

D (q) \overset{..}{q} + C (q, \dot{q}) \dot{q} + G (q) + f_{r} = u_{r},

where q(t) ε ℜⁿ is the vector of the generalized coordinates representing the angular joint positions and controlled with the driving torques u _r ε ℜⁿ, D(q) ε ℜ^nxn, D(q)=D(q)^T>0, is the link inertia matrix, C(q, q̇) q̇ ε ℜⁿ is the vector of the coriolis and centripetal torques, G(q) ε ℜⁿ is the vector of gravitational torques and f _r represents friction torques acting on the joints. This is described in [9], when only the mechanical parts of actuators dynamics are included. The dynamic model of a rigid robot manipulator becomes:

M (q) \overset{..}{q} + C (q, \dot{q}) \dot{q} + G (q) + f = u,

(1)

with:

\begin{array}{l} u = N u_{m}, & M (q) = D (q) + N^{2} J_{m} = D (q) + J \end{array}

and f = f _r + N f _m ,

where:

N is the diagonal matrix of the gear ratios.

u _m is the vector of torque supplied by the actuators.

f _m is the vector friction torque acting on the motors.

J _m is the diagonal matrix containing the effective motors' inertia.

It is assumed that the position q(t) is available for measurement.

Control Objective: The desired reference trajectory for the control object to follow is assumed to be available as bounded functions of time in terms of generalized position q_ref (t). That is, there exist three positive constants r_i, i=0,1,2 such that the following inequalities hold:

‖ q_{r e f} (t) ‖ \leq r_{0}, ‖ {\dot{q}}_{r e f} (t) ‖ \leq r_{1} and ‖ {\overset{..}{q}}_{r e f} (t) ‖ \leq r_{2}

(2)

State space representation: The dynamic equation of n link robot manipulator (1) can be written in the state space representation as:

{\begin{array}{l} {\dot{x}}_{1} (t) = x_{2}, \\ {\dot{x}}_{2} (t) = f (x_{1}, x_{2}) + P (x_{1}) u (t), \\ y (t) = x_{1}, \end{array}

(3)

where x(t) = [x ₁ x ₂ ] ^T = [q q̇]^T ε ℜ²ⁿ is the state vector. u(t) ε ℜⁿ represents the control torque vector and y(t) ε ℜⁿ is the output vector (position angular). f(x ₁ , x₂) = –M(q)⁻¹ (C(q, q̇) q̇ + G(q))) ε ℜⁿ and P(x₁) = M(x₁)⁻¹ ε ℜ^nxn.

Properties (Spong M. W. & Vidyasagar M. 1989):

P₁. The matrix M(x₁) is symmetric definite positive, then there exist two positive constants: M and M̄ such that: M ≤ ||M(x₁)|| ≤ M̄.

P₂. ∃ μ > 0 such that ||C(q, x)||≤ μ ||x||, ∀ x ε ℜ ⁿ .

P₃ The vector function f(x₁,x₂) is Lipschitz with respect to x₂. Thus there exists κ > 0 such that:

\begin{array}{l} ‖ f (x_{1}, x_{2}) - f (x_{1}, {\dot{q}}_{r e f}) ‖ \leq κ ‖ x_{2} - {\dot{q}}_{r e f} ‖ = κ ‖ e_{2} ‖, \\ \forall (x_{1}, x_{2}) \in ℜ^{n} \times ℜ^{n} . \end{array}

3. Receding-horizon control law

In the receding-horizon control strategy, the following control problem is solved at each t>0 and x(t):

\begin{array}{l} \underset{u (t)}{M i n} J (x (t), t, u (t)) = \\ \underset{u (t)}{M i n} \frac{1}{2} \int_{t}^{t + h} [x {(τ)}^{T} Q x (τ) + u {(τ)}^{T} R u (τ)] d τ \end{array}

(4)

subject to the equation (3) and x(t + h) = 0 for some h>0, where Q is positive definite and R positive semi-definite. Denote the optimal control to the above problem by u*(τ), τ ε [t, t + h]. The currently applied control is u(t) set equal to u*(t). This process is repeated for every next t for stabilization of the system at the origin. However, to solve a nonlinear dynamic optimization problem with equality constraints is highly computationally intensive, and in many cases it is impossible to be performed within a reasonable time limit. Furthermore, the global optimization solution cannot be guaranteed in each optimization procedure since, in general, it is a non-convex, constrained nonlinear optimization problem.

In order to find the current control that improves tracking error along a fixed interval, the output tracking error e(τ) = q(τ) – q_ref(τ) is used instead of the state vector x(τ) in the above receding control problem:

J (e, u, t) = \frac{1}{2} \int_{t}^{t + T} (e^{T} (τ) Q e (τ) + u {(τ)}^{T} R u (τ)) d τ

(5)

where Qεℜ^nxn is positive definite, Rεℜ^nxn positive semi-definite, T is the predicted tracking horizon.

To avoid the computational burden, we shall approximate the above receding- horizon control problem by Simpson's rule (Atkinson K. E. 1978):

\begin{array}{l} J = \frac{1}{2} \int_{t}^{t + T} L (τ) d τ \\ = \frac{T}{6} [L (t) + 4 L (t + \frac{T}{2}) + L (t + T)] \\ = \frac{h}{3} [L (t) + 4 L (t + h) + L (t + 2 h)] \end{array}

with T=2h is the prediction horizon and

L (τ) = e^{T} (τ) Q e (τ) + u^{T} (τ) R u (τ) .

A simple and effective way of predicting the cost function L(.) is to expand the predicted tracking error in a first order Taylor series, in the following way: q(t + h) = q(t) + h q̇(t) and the reference trajectory is predicted as follow: q _ref (t + h) = q _ref (t) + h q̇ _ref .

The predicted tracking error is then given by:

e (t + h) = e (t) + h \dot{e} (t) .

Predict e (t+2h) by another first-order Taylor series expansion at e (t+h) to have:

\begin{array}{l} e (t + 2 h) = q (t + 2 h) - q_{r e f} (t + 2 h) \\ = e (t) + 2 h \dot{e} (t) + h^{2} (f - {\overset{..}{q}}_{r e f}) + h^{2} P u (t) \end{array}

(6)

where

q (t + 2 h) = q (t) + 2 h \dot{q} (t) + h^{2} f + h^{2} P u (t)

and

q_{r e f} (t + 2 h) = q_{r e f} (t) + 2 h {\dot{q}}_{r e f} (t) + h^{2} {\overset{..}{q}}_{r e f} (t) .

Thus, the performance index (5) can be approximated as:

\begin{array}{l} J = \frac{h}{3} [e^{T} (t) Q e (t) + u^{T} (t) R u (t) \\ + 4 e^{T} (t + h) Q e (t + h) + 4 u^{T} (t + h) R u (t + h) \\ + e^{T} (t + 2 h) Q e (t + 2 h) + u^{T} (t + 2 h) R u (t + 2 h)] \end{array}

(7)

We can rewrite the performance index (7) in the conventional quadratic form by using the predicted tracking error given above, as:

\bar{J} = \frac{3}{2 h} J = \frac{1}{2} U^{T} θ (x) U + G^{T} (x) U + m (e, \dot{e})

where

θ (x) = | \begin{array}{l} R + h^{4} P^{T} (x_{1}) Q P (x_{1}) & 0 & 0 \\ 0 & 4 R & 0 \\ 0 & 0 & R \end{array} |

is positive definite matrix, m(e, ė): terms that are independent of U(t) where U(t)^T = [u(t)^T u(t+h)^T u(t+2h)^T],

G^{T} (x) = | h^{2} {(e + 2 h \dot{e} + h^{2} (f - {\overset{..}{q}}_{r e f})}^{T} Q P (x) 0 0 | .

The receding-horizon control problem that minimizes the cost function J̄ is: U(t) = –θ(x)⁻¹ G(x). The applied control signal to nonlinear system at time t is given by:

\begin{array}{l} u (t) = - h^{2} M (x_{1}) {(h^{4} Q + M (x_{1}) R M (x_{1}))}^{- 1} \times \\ Q (e + 2 h \dot{e} + h^{2} (f (x_{1}, x_{2}) - {\overset{..}{q}}_{r e f})) \end{array}

(9)

Note that with R=0, the above nonlinear predictive control law leads to the well known computed torque controller.

4- Stability analysis and robustness issues

In this section, we will investigate the stability and the robustness of the closed loop system with respect to model uncertainties.

1- Stability Analysis

Let Q = qI _n and R = rI _n (we give the same penalty to all joints), the tracking error of the nonlinear system (3) closed by the nonlinear feedback (9) is given by:

{\begin{array}{l} {\dot{e}}_{1} = e_{2} \\ {\dot{e}}_{2} = - q h^{2} {\bar{P}}^{- 1} e_{1} - 2 q h^{3} {\bar{P}}^{- 1} e_{2} + r {\bar{P}}^{- 1} M {(x_{1})}^{2} \times \\ (f (x_{1}, x_{2}) - {\overset{..}{q}}_{r e f}) \end{array}

(10)

Where $e_{1} = q (t) - q_{r e f} (t); e_{2} = \dot{q} (t) - {\dot{q}}_{r e f} (t)$ and $\bar{P} = q h^{4} I_{n} + r M {(x_{1})}^{2} .$ .

This equation can be written in compact form as:

\dot{e} = A (h, x_{1}) e + B η

(11)

Where $A (h, x_{1}) = | \begin{array}{l} 0 & I_{n} \\ - q h^{2} {\bar{P}}^{- 1} & - 2 q h^{3} {\bar{P}}^{- 1} \end{array} |$ ; $B = | \begin{array}{l} 0 \\ I_{n} \end{array} |$ ; $η = r {\bar{P}}^{- 1} M {(x_{1})}^{2} (f (x_{1}, x_{2}) - {\overset{..}{q}}_{r e f}) .$ .

Lemma-1: The matrix A(h,x₁) is Hurwitz.

Proof: Both matrix P̄ and its inverse are symmetric positives definite. Let x̄ ε ℜ ⁿ and $\bar{λ} \in ℜ$ are the eigenvector and the correspondent eigenvalue of the inverse of the above matrix. Thus, we have the equality:

A (h, x_{1}) | \begin{array}{l} \bar{x} \\ λ \bar{x} \end{array} | = | \begin{array}{l} λ \bar{x} \\ - q h^{2} \bar{λ} \bar{x} - 2 q h^{3} λ \bar{λ} \bar{x} \end{array} | = | \begin{array}{l} λ \bar{x} \\ λ^{2} \bar{x} \end{array} | = λ | \begin{array}{l} \bar{x} \\ λ \bar{x} \end{array} | N

ote that λ is the solution of equation:

λ^{2} + q h^{2} \bar{λ} + 2 q h^{3} \bar{λ} λ = 0

(12)

Therefore, λ is the eigenvalue of the matrix A(h, x₁) and $| \begin{array}{l} \bar{x} \\ λ \bar{x} \end{array} |$ the correspondent eigenvector. Set λ₁ and λ₂ the solution of the equation (12), we have the relations:

\begin{array}{l} λ_{1} + λ_{2} = - 2 q h^{3} \bar{λ} \\ λ_{1} λ_{2} = q h^{2} \bar{λ} \end{array}

Since the eigenvalue $\bar{λ}$ is positive, then λ₁ and λ₂ have a negative real part (end of the proof).

Since the matrix A(h,x₁) is a Hurwitz matrix, then for any symmetric positive definite matrix Q_A(h, x₁), there exists a symmetric positive definite matrix P_A(h, x₁) solution of the lyapunov equation:

\begin{array}{l} {\dot{P}}_{A} (h, x_{1}) + A {(h, x_{1})}^{T} P_{A} (h, x_{1}) + P_{A} (h, x_{1}) A (h, x_{1}) \\ = - Q_{A} (h, z_{1}) \end{array}

(13)

From the property P₃, the function f(x ₁ , x₂) is Lipschitz with regards to x₂, we can always find a bounded continuous function σ(e₂, t) and positive scalar μ satisfying the inequality:

‖ f (x_{1}, x_{2}) - {\overset{..}{q}}_{r e f} ‖ \leq σ (e_{2}, t) \leq u ‖ e ‖ .

(14)

Now, we can state the following theorem.

Theorem 1: The equilibrium point of the nonlinear system (3) in closed loop with the feedback control (9) is asymptotically stable if the following inequality hold:

μ < \frac{λ_{m i n} (Q_{A}) λ_{max} (\bar{P})}{2 r {\bar{M}}^{2} λ_{m a x} (P_{A})} .

Moreover, if r = 0, then the origin is asymptotically stable equilibrium point.

Proof:

Let V(e) = e ^T P_A e be a Lyapunov function candidate, the time derivative of this function along the trajectories (11) is:

\begin{array}{l} \dot{V} = e^{T} A {(h, x_{1})}^{T} P_{A} e + e^{T} P_{A} A (h, x_{1}) e + e^{T} {\dot{P}}_{A} e \\ + η^{T} B^{T} P_{A} e + e^{T} P_{A} B η \end{array}

By using the equality (13), we obtain:

\dot{V} = - e^{T} Q_{A} e + η^{T} B^{T} P_{A} e + e^{T} P_{A} B η

\dot{V} \leq - λ_{m i n} (Q_{A}) {‖ e ‖}^{2} + 2 ‖ B ‖ ‖ P_{A} ‖ ‖ e ‖

From the inequality (14), we can write:

\dot{V} \leq λ_{m i n} (Q_{A}) {‖ e ‖}^{2} + 2 μ r {\bar{M}}^{2} λ_{m a x} ({\bar{P}}^{- 1}) λ_{m a x} (P_{A}) {‖ e ‖}^{2}

Thus, V̇ ≤ –κ||e|| ² is negative definite if κ > 0, where $κ = λ_{m i n} (Q_{A}) - 2 μ {\bar{M}}^{2} r \frac{λ_{m a x} (P_{A})}{λ_{m a x} (\bar{P})}$ . This ensures the asymptotic stability of the equilibrium point.

Note that a short steady state error will be observed in tracking position error when r ≠ 0. However, if r = 0 the time derivative of the Lyapunov function becomes: V̇ = –λ_min(Q_A)||e|| ² which is negative definite. Thus, we can conclude that the origin becomes the equilibrium point of the system (12) and is asymptotically stable, i.e:

\underset{t \to \infty}{L i m} e (t) = \underset{t \to \infty}{L i m} {| q - q_{r e f} \dot{q} - {\dot{q}}_{r e f} |}^{T} = {| \begin{matrix} 0 & 0 \end{matrix} |}^{T} .

2- Robustness

In order to incorporate modeling uncertainties into the model of the rigid robot (1), the matrices M(q), C(q, q̇) and the vector G(q) are split up into a nominal part (indicated by the subscript zero) and an uncertain part as:

\begin{array}{l} (M_{0} (q) + Δ M) \overset{..}{q} + (C_{0} (q, \dot{q}) + Δ C) \dot{q} + G_{0} (q) + Δ G + f \\ = u (t) \end{array}

(15)

The friction torque f is included in the uncertain part given the difficulty to model it correctly. Obviously, only the nominal part of the model can be used by the nonlinear predictive control, given by:

\begin{array}{l} u (t) = - \frac{1}{h^{2}} M_{0} (q) (e + 2 h \dot{e}) \\ + (C_{0} (q, \dot{q}) \dot{q} + G_{0} (q)) + M_{0} (q) {\overset{..}{q}}_{r e f} \end{array}

(16)

Where R is setting to zero. With the nonlinear control law (16), the closed loop system is:

\begin{array}{l} \overset{..}{e} + \frac{2}{h} M {(q)}^{- 1} M_{0} (q) \dot{e} + \frac{1}{h^{2}} M {(q)}^{- 1} M_{0} (q) e \\ = - M^{- 1} (q) (Δ M {\overset{..}{q}}_{r e f} + Δ C \dot{q} + Δ G + f) = υ (q, \dot{q}, {\overset{..}{q}}_{r e f}, t) \end{array}

(17)

To estimate the worst case bound of the function υ, we make the following assumptions for all q ε ℜ ⁿ :

\begin{array}{l} {\underline{M}}_{0} \leq ‖ M_{0} (q) ‖ \leq {\bar{M}}_{0}; ‖ Δ M ‖ \leq \bar{m} \leq λ_{min} (M (q)); \\ ‖ Δ C ‖ \leq \bar{c}; ‖ Δ G ‖ \leq \bar{g}; ‖ f ‖ \leq \bar{f}; \end{array}

Given these assumptions with the inequalities (2), we can find a bounded continuous vector function ρ(e, ė, t) satisfying the inequality (Spong M. W. & Vidyasagar M.; 1989):

‖ υ ‖ < ρ (e, \dot{e}, t) \leq γ ‖ e ‖ for all q \in ℜ^{n},

where γ is a positive scalar and e = |q q̇| ^T = |e ₁ e ₂ | ^T .

In the state space representation, the system (17) can be transformed to:

\dot{e} = \bar{B} (h, x_{1}) e + B υ

(18)

where

\bar{B} (h, x_{1}) = | \begin{array}{l} 0 & I_{n} \\ - \frac{b (x_{1})}{h^{2}} & - \frac{2 b (x_{1})}{h} \end{array} | .

Since both M⁻¹(x₁) and M₀(x₁) are symmetric definite positives, the matrix b(x₁) = M⁻¹ (x₁) M₀ (x₁) has all its eigenvalues reels and positives (Samson C.; 1983). Thus, from the Lemma-1, we can conclude that the matrix B̄(h, x₁) is Hurwitz, then for any symmetric positive definite matrix Q_B(h,x₁) there exists a positive definite matrix P_B(h,x₁) solution of the given Lyapunov equation:

\begin{array}{l} {\dot{P}}_{B} (h, x_{1}) + \bar{B} {(h, x_{1})}^{T} P_{B} (h, x_{1}) + P_{B} (h, x_{1}) \bar{B} (h, x_{1}) \\ = - Q_{B} (h, x_{1}) \end{array}

Theorem 2: Suppose that the inequality holds $γ < \frac{λ_{m i n} (Q_{B})}{2 λ_{m a x} (P_{B})}$ , then the equilibrium point of the nonlinear system with uncertainties (15) closed by the optimal control (16) is asymptotically stable.

Proof:

Let V = e ^T P _B e a Lyapunov function candidate. The differentiation of V along the trajectories (18) leads to:

\begin{array}{l} \dot{V} = - e^{T} Q_{B} e + υ^{T} B^{T} P_{B} e + e^{T} P_{B} B υ . \\ \dot{V} \leq - λ_{m i n} (Q_{B}) {‖ e ‖}^{2} + 2 γ λ_{m a x} (P_{B}) {‖ e ‖}^{2} \\ \leq - (λ_{m i n} (Q_{B}) - 2 γ λ_{m a x} (P_{B})) {‖ e ‖}^{2} . \end{array}

Which is definite negative if $γ < \frac{λ_{m i n} (Q_{B})}{2 λ_{max} (P_{B})} .$ . Therefore, By LaSalle's invariance theorem, the solution e(t) of (18) tends to the invariance set:

S = {e / e_{1} = h^{2} b {({\bar{x}}_{1})}^{- 1} υ (\bar{x}, \overset{..}{q} r e f), e_{2} = 0} .

We conclude that bounded uncertainties will introduce a steady state error on tracking position angular. Where x̄ is the equilibrium point of the system (3) in closed loop with the control law (16).

3- Integral action

It is known in the literature that the integral action increases the robustness of the closed loop system against the low frequency disturbances as long as the closed loop system is stable. In this part, we shall incorporate an integral action in the loop to eliminate the steady state error and enhance the robustness of the proposed control scheme with respect to model uncertainties and disturbances.

Thus, the cost function to minimize becomes:

\begin{array}{l} J (e_{0}, u, t) = \frac{1}{2} \int_{t}^{t + T} (e_{t}^{T} (τ) Q e_{0} (τ) + u {(τ)}^{T} R u (τ)) d τ \\ = \frac{1}{2} \int_{t}^{t + h} L (τ) d τ \end{array}

(19)

Where ė₀= e₁ = x₁ – q _ref = q – q _ref and L(τ) = e₀ ^T (τ) Q e₀ (τ) + u ^T (τ) R u (τ).

Also in this case, we use the Simpson's rule to approximate the integral in the cost function (19) by:

\begin{array}{l} J (e_{0}, u, t) = \frac{h}{3} (L (t) + 4 L (t + h) + L (t + 2 h)) \\ W i t h : L (t) = e_{0}^{T} (t) Q e_{0} (t) + u^{T} (t) R u (t); \\ L (t + h) = e_{0}^{T} (t + h) Q e_{0} (t + h) + u^{T} (t + h) R u (t + h) \\ L (t + 2 h) = e_{0}^{T} (t + 2 h) Q e_{0} (t + 2 h) . \\ + u (t + 2 h) R u (t + 2 h) \end{array}

Note that in this case, the Taylor approximation of the predicted vector e₀(t + h) is given by:

\begin{array}{l} e_{0} (t + h) = e_{0} (t) + h e_{1} + \frac{h^{2}}{2} e_{2} + \frac{h^{3}}{6} (f (x) - {\overset{..}{q}}_{r e f}) \\ + \frac{h^{3}}{6} P (x_{1}) u (t) \end{array}

Following same steps in paragraph 2, the optimal control vector U(t) that minimizes the new cost function is:

U (t) = - {\bar{H}}^{- 1} \bar{G} (x)

(20)

Where U(t) = |u(t) u(t + h) u(t + 2h)| ^T ; $\bar{H} = d i a g (R + \frac{5}{9} h^{6} P^{T} (x_{1}) Q P (x_{1}), 4 R, R)$ and

\begin{array}{l} \bar{G} (x) = \\ | \begin{array}{l} \frac{2}{3} h^{3} P (x_{1}) Q (2 e_{0} + 3 h e_{1} + 2 h^{2} e_{2} + \frac{5}{6} h^{3} (f (x) - {\overset{..}{q}}_{r e f})) \\ 0 \\ 0 \end{array} | \end{array}

The control signal to be applied to the nonlinear system at time t is:

\begin{array}{l} u (t) = - \frac{2}{3} h^{3} M_{0} (x_{1}) {\bar{\bar{P}}}^{- 1} (h, x_{1}) Q \times \\ (2 e_{0} + 3 h e_{1} + 2 h^{2} e_{2} + \frac{5}{6} h^{3} (f (x) - {\overset{..}{q}}_{r e f})) \end{array}

(21)

Where $\bar{\bar{P}} (h, x_{1}) = \frac{5}{9} h^{6} Q + M_{0} (x_{1}) R M_{0} (x_{1}) .$ .

Let R = 0 in equation (21), the control signal becomes:

\begin{array}{l} u (t) = - \frac{9}{5} M_{0} (x_{1}) \times \\ (\frac{4}{3 h^{3}} e_{0} + \frac{2}{h^{2}} e_{1} + \frac{4}{3 h} e_{2} + \frac{5}{9} (f (x) - {\overset{..}{q}}_{r e f})) \end{array}

The dynamic of the tracking error is given by the equations:

{\begin{array}{l} {\dot{e}}_{0} = e_{1} \\ {\dot{e}}_{1} = e_{2} \\ {\dot{e}}_{2} = - \frac{12}{5 h^{3}} b (x_{1}) e_{0} - \frac{18}{5 h^{2}} b (x_{1}) e_{1} - \frac{12}{5 h} b (x_{1}) e_{2} \\ + υ (x, {\overset{..}{q}}_{r e f}) \end{array}

(22)

Or in compact form:

\dot{e} = \tilde{B} (h, x_{1}) e + B υ (x, {\overset{..}{q}}_{r e f})

(23)

Where:

\tilde{B} (h, x_{1}) = | \begin{array}{l} 0 & I_{n} & 0 \\ 0 & 0 & I_{n} \\ - \frac{12}{5 h^{3}} b (x_{1}) & - \frac{18}{5 h^{2}} b (x_{1}) & - \frac{12}{5 h} b (x_{1}) \end{array} |

Lemma 2: Suppose that λ_max(ε(x₁)) < 2.6, with $ɛ (x_{1}) = M_{0}^{- 1} Δ M$ , then the matrix $\bar{\bar{B}} (h, x_{1})$ is Hurwitz.

Proof: Let x̄ ε ℜ ⁿ and $\bar{λ} \in ℜ^{+}$ represent eigenvector and eigenvalue of the matrix b(x₁) respectively. Set $v = {| \begin{array}{l} \bar{x} & \tilde{λ} \bar{x} & \tilde{λ} \bar{x} \end{array} |}^{T} \in ℜ^{3 n}$ be a vector and $\tilde{λ} \in ℜ$ a scalar. We have:

\begin{array}{l} \tilde{B} (h, x_{1}) v = | \begin{array}{l} \tilde{λ} \bar{x} \\ {\tilde{λ}}^{2} \bar{x} \\ - \frac{12}{5 h^{3}} \bar{λ} \bar{x} - \frac{18}{5 h^{2}} \tilde{λ} \bar{λ} \bar{x} - \frac{12}{5 h} {\tilde{λ}}^{2} \bar{λ} \bar{x} \end{array} | . \\ = | \begin{array}{l} \tilde{λ} \bar{x} \\ {\tilde{λ}}^{2} \bar{x} \\ {\tilde{λ}}^{3} \bar{x} \end{array} | = \tilde{λ} v \end{array}

Then, $\tilde{λ}$ and v are the eigenvalue and eigenvector of the matrix B˜(h, x₁) respectively, where the eigenvalue $\tilde{λ}$ verify the equality:

{\tilde{λ}}^{3} + \frac{12}{5 h} \bar{λ} {\tilde{λ}}^{2} + \frac{18}{5 h^{2}} \bar{λ} \tilde{λ} + \frac{12}{5 h^{3}} \bar{λ} = 0.

By using the Rooth-Hurwitz criterion, the solutions of the above equation lie in left half plane (stable domain) if: $\bar{λ} > \frac{5}{18}$ .

Since M^–1M ₀ = (I _n + ε(x ₁ ))^–1, the condition for stability becomes: $λ_{m a x} (I_{n} + ɛ (x_{1})) < \frac{18}{5}$ or λ_max (ε(x₁)) < 2.6 which can be easily verified if the uncertainty ΔM is small with regards to the nominal value of the matrix M₀(x₁).

We conclude that the matrix B˜(h, x₁) is Hurwitz, then for any symmetric positive definite matrix Q˜(h, x₁) there exists a positive definite matrix P˜(h, x₁) solution of the given Lyapunov equation:

\begin{array}{l} \dot{\tilde{P}} (h, x_{1}) + \tilde{B} {(h, x_{1})}^{T} \tilde{P} (h, x_{1}) + \tilde{P} (h, x_{1}) \tilde{B} (h, x_{1}) \\ = - \tilde{Q} (h, x_{1}) \end{array}

Theorem 3: Suppose the inequality $γ < \frac{λ_{m i n} (\tilde{Q})}{2 λ_{m a x} (\tilde{P})}$ hold, then the equilibrium point of the nonlinear system with uncertainties (23) is asymptotically stable.

The proof may be obtained in the same way as the proof of the theorem 2 and is therefore omitted. Therefore, the equilibrium point of (22) or (23) is asymptotically stable. The tracking error tends towards the set:

S = {e / e_{0} = \frac{5 h^{3}}{12} b {({\bar{x}}_{1})}^{- 1} υ (\bar{x}, {\overset{..}{q}}_{r e f}), e_{1} = 0, e_{2} = 0} W

e conclude that the position and velocity tracking error converge to zero, therefore the integral action eliminates the position steady state error, i.e:

\underset{t \to \infty}{L i m} e_{1} = 0 and \underset{t \to \infty}{L i m} e_{2} = 0.

The price to be paid by introducing an integral action in the loop is that the control signal will not vanish and this will increase the required energy to maintain the tracking performance as in the matched case.

5. Nonlinear observer

A drawback of the previous nonlinear predictive controller is that it requires at least the measurement of velocity on the link side. However, as pointed out in (Nicosia S. & Tomei P. 1990) and (Canudas W. C. et al 1992), in the practical robotic systems all the generalized coordinates can be precisely measured by the encoder for each joint, but the velocity measurements obtained through the tachometers are easily perturbed by noises. Therefore, in order to coincide with these physical constraints, a nonlinear observer proposed in (Bornard G. et al 1993) is used in this paper.

Define the state vector as:

z (t) = T x (t) = [\dots \dots q_{i} (t) {\dot{q}}_{i} (t) \dots \dots] \in ℜ^{2 n}

where q_i(t) and q̇_i(t) are the link position and the velocity of the i^th arm respectively. Tεℜ^2nx2n is the transformation matrix. The system (3) can be transformed to:

{\begin{array}{l} \dot{z} = A z + H f (z) + H P (q) u (t) \\ y = C z \end{array}

(24)

where

\begin{array}{l} A = diag (A_{i}), A_{i} = | \begin{array}{l} 0 & 1 \\ 0 & 0 \end{array} |, C = diag (C_{i}), C_{i} = | \begin{array}{l} 1 & 0 \end{array} |, \\ H = diag (H_{i}), H_{i} = {| \begin{array}{l} 0 & 1 \end{array} |}^{T} for i = 1, \dots \dots \dots, n . \end{array}

With the assumption that the control torque u(t) is uniformly bounded, the high gain observer described in (Bornard G. et al, 1993) can be used to estimate angular positions and angular velocities of the n link rigid robot manipulator (24). The dynamic nonlinear observer is given by:

{\begin{array}{l} \dot{\hat{z}} = A \hat{z} + H f (\hat{z}) + H P (q) u (t) + K (y - \hat{y}), \\ \hat{y} = C \hat{z}, \end{array}

(25)

where K = Γ⁻¹(α) V is the gain of the observer with T (α) = diag(Γ _i (α)),

Γ_{i} (α) = | \begin{array}{l} α & 0 \\ 0 & α^{2} \end{array} |

for any α >0 and due to the observability property of (A,C) the eigenvalues of (A-VC) can be assigned by V.

6. A Simulation example

To illustrate some of the conclusions of this paper, we have simulated the approximate receding-horizon control scheme on a two-link robot arm used in (Lee K. W. & Khalil H. K. 1997), (Spong M. W. 1992) with mechanic equations.

The arm is shown in Figure 1. The dynamic model is described in equation (1) with the following components:

\begin{array}{l} m_{11} (q) = m_{1} l_{c 1}^{2} + m_{1} l_{c 2}^{2} + m_{2} l_{c 1}^{2}; \\ + 2 m_{2} l_{1} l_{c 2} cos (q_{2}) + I_{1} + I_{2} \\ m_{12} (q) = m_{21} (q) = I_{2} + m_{2} l_{c 2}^{2} + m_{2} l_{1} l_{2} cos (q_{2}); \\ m_{22} (q) = I_{2} + m_{2} l_{c 2}^{2}; C_{11} (q, \dot{q}) = - {\dot{q}}_{2} m_{2} l_{1} l_{c 2} sin (q_{2}); \\ C_{12} (q, \dot{q}) = - ({\dot{q}}_{1} + {\dot{q}}_{2}) m_{2} l_{1} l_{c 2} sin (q_{2}) . \\ C_{21} (q, \dot{q}) = {\dot{q}}_{1} m_{2} l_{1} l_{2} sin (q_{2}), C_{22} (q, \dot{q}) = 0. \\ G_{1} (q) = (m_{1} l_{c 1} + m_{2} l_{1}) g cos (q_{1}) + m_{2} l_{c 2} g cos (q_{1} + q_{2}) \\ G_{2} (q) = m_{2} l_{c 2} g cos (q_{1} + q_{2}) . \end{array}

where 1_c1 is the mass center of gravity coordinate of the link 1, and 1_c2 is the mass center of gravity coordinate of the link 2. The values of the manipulator parameters are given in Table 1 (Lee K. W. & Khalil H. K. 1997) and (Spong M. W. 1992).

Fig.1

Two-linkage manipulators

Table 1

Physical parameters of two-link arm.

Link 1	m₁=10kg l₁=1m l_c1=0.5m $I_{1} = \frac{10}{12} {kgm}^{2}$
Link 2	m₂=5kg l₂ = 1m l_c2=0.50m $I_{2} = \frac{5}{12} {kgm}^{2}$

The reference models chosen in continuous time are:

\begin{array}{l} q_{r e f} = | \begin{array}{l} q_{r e f 1} \\ q_{r e f 2} \end{array} | & with & q_{r e f 1} (s) = \frac{ω_{1}^{2}}{s^{2} + 2 ξ ω_{1} s + ω_{1}^{2}} \end{array} r_{1} (s)

and

q_{r e f 2} (s) = \frac{ω_{2}^{2}}{s^{2} + 2 ξ ω_{2} s + ω_{2}^{2}} r_{2} (s) .

The nonlinear predictive controller is used to track these desired trajectories with inputs (Lee K. W. & Khalil H. K. 1997):

r_{1} (t) = r_{2} (t) = 1.5 (1 - e x p (- 5 t) (1 + 5 t)) r a d .

All simulations are carried with the nonlinear observer (25) with α=0.01 and the assigned eigenvalues σ(λ) = {–0.4, −0.8}. The initial displacements and velocities are chosen as:

\begin{array}{l} q_{1} (0) = q_{2} (0) = 0^{\circ}, {\dot{q}}_{1} (0) = {\dot{q}}_{2} (0) = 0, \\ {\hat{q}}_{1} (0) = {\hat{q}}_{2} (0) = 0.01 and {\dot{\hat{q}}}_{1} (0) = {\dot{\hat{q}}}_{2} (0) = 0. \end{array}

The parameter values of two-reference models are chosen as follows: ξ = 1, ω₁ = ω₂ = 10 rad/s.

The nonlinear controller (9), has been tested by simulation and the control parameters: Q=10⁷ I_n, R=10⁻¹⁴ I_n and h is set to 0.001. Simulation results are show in Figure 2. This Figure gives the angular position (q₁(t), q₂(t)) and the position tracking error. Although a very short steady state error is observed in the position tracking error and this was expected in the analysis part, a good tracking performance is achieved by this controller in matched case. Figure 3 illustrates the induced control torque applied to robot manipulator. Note that the control torque lie inside the saturation limits (Lee K. W. & Khalil H. K. 1997).

Fig.2

Position tracking error of joint q₁ and q₂. (matched case).

Fig.3

Induced torque control.

In mismatched case, the frictions are added to the joint of robot manipulator model in equation (1) and are modeled: F _r = f _s q + f_v sign(q̇), with the values f _s = f_v = diag(5,5).

If we regard an unknown load carried by the robot as part of the second link, then the parameters m₂, l_c2 and I₂ will change to: m₂ + Δm₂, l_c2 + Δl_c2 and I₂ + ΔI₂.

Let Δm₂ = 5Kg, Δl_c2 = 0.5m and and $Δ I_{2} = \frac{1}{6} K g m^{2}$ to be the maximum parameters variations of the second link due to unknown load. It is observed from Figure 4 that the output q₂(t) tracks tightly the reference trajectory with a steady state error. This results has been expected in the analysis did in section 4, i.e the uncertainties will introduce a steady state error in tracking error. Furthermore, the induced torque control lie outside the saturation limits. Figure 5 shows the results when the control law (21) is applied to robot, it is seen from this figure the error was eliminated and the torque control signal lie inside the saturation limits. These results prove the robustness of the rigid link manipulator under the approximate receding-horizon controller with integral action to payload uncertainties and viscous friction.

Fig.4

Position tracking performance in mismatched case.

Fig.5

Torque and tracking performance with integral action.

In addition, other simulations have been carried out and the following remarks have been observed:

Decreases in performances are obtained when we increase the step control parameter h.

High dynamics of the reference trajectories results in high increase of the control torque signal. To reduce this control torque amplitude, one should increase the predictive time increment h. It should be pointed that over a threshold value of h noted h_max, the performance decrease and instability mechanism will appear. This is due to the Taylor approximation used to derive the predictive controller, which becomes invalid.

7. Conclusion

In this paper, the approximation of receding-horizon controller of rigid link robot manipulator using output feedback via link position measurements were considered. Minimizing a quadratic function of the predicted tracking error and the predicted input over the fixed horizon, by using the Simpson's rule approximation, derives the control law. One of the main advantages of these control schemes is that it does not need to perform an online optimization and asymptotic tracking of the smooth reference signal is guaranteed.

To enhance the robustness property of the nonlinear predictive developed by Ping Lu, we proposed to incorporate an integral action in the loop. Simulation shown that payload uncertainties and friction have no effect on the robot manipulator closed by the proposed algorithm. Moreover, the obtained torque signal lie in saturations limit. The Lyapunov theory is used to prove the asymptotic stability of equilibrium point of both original and augmented system.

Finally, we expect that the results presented here can be explored and extended to discrete implementation of these continuous-time predictive controllers either through computers or by special purpose chips that can run at a higher speed.

References

Abdullah

Dawsan

Dorato

Jamshidi

(1991), Survey of robust control for rigid robots, IEEE Control Magazine 11, pp.24–30.

Atkinson

K. E.

, (1978) An introduction to numerical analysis, John Wiley, New York.

Bornard

Celle-Couenne

Gilles

, (1993), Observabilite et observateurs, A. J Fossard and D. Normad-Cyrot, Systèmes nonlinéaires: Modélisation et estimation 5, Paris, Masson, pp.177–221.

de Wit

C. Canudas

Fixot

(1992), Adaptive control of robot manipulator via velocity estimated feedback, IEEE Transactions On Automatic Control, vol. 37, pp.1234–1237.

de Wit

C. Canudas

Fixot

Astrom

K. J.

, (1992), Trajectory tracking in robot manipulators via nonlinear estimated output feedback, IEEE Transaction On Robotic and Automation 8, pp.138–144.

Chen

W. H.

Balance

D. J.

Gawthrop

P. J

, (2003), Optimal control of nonlinear systems: A predictive approach, Automatica, vol. 39, pp.633–641.

Chen

B. S.

Lee

T. S.

Feng

J. H.

(1994), A nonlinear H∝ control design in robotic systems under parameter perturbation and external disturbances, International Journal of Control, 59(2), pp.439–461.

Lee

K. W.

Khalil

H.K.

(1997), Adaptive output feedback control of robot manipulators using high gain observer, International Journal of Control, vol.67, No.6, pp.869–886.

Michalska

Mayne

D. Q.

(1993), Robust receding horizon control of constrained nonlinear systems, IEEE Transaction On Automatic Control, 38(11), pp.1623–1633.

10.

Nicosia

Tomei

, (1990), Robot control by using only joint position measurements, IEEE Transaction On Automatic Control. 35(9), pp.1058–1061.

11.

Ortega

Spong

M. W.

(1989), Adaptive motion control of rigid robots: A tutorial, Automatica 25, pp.877–888.

12.

Ping

, (1995) Optimal predictive control of continuous nonlinear systems, International Journal of Control, 62(2), pp. 633–649.

13.

Ping

, (1998), Approximate nonlinear receding-horizon control laws, International Journal of Control, 71(1), pp.19–34.

14.

Sage

H. G.

de Mathelin

M. F.

Ostertag

, (1999), Robust control of robot manipulators: A survey, International Journal of Control, 72(16), 498–522.

15.

Samson

, (1983), Commande non linéaire robuste des robots manipulateurs, Rapport de recherche No.182, INRIA, France.

16.

Singh

S. M.

(1995), Nonlinear predictive control of feedback linearizable systems and flight control system design, Journal of guidance, control and dynamics, 18(5), pp.1023–1028.

17.

Slotine

J.J E.

Sastry

S. S.

, (1983), Tracking control of nonlinear systems using sliding surfaces with application to robot manipulators, International Journal of Control, 38(2), pp.465–492.

18.

Souroukh

Kravaris

, (1996), A continuous-time formulation of nonlinear model predictive control, International Journal of Control, 63(1), pp. 121–146.

19.

Spong

M. W.

Vidyasagar

, (1989), Robot dynamics and control, John Wiley, New York.

20.

Spong

M. W.

(1992) On robust control of robot manipulators, IEEE Transactions on automatic control, vol.37, No.11, pp.1782–1786.