Backstepping-Based Inverse Optimal Attitude Control of Quadrotor

Abstract

Abstract Input saturation must be taken into account for applying rapid reorientation in the large angle manoeuvre of a quadrotor. In this paper, a backstepping-based inverse optimal attitude controller (BIOAC) is derived which has the property of a maximum convergence rate in the sense of a control Lyapunov function (CLF) under input torque limitation. In the controller, a backstepping technique is used for handling the complexity introducing by the unit quaternion representation of the attitude of a quadrotor with four parameters. Moreover, the inverse optimal approach is employed to circumvent the difficulty of solving the Hamilton-Jacobi-Bellman (HJB) equation. The performance of BIOAC is compared with a PD controller in which the input torque limitation is not considered under the same unit quaternion representation using numerical simulation while the results show that BIOAC gains faster convergence with less control effort. Next, BIOAC is realized on a test bed and the effectiveness of the control law is verified by experimental studies.

Keywords

Input Saturation Backstepping Technique Inverse Optimal Approach Maximum Rate Attitude Controller Quadrotor

1. Introduction

Compared with a fixed-wing aircraft, a rotorcraft has special advantages in city and indoor environments due to a capability for VTOL (vertical take-off and landing) and hovering flight. As a new kind of rotorcraft, a quadrotor has certain unique advantages over other traditional rotorcrafts. Firstly, it avoids the complexity of swash plate and mechanical linkages. Secondly, it is much safer for indoor flying because its propellers are smaller and can be wrapped up. In addition, it has a greater thrust-weight ratio and better manoeuvrability performance. Therefore, a great deal of work has been done to research the control of the quadrotor. The projects in [1] and [2] use commercially-available toys to derive attitude controllers and position controllers respectively. Some nonlinear control methods, such as backstepping [3], sliding mode [4] and inverse dynamics [5] are used for better control performance of the quadrotor. The OS4 project [6] and X-4 [7] develop their own prototypes for generating more thrust and better stability. Most of the works presented in the literature use simplified models, where nonlinear effects and the performance of the actuators are ignored. However, when a rapid reorientation problem is considered for a large angle manoeuvre of a quadrotor, input saturation will cause control failure and must be taken into account. This is one of the unavoidable problems in the actual control system [8].

In this paper, the maximum rate attitude control problem under the input saturation is addressed. Moreover, a backstepping-based inverse optimal attitude controller (BIOAC) is derived which has the property of a maximum convergence rate within the meaning of a control Lyapunov function under input torque limitation. For designing BIOAC, the representation of the attitude of the quadrotor must first be determined. In all representations, the unit quaternion is a pervasive one. Distinct from minimal attitude representations, such as Euler angles - which have the disadvantages of singularities and complicated trigonometric expressions - the unit quaternion provides a globally non-singular representation of attitude. Compared with a classical direction cosine matrix, it has only 4 numbers and the multiplication and propagation rule can be calculated more effectively [9–11]. However, unit quaternion representation for the attitude of a quadrotor introduces some difficulties in controller design. For simplifying the design procedure of the controller, a backstepping technique [12,13] is employed in this work.

The optimal control problem for nonlinear systems is usually attributed to the solvability of the Hamilton-Jacobi-Bellman(HJB) equation. Moreover, it is particularly difficult to solve the HJB equation in the optimal control problem while taking into account the attitude kinematics and dynamics at the same time. An alternative method is the so-called inverse optimal control method in designing an optimal feedback controller [14–17]. Based on a control Lyapunov function (CLF), the inverse optimal approach circumvents solving the HJB equation. In addition, a feedback controller for a set of reasonable cost functions can be achieved.

This article is organized as follows. In section two, a mathematical model of a quadrotor on the attitude platform is given based upon the assumptions of a single rigid body and a symmetrical structure. Next, a maximum rate attitude controller is designed under input saturation conditions in section three. The performance of the proposed BIOAC is compared with a PD controller using simulations in section four, which is followed by experimental studies in section five. Finally, the conclusion is outlined in the last section.

2. Attitude Dynamics of the Quadrotor

2.1 Unit quaternion kinematics

A unit quaternion $\overset{⇀}{q}$ is a complex number with four parameters. It has the norm of constant one, and can be seen as a point of the unit sphere S³

\begin{array}{l} \overset{⇀}{q} = q_{0} + q_{1} i + q_{2} j + q_{3} k \\ q_{0}^{2} + q_{1}^{2} + q_{2}^{2} + q_{3}^{2} = 1 \end{array}

(1)

Let q represent its vector part, i.e. q˜ = q₁i + q₂j + q₃k, then the unit quaternion can be written in the following form:

\overset{⇀}{q} = [q_{0}, \tilde{q}]

(2)

Given a unit quaternion, there is a correspondence between the unit quaternion and rotation transformation:

R (q_{0}, \tilde{q}) = (q_{0}^{2} - {\tilde{q}}^{T} \tilde{q}) I + 2 q_{0} S (\tilde{q}) + 2 \tilde{q} {\tilde{q}}^{T}

(3)

where:

S (\tilde{q}) = [\begin{matrix} 0 & - q_{3} & q_{2} \\ q_{3} & 0 & - q_{1} \\ - q_{2} & q_{1} & 0 \end{matrix}]

(4)

Notice R(q₀,q˜) = R(–q₀,–q˜). Thus, the unit sphere S³ represented by the unit quaternion is a double cover of the attitude space SO(3). The phenomenon is clearer using the angle form of the unit quaternion. Euler's theorem states that any rotation in space can be achieved by rotating an angle θ around an eigenaxis $\overset{⇀}{n}$ ; then, an unit quaternion can be written as:

\overset{⇀}{q} = (\cos (θ / 2), \sin (θ / 2) \overset{⇀}{n}) \begin{matrix} ​ & θ \in (- 2 π, 2 π] \end{matrix}

(5)

It is obvious that rotating the angle 0 < θ₀ ≤ 2π around eigenaxis $\overset{⇀}{n}$ is equivalent to a rotation of the 2π – θ₀ angle around an eigenaxis of $- \overset{⇀}{n}$ . In Fig. 1, the X point and O point are different by quaternion representation, but have the same physical meaning. Actually, $\overset{⇀}{q} = (\cos (θ_{0} / 2), \sin (θ_{0} / 2) \overset{⇀}{n})$ and $- \overset{⇀}{q} = (\cos (π + θ_{0} / 2), \sin (π + θ_{0} / 2) \overset{⇀}{n})$ have the same physical meaning.

Figure 1.

Quaternion representation along some fixed axis

The conjugate of a unit quaternion is:

{\overset{⇀}{q}}^{*} = (q_{0}, - \tilde{q})

(6)

Given two unit quaternion ${\overset{⇀}{q}}_{1} = (q_{01}, {\tilde{q}}_{1})$ and ${\overset{⇀}{q}}_{2} = (q_{02}, {\tilde{q}}_{2})$ , the multiplication ⊗ is defined as:

{\overset{⇀}{q}}_{1} \otimes {\overset{⇀}{q}}_{2} = [\begin{matrix} q_{01} q_{02} - {\tilde{q}}_{1}^{T} {\tilde{q}}_{2} \\ q_{01} {\tilde{q}}_{2} + q_{02} {\tilde{q}}_{1} + S ({\tilde{q}}_{1}) {\tilde{q}}_{2} \end{matrix}]

(7)

The unit quaternion multiplication is equivalent to the multiplication of the rotation matrix. If ${\overset{⇀}{q}}_{1}$ and ${\overset{⇀}{q}}_{2}$ correspond to the rotation matrices R₁ and R₂ in SO(3), then there are the following correspondences:

\begin{array}{l} R_{1} R_{2} \Leftrightarrow \pm {\overset{⇀}{q}}_{1} \otimes {\overset{⇀}{q}}_{2} \\ R_{1} \overset{⇀}{v} \Leftrightarrow {\overset{⇀}{q}}_{1} \otimes \bar{v} \otimes {\overset{⇀}{q}}_{1}^{*} \end{array}

(8)

where $\bar{v} = {(0, {\overset{⇀}{v}}^{T})}^{T}$ .

When a rigid body is rotating with an angular velocity vector $\overset{⇀}{w}$ , the time derivative of the unit quaternion can be given by the unit quaternion propagation rule:

\dot{\overset{⇀}{q}} = \frac{1}{2} \overset{⇀}{q} \otimes \bar{w}

(9)

2.2 System model

In order to establish the dynamic equations of the quadrotor attitude system, coordinate frames must be establish first of all. This article uses two coordinate frames: an inertial coordinate frame and a body coordinate frame. As seen in Fig. 2, the origin of the body coordinate frame O_b – X_bY_bZ_b is located in the centre of the cross beam of the quadrotor: axis X pointing towards propeller 1, axis Z is perpendicular to the cross beam plane, and the three axis is to form a right-handed coordinate system.

Figure 2.

Coordinate frames of the quadrotor

The quadrotor dynamics are based upon two assumptions:

The quadrotor is a rigid body.

Any small error in the manufacturing and installation of the vehicle body is not considered, so the quadrotor is assumed to be symmetric.

Based upon assumption one, the following mathematical model can be derived from the Newton-Euler equations and quaternion kinematics:

J \dot{\overset{⇀}{w}} = \overset{⇀}{u} - \overset{⇀}{w} \times (J \overset{⇀}{w})

(10a)

{\begin{matrix} \dot{\tilde{q}} = \frac{1}{2} (q_{0} I + S (\tilde{q})) \vec{w} \\ {\dot{q}}_{0} = - \frac{1}{2} {\tilde{q}}^{T} \vec{w} \end{matrix}

(10b)

where $\overset{⇀}{q} = {[q_{0} {\tilde{q}}^{T}]}^{T}$ represents the attitude quaternion of a body coordinate frame with respect to an inertial coordinate frame, $\overset{⇀}{w}$ is the angular velocity in body coordinate frame, and $\overset{⇀}{u}$ is the torque acting on the quadrotor.

According to the second assumption, the correspondence between four propeller thrusts f_i (i = 1,2,3,4) and the torque u(i = 1,2,3) acting on the quadrotor is given as follows:

[\begin{matrix} f \\ u_{1} \\ u_{2} \\ u_{3} \end{matrix}] = [\begin{matrix} 1 & 1 & 1 & 1 \\ 0 & d & 0 & - d \\ - d & 0 & d & 0 \\ - c_{τ f} & c_{τ f} & - c_{τ f} & c_{τ f} \end{matrix}] [\begin{matrix} f_{1} \\ f_{2} \\ f_{3} \\ f_{4} \end{matrix}]

(11)

where f is the total thrust and d is the distance between the propeller and the centre of the cross beam and c_τf is the torque thrust ratio of the propeller.

Our model does not include the propeller gyroscopic torque and the flapping dynamic term; thus, $\overset{⇀}{u}$ is consists of just three axis control torques u₁,u₂,u₃:

\overset{⇀}{u} = [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}]

(12)

Considering the low-speed characteristics of the quadrotor, the following simplified equation is used to derive the propeller thrust [1]:

f_{i} = c_{f} Ω_{i}^{2}

(13)

where c_f is a constant coefficient.

3. Controller Design

The objective is to design a time optimal attitude controller under the limitations of input torque. In order to circumvent the difficulty of solving the Hamilton-Jacobi-Bellman (HJB) equation, the inverse optimal control approach is used. However, since the unit quaternion has four elements, it is difficult to implement the inverse optimal theorem directly. In order to simplify the design complexity, the backstepping technique is employed in the controller design procedure.

3.1 Inverse Optimal Theorem

The inverse optimal control approach is originated by Kalman to establish certain gain and phase margins of linear quadratic regulators. However, until Freeman uses the approach in [15] to develop a methodology for designing a robust nonlinear controller, it has been long dormant. In this section, only the main results of the approach are given, more detailed material and proofs can be found in [14–16].

Consider a control affine nonlinear system:

\dot{x} = f (x) + g (x) u

(14)

where u ∈ ℝⁿ is a control input, and f(x) and g(x) are continuous function matrices.

Definition 1: control Lyapunov function(CLF)

Let V be a C¹, proper, positive definite function V: ℝⁿ → ℝ₊, such that:

\inf_{u} [L_{f} V (x) + L_{g} V (x) u] < 0

(15)

for all x ≠ 0. The existence of a CLF for the system (14) is equivalent to the existence of a globally asymptotically stabilizing control law u = k(x).

Theorem 1: inverse optimal theorem

Assume that the static state feedback control law:

u = κ (x) : = - N^{- 1} (x) {(L_{g} V)}^{T}

(16)

stabilizes the system in (14) with respect to a positive definite radially unbounded Lyapunov function V (x), where N: ℝⁿ → ℝⁿ is a positive definite matrix-valued function. Then the control law:

u = κ^{*} (x) : = β κ (x) \begin{matrix} ​ & β \geq 2 \end{matrix}

(17)

is optimal with respect to the cost:

J = \int_{0}^{\infty} {l (x) + u^{T} R (x) u} d t

(18)

where:

\begin{array}{l} l (x) = - 2 β L_{f + g κ} V + … \\ … β (β - 2) L_{g} V \cdot R^{- 1} (x) {(L_{g} V)}^{T} > 0 \end{array}

(19)

3.2 Backstepping-based Inverse Optimal Attitude Controller

The controller design procedures of backstepping can be divided into two phases:

Design a virtual control law to stabilize virtual quadrotor kinematics.

Design the true control law which would be implemented on the system based upon the quadrotor dynamics and the virtual control law designed in phase 1.

3.2.1 Control of the Kinematics Subsystem

The kinematics subsystem in (10b) is controlled only indirectly through the angular velocity vector w, so the virtual system is considered:

{\begin{matrix} \dot{\tilde{q}} = \frac{1}{2} (q_{0} I + S (\tilde{q})) \overset{⇀}{v} \\ {\dot{q}}_{0} = - \frac{1}{2} {\tilde{q}}^{T} \overset{⇀}{v} \end{matrix}

(20)

where $\overset{⇀}{v}$ is the virtual control. Let the attitude error be:

\begin{matrix} {\overset{⇀}{q}}_{e} = {\overset{⇀}{q}}_{d}^{*} \otimes \overset{⇀}{q} \\ = [\begin{matrix} q_{d 0} q_{0} + {\tilde{q}}_{d}^{T} \tilde{q} \\ q_{d 0} \tilde{q} - q_{0} {\tilde{q}}_{d} - S ({\tilde{q}}_{d}) \tilde{q} \end{matrix}] \end{matrix}

(21)

Then the derivative of attitude error is:

\begin{array}{l} {\dot{\overset{⇀}{q}}}_{e} = [\begin{matrix} q_{d 0} {\dot{\tilde{q}}}_{0} + {\tilde{q}}_{d}^{T} \dot{\tilde{q}} \\ q_{d 0} \dot{\tilde{q}} - {\dot{q}}_{0} {\tilde{q}}_{d} - S ({\tilde{q}}_{d}) \dot{\tilde{q}} \end{matrix}] \\ = \frac{1}{2} [\begin{matrix} - q_{d 0} {\tilde{q}}^{T} + q_{0} {\tilde{q}}_{d}^{T} + {(S ({\tilde{q}}_{d}) \tilde{q})}^{T} \\ [q_{d 0} I - S ({\tilde{q}}_{d})] [q_{0} I + S (\tilde{q})] + {\tilde{q}}_{d} {\tilde{q}}^{T} \end{matrix}] \overset{⇀}{v} \\ = \frac{1}{2} [\begin{matrix} - {\tilde{q}}_{e}^{T} \\ q_{e 0} I + S ({\tilde{q}}_{e}) \end{matrix}] \overset{⇀}{v} \end{array}

(22)

Equation (22) can be rewritten as:

{\begin{matrix} {\dot{\tilde{q}}}_{e} = \frac{1}{2} (q_{e 0} I + S ({\tilde{q}}_{e})) \overset{⇀}{v} \\ {\dot{q}}_{e 0} = - \frac{1}{2} {\tilde{q}}_{e}^{T} \overset{⇀}{v} \end{matrix}

(23)

Consider a Lyapunov function candidate for the virtual system:

V_{1} = {[\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}]}^{T} [\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}]

(24)

V₁ ≥ 0 and V₁ = 0 if and only if ${\overset{⇀}{q}}_{e} = {[\pm 1 0 0 0]}^{T}$ .

The derivative of V₁ is:

{\dot{V}}_{1} = {[\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}]}^{T} [\begin{matrix} sgn (q_{e 0}) {\tilde{q}}_{e}^{T} \\ q_{e 0} I + S ({\tilde{q}}_{e}) \end{matrix}] v

(25)

Here, sgn(·) is not a traditional sign function, and is defined as:

sgn (λ) = {\begin{matrix} 1 λ \geq 0 \\ - 1 λ < 0 \end{matrix}

Let:

G^{T} ({\overset{⇀}{q}}_{e}) = [\begin{matrix} sgn (q_{e 0}) {\tilde{q}}_{e}^{T} \\ q_{e 0} I + S ({\tilde{q}}_{e}) \end{matrix}]

(26)

Then, we have:

\begin{matrix} G ({\overset{⇀}{q}}_{e}) [\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}] = {[\begin{matrix} sgn (q_{e 0}) {\tilde{q}}_{e}^{T} \\ q_{e 0} I + S ({\tilde{q}}_{e}) \end{matrix}]}^{T} [\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}] \\ = sgn (q_{e 0}) {\tilde{q}}_{e} \end{matrix}

(27)

Design the virtue control input as:

\overset{⇀}{v} = - k_{1} sgn (q_{e 0}) {\tilde{q}}_{e}

(28)

with any k₁ > 0. Then:

\begin{matrix} {\dot{V}}_{1} = {[\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}]}^{T} G^{T} ({\overset{⇀}{q}}_{e}) (- k_{1} sgn (q_{e 0}) {\tilde{q}}_{e}) \\ = - k_{1} {[\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}]}^{T} G^{T} ({\overset{⇀}{q}}_{e}) G ({\overset{⇀}{q}}_{e}) [\begin{matrix} 1 - | q_{e 0} | \\ {\tilde{q}}_{e} \end{matrix}] \end{matrix}

(29)

It can be seen that V̇₁ < 0 and V̇₁ = 0 if and only if ${\overset{⇀}{q}}_{e} = {[\pm 1 0 0 0]}^{T}$ . According to the Lyapunov Theorem, the virtual system is asymptotically stable.

After introducing the virtual control $\overset{⇀}{v}$ , the virtual system (20) becomes a variable structure system - it switches at θ = ±π along any rotating axis. As shown in Fig.3, ${\overset{⇀}{q}}_{e}$ converges on [±1 0 0 0]^T with a minimal angle θ_min, where:

Figure 3.

${\overset{⇀}{q}}_{e}$ convergence direction

θ_{\min} = {\begin{matrix} \begin{matrix} \min {θ, 2 π - θ} & 0 \leq θ \leq 2 π \end{matrix} \\ \begin{matrix} \min {- θ, 2 π + θ} & - 2 π \leq θ < 0 \end{matrix} \end{matrix}

It is already known that the unit quaternion $\overset{⇀}{q} = (\cos (θ / 2), \sin (θ / 2) \overset{⇀}{n})$ and $- \overset{⇀}{q} = (\cos (π + θ / 2), \sin (π + θ / 2) \overset{⇀}{n})$ represent the same attitude physically; as such, the differential equation of the attitude error can be formulated as (a similar derivation can be seen in [11]):

{\begin{matrix} \dot{\tilde{q}}'_{e} = \frac{1}{2} (q'_{e 0} I + S (\tilde{q}'_{e})) \overset{⇀}{v} \\ \dot{q}'_{e 0} = - \frac{1}{2} {(\tilde{q}'_{e}^{​})}^{T} \overset{⇀}{v} \end{matrix}

(30)

where $\overset{⇀}{q}'_{e} = sgn (q_{e 0}) {\overset{⇀}{q}}_{e}$ , with the virtual control:

\overset{⇀}{v} = - sgn (q_{e 0}) k_{1} {\tilde{q}}_{e} = - k_{1} \tilde{q}'_{e}

(31)

3.2.2 Control of the Full Rigid Body Model

Consider now the angular velocity error:

\overset{⇀}{z} = \overset{⇀}{v} - \overset{⇀}{w} = - (\overset{⇀}{w} + k_{1} \tilde{q}'_{e})

(32)

Then, the differential Equation (30) can be rewritten as:

{\begin{matrix} \dot{\tilde{q}}'_{e} = \frac{1}{2} (q'_{e 0} I + S (\tilde{q}'_{e})) (\overset{⇀}{v} - \overset{⇀}{z}) \\ \dot{q}'_{e 0} = - \frac{1}{2} {(\tilde{q}'_{e}^{​})}^{T} (\overset{⇀}{v} - \overset{⇀}{z}) \end{matrix}

(33)

Moreover, as shown above, it is asymptotically stable for $\overset{⇀}{z} = 0$ . The differential equation for $\overset{⇀}{z}$ is:

\begin{array}{l} \dot{\overset{⇀}{z}} = - \dot{\overset{⇀}{w}} - k_{1} \dot{\tilde{q}}'_{e} \\ = J^{- 1} (S (\overset{⇀}{w}) J \overset{⇀}{w} - u) + \frac{1}{2} k_{1}^{2} (q'_{e 0} I + S (\tilde{q}'_{e})) \tilde{q}'_{e} \\ = f (\tilde{q}'_{e}, \overset{⇀}{z}) - J^{- 1} \overset{⇀}{u} \end{array}

(34)

where:

\begin{array}{l} f (\tilde{q}'_{e}, \overset{⇀}{z}) = - k_{1} J^{- 1} S (- k_{1} \dot{\tilde{q}}'_{e} - \overset{⇀}{z}) J \tilde{q}'_{e} - \\ … J^{- 1} S (- k_{1} \dot{\tilde{q}}'_{e} - \overset{⇀}{z}) J \overset{⇀}{z} - \frac{1}{2} k_{1}^{2} (q'_{e 0} I + S (\tilde{q}'_{e})) \tilde{q}'_{e} \end{array}

(35)

We want to find $u = u (\tilde{q}'_{e}, \overset{⇀}{z})$ , such that the system of (33) and (34) is globally asymptotically stable. Let $x = {[\begin{matrix} \tilde{q}'_{e} & \overset{⇀}{z} \end{matrix}]}^{T}$ , the system model becomes:

\dot{x} = [\begin{matrix} - \frac{1}{2} k_{1} (q'_{e 0} I + S (\tilde{q}'_{e})) \tilde{q}'_{e} \\ f (\tilde{q}'_{e}, \overset{⇀}{z}) \end{matrix}] - (\begin{matrix} 0 & 0 \\ 0 & J^{- 1} \end{matrix}) (\begin{matrix} 0 \\ u \end{matrix})

(36)

Let:

F (x) = [\begin{matrix} - \frac{1}{2} k_{1} (q'_{e 0} I + S (\tilde{q}'_{e})) \tilde{q}'_{e} \\ f (\tilde{q}'_{e}, \overset{⇀}{z}) \end{matrix}]

(37)

and:

G (x) = (\begin{matrix} 0 & 0 \\ 0 & - J^{- 1} \end{matrix})

(38)

The system model can be rewritten as:

\dot{x} = F (x) + G (x) (\begin{matrix} 0 \\ \overset{⇀}{u} \end{matrix})

(39)

Since the derivative of the control Lyapunov function is:

\dot{V} (x) = L_{F} V (x) + L_{G} V (x) \overset{⇀}{u}

(40)

Provided that applying the inverse optimal control theorem:

\overset{⇀}{u} = - β \cdot N^{- 1} (x) {(L_{G} V)}^{T}

(41)

could make V̇(x) negative with a sufficiently high gain, then the maximum rate control law is given by:

\overset{⇀}{u} = - u_{\max} \cdot S G N ({(L_{G} V)}^{T})

(42)

where u_max represents the maximum value of input saturation.

Notice that the SGN function is an extended sign function for a multidimensional situation. Since the sign function can be seen as the direction function of one dimension, i.e.:

sgn (λ) = {\begin{matrix} λ / | λ | λ \neq 0 \\ 0 λ = 0 \end{matrix}

(43)

we extend it to a three dimensional situation, such as:

S G N (X) = {\begin{array}{l} X / ‖ X ‖ X \neq [0 0 0]^{T} \\ [0 0 0]^{T} X = [0 0 0]^{T} \end{array}

(44)

Consider the new Lyapunov function:

V_{2} = x^{T} x

(45)

Then, the maximum rate control law (with respect to the designed Lyapunov function) is:

\begin{matrix} \overset{⇀}{u} = - u_{\max} \cdot S G N (- J (\tilde{q}'_{e} + z)) \\ = - u_{\max} \cdot S G N (J ((k_{1} - 1) \tilde{q}'_{e} + w)) \end{matrix}

(46)

Notice that the backstepping-based inverse optimal controller (BIOAC) that we designed is a sliding mode controller with the sliding surface:

s = J ((k_{1} - 1) \tilde{q}'_{e} + \overset{⇀}{w})

(47)

It has the inherent property of robustness as to matched model uncertainty and explicit disturbances. However, since the sign function SGN(·)will cause a high frequency oscillation in practice, the sliding mode control law will suffer the chattering problem. For removing chattering in sliding mode control, there are a few methods. A more common approach is to replace the sign function by a smooth function with a boundary layer of the form:

η (s) = \frac{s}{ε + ‖ s ‖}

(48)

where ε is a constant vector to represent the boundary layer. However, with a boundary layer the BIOAC sacrifices the robustness in relation model uncertainty and explicit disturbances.

4. Numerical Simulation

In this section, the performance of the designed BIOAC is compared with a PD controller using numerical simulation in the MATLAB environment. The system parameters used in the simulation are:

\begin{array}{l} I_{1} = 0.0079 k g \cdot m^{2}, I_{2} = 0.0079 k g \cdot m^{2}, I_{3} = 0.0093 k g \cdot m^{2} \\ d = 0.2 m, c_{f} = 0.00003, c_{τ f} = 0.01 \end{array}

The initial conditions are:

\begin{array}{l} \overset{⇀}{q} (0) = (0.159, 0.57, 0.57, 0.57)^{T} \\ \overset{⇀}{w} (0) = {(0, 0, 0)}^{T} \end{array}

And the desired condition are:

\begin{array}{l} {\overset{⇀}{q}}_{d} = {(1, 0, 0, 0)}^{T} \\ {\overset{⇀}{w}}_{d} = {(0, 0, 0)}^{T} \end{array}

This corresponds to the initial attitude errors of 161.7 degrees around axis (1,1,1)^T.

For a more obvious illustration of control performance, the PD controller which provides a near-eigenaxis rotation [18] is used as a comparison. The PD algorithm is described as:

{\overset{⇀}{u}}_{p d} = S (\overset{⇀}{w}) J \overset{⇀}{w} - k J {\tilde{q}}_{e} - d J \overset{⇀}{w}

(49)

In the simulation process of the PD controller, the control effort limitation is never settled, and in a maximum rate control simulation the maximum control effort is (0.05N · m, 0.05N · m, 0.05N · m).

Appearing in the attitude quaternion curves in Fig. 4 and the output control torque curve in Fig. 5, the BIOAC is faster than the PD with a control effort under the maximum limitations. Moreover, the PD controller seeks a greater control torque when the error of orientation is large, and will suffer the problem of control failure in practice.

Figure 4.

Time histories of the quaternion

Figure 5.

Time histories of the control torques

5. Experiments

Given a good performance in the MATLAB simulation, BIOAC is realized on a test bench to show its effectiveness in practice. The physical device being used in the work discussed here is shown in Fig. 6. The experimental system consists of a posture platform, an aircraft body, an onboard controller, an attitude and heading reference system (AHRS), a wireless serial communication interface, a remote controller and receiver, and a computer.

Figure 6.

Physical devices for the experimental system

The aircraft body selected is the XAircraft X-650 quadrotor, which is made of carbon fibre, equipped with four brushless DC motors and a motor speed control device. It has an empty weight of about 800g and the maximum single propeller thrust is 4N. The Attitude and Heading Reference System is the Xsens company MTi inertial measurement unit (IMU), which can provide 3 axes of driftless orientation and 3 axes of calibrated angular velocity. Its maximum update rate is 100Hz, with a 0.5 degree attitude measurement accuracy, a 1 degree heading measure accuracy, and a 300 degree per second maximum measurable angular velocity. The remote controller and receiver is a WFLY FT06-A with a 5-way continuous pulse channel and a switching channel. TI's TMS320F2809 chip with a 100MHz core frequency is the core of the onboard controller.

The connection diagram of experimental system signals is shown in Fig 7. In every control cycle (10ms), the F2809 chip is interrupted to receive orientation and angular velocity information from the IMU and then calculate the control pulse width and transfer it to the motor speed control devices. Outside the control loop, the F2809 is interrupted to turn the parameters and sends a set of state data to the computer in every 100ms. The work of receiving the remote control instruction is done in the same interrupter program.

Figure 7.

Signal connection diagram

In the experiment, the attitude of the quadrotor switch is between:

\begin{array}{l} {\overset{⇀}{q}}_{1} = (0. 9848, 0.1003, 0. 1003, 0.1003)^{T} \\ {\overset{⇀}{w}}_{1} = {(0, 0, 0)}^{T} \end{array}

and:

\begin{array}{l} {\overset{⇀}{q}}_{2} = {(1, 0, 0, 0)}^{T} \\ {\overset{⇀}{w}}_{2} = {(0, 0, 0)}^{T} \end{array}

i.e., the quadrotor rotates 20 degrees around the axis of (1,1,1)^T

In the experiment, J and c_τf is unknown. However, since BIOAC is a sliding mode controller, it can work effectively with the settings of J = I and c_τf = 1.

The attitude curves in Fig. 8 show that the controlled system is stable and, after the switch instruction, the attitude of the quadrotor converges to the desired attitude fast. There are small static errors in Fig. 8 and Fig. 9. The reasons for this are the incomplete compensation and the existence of the boundary layer. The results would be improved with a more accurate estimation of J and c_τf.

Figure 8.

Time histories of the quaternion

Figure 9.

Time histories of the input instructions

6. Conclusions

The maximum rate attitude control problem of the quadrotor is studied under the constraint of input saturation. Due to the difficulty in obtaining closed-form solutions to the HJB equation, the inverse optimal approach is used. Following the backstepping phases, a global stable controller is derived. With respect to a classic PD controller, the numerical simulation results show that BIOAC could achieve faster convergence with control effort limitation. The realization of the BIOAC is gained in an experimental system and shows the practical effectiveness of being free from computational issues, which is highly desirable given the limited resources onboard.

7. Acknowledgements

The authors would like to thank Dr. HE Yunze in NUDT for providing advice and Dr. SHEN Tao and SONG Baoquan in NUDT for their help in constructing the test bench.

References

Tayebi

McGilvray

(2006) Attitude Stabilization of a VTOL Quadrotor Aircraft. IEEE Transactions on Control Systems Technology, Vol. 14, No. 3, pp. 562–571.

Lara

Sanchez

Lozano

Castillo

(2006) Real-Time Embedded Control System for VTOL Aircrafts: Application to stabilize a quadrotor helicopter. Proceedings of the 2006 IEEE International Conference on Control Applications, Munich, Germany, October 4–6.

Madani

Benallegue

(2006) Control of a Quadrotor Mini-Helicopter via Full State Backstepping Technique. Proceedings of the 45th IEEE Conference on Decision & Control, San Diego, CA, USA, December 13–15.

Lee

Kim

H. J.

Sastry

(2009) Feedback Linearization vs. Adaptive Sliding Mode Control for a Quadrotor Helicopter. International Journal of Control, Automation, and Systems, Vol. 7, No. 3, pp. 419–428.

Das

Subbarao

Lewis

F. L.

(2009) Dynamic Inversion with Zero-dynamics Stabilisation for Quadrotor Control. IET Control Theory Application, Vol. 3, No. 3, pp. 303–314.

Bouabdallah

Siegwart

(2005) Backstepping and Sliding Mode Techniques Applied to an Indoor Micro Quadrotor. Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain, April.

Pounds

Mahony

Gresham

Corke

Roberts

(2004) Towards Dynamically-Favourable Quad-Rotor Aerial Robots. Proceedings of Australasian Conference on Robotics and Automation, Canberra, Australia.

Johnson

E. N.

(2000) Limited Authority Adaptive Flight Control. Ph.D Thesis of Georgia Institute of Technology.

Wen

J.T.-Y.

Kreutz-Delgado

(1991) The Attitude Control Problem. IEEE Transactions on Automatic Control, Vol. 36, No. 10, October.

10.

Fragopoulos

Innocenti

(2004) Stability Considerations in Quaternion Attitude Control Using Discontinuous Lyapunov Functions. IEE Proceedings on Control Theory Applications, Vol. 151, No. 3.

11.

Dapeng

Qing

Zexiang

(2007) Attitude Control Based on the Lie-group Structure of Unit Quaternions. Proceedings of the 26th Chinese Control Conference, July 26–31, Zhangjiajie, Hunan

12.

Madani

Benallegue

(2006) Backstepping Control for a Quadrotor Helicopter. Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robos and Systems, October 9–15, Beijing, China.

13.

Kristiansen

Nicklasson

P. J.

Gravdahl

J. T.

(2009) Satellite Attitude Control by Quaternion-Based Backstepping. IEEE Transactions on Control Systems Technology, Vol. 17, No. 1, January.

14.

Freeman

R. A.

Primbs

J. A.

(1996) Control Lyapunov Functions: New Ideas From an Old Source. Proceedings of the 35th Conference on Decision and Control, Kobe, Japan, December.

15.

Freeman

R. A.

Kokotovic

P. V.

(1996) Robust Nonlinear Control Design: State-Space and Lyapunov Techniques. Boston, MA: Birkauser.

16.

Freeman

R. A.

Kokotovic

P. V.

(1996) Inverse Optimality in Robust Stabilization. SJAM Journal of Control and Optimization, Vol. 34, No. 4, pp. 1365–1391.

17.

Krstic

Tsiotras

(1999) Inverse Optimal Stabilization of a Rigid Spacecraft. IEEE Transactions on Automatic Control, Vol. 44, No. 5, pp. 1042–1048.

18.

Horri

N. M.

Palmer

P. L.

Roberts

M. R.

(2009) Optimal Satellite Attitude Control: A Geometric Approach. IEEE Aerospace Conference, Guildford, March.

19.

Wie

Weiss

Arapostathis

(1989) Quaternion Feedback Regulator for Spacecraft eigenaxis Rotations. Journal of Guidance, Vol. 12, No. 3, May-June.