Tracking of periodic oscillations in an underactuated system via adaptive neural networks

Abstract

In this paper, the tracking control of periodic oscillations in an underactuated mechanical system is discussed. The proposed scheme is derived from the feedback linearization control technique and adaptive neural networks are used to estimate the unknown dynamics and to compensate uncertainties. The proposed neural network-based controller is applied to the Furuta pendulum, which is a nonlinear and nonminimum phase underactuated mechanical system with two degrees of freedom. The new neural network-based controller is experimentally compared with respect to its model-based version. Results indicated that the proposed neural algorithm performs better than the model-based controller, showing that the real-time adaptation of the neural network weights successfully estimates the unknown dynamics and compensates uncertainties in the experimental platform.

Keywords

Tracking control periodic oscillation Furuta pendulum adaptive neural network real-time experiments

Introduction

The study of oscillatory behavior as well as its control in many physical systems has been an active research topic in recent years. For example, Parmananda et al.¹ reported numerical and experimental results indicating successful stabilization of unstable steady states and periodic orbits in an electrochemical system. Semenov et al.² investigated the stabilization of the inverted pendulum with vertically oscillating suspension under hysteretic control. In Olm et al.,³ an analysis of digital controllers under time-varying sampling period for the tracking of periodic oscillations was given. More recently, tracking control of a class of nonlinear systems using oscillation was considered in Wang and Guo.⁴ Specifically, based on the oscillation functions associated with accessible vibrating components of the system, oscillatory control was designed to track a desired trajectory.

There is a large class of underactuated mechanical systems which arises from problems in robotics and mechatronic systems. The Furuta pendulum is an underactuated electromechanical system of two degrees of freedom (DOF) and has only one actuator.⁵ It is an inverted pendulum, which is classified into a nonlinear, nonminimum phase and underactuated system.⁶ One of the most important applications of Furuta and inverted pendulums is to test linear and nonlinear control techniques. Some control techniques developed are bang-bang control,^5,7 PID control,^8,9 energy-based control,^10–12 feedforward control,^13,14 sliding mode control,^15,16 fuzzy logic,^17–19 singular perturbation-based control,²⁰ as well as hybrid control²¹ and predictive control.²² However, we will focus our attention to neural network-based control.^23–25

Artificial neural networks are useful to compensate unknown dynamic systems. There are some works where neural networks were used with this objective as Nelson and Kraft²⁶ where a full-state feedback optimal controller was developed to control an inverted pendulum in the upright position. In Sazonov et al.,²⁷ a hybrid system controller, incorporating a neural network plus an optimal linear controller applied to inverted pendulum in regulation mode was presented. The work in Shaheed²⁸ introduced an open-loop control strategy with neural networks for vibration suppression in a flexible manipulator system. In Jung and Cho,²⁹ a decoupled reference compensation technique with neural networks applied to the control of a two DOF inverted pendulum mounted on a x–y table was presented. In Noh et al.,³⁰ the implementation of position control applied to a mobile inverted pendulum using radial basis function neural networks was presented.

On the other hand, feedback linearization is a known control technique used in nonlinear systems.^31,32 The basic idea of this technique consists in defining an output function (usually linear in terms of the system state), which is derived r times with respect to time until the control input appears. Thus, the controller can be computed so that the solutions of resulting closed-loop system in terms of the output converge to zero. Depending on the class of the underactuated mechanical system under consideration and the proposed output function, the closed-loop system may be composed of an internal dynamics which must be stable in order to accomplish the control task.³³ The previous literature review shows that only a few works have addressed the problem of tracking control of periodic oscillatory trajectories of the Furuta pendulum.

In this paper, the problem of inducing periodic oscillatory behavior in the Furuta pendulum is addressed by proposing a new adaptive neural network-based controller for the tracking of periodic trajectories. The proposed control algorithm has a structure inspired by a model-based controller, which is designed using the feedback linearization technique with a new output function. The advantage of using adaptive neural networks is that the parameters of the dynamic model are not needed for the design of the control algorithm. The convergence analysis of the output trajectories is shown and conditions for the solutions of the internal dynamics to be bounded are given. An extensive experimental evaluation is performed in a test bed, where the new adaptive neural network controller is compared with respect to its model-based version. The two controllers are tested using two different periodic trajectories, showing that better tracking performance is obtained with new scheme.

Furuta pendulum dynamics and control goal

As mentioned before, the Furuta pendulum is an underactuated system of two DOF. Figure 1 shows a description of the joint positions and applied torque. The dynamic model of the Furuta pendulum in Euler–Lagrange form^6,34–36 is written as

M (q) ¨ q + C (q, ˙ q) ˙ q + g_{m} (q) + f (˙ q) = u

(1)

with

\begin{matrix} q = [\begin{matrix} q_{1} \\ q_{2} \end{matrix}], u = [\begin{matrix} τ \\ 0 \end{matrix}], \\ M (q) = [\begin{matrix} θ_{1} + θ_{2} \sin^{2} (q_{2}) & θ_{3} \cos (q_{2}) \\ θ_{3} \cos (q_{2}) & θ_{4} \end{matrix}], \\ C (q, ˙ q) = [\begin{matrix} \frac{1}{2} θ_{2} ˙ q_{2} \sin (2 q_{2}) & - θ_{3} ˙ q_{2} \sin (q_{2}) + \frac{1}{2} θ_{2} ˙ q_{1} \sin (2 q_{2}) \\ - \frac{1}{2} θ_{2} ˙ q_{1} \sin (2 q_{2}) & 0 \end{matrix}], \\ g_{m} (q) = [\begin{matrix} 0 \\ - θ_{5} \sin (q_{2}) \end{matrix}], f (˙ q) = [\begin{matrix} θ_{6} ˙ q_{1} + θ_{8} sig n (˙ q_{1}) \\ θ_{7} ˙ q_{2} + θ_{9} sig n (˙ q_{2}) \end{matrix}] \end{matrix}

where

q \in R^{2}

is the vector of joint positions,

u \in R^{2}

u∈R2 is the vector of input torques, being

τ \in R

the torque applied to the arm,

M (q) \in R^{2 \times 2}

is the symmetric positive definite inertia matrix,

C (q, ˙ q) ˙ q \in R^{2}

is the vector of centripetal and Coriolis torques,

g_{m} (q) \in R^{2}

is the vector of gravitational torques,

f (˙ q) \in R^{2}

is the vector of friction torques, and the model parameters θ_i are positive constants related to the physical properties of the system. Finally,

sign(x) =^{1}

, for

x >^{0}

sign(x) =^{-1}

, for

x <^{0}

, and

sign(x) =^{0}

, for

x =^{0}

Figure 1.

Three-dimensional model of the Furuta pendulum with the relative measurement of the joint positions and applied torque.

It is assumed that the desired trajectory $q_{d} (t)$ is a bounded, continuous, and twice-differentiable signal, which satisfies

| | q_{d} (t) | |, | | {\dot{q}}_{d} (t) | |, | | {\ddot{q}}_{d} (t) | | \leq μ

(2)

where μ is a positive constant. Besides, in this study we consider that the desired trajectory for the arm position is a periodic oscillatory signal, that is

q_{d} (t) = q_{d} (t + T_{s})

with

T_{s} > 0

being the period of the signal. However, the proposed methodology can be applied for the tracking of any continuous and twice-differentiable signal bounded in the sense of equation (2).

The motivation of introducing oscillatory behavior in pendulum-like systems has been well explained in Canudas-de-Wit et al.³⁷ and Shiriaev et al.³⁸ where the authors established that a pendulum is the simplest model of a walking robot leg and introducing periodic behavior in the closed-loop system for underactuated mechanical systems is an approach to solve the motion control problem in walking robots.

The joint position tracking error vector is defined as

e = [\begin{matrix} e_{1} \\ e_{2} \end{matrix}] = [\begin{matrix} q_{d} - q_{1} \\ - q_{2} \end{matrix}] \in R^{2}

(3)

Then, the control problem consists in designing a controller $τ (t) \in R$ , which does not require the exact knowledge of the Furuta pendulum model, such that the error signal $e (t)$ is uniformly ultimately bounded.

Adaptive neural network controller

The Furuta pendulum model in equation (1) can be rewritten as

¨ q = - M {(q)}^{- 1} [C (q, ˙ q) + g_{m} (q) + f (˙ q)] + M {(q)}^{- 1} u = [\begin{matrix} f_{1} (q, ˙ q) \\ f_{2} (q, ˙ q) \end{matrix}] + [\begin{matrix} g_{1} (q) \\ g_{2} (q) \end{matrix}] τ

which at the same time can be expressed in terms of the tracking error in equation (3) in the following form

\frac{d}{d t} e_{1} = {\dot{e}}_{1}

(4)

\frac{d}{d t} {\dot{e}}_{1} = {\ddot{q}}_{d} - f_{1} - g_{1} τ

(5)

\frac{d}{d t} e_{2} = {\dot{e}}_{2}

(6)

\frac{d}{d t} {\dot{e}}_{2} = - f_{2} - g_{2} τ

(7)

The control goal is to satisfy with only one input τ(t) the tracking of a desired trajectory for the arm position while the pendulum remains regulated at the upward unstable equilibrium. In other words, both the trajectory tracking error e₁(t) and the regulation error e₂(t) defined in equation (3) must remain bounded in a region around the origin, in spite of the fact that only one actuator is available to control the system. Thus, the designed control action $τ (t) \in R$ is given on the basis of an output function in terms of the signals e₁(t) and e₂(t). Inspired by passivity-based control schemes, where output signals with the structure $s = \dot{e} + Δ e$ are commonly used, we propose the output function $y (t) \in R$ given by

y = Δ_{1} e_{1} + {\dot{e}}_{1} + Δ_{2} e_{2} + {\dot{e}}_{2}

(8)

where

Δ_{1}

and

Δ_{2}

are positive constants.

The time derivative of the output in equation (8) is

\dot{y} = {\ddot{q}}_{d} - F - G τ + Δ_{1} {\dot{e}}_{1} + Δ_{2} {\dot{e}}_{2}

(9)

where

F = f_{1} + f_{2}

(10)

G = g_{1} + g_{2}

(11)

A feedback linearization controller is proposed for the cancellation of nonlinearities through a feedback control law, transforming the nonlinear system in equation (9) into an equivalent linear system whose trajectories converge asymptotically to the origin. The proposed control law is given by

τ = \frac{- F + {\ddot{q}}_{d} + Δ_{1} {\dot{e}}_{1} + Δ_{2} {\dot{e}}_{2} + k y + δ sign (y)}{G}

(12)

where the proportional term ky, with the positive constant k, ensures the asymptotic convergence of the output trajectories; the nonlinear term $δ sign (y)$ , with the positive constant δ, is a term aimed to introduce robustness in the compensation of external disturbances and unmodeled error dynamics; the nonlinear functions F and G are defined in equations (10) and (11), respectively. Then, by substituting equation (12) into equation (9), we obtain

\dot{y} = - k y - δ sign (y)

(13)

It is possible to show that y(t) converges to zero. A way to prove this claim is computing the time derivative of the following positive definite function

U_{1} = \frac{1}{2} y^{2}

(14)

which is given by

{\dot{U}}_{1} = y [- k y - δ sign (y)] = - k y^{2} - δ | y |

(15)

As can be seen, the time derivative of U₁ given in equation (15) is a negative definite function, proving that the trajectories of y(t) converge to the origin with exponential rate of convergence as time t goes to infinity.

A smooth function $h : R^{n} \to R$ can be approximated by a neural network. Then, given a compact set $S \in R^{n}$ , there exists a two-layer neural network such that^23,39

h (x) = W^{⊤} σ (V^{⊤} \bar{x}) + ϵ

(16)

where

ϵ

is a constant expressing the approximation error such that

ϵ_{N} > | ϵ | > 0

\bar{x} = {[x^{⊤} 1]}^{⊤} \in R^{n + 1}

is the vector of input signals containing a unitary element for threshold weights, with the state vector

x \in R^{n}, V \in R^{(n + 1) \times L}

and

W \in R^{L}

are matrices containing values of input and output weights of the neural network, respectively, and

σ \in R^{L}

is a vector of membership functions. In the study presented in this paper, n = 4 since

x =^{[q_{1} q_{2} ˙ q_{1} ˙ q_{2}]} \in R^{4}

Then, taking into account the model-based controller in equation (12), the following adaptive neural network controller is proposed

τ = \frac{- \hat{F} + {\ddot{q}}_{d} + Δ_{1} {\dot{e}}_{1} + Δ_{2} {\dot{e}}_{2} + k y + δ sign (y)}{\hat{G}}

(17)

where k and δ are positive constants,

sign (y)

is defined as the sign of y, and

\hat{F}

and

\hat{G}

are neural network-based estimations of the function F and G, respectively, given by

\hat{F} = {\hat{W}}_{f}^{⊤} σ (V_{f}^{⊤} \bar{x}), \hat{G} = {\hat{W}}_{g}^{⊤} σ (V_{g}^{⊤} \bar{x})

(18)

where the vectors

{\hat{W}}_{f} \in R^{L}

and

{\hat{W}}_{g} \in R^{L}

are the estimated output weights of the neural network,

V_{f} \in R^{5 \times L}

and

V_{g} \in R^{5 \times L}

are the input weights, in which the threshold values have been taken into account. The values of the matrices V_f and V_g are constant and random,⁴⁰

σ (p) = {[\tanh (p_{1}) \dots \tanh (p_{L})]}^{⊤} \in R^{L}

is a vector of membership functions. Besides, L is the quantity of neurons, n = 4 is the dimension of the state vector of the mechanical system. Finally, the estimated output weights of the neural networks can be obtained with the following adaptation laws

{\dot{\hat{W}}}_{f} = - M σ (V_{f}^{⊤} \bar{x}) y - κ_{f} M | y | {\hat{W}}_{f}

(19)

{\dot{\hat{W}}}_{g} = - N σ (V_{g}^{⊤} \bar{x}) y τ - κ_{g} N | y τ | {\hat{W}}_{g}

(20)

where

κ_{f} > 0, κ_{g} > 0

and

M, N \in R^{L \times L}

are diagonal positive definite matrices. See the Figure 2 for a description of the block diagram implementation of the adaptive neural network controller in equation (17).

Figure 2.

Block diagram implementation of the adaptive neural network controller in equation (17).

The proposed controller in equation (17) has a nonlinear quotient structure with an adaptive neural network function in the numerator and another one in the denominator. Besides, the controller includes linear PD terms in the tracking error $e_{1} (t)$ and $e_{2} (t)$ , which results from the output function y(t). The new controller has a robustifying term in order to compensate the modeling errors that remain from the neural network compensation. This controller allows carrying out the tracking of a reference trajectory for the arm while the pendulum in the second joint is regulated at the upright position. The weights of the neural network are updated in real time by means of properly designed adaptation laws.

By using the universal function approximation property of the neural networks in equation (16), which implies that

F = W_{f} σ (V_{f}^{⊤} \bar{x}) + ϵ_{f}, and G = W_{g} σ (V_{g}^{⊤} \bar{x}) + ϵ_{g}

with

W_{f} \in R^{L}

and

W_{g} \in R^{L}

being the constant ideal output weights, â_f,

ϵ_{g} \in R

being the approximation errors, and with the weight estimation error definitions

{\tilde{W}}_{f} = W_{f} - {\hat{W}}_{f}, {\tilde{W}}_{g} = W_{g} - {\hat{W}}_{g}

we have that

\dot{y} = - k y - δ sign (y) - {\tilde{W}}_{f}^{⊤} σ (V_{f}^{⊤} \bar{x}) - {\tilde{W}}_{g}^{⊤} σ (V_{g}^{⊤} \bar{x}) τ - ϵ_{f} - ϵ_{g} τ

(21)

Taking into account that the ideal weights W_f and W_g are constants, equations (19) and (20) can be written in terms of the estimation errors ${\tilde{W}}_{f}$ and ${\tilde{W}}_{g}$ , as follows

{\dot{\tilde{W}}}_{f} = M σ (V_{f}^{⊤} \bar{x}) y + κ_{f} M | y | [W_{f} - {\tilde{W}}_{f}]

(22)

{\dot{\tilde{W}}}_{g} = N σ (V_{g}^{⊤} \bar{x}) y τ + κ_{g} N | y τ | [W_{g} - {\tilde{W}}_{g}]

(23)

In order to prove that the solution y(t) coming from closed-loop system in equations (21) to (23) converges to zero as time t increases and the output weight estimation errors ${\tilde{W}}_{f} (t)$ and ${\tilde{W}}_{g} (t)$ are bounded for all time, the following positive definite function is proposed

U_{2} = \frac{1}{2} y^{2} + \frac{1}{2} Tr ({\tilde{W}}_{f}^{⊤} M^{- 1} {\tilde{W}}_{f}) + \frac{1}{2} Tr ({\tilde{W}}_{g}^{⊤} N^{- 1} {\tilde{W}}_{g})

(24)

where

Tr (A)

means the trace of a square matrix

A \in R^{n \times n}

, i.e.

Tr (A) = \sum_{i = 1}^{n} a_{i i}

. The time derivative of U₂ in equation (24) is given by

{\dot{U}}_{2} = y \dot{y} + Tr ({\tilde{W}}_{f}^{⊤} M^{- 1} {\dot{\tilde{W}}}_{f}) + Tr ({\tilde{W}}_{g}^{⊤} N^{- 1} {\dot{\tilde{W}}}_{g})

(25)

By replacing equations (21) to (23) into equation (25), we obtain

\begin{matrix} {\dot{U}}_{2} = - k y^{2} - δ | y | - ϵ_{f} y - ϵ_{g} y τ + κ_{f} | y | Tr ({\tilde{W}}_{f}^{⊤} [W_{f} - {\tilde{W}}_{f}]) \\ + κ_{g} | y τ | Tr ({\tilde{W}}_{g}^{⊤} [W_{g} - {\tilde{W}}_{g}]) + | y | ψ_{0} \end{matrix}

(26)

= - k y^{2} - δ | y | - ϵ_{f} y - ϵ_{g} y τ - κ_{f} | y | Tr ({\tilde{W}}_{f}^{⊤} {\tilde{W}}_{f}) - κ_{g} | y τ | Tr ({\tilde{W}}_{g}^{⊤} {\tilde{W}}_{g}) + | y | ψ_{0}

(27)

where

ψ_{0} = - ϵ_{f} sign (y) - τ ϵ_{g} sign (y) + κ_{f} Tr ({\tilde{W}}_{f}^{⊤} W_{f}) + κ_{g} | τ | Tr ({\tilde{W}}_{g}^{⊤} W_{g})

(28)

It will be useful to recall Frobenius norm of a matrix

A \in R^{m \times n}

, which is given by

| | A | |_{F} = \sqrt{T r (A^{⊤} A)} = \sqrt{\sum_{i, j} a_{i j}^{2}}

(29)

Besides, a useful property for matrices

A \in R^{m \times n}

and

B \in R^{m \times n}

| T r (A^{⊤} B) | \leq | | A | |_{F} | | B | |_{F}

(30)

Thus, using the definition in equation (29) and the property in equation (30), the signal ψ₀ in equation (28) satisfies

| | ψ_{0} | | \leq c_{1} | | {\tilde{W}}_{f} | |_{F} + c_{2} + [c_{3} | | {\tilde{W}}_{g} | |_{F} + c_{4}] | τ |

(31)

where

\begin{matrix} c_{1} = κ_{f} | | W_{f} | |_{F}, \\ c_{2} = | ϵ_{f} |, \\ c_{3} = κ_{g} | | W_{g} | |_{F}, \\ c_{4} = | ϵ_{g} | \end{matrix}

The definition in equation (30), the fact $| y τ | = | y | | τ |$ , and the inequality in equation (31) can be used to write the following upper bound on ${\dot{U}}_{2}$ as follows

\begin{matrix} {\dot{U}}_{2} \leq - k y^{2} - δ | y | - κ_{f} | y | | | {\tilde{W}}_{f} | |_{F}^{2} - κ_{g} | y | | τ | | | {\tilde{W}}_{g} | |_{F}^{2} \\ + [c_{1} | | {\tilde{W}}_{f} | |_{F} + c_{2}] | y | + [c_{3} | | {\tilde{W}}_{g} | |_{F} + c_{4}] | y | | τ | \end{matrix}

(32)

After some algebra and by completing the square for two set of terms associated to

| | {\tilde{W}}_{f} | |_{F}

and

| | {\tilde{W}}_{g} | |_{F}

, we are able to write

\begin{matrix} {\dot{U}}_{2} \leq - k y^{2} - [δ - c_{2} - κ_{f} \frac{1}{4} \frac{c_{1}^{2}}{κ_{f}^{2}} - κ_{g} \frac{1}{4} \frac{c_{3}^{2}}{κ_{g}^{2}} | τ |] | y | \\ - κ_{f} | y | {[| | W_{f} | |_{F}^{2} - \frac{1}{2} \frac{c_{1}}{κ_{f}}]}^{2} - κ_{g} | y | | τ | {[| | W_{g} | |_{F}^{2} - \frac{1}{2} \frac{c_{3}}{κ_{g}}]}^{2} \end{matrix}

(33)

Since the torque τ(t) in equation (17) is bounded for some compact set of the state space $[q^{⊤} {\dot{q}}^{⊤} {\hat{W}}_{f}^{⊤} {\hat{W}}_{g}^{⊤}] \in R^{4 + 2 L}$ and under the assumption that the desired arm trajectory $q_{d} (t)$ satisfies equation (2), the condition

δ > c_{2} + κ_{f} \frac{1}{4} \frac{c_{1}^{2}}{κ_{f}^{2}} + κ_{g} \frac{1}{4} \frac{c_{3}^{2}}{κ_{g}^{2}} | τ |_{max}, | τ (t) | \leq | τ |_{max}

guarantees

{\dot{U}}_{2} \leq - k y^{2} = - χ (y)

(34)

As ${\dot{U}}_{2} (t) \leq 0$ , which implies that y(t), ${\tilde{W}}_{f} (t)$ , and ${\tilde{W}}_{g} (t)$ are bounded. By integrating both sides of equality (34) it can be shown that $\int_{0}^{t} χ (y (ς)) d ς \leq - \int_{0}^{t} {\dot{U}}_{2} (ς) d ς \leq U_{2} (0) - U_{2} (t) \leq U_{2} (0)$ ; therefore, $\lim_{t \to \infty} \int_{0}^{t} χ (y (ς)) d ς$ exists, and it is finite. Moreover, since $\dot{χ} (t)$ is bounded, $χ (t)$ is uniformly continuous and by Barbalat’s lemma,³¹ $\lim_{t \to \infty} χ (t) = 0$ implying also that $y (t) \to 0$ as $t \to \infty$ .

By using the transformation

z = [\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \\ y \end{matrix}] = H x = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & - \frac{g_{2}}{g_{1}} & 1 \\ Δ_{1} & Δ_{2} & 1 & 1 \end{matrix}] [\begin{matrix} e_{1} \\ e_{2} \\ {\dot{e}}_{1} \\ {\dot{e}}_{2} \end{matrix}]

(35)

which is invertible, the internal dynamics can be computed as follows

\frac{d}{d t} [\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}] = [\begin{matrix} - Δ_{1} G_{1} η_{1} - Δ_{2} G_{1} η_{2} - G_{1} η_{3} + G_{1} y \\ - Δ_{1} G_{2} η_{1} - Δ_{2} G_{2} η_{2} + G_{1} η_{3} + G_{2} y \\ \frac{θ_{3}}{θ_{4}} \cos (η_{2}) {\ddot{q}}_{d 1} - \frac{θ_{3}}{θ_{4}} \cos (η_{2}) f_{1} - \frac{θ_{3}}{θ_{4}} \sin (η_{2}) {\dot{η}}_{1} {\dot{η}}_{2} - f_{2} \end{matrix}]

(36)

where

G_{1} = \frac{g_{1}}{g_{1} + g_{2}}

and

G_{2} = \frac{g_{2}}{g_{1} + g_{2}}

A local analysis of the system (36) showed that necessary and sufficient conditions to ensure the uniformly ultimately boundedness of the trajectories ${[η_{1} (t) η_{2} (t) η_{3} (t)]}^{⊤}$ are the following

θ_{3} > θ_{4}

(37)

0 < Δ_{1} < \frac{θ_{5}}{θ_{7}}

(38)

Δ_{2} > \frac{Δ_{1} θ_{4} + θ_{7}}{θ_{3}} + \frac{Δ_{1} θ_{5} (θ_{3} - θ_{4})}{θ_{3} (θ_{5} - Δ_{1} θ_{7})}

(39)

where

Δ_{1}

and

Δ_{2}

are control gains and the Furuta pendulum parameters θ_i are constants. This implies that the error trajectories

e (t) = {[e_{1} (t) e_{2} (t)]}^{⊤} \in R^{2}

are also uniformly ultimately bounded. In fact, the inequalities (38) and (39) provide an explicit tuning guideline for the control gains

Δ_{1}

and

Δ_{2}

. Further details about the local analysis of the system (36) can be found in Moreno-Valenzuela et al.^41,42

Experimental results

In this section, two controllers are experimentally tested: the model-based feedback linearization controller in equation (12), and the new adaptive neural network control law in equation (17). The experimental tests have been carried out in a Furuta pendulum built at the Instituto Politécnico Nacional—CITEDI, as shown in Figure 3. The experimental platform consists of a PC running Windows XP. The controllers are programmed using Matlab, Simulink, and Real-Time Windows Target. The experimental setup is also equipped with a 30A20AC servo amplifier of Advanced Motion Controls, a servomotor which is used as actuator, and a Sensoray 626 data acquisition board. The Furuta pendulum is provided with optical incremental encoders in order to sense the relative joint position. The numerical values of the parameters θ_i of the dynamic model in equation (1) were experimentally identified and are given in Table 1.

Figure 3.

Furuta pendulum built at the Instituto Politécnico Nacional—CITEDI.

Table 1.

Identified parameters of the Furuta pendulum dynamic model in equation (1).

Symbol	Value	Unit
θ ₁	0.06196	kg m² rad
θ ₂	0.01492	kg m² rad
θ ₃	0.01855	kg m² rad
θ ₄	0.01318	kg m² rad
θ ₅	0.50767	kg m² rad
θ ₆	0.00837	$N m \frac{rad}{s}$
θ ₇	0.00071	$N m \frac{rad}{s}$
θ ₈	0.01886	$N m \frac{rad}{s}$
θ ₉	0.00871	$N m \frac{rad}{s}$

The joint velocities are estimated using the following algorithm

\dot{q} (k T) = \frac{q (k T) - q ((k - 1) T)}{T}

(40)

where T = 0.001

[s]

is the sample time and

k = 0, 1, 2, \dots

is the time index. Initials conditions of the system were set to

{[q_{1} (0) q_{2} (0)]}^{⊤} = {[0 0]}^{⊤} (rad)

and

{[{\dot{q}}_{1} (0) {\dot{q}}_{2} (0)]}^{⊤} = {[0 0]}^{⊤} (rad/s) .

For the model-based controller in equation (12) and adaptive neural network controller in equation (17), the gains

Δ_{1} = 0.7, Δ_{2} = 3.5, k = 2.1, and δ =0 .07

(41)

were used.

For the adaptive neural network controller in equation (17), the number of neurons was L = 10, the matrix $M = 0.7 I_{10} \in R^{10 \times 10}$ , the matrix $N = 1.75 I_{10} \in R^{10 \times 10}, I_{10} \in R^{10 \times 10}$ is an identity matrix, $κ_{f} = 0.035$ and $κ_{g} = 0.175$ .

Periodic oscillation I

A set of experiments by using the desired periodic oscillation

q_{d} (t) = 1.0 \sin (t) (rad)

(42)

was done under the above-written conditions.

The results of the real-time implementation of the controller in equation (12) are given in Figure 4, which shows the performance of joint positions $q_{1} (t)$ and q₂(t) and the applied torque τ(t).

Figure 4.

Periodic oscillation I, model-based controller in equation (12). Left-hand side plot: time evolution of (a) q₁(t) and (b) q₂(t). Right-hand side plot: (c) applied torque τ(t).

The new adaptive neural network controller proposed in equation (17), with signals $\hat{F} (t)$ and $\hat{G} (t)$ in equation (18), is also implemented in real time. Figure 5 shows the time evolution of q₁(t) and $q_{2} (t)$ , and the applied control effort τ(t). Finally, the estimated weights ${\hat{W}}_{f} (t)$ and ${\hat{W}}_{g} (t)$ of the neural network are depicted in Figure 6, which remain bounded during the time that the controller is working, as predicted by theory.

Figure 5.

Periodic oscillation I, adaptive neural network controller in equation (17). Left-hand side plot: time evolution of (a) q₁(t) and (b) q₂(t). Right-hand side plot: (c) applied torque τ(t).

Figure 6.

Periodic oscillation I, adaptive neural network controller in equation (17). Time evolution of the estimated output weights (a) ${\hat{W}}_{f} (t)$ and (b) ${\hat{W}}_{g} (t)$ .

Periodic oscillation II

In this section, in order to show that the proposed controller is able to track a variety of periodic trajectories, a more complex desired trajectory was selected. By using the control gains in equation (41) in the model-based controller in equation (12) and the adaptive neural network controller in equation (17), we have carried out real-time experiments using the desired periodic oscillation

q_{d} (t) = 1.0 \sin (t) + 0.2 \cos (4 t) (rad)

(43)

which exhibits a more complex behavior with respect to the desired periodic oscillation in equation (42), allowing us to test the performance of the new controller.

Results for the model-based controller in equation (12) are given in Figure 7, where the time evolution of joint trajectories q₁(t) and q₂(t), and the applied torque τ(t) are appreciated.

Figure 7.

Periodic oscillation II, model-based controller in equation (12). Left-hand side plot: time evolution of (a) q₁(t) and (b) q₂(t). Right-hand side plot: (c) applied torque τ(t).

The results of the implementation of the adaptive neural network controller in equation (17) are given in Figure 8, which depicts the actual joint position trajectories q₁(t) and q₂(t), and the applied control action τ(t). Figure 9 describes the estimated weights ${\hat{W}}_{f} (t)$ and ${\hat{W}}_{g} (t)$ obtained with update laws in equations (22) and (23), respectively. Similarly, the output weights remain bounded for all time.

Figure 8.

Periodic oscillation II, adaptive neural network controller in equation (17). Left-hand side plot: time evolution of (a) q₁(t) and (b) q₂(t). Right-hand side plot: (c) applied torque τ(t).

Figure 9.

Periodic oscillation II, adaptive neural network controller in equation (17). Time evolution of the estimated output weights (a) ${\hat{W}}_{f} (t)$ and (b) ${\hat{W}}_{g} (t)$ .

Note that in all experiments the applied torque τ(t) in Figures 4, 5, 7, and 8 exhibits high frequency components. The reasons of these high frequency components are the quantization noise introduced by the encoders and the discrete estimation of the joint velocity given by equation (40).

Performance comparison

In order to compare the performance of the implemented controllers, Figure 10 depicts the time evolution of errors $e_{1} (t)$ and $e_{2} (t)$ for both controllers. In the left-hand side plot, the results by using the periodic oscillation $q_{d} (t)$ in equation (42) are given, and in the right-hand side plot the comparison of the errors by using the periodic oscillation $q_{d} (t)$ in equation (43) is illustrated. Visual examination of the plots in Figure 10 suggests that the best performance is obtained with the new adaptive neural network-based controller (17).

Figure 10.

Comparison of the tracking errors (a), (c) e₁(t) and (b), (d) e₂(t) for the implementation of the model-based controller in equation (12) and the adaptive neural network controller in equation (17). Left-hand side plot: comparison of the errors for the periodic oscillation I. Right-hand side plot: comparison of the errors for the periodic oscillation II.

In order to assess the tracking performance of both controllers, the RMS values of the tracking error ${[e_{1} (t) e_{2} (t)]}^{⊤}$ signal is computed. The results are given in Tables 2 and 3 for the desired periodic oscillation in equations (42) and (43), respectively. The column improvement means the percentage of variation of the new adaptive neural network controller in equation (17) with respect to the model-based controller in equation (12). In order to compare the performance during different time intervals, the RMS value is computed for different time intervals.

Table 2.

Periodic oscillation I: RMS values of the tracking error signals ${[e_{1} (t) e_{2} (t)]}^{⊤}$ .

Interval (s)	Model based	Adaptive NN	Improvement (%)
20–25	0.2591	0.1720	+33.62
15–25	0.2392	0.1696	+29.10
5–25	0.2594	0.1697	+34.58

NN: neural network; RMS: root mean square.

Table 3.

Periodic oscillation II: RMS values of tracking error signals ${[e_{1} (t) e_{2} (t)]}^{⊤}$ .

Interval (s)	Model based	Adaptive NN	Improvement (%)
20–25	0.3369	0.2990	+11.25
15–25	0.3303	0.2873	+13.02
5–25	0.3219	0.2788	+13.39

NN: neural network; RMS: root mean square.

The results observed in Figure 10, which shows the time evolution of $e_{1} (t)$ and $e_{2} (t)$ for both implementations and the obtained RMS values of the tracking errors in Tables 2 and 3, suggest that the best performance is obtained with the adaptive neural network controller in equation (17).

The improvement can be only explained due to the contribution of the adaptive neural network in the controller, which compensates other types of nonmodeled dynamics in equation (1), such as static friction, switching of the PWM servo amplifier, and vibrations in the mechanical structure of the mechanism.

Finally, in order to verify that the neural network contribution to the total applied torque is significant, Figure 11 shows the torque contributions for the model-based controller in equation (12) and the adaptive neural network controller in equation (17), which were separated as follows

τ = τ_{1} + τ_{2}

where

τ_{1} = \frac{- F}{G}, τ_{2} = \frac{{\ddot{q}}_{d} + Δ_{1} {\dot{e}}_{1} + Δ_{2} {\dot{e}}_{2} + k y + δ sign (y)}{G}

for the model-based controller in equation (12), and

\begin{matrix} τ_{1} = \frac{- \hat{F}}{\hat{G}}, \\ τ_{2} = \frac{{\ddot{q}}_{d} + Δ_{1} {\dot{e}}_{1} + Δ_{2} {\dot{e}}_{2} + k y + δ sign (y)}{\hat{G}} \end{matrix}

for the adaptive neural network controller in equation (17).

Figure 11.

Torque contributions for the (a) model-based controller in equation (12) and (b) the adaptive neural network-based controller in equation (17).

As can be seen in Figure 11, the right-hand side plot shows that the neural network contribution $τ_{1} (t)$ is comparable in magnitude with $τ_{2} (t)$ , which contains the proportional part plus the sign term in y(t). It can also be seen that the signal $τ_{2} (t) = - \hat{F} (t) / \hat{G} (t)$ from adaptive neural-network scheme in the right-hand side of Figure 11 remains bounded and is comparable in magnitude to the signal $τ_{2} (t) = - F (t) / G (t)$ from the model-based controller in the left-hand side of Figure 11.

Conclusion

The main purpose of this document has been the introduction of a new controller in order to induce periodic oscillations in an underactuated system. The new scheme is based on adaptive neural networks, which has the advantage that the system model is not required to be known.

The new scheme is inspired from the feedback linearization technique and a novel output function y(t). The output function y(t) was chosen as a linear combination of the position errors and velocity errors.

An extensive real-time experimental study has also been presented, where a model-based controller is compared with the new adaptive neural network scheme. Two types of periodic oscillation were used in order to assess the adaptation capabilities of the new scheme. Better results were obtained with the new scheme since this is able to compensate disturbances and unmodeled dynamics.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by SIP-IPN, TecNM, and CONACYT Projects 176587 and 134534, Mexico.

References

Parmananda

Madrigal

Rivera

et al . Stabilization of unstable steady states and periodic orbits in an electrochemical system using delayed-feedback control. Phys Rev E 1999; 59: 5266–5271.

Semenov

Shevlyakova

and Meleshenko

PA.

Inverted pendulum under hysteretic control: stability zones and periodic solutions. Nonlinear Dyn 2014; 75: 247–256.

Olm

Ramos

and Costa-Castello

Stability analysis of digital repetitive control systems under time–varying sampling period. IET Control Theory Appl 2011; 5: 29–37.

Wang

and Guo

Oscillatory tracking control of a class of nonlinear systems. J Dyn Sys Meas Control 2012; 134: 1–6.

Furuta

Yamakita

and Kobayashi

Swing-up control of inverted pendulum using pseudo-state feedback. Proc IMechE, Part I: J Systems and Control Engineering 1992; 206: 263–269.

Fantoni

and Lozano

Non-linear control for underactuated mechanical systems. London: Springer-Verlag, 2002.

Mori

Nishihara

and Furuta

Control of unstable mechanical system control of pendulum. Int J Control 1976; 23: 673–692.

Chang

Hwang

and Hsieh

JG.

A self-tuning PID control for a class of nonlinear systems based on the Lyapunov approach. J Process Control 2002; 12: 233–242.

Ghosh

Krishnan

and Subudhi

Robust proportional–integral–derivative compensation of an inverted cart–pendulum system: an experimental study. IET Control Theory Appl 2012; 6: 1145–1152.

10.

Åstrom¨

and Furuta

Swinging up a pendulum by energy control. Automatica 2000; 36: 287–295.

11.

Lozano

Fantoni

and Block

DJ.

Stabilization of the inverted pendulum around its homoclinic orbit. Syst Control Lett 2000; 40: 197–204.

12.

Siuka

and Schöberl

Applications of energy based control methods for the inverted pendulum on a cart. Robot Autonom Syst 2009; 57: 1012–1017.

13.

Mazenc

and Praly

Asymptotic tracking of a reference state for systems with a feedforward structure. Automatica 2000; 36: 179–187.

14.

Mazenc

and Bowong

Tracking trajectories of the cart-pendulum system. Automatica 2003; 39: 677–684.

15.

Santiesteban

Floquet

Orlov

et al . Second-order sliding mode control of underactuated mechanical systems ii: orbital stabilization of an inverted pendulum with application to swing up/balancing. Int J Robust Nonlinear Control 2008; 18: 544–556.

16.

Park

and Chwa

Swing-up and stabilization control of inverted-pendulum systems via coupled sliding-mode control method. IEEE Trans Ind Electron 2009; 56: 3541–3555.

17.

Yubazaki

and Hirota

Upswing and stabilization control of inverted pendulum system based on the SIRMs dynamically connected fuzzy inference model. Fuzzy Sets Syst 2001; 122: 139–152.

18.

Wang

Tanaka

and Griffin

MF.

An approach to fuzzy control of nonlinear systems: stability and design issues. IEEE Trans Fuzzy Syst 1996; 4: 14–23.

19.

Moghanni-Bavil-Olyaei

and Ghanbari

Design of an adaptive fuzzy sliding mode control using supervisory fuzzy control for micro-electro-mechanical systems (MEMS) z-axis gyroscope. J Low Freq Noise Vib Active Control 2014; 33: 163–168.

20.

Fatehi

Eghtesad

and Amjadifard

Using singular perturbation method for controlling a crane system with a flexible cable and large swing angle. J Low Freq Noise Vib Active Control 2015; 34: 661–384.

21.

Zhao

and Spong

MW.

Hybrid control for global stabilization of the cart–pendulum system. Automatica 2001; 37: 1941–1951.

22.

Mills

Wills

and Ninness

Nonlinear model predictive control of an inverted pendulum. In: Proceedings of the American control conference, St. Louis, USA, 10–12 June 2009, pp.2335–2340. NJ: IEEE.

23.

Lewis

Jagannathan

and Yesildirek

Neural network control of robot manipulators and nonlinear systems. London, UK: Taylor and Francis, 1999.

24.

Lee

and Harris

CJ.

Adaptive neural network control of robotic manipulators. Singapore: World Scientific, 1998.

25.

Hunt

Sbarbaro

Zbikowski

et al . Neural networks for control systems: a survey. Automatica 1992; 28: 1083–1112.

26.

Nelson

and Kraft

LG.

Real-time control of an inverted pendulum system using complementary neural network and optimal techniques. In: Proceedings of the American control conference, Baltimore, USA, 29 June–1 July 1994, pp.2553–2554. NJ: IEEE.

27.

Sazonov

Klinkhachorn

and Klein

RL.

Hybrid LQG-neural controller for inverted pendulum system. In: Proceedings of the 35th southeastern symposium on system theory, Morgantown, USA, 16–18 March 2003, pp.206–210. NJ: IEEE.

28.

Shaheed

Modelling and open-loop control of a single-link flexible manipulator with neural networks. J Low Freq Noise Vib Active Control 2001; 20: 105–131.

29.

Jung

and Cho

HT.

Decoupled neural network reference compensation technique for a PD controlled two degrees-of-freedom inverted pendulum. Int J Control Autom Syst 2004; 2: 92–99.

30.

Noh

Lee

and Jung

Position control of a mobile inverted pendulum system using radial basis function network. Int J Control Autom Syst 2010; 8: 157–162.

31.

Khalil

HK.

Nonlinear systems. Upper Saddle River, NJ: Prentice Hall, 2002.

32.

Slotine

J-JE

and Li

Applied nonlinear control. Englewood Cliffs, NJ: Prentice Hall, 1996.

33.

Maalouf

Moog

Aoustin

et al . Classification of two-degree-of-freedom underactuated mechanical systems. IET Control Theory Appl 2015; 9: 1501–1510.

34.

Murray

and Sastry

SS.

A mathematical introduction to robotic manipulation. Boca Raton, FL: CRC Press, 1994.

35.

Sciavicco

and Siciliano

Modelling and control of robot manipulators. London: Springer-Verlag, 2000.

36.

Cazzolato

and Prime

On the dynamics of the Furuta pendulum. J Control Sci Eng 2011; 2011: 1–8.

37.

Canudas-de-Wit

Espiau

and Urrea

Orbital stabilization of underactuated mechanical systems. In: Proceedings of the 15th IFAC World Congress, Barcelona, Spain, 21–26 July 2002, pp.1–6. Elsevier IFAC Publications.

38.

Shiriaev

Perram

and Canudas-de-Wit

Constructive tool for orbital stabilization of underactuated nonlinear systems: virtual constraints approach. IEEE Trans Autom Control 2005; 50: 1164–1176.

39.

Haykin

Neural networks: a comprehensive foundation. Upper Saddle River, NJ: Prentice Hall, 1999.

40.

Igelnik

and Pao

YH.

Stochastic choice of basis functions in adaptive function approximation and the functional-link net. IEEE Trans Neural Netw 1995; 6: 1320–1329.

41.

Moreno-Valenzuela

Aguilar-Avelar

Puga-Guzmán

et al . Adaptive neural network control for the trajectory tracking of the Furuta pendulum. IEEE Trans Cybernet 2016; 46(12): 3439–3452.

42.

Moreno-Valenzuela J and Aguilar-Avelar C. Motion Control of Underactuated Mechanical Systems. Cham: Springer International Publishing AG, 2018.