Inner-Learning Mechanism Based Control Scheme for Manipulator with Multitasking and Changing Load

Abstract

With the rapid development of robot technology and its application, manipulators may face complex tasks and dynamic environments in the coming future, which leads to two challenges of control: multitasking and changing load. In this paper, a novel multicontroller strategy is presented to meet such challenges. The presented controller is composed of three parts: subcontrollers, inner-learning mechanism, and switching rules. Each subcontroller is designed with self-learning skills to fit the changing load under a special task. When a new task comes, switching rule reselects the most suitable subcontroller as the working controller to handle current task instead of the older one. Inner-learning mechanism makes the subcontrollers learn from the working controller when load changes so that the switching action causes smaller tracking error than the traditional switch controller. The results of the simulation experiments on two-degree manipulator show the proposed method effect.

1. Introduction

Manipulators, as one of the most popular robots in the world, are often used in industrial factories instead of millwright, welder, carrier, and other workers in product line to reduce costs and enhance quality. This pattern of manipulator utilization works well in the past several decades but is meeting and will meet great challenges [1] from the fast changing manufacturing orders because of the growing number of people who want their daily products different from others'. Besides industrial factories, more and more manipulators are used in the outdoor [2 –5] or indoor [6, 7] environments individually or fixed on moving platforms. To fit these situations, the complexity of task and environment should be considered when we design the controller of manipulator.

Many researchers have focused on this issue. In [8, 9], a human collaborator in the manipulator's working space was considered and the integration of humans and robots behaviors was discussed. Paper [10] presented a novel control strategy for manipulator in unknown environments. Obstacle avoidance during manipulation tasks is studied in [11]. Paper [12] discussed the control problem of manipulator in random vibration. Paper [13] tried to resolve the problem of multitask allocation. The variable structure control (VSC) with sliding mode has been used to solve the load disturbance and other uncertainties [14]. Iterative learning controller of manipulator can help to finish repetitive tasks [15, 16]. Intelligent learning control method is becoming an effective strategy for manipulator with complex task and environment. In [17], neural networks are applied to control the manipulators to deal with the model uncertainties and the varying workload problems. Paper [18 –20] used the fuzzy control and got good performances in certain circumstances. Obviously, it is very hard to use only one controller to reach the control goal for the manipulators may encounter different circumstances. Multicontroller approach is the typical case which has been widely used in complex systems [21]. The multicontroller approach has attracted many researchers in the robot control field during the last few decades due to its tolerance to system failure [22, 23], flexibility in linear parameter-varying applications [24], and success in kinetic modeling of logic-based systems [20]. Paper [25] used the multicontroller in Acrobot robot. Paper [26] used the architecture to control an ocean ship successfully. Sliding mode control is a typical method of multicontrollers and has been widely used in manipulator control [14, 27].

Although we can find many papers about this issue, we need to get down to some basic problems about manipulator control. First, what kind of factors should be considered at current time? Before we design complex controllers, the manipulator without any external sensors, so called the first generation manipulator that has only joint angle sensors, should be considered first. For this kind of manipulator, environment complexity often means changing loads. Obstacles, trajectories, and other factors should be considered later. The second problem is what kind of skills should be integrated into the controller. Because of the load changing, learning skill is necessary. And multicontroller strategy allows us to design controllers individually for each predefined situation. Based on such considerations, a multicontroller structure with multiple learning mechanisms is presented in this paper to deal with the control problem of manipulator with multitasking and changing load.

The remainder of this paper is organized as follows: the two main problems of controlling the manipulator are presented in Section 2. The control architecture based on the inner-learning mechanism is proposed in Section 3 and the stability is proved in Section 4. Numerical simulation studies are carried out on a two-link manipulator to verify the effectiveness of the controller in Section 5, followed by the conclusion in Section 6.

2. Problem Formulation

We consider a robot manipulator whose dynamic model is expressed in Cartesian space as follows:

D (q) \ddot{q} + C (q, \dot{q}) \dot{q} + G (q) = τ,

(1)

where q, $\dot{q}$ , and $\ddot{q}$ , respectively, represent the angle, angular velocity, and angular acceleration, $D (q) \in R^{n \times n}$ is a bounded and positive definite inertia matrix, $C (q, q) \in R^{n \times n}$ is the centrifugal force and Coriolis force term, and $\dot{D} (q) - 2 C (q, \dot{q}) \dot{q}$ is a skew symmetric matrix. G(q) is the gravity term and τ is the control moment. In this paper, the robot manipulators should have the following common robot properties [16].

D(q) is symmetric, bounded, and positive definite matrix.

$\dot{D} (q) - 2 C (q, \dot{q})$ is a skewing symmetric matrix; hence, $x^{T} (\dot{D} (q) - 2 C (q, \dot{q})) x = 0$ , for all x ∈ Rⁿ.

G(q) is uniformly bounded; hence, ∥G(q)∥ ≤ c, where c is an unknown positive parameter.

For the above dynamic system, the problems of controlling it today include two aspects: (a) the manipulators should deal with multitasks when working; that is to say, they have to handle different works after being installed on workstations or other platforms. And (b) the manipulators should undertake different workloads when working; the workload may change under different situations. We will discuss the two problems in detail in the following.

2.1. Deal with Multitasks

The robot manipulators are assigned to various tasks for the growing number of people wants their daily life to be easier and more convenient than the traditional life. The manipulators may need to do some irregular works such as carve some different shapes or curves on the surface of some products; they may also need to do some repeated works like stowing or transporting things in the factory. Above all, manipulators have to finish different tasks to ease people's daily life today. These tasks are generally abstracted to different trajectories on theoretical analysis such as periodical curves, aperiodic curves, and other special trajectories. To simplify the analysis, we use the periodical and aperiodic trajectories as the research object in this paper. The trajectory of the manipulator is decided by the angle which is given by the tasks. So we can use Q_d(t) to represent the tasks and describe the multitasking model by the following piecewise function:

Q_{d} (t) = {\begin{cases} F_{1} (t) & t \in (t_{1}, t_{2}] \\ F_{2} (t) & t \in (t_{2}, t_{3}] \\ ⋮ \\ F_{n} (t) & t \in ({t_{n}}_{- 1}, t_{n}], \end{cases}

(2)

where F(t) is a two-dimensional function that sets the target angle for the joints. Equation (2) is a piecewise functions, and we can set different angles by defining different F(t). As in Figure 1, the manipulator may track aperiodic trajectory to finish task 1 before time t₁ and track periodic trajectory to finish task 2 after time t₁.

Figure 1:

The trajectories of two different tasks.

2.2. Undertake Changing Load

To achieve different targets, the manipulators have to undertake various workloads; they may face the following situations: (a) the workload is a constant in one task but changes to another constant when doing another task; (b) the workload is a constant but changes with different operation objects in one task; (c) the workload may change over time; (d) the workload may face the mixture of the above various situations. The changing pattern of the workload also has two cases: time-driven and event-driven. Obviously, it is a great challenge for us to design a controller to handle all the above situations. So we considered the situations (a), (b), and the time-driven cases in this paper to simplify the theoretical analysis; the physical model can be described by the following piecewise function:

p_{l} = {\begin{cases} c_{1} & t \in (t_{1}, t_{2}], \\ ⋮ \\ c_{n} & t \in ({t_{n}}_{- 1}, t_{n}], \end{cases}

(3)

where p_l is the workload of the robot manipulator, c_i (i = 1, 2,…, n) is a positive constant, and t is the duration time of c_i.

As in Figure 2, we can see that the load will make a change when it comes to the changing time and lasts for a period until the next change time comes. The time of the load change may be the time of the task change like situation (a) or in the process of performing one task like situation (b). The changes may happen simultaneously or one after another, so the controller may lead to large errors if it is unable to adapt to the changes quickly.

Figure 2:

Schematic of the changing load. l₁, l₂, l₃…l_n is the load constant. t₁, t₂, t₃ is the change point when the load changes from one constant to another.

3. Control Scheme Based on the Inner-Learning Mechanism

This paper will solve the aforementioned two problems which may occur during the working of the manipulator. We can use multicontroller architecture to solve the problems, but, as mentioned earlier, merely using switching control cannot guarantee the good effectiveness of the control system because the subcontrollers are unable to acquire the task or workload change information from each other when the control system switches from one controller to another. But it could be solved if the subcontrollers have the ability to learn the information of environmental changes and learn the information of the working controller when switching. So, we introduce the mechanism which we call the inner-learning mechanism; it can make the controllers learn the system change information from each other and adjust their parameters by the knowledge they learned. Based on this mechanism, we propose a control scheme which addresses the following issues. (1) It can reselect the most suitable subcontroller as working controller to handle current situation instead of the older one; (2) the controllers can learn the system information such as load change from the working controller and cause smaller tracking error when switching under the guidance of the inner-learning mechanism. Considering these, we can get an overview of the architecture from Figure 3.

Figure 3:

Multicontroller architecture based on inner-learning mechanism, where C_n is the subcontroller and G is the control plant.

As in Figure 3, the core part of the control structure is surrounded by the dashed line; it is similar to a polygon rotating structure and contains three main aspects: (a) subcontroller; (b) switching rules; and (c) inner-learning mechanism. They are to be introduced in the following sections.

3.1. Subcontrollers

Our goal is to design a multicontroller C to stabilize the manipulator. The multicontroller C consists of a family of subcontrollers {C_m}_{m = 1}ⁿ and is associated with a switching signal, where each subcontroller C_n belongs to the constrained set and the switching signal indicates the next working controller within the family {C_m}_{m = 1}ⁿ [28].

The precondition of the existence of the inner-learning mechanism is that each subcontroller has the learning skill. That is to say, when we choose subcontrollers, we should make sure they can learn the information of the changes from the working control and adjust their parameters to the best. Under the guidance of these principles and for simplicity, we choose the PID controller and iterative learning controller (ILC) as the two subcontrollers.

PID controller has been widely used in industrial robotics for the feature of simple structure and easy realization. Consider the feature of nonlinear and varying load of the robotic system; we use neural network adaptive PID controller as method 1. Since the robots often need to track the periodic curves and the iterative learning control has a good effect for the cyclical action, so we choose the ILC controller as method 2.

When we design controllers, we always try to make the controller as simple as possible under the premise that does not affect system performance. PD and PID controllers are used very frequently in industrial robot control. Practice has proved that PD control is more effective than PID control and it has a simpler structure. Actually, PD controller is one kind of PID controller whose parameter of the integral term is zero, so the controllers are all chosen PD-type in this paper. To simplify the description, we use C₁ and C₂ and subsystem 1 and subsystem 2 to represent method 1, method 2, and the corresponding two subsystems, respectively. The RBF neural network has a good approximation ability; we can use it to adjust the parameters; reference [29] has described this method in detail. The adaptive PID controller can be written as the following equation:

u_{0} = G (q) + k_{p 0} * e + k_{d 0} * \dot{e},

(4)

where k_p0, k_d0 will be adjusted by the neural network.

Controller C₂ uses the open-loop learning algorithm; the algorithm can be defined as

u_{k + 1} (t) = u_{k} (t) + k_{p} e_{k} (t) + k_{d} \frac{d e_{k} (t)}{d t},

(5)

where the subscript k = (0, 1,…, n) is the index of iterations, u_k(t) is the former control, and k_pe_k(t) + k_d(de_k(t)/dt) is a correction term of the former output error. The convergence of the algorithm should be considered when we use the ILC controller.

3.2. Switching Rules

The switching rules should guide the switching of the subcontrollers with the change of tasks and make a decision about which controller to switch to. The switching rules can be defined as

σ (t) = S ([t_{0}, t^{-}), σ ([t_{0}, t^{-})), x ([t_{0}, t^{-})), y ([t_{0}, t^{-}))), (t^{-} \geq t_{0}),

(6)

where t₀ denotes the initial time of the system and x, y represent the states of input and output, respectively. We assume that m ∈ {1, 2,…, n}, and the candidate controller will switch to C_m when σ(t) = m. The rules can be designed as time-driven, state-driven, or event-driven patterns; we should design the switching rules according to the actual situation. For the robot manipulator, it will spend a fixed time to complete the specific task and switch from one to another when the task and load change. So we can design a time-driven switching law to organize the switching of the subcontrollers. The rules can be defined as follows:

σ (t^{+}) = S (t, σ (t^{-})),

(7)

where the switching signal σ only depends on the time and its past values. That is to say, the switching will happen when a task is completed.

3.3. Inner-Learning Mechanism

The innovation of the new architecture is the introduction of inner-learning mechanism. In Figure 3, the subcontrollers are interconnected by the learning mechanism; it can guide the mutual learning of the subcontrollers so that the current controller can get the information from the working controller and adapt to the environment quickly. The design of the mechanism is various from different subcontrollers for its different self-learning mechanism, so we should design an appropriate learning mechanism to realize the inner-learning between the subcontrollers. Consider the two subcontrollers we chose: controller C₁ can adjust the parameters to the best if the initial value failed to meet the requirements of the control action; controller C₂ can achieve the best control effect through continuous learning. They all have the ability of self-learning, so we designed the following learning mechanism.

Before switching, C₁ adjusts the value of k_p and k_d to a group of optimal parameters k_p^d and k_d^d which will be transferred to the iterative learning controller. Then, the control will be switched to

u_{k + 1} (t) = u_{k} (t) + k_{p}^{d} e_{k + 1} (t) + k_{d}^{d} \frac{d e_{k + 1} (t)}{d t} .

(8)

After switching, (8) will be the active controller. The new candidate can learn information from the two parameters k_p^d and k_d^d which are carrying the information of subsystem 1; this would help the candidate controller adapt to the changes of the load of task, reduce the overshoot, and exert effective control.

By periodic revision of the control signals, controller C₂ can achieve the best control effect after running some times. Before the candidate switched to C₁, the system should store the last control input u^d(t) which is also the optimal control of C₂. The initial control effect of C₁ controller is determined by the parameters ${k_{p}}_{0}$ and ${k_{d}}_{0}$ ; the PID controller can exert effective control only when it could learn the information of subsystem 2. So we could find a method to transmit the information from u^d(t) to ${k_{p}}_{0}$ and ${k_{d}}_{0}$ . We use the least square method to fit the two parameters and then take them into the following equation:

u_{d} (t) = [\begin{bmatrix} {k_{p}}_{0} & {k_{d}}_{0} \end{bmatrix}] [\begin{bmatrix} e \\ \dot{e} \end{bmatrix}],

(9)

where e and $\dot{e}$ are already known. Then, (4) will be the effective control signal of the system; it will cause smaller tracking error and adapt to the changes quickly. Then, the system will switch between the subcontrollers under the instruction of this inner-learning mechanism.

4. The Stability of the System

Before discussing the stability, we need to make the following reasonable assumptions and definition [29].

Assumption 1. The trajectories and their first and second order derivatives are bounded.

Assumption 2. The states of the manipulator are measureable and observable.

Assumption 3. For each iteration, the initial conditions are satisfied; q_d(0) = q^j(0), ${\dot{q}}_{d} (0) = {\dot{q}}^{j} (0)$ , for all j ∈ N, where j denotes the operation or iteration number.

Remark 4. For a real robotic system, its joint angle, angular velocity, and angular acceleration must be bounded, and these states can be easily measured by the sensors, so we can ensure the rationality of Assumptions 1 and 2. Assumption 3 is typical in the literature of iterative learning control, and it can be ensured in a real system. So we can see that the 3 assumptions are reasonable.

Definition 5. We called the positive value τ_a the average dwell time under the switching rule σ if it satisfies the following conditions:

N_{σ} (t, T) \leq N_{0} + \frac{T - t}{τ_{a}},

(10)

where T > t ≥ 0 represent the running time of the system, N_σ(t, T) is the switching times on interval (t, T), and N₀ is a positive number.

In order to describe the stability of the system, we defined V as the potential energy of the manipulator and get the following theorem.

Theorem 6. If $(\partial / \partial q) [{(\partial V / \partial q)}^{T}]$ are bounced, for all $q \in R^{n}$ , the time to finish each task is long enough for the manipulator, and the initial conditions of iteration are satisfied, that is, q_d(0) = q(0), ${\dot{q}}_{d} (0) = \dot{q} (0)$ , then the system is globally stable.

Proof (a) The Stability of the Subsystems. For subsystem 1, [30] has given a detailed proof of its stability. It used Hamilton function as the Lyapunov function and got the conclusion that if the control gain K_p satisfies (11), then the system is stable. Consider

\frac{\partial}{\partial q} [{(\frac{\partial V}{\partial q})}^{T}] + K_{p} > 0,

(11)

where K_p is the error gain matrix of the PD algorithm in [30].

From the properties of the robot dynamics, we have

{(\frac{\partial V}{\partial q})}^{T} = G^{T} (q) .

(12)

In our method, k_p0 corresponds to K_p in (11); we can see that if k_p0 in subsystem 1 satisfied (11), then we can prove the stability of subsystem 1. So we substitute k_p0 into (11) and get (∂/∂q)[G^T(q)] + k_p0, where G(q) is the gravity term in (1) and is obtained by (18) in our robotic system and k_p0 is obtained by (9), for u^d(t) has been able to stabilize the system, so the least squares method can make the parameter k_p0 meet the following inequality:

\frac{\partial}{\partial q} [G^{T} (q)] + k_{p 0} = \frac{\partial}{\partial q} [m_{4} * g * \cos (q_{1}) + m_{5} * g * \cos (q_{1} + q_{2}), m_{5} * g * \cos (q_{1} + q_{2})] + k_{p 0} > 0;

(13)

the detailed calculation process will be omitted here. For a real robotic system, it is easy to verify the establishment of (13) and from [30] we can get the result that subsystem 1 is stable.

For subsystem 2, we will simplify some symbols before the stability proof. Consider

\begin{matrix} D = C (q_{d}), C = C (q_{d}, {\dot{q}}_{d}), G = G (q_{d}, {\dot{q}}_{d}), \\ C_{1} = {\frac{\partial C}{\partial \dot{q}} |}_{(q_{d}, {\dot{q}}_{d})} {\dot{q}}_{d} + {\frac{\partial G}{\partial \dot{q}} |}_{(q_{d}, {\dot{q}}_{d})}, \\ F = {\frac{\partial D}{\partial q} |}_{(q_{d}, {\dot{q}}_{d})} {\ddot{q}}_{d} + {\frac{\partial C}{\partial q} |}_{(q_{d}, {\dot{q}}_{d})} {\dot{q}}_{d} + {\frac{\partial G}{\partial q} |}_{(q_{d}, {\dot{q}}_{d})}, \end{matrix}

(14)

where λ_min(·) represents the minimum eigenvalue of the matrix and ${∥ \cdot ∥}_{\max}$ represents the maximum Euclidean norm of the matrix. Then, we obtained the following lemma.

Lemma 7. If the control gain satisfies the following equation:

\begin{matrix} l_{p} = λ_{\min} (K_{d}^{0} + 2 C_{1} - 2 Λ D) > 0, \\ l_{r} = λ_{\min} (K_{d}^{0} + 2 C + \frac{2 F}{Λ} - \frac{2 {\dot{C}}_{1}}{Λ}) > 0, \\ l_{p} l_{r} = {∥ \frac{F}{Λ} - (C + C_{1} - Λ D) ∥}_{\max}^{2}, \end{matrix}

(15)

where K_d⁰ is the initial control gain and Λ is a diagonal positive definite matrix, then the system subsystem 2 is stable, for t ∈ [t_s, t_f] (where t_s, t_f are the start time and finish time of the iteration, respectively).

Reference [16] has proved that Lemma 1 is established for the variable gain. K_d⁰ in (15) is corresponding to K_d^d of subsystem 2. So subsystem 2 is stable if K_d^d satisfied (15). The inner-learning mechanism can make the optimal control of subsystem 1 the initial control of subsystem 2. So (15) is established when the system switches to subsystem 2 and the stability of subsystem 2 is obtained.

(b) The Stability of the Switching System. The control architecture we proposed is a multicontroller switching system; we should note that the stability of subsystems is not equal to the stability of the switching system [31]. So we should demonstrate the stability of the switching system.

Paper [32] proposed a stability analysis method based on the average dwell time which is defined in Definition 5. Indeed, if the switching system stays on each subsystem long enough to ensure that the energy reduction is equal or more than the energy increase caused by the switching, then the stability is guaranteed. It has been proved that if the dwell time τ_a is sufficiently large, then the proposed switching system is stable [33]. Its physical meaning is that the system is stable if there is sufficient time to offset the energy increasing trend caused by the switching. So, if the time to finish each task is long enough for the manipulator, then the system is stable.

5. Simulation Results

In order to verify the effectiveness of the controller, we designed the following simulation.

The manipulator used for simulation is a two-revolute joint robot, as shown in Figure 4.

Figure 4:

A two-revolute joint robot.

Due to the limitation input method, we rephrase it here, that is: The simulation experiments use the model which is described in (1). D(q), $C (q, \dot{q})$ , and G(q) in (1) are, respectively, given by the following expressions:

D (q) = [\begin{bmatrix} m_{1} + m_{2} + 2 * m_{3} * \cos (q_{2}) & m_{2} + m_{3} * \cos (q_{2}) \\ m_{2} + m_{3} * \cos (q_{2}) & m_{2} \end{bmatrix}],

(16)

C (q, \dot{q}) = [\begin{bmatrix} - m_{3} * ({\dot{q}}_{2}) * \sin (q_{2}) & - m_{3} * ({\dot{q}}_{1} + {\dot{q}}_{2}) * \sin (q_{2}) \\ m_{3} * {\dot{q}}_{1} * \sin (q_{2}) & 0 \end{bmatrix}],

(17)

G (q) = [\begin{bmatrix} m_{4} * g * \cos (q_{1}) + m_{5} * g * \cos (q_{1} + q_{2}) \\ m_{5} * g * \cos (q_{1} + q_{2}) \end{bmatrix}],

(18)

where m_i in (2) is the element of the vector M which is given by

M = P + p_{l} L,

(19)

where p_l denotes the workload, P is the parameter vector of the robot which can be described as $P = {[p_{1} p_{2} p_{3} p_{4} p_{5}]}^{T}$ , $L = {[l_{1}^{2} l_{2}^{2} l_{1} l_{2} l_{1} l_{2}]}^{T}$ , where l₁ = l₂ = 1 are the length of the two joints, g = 9.8, and $P = {[1.66 0.42 0.63 3.75 1.25]}^{T}$ . The robot will track two different groups of target curves to verify the effectiveness of the control structure.

Group a. Consider

\begin{matrix} q_{d 1} (t) = 0.01 t^{2} \\ q_{d 2} (t) = 0.01 t^{2}, \end{matrix} t \in (0, t_{1}] .

(20)

Group b. Consider

\begin{matrix} q_{d 1} (t) = 0.1 * \sin (t) \\ q_{d 2} (t) = 0.1 * \sin (t), \end{matrix} t \in [t_{1}, t) .

(21)

The changing mission trajectory can be defined as the changing curve at time t₁. Also, the load change can be defined as

p_{l} = {\begin{cases} 0 & t \in (0, t_{1}^{'}] \\ 1 & t \in (t_{1}^{'}, t] . \end{cases}

(22)

It means the load will change at time t₁′.

Target trajectory and task load changes can be divided into the following cases.

Case 1. t₁ = t₁′; namely, the changes occur at the same time.

Case 2. t₁ < t₁′; namely, the load changes after the change of target trajectory.

Case 3. t₁ > t₁′; namely, the target trajectory will change after the load.

In the experiment, we set t₁ = t₁′ = 10 in Case 1; t₁ = 10, t₁′ = 15 in Case 2; t₁ = 10, t₁′ = 5 for Case 3. Then, we use adaptive PID controller and switch controller based on PID controller, iterative learning controller, and the proposed controller to control the double-joint robot, respectively. The simulation results are shown in Figures 5, 6, 7, 8, 9, and 10.

Figure 5:

Control effects of Case 1. (a), (b), and (c) show the effects of switching control without inner-learning mechanism, adaptive PID control, and inner-learning switching control, respectively.

Figure 6:

Tracking errors of Case 1. (a), (b), and (c) show the trajectory tracking error of the switching control without inner-learning mechanism, adaptive PID control, and inner-learning switching control, respectively.

Figure 7:

Control effects of Case 2. (a), (b), and (c) show the effects of switching control without inner-learning mechanism, adaptive PID control, and inner-learning switching control, respectively.

Figure 8:

Tracking errors of Case 2. (a), (b), and (c) show the trajectory tracking error of the switching control without inner-learning mechanism, adaptive PID control, and inner-learning switching control, respectively.

Figure 9:

Control effects of Case 3. (a), (b), and (c) show the effects of switching control without inner-learning mechanism, adaptive PID control, and inner-learning switching control, respectively.

Figure 10:

Tracking errors of Case 3. (a), (b), and (c) show the trajectory tracking error of the switching control without inner-learning mechanism, adaptive PID control, and inner-learning switching control, respectively.

From the above simulation result, we can see that the control architecture we proposed is more suitable for the complex robotic system; it can make the robot more adaptable to the complex environment and solve the problems we proposed in the beginning of the paper effectively. To see the effectiveness of the architecture more clearly, we calculated the summation of the trajectory tracking error mean of the three different control structures in three cases and made Table 1.

Table 1:

The error mean comparison of the three controllers (where $\bar{e_{1}} + \bar{e_{2}}$ is the summation of the error mean).

Error mean ( $\bar{e_{1}} + \bar{e_{2}}$ )	Case
Error mean ( $\bar{e_{1}} + \bar{e_{2}}$ )	Case 1 $t_{1} = {t_{1}}^{'}$	Case 2 $t_{1} < {t_{1}}^{'}$	Case 3 $t_{1} > {t_{1}}^{'}$

Controller
Learning control	0.0811	0.0809	0.1224
Adaptive PID	0.1758	0.1339	0.1575
Switch control	0.2039	0.2069	0.2872

From Table 1, we can see that the control architecture we proposed has a better performance than the two common methods.

6. Conclusion

In this paper, we discuss the control problems of the manipulators with changing load and multitasks and propose a novel kind of multicontroller architecture for the robot manipulator system and demonstrate the stability of the architecture. The new architecture can establish an internal learning mechanism for the multicontroller, guarantee the continuity of the system, and avoid the impact of the transient response caused by the switching action. The result of the experiment shows the effectiveness of the proposed architecture. Our future work will focus on the applications of this novel method in multielectromechanical and embedded systems.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Footnotes

Acknowledgments

This work was supported by the Chinese Fundamental Research Funds for the Central Universities (nos. CDJZR11170006 and CDJZR12170018), the National Natural Science Foundation of China (no. 60905053), and the Chongqing Natural Science Foundation (no. 2011BB0081).

References

Madani

Daachi

, and Benallegue

, “Adaptive variable structure controller of redundant robots with mobile/fixed obstacles avoidance,” Robotics and Autonomous Systems, vol. 61, no. 6, pp. 555–564, 2013.

S.-C.

Yuh

, and Kim

, “Armless underwater manipulation using a small deployable agent vehicle connected by a smart cable,” Ocean Engineering, vol. 70, pp. 149–159, 2013.

Mazzini

Kettler

Guerrero

, and Dubowsky

, “Tactile robotic mapping of unknown surfaces, with application to oil wells,” IEEE Transactions on Instrumentation and Measurement, vol. 60, no. 2, pp. 420–429, 2011.

Paul

Webb

Liu

, and Dissanayake

, “Autonomous robot manipulator-based exploration and mapping system for bridge maintenance,” Robotics and Autonomous Systems, vol. 59, no. 7–8, pp. 543–554, 2011.

Sabatini

Monti

Gasbarri

, and Palmerini

G. B.

, “Adaptive and robust algorithms and tests for visual-based navigation of a space robotic manipulator,” Acta Astronautica, vol. 83, pp. 65–84, 2013.

Kim

K.-Y.

Song

H.-S.

Suh

J.-W.

, and Lee

J.-J.

, “A novel surgical manipulator with workspace-conversion ability for telesurgery,” IEEE/ASME Transactions on Mechatronics, vol. 18, no. 1, pp. 200–211, 2013.

Stoianovici

Kim

Schäfer

, “Endocavity ultrasound probe manipulators,” IEEE/ASME Transactions on Mechatronics, vol. 18, no. 3, pp. 914–921, 2013.

Zanchettin

A. M.

Bascetta

, and Rocco

, “Acceptability of robotic manipulators in shared working environments through human-like redundancy resolution,” Applied Ergonomics, vol. 44, no. 6, pp. 982–989, 2013.

Zeng

and Bone

G. M.

, “Design of elastomeric foam-covered robotic manipulators to enhance human safety,” Mechanism and Machine Theory, vol. 60, pp. 1–27, 2013.

10.

Capisani

L. M.

and Ferrara

, “Trajectory planning and second-order sliding mode motion/interaction control for robot manipulators in unknown environments,” IEEE Transactions on Industrial Electronics, vol. 59, no. 8, pp. 3189–3198, 2012.

11.

Duguleana

Barbuceanu

F. G.

Teirelbar

, and Mogan

, “Obstacle avoidance of redundant manipulators using neural networks based reinforcement learning,” Robotics and Computer-Integrated Manufacturing, vol. 28, no. 2, pp. 132–146, 2012.

12.

Cui

M.-Y.

Xie

X.-J.

, and Wu

Z.-J.

, “Dynamics modeling and tracking control of robot manipulators in random vibration environment,” IEEE Transactions on Automatic Control, vol. 58, no. 6, pp. 1540–1545, 2013.

13.

Żabińska

Sośnicki

Cetnarowicz

, and Turek

, “Robot task allocation using signal propagation model,” Procedia Computer Science, vol. 18, pp. 1505–1514, 2013.

14.

Feng

, and Man

, “Non-singular terminal sliding mode control of rigid manipulators,” Automatica, vol. 38, no. 12, pp. 2159–2167, 2002.

15.

Tayebi

Abdul

Zaremba

M. B.

, and Ye

, “Robust iterative learning control design: application to a robot manipulator,” IEEE/ASME Transactions on Mechatronics, vol. 13, no. 5, pp. 608–613, 2008.

16.

Ouyang

P. R.

Zhang

W. J.

, and Gupta

M. M.

, “An adaptive switching learning control method for trajectory tracking of robot manipulators,” Mechatronics, vol. 16, no. 1, pp. 51–61, 2006.

17.

S. S.

Hang

C. C.

, and Woon

L. C.

, “Adaptive neural network control of robot manipulators in task space,” IEEE Transactions on Industrial Electronics, vol. 44, no. 6, pp. 746–752, 1997.

18.

Sahraoui

Salem

, and Khelfi

M. F.

, “Online fuzzy based intelligent control of robot manipulator by SAFIS approach,” in Proceedings of the 1st International Conference on Complex Systems (ICCS '12), pp. 1–6, Agadir, Morocco, November 2012.

19.

Wai

and Muthusamy

, “Fuzzy-neural-network control for robot manipulator via sliding-mode design,” in Proceedings of the 9th Asian Control Conference (ASCC '13), Istanbul, Turkey, 2013.

20.

and Woo

P.-Y.

, “Fuzzy supervisory sliding-mode and neural-network control for robotic manipulators,” IEEE Transactions on Industrial Electronics, vol. 53, no. 3, pp. 929–940, 2006.

21.

Liberzon

, Switching in Systems and Control, Birkhauser, Boston, Mass, USA, 2003.

22.

Siljak

D. D.

, “Reliable control using multiple control systems,” International Journal of Control, vol. 31, no. 2, pp. 303–329, 1980.

23.

Bonilla

Mendoza

González-Galván

E. J.

Chávez-Olivares

Loredo-Flores

, and Reyes

, “Path-tracking maneuvers with industrial robot manipulators using uncalibrated vision and impedance control,” IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews, vol. 42, no. 6, pp. 1716–1729, 2012.

24.

Paul

and Safonov

M. G.

, “Model reference adaptive control using multiple controllers and switching,” in Proceedings of the 42nd IEEE Conference on Decision and Control, pp. 3256–3261, December 2003.

25.

Xue

Hou

, and Li

, “State switching optimization and global stability control strategy for underactuated two-link manipulator,” Chinese Journal of Scientific Instrument, vol. 33, no. 5, pp. 1035–1040, 2012.

26.

Saari

and Djemai

, “Ship motion control using multi-controller structure,” Ocean Engineering, vol. 55, pp. 184–190, 2012.

27.

Fallaha

C. J.

Saad

Kanaan

H. Y.

, and Al-Haddad

, “Sliding-mode robot control with exponential reaching law,” IEEE Transactions on Industrial Electronics, vol. 58, no. 2, pp. 600–610, 2011.

28.

Jiang

and Hespanha

J. P.

, “Multi-controller design under uncontrolled and controlled switching,” in Proceedings of the American Control Conference (ACC '09), pp. 1760–1765, St. Louis, Mo, USA, June 2009.

29.

Liu

, Advanced PID Control of Matlab Simulation, PHEI, Beijing, China, 2011.

30.

Takegaki

and Arimoto

, “New feedback method for dynamic control of manipulators,” Journal of Dynamic Systems, Measurement and Control, Transactions of the ASME, vol. 103, no. 2, pp. 119–125, 1981.

31.

Peleties

and DeCarlo

, “Asymptotic stability of m-switched systems using Lyapunov functions,” in Proceedings of the 31st IEEE Conference on Decision and Control, Tucson, Ariz, USA, 1992.

32.

Hespanha

J. P.

and Morse

A. S.

, “Stability of switched systems with average dwell-time,” in Proceedings of the 38th IEEE Conference on Decision and Control (CDC '99), pp. 2655–2660, Phoenix, Ariz, USA, December 1999.

33.

Morse

A. S.

, “Supervisory control of families of linear set-point controllers—part 1: exact matching,” IEEE Transactions on Automatic Control, vol. 41, no. 10, pp. 1413–1431, 1996.