Recursive Differential Evolution Algorithm for Inertia Parameter Identification of Space Manipulator

Abstract

This paper proposes a recursive differential evolution (RDE) algorithm to identify the inertial parameters of an unknown target and simultaneously revise the friction parameters of space manipulator joints. The inertia parameters of a space manipulator, which govern the dynamic behaviours of the entire system to a significant extent, can change for many reasons during the process of on-orbit operations; consequently, it is essential to trace these changes within the control system to ensure the stability and accuracy of the entire system. RDE is inspired by a recursive least squares algorithm, using approximate gradient information to guide the mutation operation in the standard DE. A series of contrast simulations are employed to confirm the feasibility of the RDE algorithm. The simulation results show that the identification of the RDE algorithm is more precise than for a GA (genetic algorithm) and LS (least square) algorithm, and has an appropriate convergence rate. The RDE identification method is suitable for linear, nonlinear and combined systems, and can follow system dynamics exactly.

Keywords

Recursive Differential Evolution Parameter Identification Friction Space Manipulator

1. Introduction

Space robotics is considered one of the most promising approaches for on-orbit servicing (OOS) missions such as docking, berthing, refuelling, repairing, upgrading, transporting, rescuing, orbital debris removal, etc. Many enabling techniques have been developed in the past two decades and several technology demonstration missions have been completed [1, 2]. A number of on-orbit servicing missions have been successfully accomplished. Engineering Test Satellite VII (ETS-VII), an unmanned spacecraft equipped with a 2m long, six-degree-of-freedom manipulator arm, with the objective of verifying technologies for autonomous rendezvous and docking (AR&D), as well as robotic servicing in space [3], was developed and launched by the National Space Development Agency of Japan (NASDA). ETS-VII has successfully carried out a variety of on-board experiments with its manipulator arm, such as model-based space robot teleoperation from the ground with a time-delay [4] and robotic servicing tasks such as orbital replacement units (ORU) exchange, deployment of a space structure, and capture and berthing of a target satellite [5]. These key technologies are essential for an orbital free-flying robot. The Defence Advanced Research Projects Agency (DARPA), in conjunction with Boeing, successfully launched and accomplished the Orbital Express mission in 2007. As an advanced OOS technology demonstration mission, it demonstrated short-range and long-range autonomous rendezvous, capture and berthing, on-orbit electronics upgrades, on-orbit refuelling, as well as an autonomous fly-around visual inspection using a demonstration client satellite. During the mission, a robotic arm autonomously transferred a supplemental battery and backup computer to a target spacecraft designed to be serviced [6].

During the process of the on-orbit operations, the inertia parameters of the space robot can change for several reasons, e.g., fuel consumption, payload deployment, capturing of a target or docking with a spacecraft. The control system should trace the changes of these parameters to ensure the stability and accuracy of the entire system. For example, the inertia parameters of the combination of manipulator arm and target are not the same as for the single manipulator after capturing an unknown target. Some operations of the manipulator cannot be completed optimally if the inertia parameters of a new compound system are not well-known. In other cases, advanced control methods that considered the arm/base coupling, such as “reactionless manipulation” [7], resolved motion control based on a “generalized Jacobian matrix” [8] and “coordinated control” [9], etc., need precise knowledge of the inertia parameters of each body [10]. As such, identification of the target and the space manipulator's inertia parameters may become essential.

Generally, knowledge about the 10 inertia parameters such as mass, centre of mass location and the inertia tensor (moments and products of inertia) of mechanical systems is of great interest whenever dynamic behaviour is significantly governed by these parameters [11]. Such parameters are often contained in equations describing the system. There are several methods for identifying the inertia parameters of a robot, most of which are based on Newton-Euler dynamic equations [12 –21]. Generally, two assumptions are proposed to correct and identify inertia properties from equations.

The first assumption is that the linear moment and angular moment of the entire system are conserved [12 –16]. A procedure has been developed for calculating the mass properties of a manipulator, which is mounted on a six degree-of-freedom force sensor, by measuring the reaction forces and moments at the base. The mass properties identified by this procedure are not sufficiently complete for computed torque and other dynamic control techniques, but do provide compensation for the gravitational load on the links of the manipulator [12]. To avoid the rank deficiency of the identification matrix, a group of the inertia parameters that have a dominant effect on the angular motion dynamics were selected and properly identified [13]. This method enables the inertia parameter identification using standard flight telemetry data of an existing space robot through the simple observation of a typical manipulator operation, and does not require a special measurement apparatus or operation procedure. However, not all of the unknown parameters are identified. By eliminating the linear and angular momenta from the momentum equations, the momentum increment equations are obtained and the unknown inertial parameters of the grasped object are determined [14]. The advantage of this method is that the inertial parameters can be accurately identified in both cases of zero and nonzero momenta and the problem of singular solution can also be avoided. Two approaches—the least square method based on parameter decoupling and the non-linear system optimization based on the PSO, were proposed to identify the inertia parameters of the spacecraft [15]. The approaches do not assume that the spacecraft moves slowly enough and thereby overcome the shortcomings of previous methods that forcibly break the coupling relationship between the parameters. A robotics-based method for on-orbit identification of inertia properties of spacecraft makes use of an on-board robotic arm to change the inertia distribution of the spacecraft system [16]. The velocity of the spacecraft system will change according to inertia redistribution. Since the velocity change is measurable and the inertia redistribution of the robotic arm itself is precisely computable, the inertia parameters of the spacecraft body become the only unknown elements in the momentum equations and hence, can be identified using the momentum equations of the spacecraft system. This method requires accurate kinematics and dynamic parameters of the arm and was insensitive to sensor noise for identifying the mass and mass-centre location, but extremely sensitive to sensor noise for identifying the inertia tensor.

The second assumption is that the space manipulator arm is driven by input torque, which has little influence on the base [17 –21]. A method was validated by the ETS-VII satellite experiments using accelerometers [17]. This method consisted of an iteration of three operations: firstly, the planning of optimal manoeuvres with a simulation model and with guessed (or updated) parameters; secondly, execution of the optimal manoeuvres on the real system via data acquisition; finally, identification with a simulation model via updating the parameters. An adaptive control method without using acceleration information in the identification model was proposed [18], which consisted of a PD feedback part and a full dynamics feed- forward compensation part, with the unknown manipulator and payload parameters being estimated online. A neural network was applied to realize online identification, since the sampled real-time data were input to the network in order to obtain the real-time parameter value [19]. The identified parameters were regarded as the weights of the network and the weights approach the actual values of the identified parameters by training the weights. A genetic algorithm, that was able to directly and effectively identify inertial parameters, was applied for inertial parameters identification in the underactuated manipulator system after transformations [20]. An interactive stochastic gradient (ISG) algorithm was proposed to estimate the space manipulators and target inertial parameters in the scenario of multiple space robots manipulating the target [21]. This method utilizes the estimated results between the adjacent nodes to modify locally identified parameters and converges faster and with better robustness and stableness than the distributed least squares method.

In this paper, a method for identifying an unknown target's inertia parameters and correcting the joint friction parameters simultaneously under the second assumption is proposed. In order to estimate the linear and nonlinear dynamic parameters of a space robot in an unknown environment in a timely manner, the new RDE method (recursive differential evolution) is introduced. Compared to traditional algorithms, such as the least square (LS) algorithm, genetic algorithm (GA) and PSO (particle swarm optimization), the RDE algorithm is different in the use of historical information. The current identification results of the traditional algorithms do not has any relationship with the historical results, while the RDE algorithm can utilize the historical results to renew the estimated results. At the same time, the RDE algorithm takes the advantages of DE (Differential Evolution) algorithm such as robustness and fast convergence. In order to use the RDE to solve our problem, at least three improvements/adaptations were made as follows: 1) considering the practical application, we simplified the problem model: the manipulator system was predigested as a two-body system with rigid bodies connected by a rotational joint and the entire system was divided into three sub-body systems, i.e., the unknown body subsystem, the base body subsystem and the two-joint arm body subsystem; 2) in order to make the expression of the manipulator system model equations more concise, we transform the expression form of unknown parameters; 3) consulting the idea of DE/rand-to-best/1/bin mutation strategy and the RLS, the approximate gradient vector of maximum, multiplied by a regulated number, was effected as part of a different vector, which replaced the best vector of DE/rand-to-best/1/bin. This improvement can guide the entire population toward global optimization. Current study achievements primarily focus on linear system identification and nonlinear system off-line identification; however, the RDE algorithm can estimate the dynamic parameters of nonlinear and linear items easily via the different recursive vector updating the system's parameters in a timely manner. The simulation results represent the fine convergent effect of RDE algorithm.

The rest of this paper is organized as follows. In section 2, the dynamic model and its control functions are formulated in consideration of joint friction torque vector. Then, the identified parameters are deduced with the base of dynamic function. In section 3, differential evolution and its improved method are introduced. In section 4, the inspiration for and deduced procedures of RDE are presented. In section 5, RDE is applied to parameter identification for the single-arm space manipulator, before the inertia properties of payload and joint friction coefficient are determined. The simulation and its analysis are also presented in this section. Conclusions and the scope for further research are presented in section 6.

2. Models and Identification Problem

2.1. Models of space manipulator

The space robot system in this paper is considered as a space manipulator with N (or n) degrees of freedom ( $N \geq 2$ ), which is a serial rigid link system connected by rotary hinges attached to a satellite base. Figure 1 shows the manipulator arm firmly grasping an unknown target and the combination of the end effector of the manipulator and the target can be viewed as a new link at the end of the manipulator.

Figure 1.

Model of space manipulator

In Figure 1, b represents the model of the space manipulator; I (i=1, 2,…n) is the index of the body from the base; $Σ_{I}$ is the inertial reference frame; $Σ_{b}$ is the satellite base fixed frame; $Σ_{E}$ is the end effecter frame; $J_{i}$ is the centre of joint i; $C_{b}$ is the centroid of the base; $C_{i}$ is the centroid of link i; $B_{i}$ is the body of link i.

2.2. Dynamic model of space manipulator

When joint friction is considered, the dynamic equations of the space manipulator controlled by the input torque are:

M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + G (q) + τ_{f} = τ + τ_{F}

(1)

where $q \in R^{n \times 1}$ is the joint angle position vector; $\dot{q} \in R^{n \times 1}$ is the joint angle velocity vector; $\ddot{q} \in R^{n \times 1}$ is the joint angle acceleration vector; $M (q) \in R^{n \times n}$ is positive symmetrical inertial matrix; $C (q, \dot{q}) \in R^{n \times n}$ is centrifugal force and Carioles force matrix; $G (q) \in R^{n \times 1}$ is gravity vector, related to the position of the manipulator arm; $τ_{f} \in R^{n \times 1}$ is joint friction torsion; $τ \in R^{n \times 1}$ is real input control torque; $τ_{F} \in R^{n \times 1}$ is external torque.

When the end effector of the manipulator arm and the target are viewed as a new link of the manipulator, we can obtain $τ_{F} = 0$ . Considering joint friction torsion τ_f is the main error in the difference between the input torque τ and ideal input torque $τ_{l} \in R^{n \times 1}$ , we can obtain $τ = τ_{l} + τ_{f}$ and τ_l can be calculated by $q, \dot{q}$ and $\ddot{q}$ . Neglecting the influence of gravity, the dynamic functions can be written as:

M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + τ_{f} = τ_{l} + τ_{f} = τ

(2)

2.3. Model simplification and parameter definition

The configuration of the serial manipulator arm is determined by the joint angles, which can be precisely measured by the joint position sensors. Thus, the exact configuration of the manipulator arm is known at all times. In general, the dynamic parameters of the links and joints, which consist of mass, the centroid position and moments of inertia, will not change with movement of the arm. Therefore, dynamic parameters can be determined in advance and can in practice be considered as constant parameters. The brake device is usually installed on the joints of the manipulator arm for safety. In this way, any of the joints can be individually locked by the brake. For simplicity, we propose the following assumptions: 1) only the first joint and the joint connecting the last two links of the manipulator arm are active and the other joints are locked during the identification process; 2) the exact configuration and the parameters of the links and joints, except the last link, are known.

Figure 2 shows that the entire system is divided into three sub-body systems. The combination of the unknown target and the manipulator arm's final link is known as the unknown body subsystem. The satellite base body is called the base body subsystem and the other of the manipulator arm is called the two-joint arm body subsystem. The base body subsystem and the two-joint arm body subsystem's parameters are known, and these two subsystems are connected by the first joint. Here, the identification parameters are divided by the τ_l and the τ_f items.

Figure 2.

Simplified model of the space manipulator

Considering the $τ_{l}$ item, two-joint manipulator model equations are presented as follows [22]:

[\begin{matrix} α + 2 ε \cos (q_{2}) + 2 η \sin (q_{2}) & β + ε \cos (q_{2}) + η \sin (q_{2}) \\ β + ε \cos (q_{2}) + η \sin (q_{2}) & β \end{matrix}] [\begin{matrix} {\ddot{q}}_{1} \\ {\ddot{q}}_{2} \end{matrix}] + [\begin{matrix} ε Y_{1} + η Y_{2} \\ ε Y_{3} + η Y_{4} \end{matrix}] = [\begin{matrix} τ_{l 1} \\ τ_{l 2} \end{matrix}]

(3)

where $τ_{l i} (i = 1, 2)$ is the expected input torque of joint i, $q_{i}, {\dot{q}}_{i}, {\ddot{q}}_{i} (i = 1, 2)$ are the angle position, angle velocity and angle acceleration of joint i, respectively, and the expressions of $Y_{j} (j = 1, 2, 3, 4)$ are:

{\begin{cases} Y_{1} = - 2 \sin (q_{2}) {\dot{q}}_{1} {\dot{q}}_{2} - \sin (q_{2}) {\dot{q}}_{2}^{2} + e_{2} \cos (q_{1} + q_{2}) \\ Y_{2} = - 2 \cos (q_{2}) {\dot{q}}_{1} {\dot{q}}_{2} + \cos (q_{2}) {\dot{q}}_{2}^{2} + e_{2} \sin (q_{1} + q_{2}) \\ Y_{3} = - 2 \sin (q_{2}) {\dot{q}}_{2}^{2} + e_{2} \cos (q_{1} + q_{2}) \\ Y_{4} = - 2 \cos (q_{2}) {\dot{q}}_{2}^{2} + e_{2} \sin (q_{1} + q_{2}) \end{cases}

(4)

The variables $[α, β, ε, η]$ are equations consisting of parameters $[m_{e}, I_{e}, l_{c e}, δ_{e}]$ , all of which are identification parameters for the manipulator arm grasping the unknown target, where m_e is the mass, I_e is the inertia tensor, $l_{c e}$ is the distance from origin to the mass centre and δ_e is the angle between the position vector from origin to the mass centre and the manipulator arm link. Expressions of $[α, β, ε, η]$ are given as:

{\begin{cases} α = I_{1} + m_{1} l_{c 1}^{2} + I_{e} + m_{e} l_{c e}^{2} + m_{e} l_{1}^{2} \\ β = I_{e} + m_{e} l_{c e}^{2} \\ ε = m_{e} l_{1} l_{c e} \cos (δ_{e}) \\ η = m_{e} l_{1} l_{c e} \sin (δ_{e}) \end{cases}

(5)

There is unique corresponding relationship between $[α, β, ε, η]$ and $[m_{e}, I_{e}, l_{c e}, δ_{e}]$ :

{\begin{cases} m_{e} = \frac{(α - β - I_{1} - m_{1} l_{c 1}^{2})}{l_{1}^{2}} \\ I_{e} = β - \frac{ε^{2} + η^{2}}{(α - β - I_{1} - m_{1} l_{c 1}^{2})} \\ l_{c e} = \frac{{(ε^{2} + η^{2})}^{1 / 2} l_{1}}{(α - β - I_{1} - m_{1} l_{c 1}^{2})} \\ δ_{e} = a \tan (\frac{η}{ε}) \end{cases}

(6)

Therefore, we can evaluate the parameters $[α, β, ε, η]$ instead of $[m_{e}, I_{e}, l_{c e}, δ_{e}]$ .

For item τ_f, the Stribeck model is quoted to analyse the friction torque of joint i: [22]

τ_{f i} = [f_{c i} + (f_{s i} - f_{c i}) \exp {(- | {\dot{q}}_{i} / {\dot{q}}_{s i} |)}^{2}] sgn ({\dot{q}}_{i}) + f_{v i} {\dot{q}}_{i}

(7)

For joint i, $f_{c i}$ is Coulomb friction torque, $f_{s i}$ is static friction torque, $f_{v i}$ is the sticky friction coefficient, ${\dot{q}}_{s i}$ is the Stribeck velocity, ${\dot{q}}_{i}$ is the angle velocity and $sgn ({\dot{q}}_{i})$ is the sign function, which changes around zero. Therefore, parameters $[f_{c i}^{+}, f_{s i}^{+}, {\dot{q}}_{s i}^{+}, f_{v i}^{+}, f_{c i}^{-}, f_{s i}^{-}, {\dot{q}}_{s i}^{-}, f_{v i}^{-}]$ should be identified partially. As the difference between corresponding parameters is low, only the $[f_{c i}, f_{s i}, {\dot{q}}_{s i}, f_{v i}]$ four parameters will be estimated. In conclusion, only 12 parameters $[α, β, ε, η, f_{c i}, f_{s i}, {\dot{q}}_{s i}, f_{v i}] (i = 1, 2)$ should be evaluated.

3. DE Algorithm

3.1. Basic differential evolution

The DE (differential evolution) algorithm is a stochastic, population-based optimization algorithm that was introduced by Storn and Price in 1995 [23]. This algorithm has similar steps to the GA (genetic algorithm), both of which are able to solve practical non-differentiable or non-linear problems, the main difference between them being the mutation strategy. The DE algorithm can adequately employ a group distribution feature to enhance the ability of finding approximate solutions [24]. The procedures of the DE algorithm are as follows:

Step1. Initialization. Define upper and lower bounds for each parameter:

{x_{i} (0) | x_{j, i}^{L} \leq x_{j, i} (0) \leq x_{j, i}^{U}, j = 1, 2, …, D} \begin{matrix} f o r \begin{matrix} i = 1, 2, …, N P \end{matrix} \end{matrix}

(8)

The initial parameter values $x_{j, i} (0)$ are randomly and uniformly selected at intervals: $[x_{j, i}^{L}, x_{j, i}^{U}]$

x_{j, i} (0) = x_{j, i}^{L} + r a n d_{j, i} (x_{j, i}^{U} - x_{j, i}^{L})

(9)

where $N P$ is the size of the population and D is the number of real parameters, $r a n d_{j, i} \in U [0, 1]$ . The parameter vectors have the form $x_{i} (g) = [x_{1, i} (g), x_{2, i} (g), …, x_{D, i} (g)] (i = 1, 2, …, N P)$ and g is the generation number.

Step 2. Mutation. Randomly select three vectors, $x_{r 1} (g)$ , $x_{r 2} (g)$ , $x_{r 3} (g)$ and $r 1 \neq r 2 \neq r 3$ . Then add the weighted difference of two of the vectors to the third:

v_{i} (g + 1) = x_{r 1} (g) + F (x_{r 2} (g) - x_{r 3} (g))

(10)

The scalar factor $F \in [0, 2]$ is a constant factor and $v_{i} (g + 1)$ is called the donor vector.

Step 3. Recombination. The trial vector $u_{j, i} (g + 1)$ is developed from the elements of the target vector $x_{j, i} (g)$ and the donor vector $v_{j, i} (g + 1)$ with probability $C_{R}$ .

u_{j, i} (g + 1) = {\begin{cases} v_{j, i} (g + 1) r a n d_{j, i} \leq C_{R} o r j = j_{r a n d} \\ x_{j, i} (g) r a n d_{j, i} \geq C_{R} o r j \neq j_{r a n d} \end{cases}

(11)

where $r a n d_{j, i} \in U [0, 1]$ , $j_{rand}$ is a random integer taken from $[1, 2, …, D]$ , which ensures that $v_{i} (g + 1) \neq x_{i} (g)$ .

Step 4. Selection. Compare the target vector $x_{i} (g)$ with the trial vector $u_{i} (g + 1)$ and the vector with the lowest function value is admitted to the following equation:

x_{i} (g + 1) = {\begin{cases} u_{i} (g + 1), f (u_{i} (g + 1)) \leq f (x_{i} (g)) \\ x_{i} (g), o t h e r w i s e \end{cases} (i = 1, 2, …, N)

(12)

Step 5. Mutation, recombination and selection continue until some stopping criterion is reached.

3.2. Advanced DE algorithm

The basic DE algorithm mutation strategy is called DE/rand/1/bin [25]. One of the advanced DE algorithms uses the mutation strategy DE/rand-to-best/1/bin:

v_{i} (g + 1) = x_{r 1} (g) + λ (x_{b e s t} (g) - x_{r 1} (g)) + F (x_{r 2} (g) - x_{r 3} (g))

(13)

where combining factor $λ \in [0, 1]$ and $x_{b e s t} (g)$ creates the best vector in the population. Medens and Mohais [25] indicate that when factor F and $C_{R}$ select larger values, the parameter group has a better converging result and faster converging rate. Referring to the ideas of DE/rand/1/bin and DE/rand-to-best/1/bin, the RDE algorithm with a new mutation strategy is presented in this paper.

4. Recursive DE Algorithm

4.1. Inspiration

Let the system model be:

A (z^{- 1}) y (t) = B (z^{- 1}) u (t)

(14)

where:

{\begin{cases} A (z^{- 1}) = 1 + a_{1} z^{- 1} + a_{2} z^{- 2} + … + a_{n a} z^{- n a} \\ B (z^{- 1}) = b_{0} + b_{1} z^{- 1} + b_{2} z^{- 2} + … + b_{n b} z^{- n b} \end{cases}

(15)

and $z^{- 1}$ denotes the backward shift operator and $u (t)$ and $y (t)$ are the system input and output, respectively. Parameters $a_{i}$ and $b_{i}$ are the real coefficient, while $n a$ and $n b$ are the orders of the polynomials $A (z^{- 1})$ and $B (z^{- 1})$ . When the unknown parameters θ are assumed to appear linearly, the system (14) can be expressed as follows:

y (t) = ψ^{T} (t) θ (t)

(16)

where $θ (t)$ is a vector of unknown parameters,

θ (t) = {[- a_{1}, …, - a_{n a}, b_{0}, …, b_{n b}]}^{T}

(17)

and $ψ^{T} (t)$ is a vector that is consistent with the measured input and output values,

ψ^{T} (t) = [y (t - 1), …, y (t - n a), u (t), u (t - 1), …, u (t - n b)]

(18)

The RLS (recursive least square) algorithm equations are as follows:

P_{N + 1} = P_{N} - P_{N} ψ_{N + 1} {(1 + ψ_{N + 1}^{T} P_{N} ψ_{N + 1})}^{- 1} ψ_{N + 1}^{T} P_{N}

(19)

{\hat{θ}}_{N + 1} = {\hat{θ}}_{N} + K_{N + 1} (y_{N + 1} - ψ_{N + 1}^{T} {\hat{θ}}_{N})

(20)

K_{N + 1} = P_{N} ψ_{N + 1} {(1 + ψ_{N + 1}^{T} P_{N} ψ_{N + 1})}^{- 1}

(21)

where, ${\hat{θ}}_{N}$ is the supposed values vector of θ_N; $K_{N + 1}$ is the Kalman gain vector; P_N is the inverse of input correlation matrix ψ_N, $P_{N} = {(ψ_{N}^{T} ψ)}^{- 1}$ .

The item $K_{N + 1} (y_{N + 1} - ψ_{N + 1}^{T} {\hat{θ}}_{N})$ in Eq.(19) can be seen as gradient vector of ${\hat{θ}}_{N}$ and is easy to calculate for a linear system. For a nonlinear system, it should be pretreated with a recursive maximum likelihood (RMI) algorithm first, which is a complicated process [26].

Considering the idea of a DE/rand-to-best/1/bin mutation strategy, the approximate gradient vector of the maximum multiplied by a regulated number can be conducted as part of a different vector, which will replace the best vector of DE/rand-to-best/1/bin. This method can guide the entire population $x_{j, i} (t) (i = 1, 2, …, N P; j = 1, 2, …, D)$ toward global optimization.

4.2. Deducing process

We define the system model as follows:

y_{N} = g (u_{N}, θ_{N})

(22)

where $g (u_{N}, θ_{N})$ is a nonlinear function. Supposing $J_{N + 1}$ is the objective function at time $N + 1$ :

J_{N + 1} = \frac{1}{2} \sum_{i = 1}^{N + 1} {(y_{i} - \hat{g} (u_{i}, θ_{i}))}^{2} = \frac{1}{2} \sum_{i = 1}^{N + 1} {({\hat{e}}_{i})}^{2}

(23)

where $\hat{g} (u_{i}, θ_{i})$ is the estimated value of $g (u_{i}, θ_{i})$ and ${\hat{e}}_{i}$ is an error in modelling at moment i. For the extremist value of θ_N, the differential of $J_{N + 1}$ for θ_N is zero:

\frac{\partial J_{N + 1}}{\partial θ_{N}} = \frac{\partial J (θ_{N} + Δ θ_{N})}{\partial θ_{N}} = \frac{\partial J (θ_{N})}{\partial θ_{N}} + \frac{\partial J^{2} (θ_{N})}{\partial θ_{N}^{2}} Δ θ_{N} + O (Δ θ_{N}^{2}) = 0

(24)

Neglect the second-order derivative,

Δ θ_{N} = - \frac{\partial J (θ_{N})}{\partial θ_{N}} {(\frac{\partial J^{2} (θ_{N})}{\partial θ_{N}^{2}})}^{- 1}

(25)

Use Eq.(23) and calculate: $\frac{\partial J (θ_{N})}{\partial θ_{N}}$ and $\frac{\partial J^{2} (θ_{N})}{\partial θ_{N}^{2}}$ :

\frac{\partial J (θ_{N})}{\partial θ_{N}} = \sum_{i = 1}^{N} e_{i} \frac{\partial e_{i}}{\partial θ_{N}}

(26)

\frac{\partial J^{2} (θ_{N})}{\partial θ_{N}^{2}} = {\sum_{i = 1}^{N} (\frac{\partial e_{i}}{\partial θ_{N}})}^{2} + e_{i} (\frac{\partial^{2} e_{i}}{\partial θ_{N}^{2}})

(27)

When ${\hat{θ}}_{N}$ is close to θ_N, the value of e_i will tend towards 0. Neglect the second-order derivative $\frac{\partial^{2} e_{i}}{\partial θ_{N}^{2}}$ from Eq.(27); then, taking Eq.(26) and Eq.(27) into Eq.(25), the results are:

Δ θ_{N} = - \sum_{i = 1}^{N} e_{i} \frac{\partial e_{i}}{\partial θ_{N}} / {\sum_{i = 1}^{N} (\frac{\partial e_{i}}{\partial θ_{N}})}^{2}

(28)

Then, Eq.(23) can be updated by:

J_{N + 1} = \frac{1}{2} \sum_{i = 1}^{N} {(e_{i})}^{2} + \frac{1}{2} {(e_{N + 1})}^{2} = J_{N} + \frac{1}{2} {(e_{N + 1})}^{2}

(29)

Then we update Eq.(24) as:

\frac{\partial J (θ_{N})}{\partial θ_{N}} = \frac{\partial J (θ_{N - 1})}{\partial θ_{N}} + e_{N} \frac{\partial e_{N}}{\partial θ_{N}}

(30)

At moment $N - 1$ , minimize $\frac{\partial J (θ_{N - 1})}{\partial θ_{N - 1}} = 0$ . In general, the value of θ_i changes slightly from moment $N - 1$ to N and the present gradient vector is not the global minimum gradient vector. Then we can get the approximate result of $Δ θ_{N}$ :

Δ θ_{N} = - {e_{N} \frac{\partial e_{N}}{\partial θ_{N}} / (\frac{\partial e_{N}}{\partial θ_{N}})}^{2} = - e_{N} \frac{\partial θ_{N}}{\partial e_{N}}

(31)

As θ_N is unknown, we use ${\hat{θ}}_{N - 1}$ to replace it. Then, ${\tilde{e}}_{N}$ and $Δ θ_{N}$ is updated as:

{\tilde{e}}_{N} = y_{N} - g (u_{N}, θ_{N - 1})

(32)

Δ θ_{N} = - {\tilde{e}}_{N} \frac{\partial θ_{N - 1}}{\partial {\tilde{e}}_{N}}

(33)

Take $Δ θ_{N} = θ_{N - 1} - θ_{N - 2}$ :

{\begin{cases} {\hat{θ}}_{N} = {\hat{θ}}_{N - 1} - K_{N} (y_{N} - g (u_{N}, {\hat{θ}}_{N - 1})) \\ K_{N} = \frac{{\hat{θ}}_{N - 1} - {\hat{θ}}_{N - 2}}{y_{N} - y_{N - 1} + g (u_{N}, {\hat{θ}}_{N - 2}) - g (u_{N}, {\hat{θ}}_{N - 1})} \end{cases}

(34)

Clearly, Eq.(34) has a similar form to Eq.(10). This similarity inspired us to use an approximate gradient vector to modify the different vectors. As the item $K (y_{N} - g (u_{N}, {\hat{θ}}_{N - 1}))$ can be accepted as the current optimal gradient vector, mutation strategy DE/ best/1/bin will be the optimum strategy to employ. Involving the associating factor λ and different vector $F ({\hat{θ}}_{N - 1}^{r 2} - {\hat{θ}}_{N - 1}^{r 3})$ , the mutation strategy of the RDE algorithm can then be presented as follows:

{\begin{cases} {\hat{θ}}_{N} = {\hat{θ}}_{N - 1}^{B e s t} - λ K_{N} (y_{N} - g (u_{N}, {\hat{θ}}_{N - 1}^{B e s t})) + F ({\hat{θ}}_{N - 1}^{r 2} - {\hat{θ}}_{N - 1}^{r 3}) \\ K_{N} = \frac{{\hat{θ}}_{N - 1}^{B e s t} - {\hat{θ}}_{N - 2}^{B e s t}}{y_{N} - y_{N - 1} + g (u_{N}, {\hat{θ}}_{N - 2}^{B e s t}) - g (u_{N}, {\hat{θ}}_{N - 1}^{B e s t})} \end{cases}

(35)

Experiment results show that including the larger values of $C R$ and F will contribute to group convergence. The value of λ should not be large, in case the results are concurrent with a local optimal solution.

4.3. Comparing DE, ADE & RDE

The primary difference among DE, ADE and RDE is mutation strategy. The RDE algorithm can renew identification parameters when the database is expanded with new data that are suitable to the online problem. The procedures of the three algorithms are shown in Figure 3.

Figure 3.

Comparison of DE, ADE and RDE

For the parameter identification problem, as Figure 3 shows, DE and ADE are based only on historical databases, while RDE can employ new data to repeatedly update existing results. RDE uses an off-line identified method to calculate the initializing parameters, as do the DE and ADE, before new data are added to the database.

5. Simulations, Results and Analysis

5.1. Simulation condition

An appropriate driving signal is important for increasing the accuracy of identification results. As the input torque handles the manipulator, the expecting angle position, angle velocity and angle acceleration can be calculated using the physical parameters of the mechanical arm (Table 1), as well as the driving signal τ_l, in the experiment $τ_{l 1} = 5 \sin (2 π k / 55)$ and $τ_{l 2} = 5 \cos (2 π k / 55), k = 1, 2, …, 55$ .

Table 1.

Physical parameters of the mechanical arm

Known parameters
m ₁	l ₁	l _c1	I ₁	e ₁
1kg	1m	1/2m	1/12kg	−7/12

Identified parameters
m_e	l _ce	I_e	δ_e
3kg	1m	2/5kg	0

The results of two joints are shown in Figure 4 through Figure 6. Considering the influence of friction torque τ_f, real torque τ is not the same as the desired values τ_l. To simplify the simulation process, the undetermined two-joint parameters of τ_f take the same values that are listed in Table 2.

Table 2.

Undetermined parameters of f_c

f_s	q̇_s	f_v	τ_f
2.8	3.4	0.1	0.2

Figure 4.

Joint angle

Figure 5.

Joint angular velocity

Figure 6.

Joint angular acceleration

Some of the simulation experiment's procedures have the opposite sequence to the practical procedures. The values of angle position, angle velocity and the true input torque τ should be measured, sampled and filtered in advance; then, we can discover expected input torque τ_l and friction torque τ_f.

Let the parameters $N P = 20$ , $D = 8$ , $F = 1.2$ , $C R = 0.9$ , the two off-line iterant times be 500, the on-line iterant times be 200 and the iterant generations be 34, so that the total iterant generations are 36. Each generation's initial numbers are the results of the preceding generation. The initial parameter intervals are given in Table 3.

Table 3.

Initial parameter intervals

Parameters	α	β	ε	η	f_c	f_s	q̇_s	f_v
Lower bound	0	0	0	0	0	0	0	0
Upper bound	10	5	5	5	5	5	1	1

In order to minimize the estimated difference between real torque values and the calculated torque, we set the objective function as:

J_{N + 1} = \frac{1}{2} \sum_{j = 1}^{2} \sum_{i = 1}^{N + 1} {(τ_{i, j} - \hat{g} (u_{i, j}, θ_{i, j}))}^{2} = \frac{1}{2} \sum_{j = 1}^{2} \sum_{i = 1}^{N + 1} {({\hat{e}}_{i})}^{2}

(36)

5.2. Results and analysis

The simulation results are presented in Table 4 and the estimated error variable, alongside recursive times, are shown in Figure 7.

Table 4.

Physical parameters' identification results

Parameters	α	β	ε	f_c	f_s	q̇_s	f_v
Real values	6.7333	3.4	3.0	2.8	3.4	0.1	0.2
Average identified value	6.4706	3.2562	2.8694	2.9580	3.3221	0.1974	0.1957
Best identified value	6.7255	3.3889	2.9918	2.8384	3.1698	0.0719	0.2053

Figure 7.

Object function value varies according to recursive times

Clearly, the estimated error is large, because the identified parameters are based simply on historical data and the estimated effect is not sufficient. Recent online data, collected from manipulator sensors and added to the database and the RDE gives rise to a progressive and optimizing algorithm; the value of the object function, i.e., the value of the estimated error, decreases against the recursive time, as shown in Figure 7.

The comparison between real input torque and the torque calculated by the identified parameters are shown in Figure 8 and Figure 9. From these representations, we can conclude that the former torque values of the two joints had a small error compared to the real torque values; this occurred due to the estimated error of parameters. Alongside further estimation of the parameters, the estimated error values decrease and the estimated torque values can trace the updated real input values of each joint.

Figure 8.

Real input torque and the identified torque of joint 1

Figure 9.

Real inputs and the identified torque of joint 2

Furthermore, a series of comparative experiments were conducted [22] and the comparisons of the parameter identification results and the average convergence times of the each algorithm are shown in Table 5.

Table 5.

Comparable identification results of τ_l

Parameters	α	β	ε	η	Average convergence times
Real values	6.7333	3.4	3.0	0	\
RDE	6.7255	3.3889	2.9918	0	5
LS	6.7370	3.4202	3.0092	0.4917	1
GA	6.08	3.0677	2.7152	0.0001	63
PSO	6.7335	3.4001	3.0001	0	8

Table 5 reveals that for the linear system, the estimated error of RDE is clearly smaller than LS and GA, which is approximate to PSO. The entire calculating time is roughly 40s and each calculation generation requires approximately 1s. The advantages of RDE can be valued according to the average convergence times, that is, five calculation loops (including steps such as mutation, recombination and selection) are needed when the RDE algorithm converges to the final value during each recursive identification step. The estimated results of LS algorithm are more accurate than that of RDE algorithm, while the GA requires more steps than the RDE algorithm. Though PSO requires similar steps, each calculation process is independent, which means that the current identified result has no relationship with previous results. Therefore, the PSO algorithm has two disadvantages: 1) the identification results fluctuate near the real value and the fluctuations have upper and lower boundaries; sometimes the results even converge to a local best value; 2) with the increase in measured data, the calculation cost time also increases to a large number, leading to a significant calculation burden.

6. Conclusions

During the process of the space manipulator arm capturing the unknown target, some linear and nonlinear parameters of the manipulator will change and the target's inertia parameters should be determined as this happens. Traditional intelligent algorithms can only dispose of identification problems in nonlinear systems offline, while using the RML (recursive maximum likelihood) calculating process is extremely complex. In order to solve this problem, RDE, an improved DE algorithm, is presented in this paper. The algorithm is inspired by the RLS and DE algorithms and uses the global optimal vector to guide the DE algorithm to approximate the real value of the parameter to be identified. This algorithm can be applied primarily to mixed systems that include both linear and nonlinear items. The simulation presented in this paper proves the feasibility of the RDE algorithm. Finally, the simulation results also reveal that the identification precision of the linear part is better for RDE than both the GA and least square algorithm; furthermore, this identification method can follow a system's dynamics exactly.

Footnotes

Nomenclature

7. Acknowledgements

This work is sponsored by National Nature Science Foundation of China (11272256, 61005062, 60805034) and supported by the Fundamental Research Funds for the Central Universities (3102015BJ006). The authors gratefully acknowledge the reviewers for their valuable suggestions for improving the paper.

References

Flores-Abad

Pham

, A review of space robotics technologies for on-orbit servicing, 2014, 68:p.1–26.

Pedersen

Kortenkamp

Wettergreen

, A survey of space robotics, Proceedings of the 7th International Symposium on Artificial Intelligence, Robotics and Automation in Space, 2003, p.19–23.

Oda

Kibe

Yamagata

, ETS-VII, space robot in-orbit experiment satellite, Proceedings of IEEE International Conference on Robotics and Automation, 1996, 1: P.739–44.

Yoon

W K

Goshozono

Kawabe

, Modelbased space robot teleoperation of ETS-VII manipulator, IEEE Transactions on Robotics and Automation, 2004, 20(3): P.602–612.

Yoshida

, Engineering test satellite VII flight experiments for space robot dynamics and control: Theories on laboratory test beds ten years ago, now in orbit, The International Journal of Robotics Research, 2003, 22(5): p.321–335.

Friend

R B

, Orbital express program summary and mission overview, Proc. SPIE 6958, Sensors and Systems for Space Applications II, 2008.

Yoshida

Hashizume

Abiko

, Zero reaction maneuver: Flight validation with ETS-VII space robot and extension to kinematically redundant arm, Proc. of the IEEE International Conference on Robotics and Automation, 2011, p.441–446.

Umetani

Yoshida

, Resolved motion rate control of space manipulators with generalized Jacobian matrix, IEEE Transactions on Robotics and Automation, 1989, 5(3): P.303–314.

Oda

Ohkami

, Coordination control of *spacecraft attitude and space manipulators, Control Engineering Practice, 1997, 5(1): P.11–21.

10.

Liang

, Survey of modeling, planning, and ground verification of space robotic systems, Acta Astronautica, 2011, 68(11): P.1629–1649.

11.

Schedlinski

Link

, A survey of current inertia parameter identification methods, Mechanical systems and signal processing, 2001, 15(1): P. 189–211.

12.

West

Papadopoulos

Dubowsky

, A method for estimating the mass properties of a manipulator by measuring the reaction moments at its base, Proce. of the IEEE International Conference on Robotics and Automation, 1989, p.1510–1516.

13.

Yoshida

Abiko

, Inertia parameter identification for a free-flying space robot, AIAA Guidance, Navigation, and Control Conference, p.2002.

14.

Lei

Jin

Shi-Jie

, Inertial parameter identification of unknown object captured by a space robot, Journal of Astronautics, 2012, 33(11): P.1570–1576.

15.

Wang

, On Orbit Identification of Mass Characteristic Parameters for Space-craft, Journal of Astronautics, 2010, 31(8): P.1906–1914.

16.

Dang

Pham

, On-orbit identification of inertia properties of spacecraft using a robotic arm, Journal of guidance, control, and dynamics, 2008, 31(6): P.1761–1771.

17.

Lampariello

Hirzinger

, Modeling and experimental design for the on-orbit inertial parameter identification of free-flying space robots, In ASME 2005 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, 2005, p. 881–890.

18.

Slotine

J J E

, On the adaptive control of robot manipulators, The International Journal of Robotics Research, 1987, 6(3): P.49–59.

19.

Narendra

K S

Parthasaraphy

, Identification and control of dynamical systems using neural network, IEEE Transaction on Neural Network, 1990, 1 (1):p. 5–27.

20.

Liu

, Identification of underactuated manipulator based on genetic algorithm, Industrial Informatics (INDIN), 10th IEEE International Conference on, 2012, p.653–656.

21.

Huang

, The interactive parameters estimation of multiple space robot manipulators, Multisensor Fusion and Information Integration for Intelligent Systems (MFI), 2014 International Conference on. IEEE, 2014.

22.

Liu

Jinkun

Shen

Xiaorong

Zhao

Long

, System identification Theory and Matlab simulation, Publishing House of Electronics Industry, 2013.

23.

Storn

Price

, Differential evolution-a simple and efficient adaptive scheme for global optimization over continuous spaces, Berkeley, 1995.

24.

Vesterstrom

Thomsen

R A

, Comparative study of differential evolution, particle swarm optimization, and evolutionary algorithms on numerical *bench-mark problems, Proceedings of the IEEE Congress on Evolutionary Computation, 2004, 2: P.1980–1987.

25.

Mendes

Mohais

A S

Dyn

D E

, A differential evolution for dynamic optimization problems, Proceedings of the IEEE Congress on Evolutionary Computation, 2005, 3: P.2808–2815.

26.

Schon

T B

Wills

Ninness

, Maximum likelihood nonlinear system estimation. The 14th International Federation of Automatic Control (IFAC) symposium on System Identification, 2006, 14(1): P. 003–1008.