Sage Journals: Discover world-class research

Abstract

To overcome the problem of dynamics coupling between a space robot and a target satellite, this study introduces a new coordinated motion control approach with an adaptive filtering algorithm for a dual-arm free-floating space robot. Based on the reaction null space control scheme, one arm is used to complete the capture task and the other to counteract disturbance to the space base. However, when space robot captures a noncooperative target, the system may experience abrupt changes in dynamic parameters and output measurement noise, which can cause traditional control methods to achieve poor results in practical applications. Thus, an adaptive filtering algorithm with a variable forgetting factor is proposed to improve the tracking capabilities and robustness of the system. The convergence analysis is performed based on a Lyapunov function. The simulation results demonstrate the effectiveness of the proposed algorithm.

Keywords

Coordinated motion control dual-arm space robot variable forgetting factor recursive least square

Introduction

With the development of astronautic technology, space robots have been playing an important role in space exploration. Their main missions include capturing and repairing noncooperative space objects or debris and supporting astronauts in replacing or assembling components on space stations. Therefore, many countries have paid significant attention to the development of space robotic technologies. The SUMO/FREND project and the Phoenix Program¹ exemplify typical orbital applications of space robots. The main characteristics of the two projects are that the space robots have more than one manipulator, and the inertial parameters of the target spacecraft are much larger than those of the robot.

A number of investigations on the capture of satellites and space debris have been conducted. To describe the coupling relationship between a satellite base and its mounted manipulators, researchers in this field generally tend to separate the on-orbit capture missions into four phases.² The first is the observing and approaching phase where a space manipulator is controlled and moved toward the grasping location by gradually following the motion of the target. The second phase is the capture (physical contact) phase in which the end-effector of the space manipulator physically captures the target. In the third phase, the space manipulator firmly captures the target satellite and applies the control strategy to deal with the tumbling motion and dynamic uncertainties. The fourth is the compound stabilization phase in which the space robot dampens the motion of the target. In this study, we address the problems that arise in the post-capture and compound stabilization phase. The main topic presented in this study is the minimization of the disturbance to the base after capturing a large noncooperative target satellite. This task is necessary since the antennas of the servicing base must be pointed toward the Earth³; therefore, the base attitude must be maintained.

To resolve the dynamic interaction problems of free-floating robots, a well-known concept of reaction null space (RNS) control law has been widely employed. The RNS control law was originally proposed by Nenchev et al.⁴ to achieve the attitude control of a free-floating space robot. Yoshida et al.⁵ applied RNS control to stabilize the base attitude in the ETS-VII project, which proved useful. In Dimitrov and Yoshida,² a distributed momentum control strategy was proposed for capturing a tumbling satellite. The RNS motion control was employed to control the joint motion and spacecraft attitude. Recently, based on RNS control, Huang et al.⁶ planned zero-disturbance end-effector paths for a dual-arm space robot using a dynamic balance control algorithm. However, most of the approaches mentioned above have relied on the accurate dynamic parameters of the target, such as mass and moment of inertia.

In the presence of parameter uncertainties, a wide range of adaptive controllers have been developed for space robots. After capturing an unknown target, adaptive techniques were proposed in the literature^7,8 to avoid the effect of parameter uncertainties on the base attitude and achieve trajectory tracking of the end-effector. Nguyen-Huynh and Sharf^9,10 presented an adaptive reaction null space (ARNS) control algorithm to satisfy the objective of maintaining a minimum disturbance to the base without knowledge of the target dynamics. In the proposed adaptive approach, the recursive least squares (RLS) algorithm was employed to update the reactionless joint rates for parameter adaptation in an online manner. An adaptive filter was used to update the estimated parameters at each time sample. In the classical RLS algorithm, the forgetting factor is constant with values between 0 and 1. However, it is unsuitable for tracking time-varying parameters since the algorithm gain converges to zero, which leads to an exponential growth of the filter gain matrix¹¹ To resolve the conflicts, numerous variable forgetting factor RLS (VFF-RLS) algorithms have been developed.^12–14 In this study, we improve a variable forgetting factor weighted recursive least square (VFF-WRLS) algorithm for system identification and apply it to the coordinated motion control for a dual-arm space robotic system. This algorithm avoids the covariance explosion problem arising in the RLS algorithm with a constant forgetting factor. In this case, past data are gradually discarded on the assumption that more recent data are more informative.

The main contribution of the proposed algorithm can be stated as follows:

The conventional RNS motion control scheme is implemented in a dual-arm space robot in which both arms execute ARNS motion. This adaptive control scheme is developed to stabilize a noncooperative target with an unknown momentum without the use of attitude control system (ACS) devices such as thrusters or reaction wheels.

In the presence of parameter uncertainties, an improved adaptive filtering algorithm with a VFF that improves the tracking capabilities and robustness of the system is implemented.

The VFF is defined based on the prediction errors, and a basic convergence analysis is performed to make this approach more practical to the robotic system.

This article is organized as follows. In the second section, the kinematic model of a dual-arm space robot is built and the coordinated motion equation is obtained. Then, the ARNS algorithm for the dual-arm space robot is developed with a VFF-WRLS algorithm, and a convergence analysis of the algorithm is conducted. In the fifth section, a set of simulations verify the proposed methods. The conclusions are summarized in the last section.

Dual-arm space robot system

Basic assumptions

During the operation, the space robot is in a free-floating mode. We assume that:

The system is composed of rigid bodies only, and the origin of the inertial frame $Σ_{I}$ is located at the center of mass of the entire system to reduce the computational complexity.

After the target is grasped by the space robot, it is fixed to the end-effector. Thus, there is no relative motion between the end-effector and the target.

There is no initial angular momentum on the space robot. In the absence of any external forces or torques on the system, the total momentum is conserved. We do not consider any momentum exchange devices in this study such as reaction wheels or thrusters.

Kinematic modeling of a dual-arm space robot

As shown in Figures 1 and 2,¹⁵ a dual-arm space robotic system for a capture task typically consists of three major parts:¹⁶ a space base or servicing satellite, two arms mounted on the space base, and the satellite or debris to be captured. Figure 1 shows the space base and the two arms that comprise the servicing system. In this scenario, we use one arm to complete the capture mission and the other to counteract the disturbance to the base.

Figure 1.

Dual-arm space robot system.

Figure 2.

Part of a target captured by the space robot.

The principal difference between a space robot and a ground-fixed robot is that the base of a space manipulator is allowed to be uncontrolled (free-floating mode operation) in the orbital environment. Special attention must be paid to the dynamic coupling between the space base and its manipulators. The kinematic and dynamic model of a dual-arm space robot has been described in Jiao et al.^17,18 For clarification purposes, we recall the process of building the kinematic model here. The body 0 in Figure 1 represents the space base of the space robot, which is connected to two manipulators, each with three links. Manipulator joints are revolute and have a single degree of freedom. The inertial position of the link $r_{i}^{k}$ and end-effector $p_{e}^{k}$ in arm-k (k = a, b) can be written, respectively, as

\begin{array}{l} r_{i}^{k} = r_{0} + b_{0}^{k} + \sum_{j = 1}^{i - 1} (a_{j}^{k} + b_{j}^{k}) + a_{i}^{k} \end{array}

\begin{array}{l} p_{e}^{k} = r_{0} + b_{0}^{k} + \sum_{j = 1}^{n_{k}} (a_{j}^{k} + b_{j}^{k}) \end{array}

where $r_{0}$ is the position vector of the center of mass of the base satellite and $a_{i}^{k}$ and $b_{i}^{k}$ are the position vectors of the links. Then, the linear and angular velocity equations can be obtained according to equations (1) and (2)

\begin{array}{l} v_{i}^{k} = v_{0} + ω_{0} \times (r_{i}^{k} - r_{0}) + \sum_{j = 1}^{i} [z_{i}^{k} \times (r_{i}^{k} - p_{j}^{k})] {\dot{θ}}_{j}^{k} \end{array}

\begin{array}{l} v_{e}^{k} = v_{0} + ω_{0} \times (p_{e}^{k} - r_{0}) + \sum_{j = 1}^{i} [z_{i}^{k} \times (p_{e}^{k} - p_{j}^{k})] {\dot{θ}}_{j}^{k} \end{array}

\begin{array}{l} ω_{i}^{k} = ω_{0} + \sum_{j = 1}^{i} (z_{j}^{k} {\dot{θ}}_{j}^{k}) \end{array}

\begin{array}{l} ω_{e}^{k} = ω_{0} + \sum_{j = 1}^{n_{k}} (z_{j}^{k} {\dot{θ}}_{j}^{k}) \end{array}

where $v_{0}$ and $ω_{0}$ are the linear and angular velocity of the spacecraft, respectively; $z_{i}^{k}$ is the unit vector for the rotation direction of joint i; and ${\dot{θ}}_{j}^{k}$ is the joint rates of joint j in arm-k.

To decrease the fuel consumption, we eliminated the use of ACS devices such as thrusters or reaction wheels. Under Assumption (3), there are no external forces or torques acting on the space robot system; therefore, the linear and angular momentum of the system are conserved, which means $(P, L)$ remains constant. With Assumption (3), the initial momentum is zero, and we can obtain the following equation, the elements of the matrix are explicitly described in the study by Jiao et al.¹⁸

\begin{array}{l} P = m_{0} v_{0} + \sum_{i = 1}^{n_{a}} m_{i}^{a} {\dot{r}}_{i}^{a} + \sum_{i = 1}^{n_{b}} m_{i}^{b} {\dot{r}}_{i}^{b} = 0 \end{array}

\begin{array}{l} L = & I_{0} ω_{0} + r_{0} \times m_{0} {\dot{r}}_{0} + \sum_{i = 1}^{n_{a}} (I_{i}^{a} ω_{i}^{a} + r_{i}^{a} \times m_{i}^{a} {\dot{r}}_{i}^{a}) \\ ​ & + \sum_{i = 1}^{n_{b}} (I_{i}^{b} ω_{i}^{b} + r_{i}^{b} \times m_{i}^{b} {\dot{r}}_{i}^{b}) = 0 \end{array}

where $ω_{i}^{k}$ is the angular velocity of the links and $I_{0}, I_{i}^{k}$ are the inertia matrices of the spacecraft and link $B_{i}^{k}$ , respectively. Defining $L_{0}$ as the angular momentum of the system in the frame of the mass center of the base, we then can obtain

\begin{array}{l} L = L_{0} + r_{0} \times P \end{array}

According to equations (7) to (9), we have

\begin{array}{l} L_{0} = 0 \end{array}

Reformulating equations (7) and (10) in a matrix form, we obtain

[\begin{matrix} P \\ L_{0} \end{matrix}] = (\begin{matrix} M E & M {\tilde{r}}_{0 g}^{T} \\ 0 & H_{ω} \end{matrix}) [\begin{matrix} v_{0} \\ ω_{0} \end{matrix}] + [\begin{matrix} J_{T ω}^{a} \\ I_{ϕ}^{a} \end{matrix}] {\dot{Θ}}^{a} + [\begin{matrix} J_{T ω}^{b} \\ I_{ϕ}^{b} \end{matrix}] {\dot{Θ}}^{b} = 0

where ${\tilde{r}}_{0 g}^{T}$ is the position of the mass center of the entire system and $J_{T ω}^{k}$ is the Jacobian matrix of arm-k. Then, equation (11) can be represented as

\begin{array}{l} [\begin{matrix} P \\ L_{0} \end{matrix}] = H_{b} {\dot{x}}_{b} + H_{b m}^{a} {\dot{Θ}}^{a} + H_{b m}^{b} {\dot{Θ}}^{b} = 0 \end{array}

where $H_{b}$ is the global inertia matrix of the spacecraft and $H_{b m}^{k}$ represents the coupled inertia matrix of the manipulator

H_{b} = (\begin{matrix} M E & M {\tilde{r}}_{0 g} \\ 0 & H_{ω} \end{matrix}) \in R^{3 \times 3}

H_{b m}^{k} = {[\begin{matrix} J_{T ω}^{k} & I_{ω ϕ}^{k} \end{matrix}]}^{T} \in R^{3 \times n_{k}}

Then, the angular momentum conservation equation is achieved by removing the linear velocity of the space base in equation (12)

\begin{array}{l} H_{ω} ω_{0} + H_{Θ}^{a} {\dot{Θ}}^{a} + H_{Θ}^{b} {\dot{Θ}}^{b} = 0 \end{array}

Assuming that there is no attitude disturbance to the base $(ω_{0} = 0)$

\begin{array}{l} H_{Θ}^{a} {\dot{Θ}}^{a} + H_{Θ}^{b} {\dot{Θ}}^{b} = 0 \end{array}

In this task, arm-a is used to accomplish the capture task and arm-b is designed as the balance arm mainly used to compensate for the attitude disturbance owing to the motion of arm-a; thus, the mapping relationship between the two arms can be formulated as

\begin{array}{l} {\dot{Θ}}_{d}^{b} = - {(H_{Θ}^{b})}^{+} H_{Θ}^{a} {\dot{Θ}}_{d}^{a} \end{array}

where ${(H_{Θ}^{k})}^{+} = {(H_{Θ}^{k})}^{T} {(H_{Θ}^{k} {(H_{Θ}^{k})}^{T})}^{- 1}$ denotes the Moore–Penrose inverse of the inertia coupling matrix $H_{Θ}^{k}$ . Arm-a is used as the mission arm and was planned based on the requirement of the mission. Then from equations (14) and (15), the RNS joint rates of the balanced arm can be obtained as

\begin{array}{l} {\dot{Θ}}_{RNS}^{b} = - {(H_{Θ}^{b})}^{+} H_{Θ}^{a} {\dot{Θ}}_{RNS}^{a} \end{array}

The precondition of utilizing equations (13) to (16) is that the initial angular momentum is zero. For the general case when the precondition is not satisfied and the tumbling target carries an initial angular momentum $L_{t}$ , equation (14) can be rewritten as

\begin{array}{l} H_{Θ}^{a} {\dot{Θ}}_{RNS}^{a} + H_{Θ}^{b} {\dot{Θ}}_{RNS}^{b} = L_{m} + L_{t} = L \end{array}

where $L_{t}$ is the initial angular momentum of the target before capture. After the target is physically attached to the end-effector, the matrix $H_{Θ}^{a}$ is suddenly changed and starts to include the inertia term of the target. In this context, the desired RNS motion of the two arms in equation (16) can be rewritten as

{\dot{Θ}}_{RNS}^{b} = {(H_{Θ}^{b})}^{+} (L - H_{Θ}^{a} {\dot{Θ}}_{RNS}^{a})

After capture, the angular momentum of the entire system becomes $L = L_{m} + L_{t}$ , and ${\dot{Θ}}_{RNS}^{k}$ denotes the desired reactionless joint velocities of arm-k. However, because of the unknown properties of the target, we do not have accurate knowledge of matrix $H_{Θ}^{a}$ . Thus, the joint motion will be governed by equation (17) with a nonzero $ω_{0}$ ; the general solution for the joint rates can be written as

\begin{array}{l} {\dot{Θ}}^{b} = {(H_{Θ}^{b})}^{+} (L - H_{ω} ω_{0} - H_{Θ}^{a} {\dot{Θ}}^{a}) \end{array}

Substituting for ${\dot{Θ}}_{RNS}^{b}$ from equation (18) into equation (19) yields

H_{Θ}^{a} ({\dot{Θ}}_{RNS}^{a} - {\dot{Θ}}^{a}) + H_{Θ}^{b} ({\dot{Θ}}_{RNS}^{b} - {\dot{Θ}}^{b}) - H_{ω} ω_{0} = 0

It can also be described as

\begin{array}{l} [\begin{matrix} H_{Θ}^{a} & H_{Θ}^{b} \end{matrix}] ([\begin{matrix} {\dot{Θ}}_{RNS}^{a} \\ {\dot{Θ}}_{RNS}^{b} \end{matrix}] - [\begin{matrix} {\dot{Θ}}^{a} \\ {\dot{Θ}}^{b} \end{matrix}]) = H_{ω} ω_{0} \end{array}

Apparently, the expressions of ${\dot{Θ}}_{RNS}^{a}$ and ${\dot{Θ}}_{RNS}^{b}$ share the same form; thus, they can be rewritten in a more compact form as

{\dot{Θ}}_{RNS} - \dot{Θ} = H_{Θ}^{+} H_{ω} ω_{0}

where

{\dot{Θ}}_{RNS} = [\begin{matrix} {\dot{Θ}}_{RNS}^{a} \\ {\dot{Θ}}_{RNS}^{b} \end{matrix}], \dot{Θ} = [\begin{matrix} {\dot{Θ}}^{a} \\ {\dot{Θ}}^{b} \end{matrix}], H_{Θ} = [\begin{matrix} H_{Θ}^{a} \\ H_{Θ}^{b} \end{matrix}]

or, more succinctly,

{\dot{Θ}}_{RNS} - \dot{Θ} = K ω_{0}

where $K = H_{Θ}^{+} H_{ω} \in R^{6 \times 1}$ .

Thus, the regression form in equation (23) is the foundation of the ARNS control scheme for a dual-arm space robot system. From this regression form of equation (23), it can be viewed that if the joint rates $\dot{Θ}$ closely follow the desired RNS joint rates ${\dot{Θ}}_{RNS}$ , $ω_{0}$ may converge to zero, which means that zero attitude disturbance to the base is produced. In the ARNS control scheme, equation (23) is coupled with the well-known recursive VFF-WRLS algorithm for parameter adaption.

Weighted recursive adaptation algorithm with VFFs

If perfect knowledge of the system properties is available, equation (23) can be considered as an alternative to compute the RNS motion. However, to capture a noncooperative target, the mission will involve an unpredictable change in the inertia properties as well as the total momentum of the space robotic system. When the system has parameter uncertainties, one way to cope with such an issue is to develop an adaptive control algorithm to reduce the uncertainties. In the system identification context, the RLS algorithm with exponential data weighting was employed to adaptively update the joint velocities online.

The time-varying system commonly can be represented by a linear regression equation, that is, equation (23), which is the fundamental scheme of the ARNS motion for a dual-arm space robot system. In our conception of ARNS algorithm, we assumed that it was not necessary for the manipulators to follow any specific trajectory because the main objective was to minimize the disturbance to the base immediately after the capture of the tumbling target. Thus, the desired joint velocity ${\dot{Θ}}_{RNS}$ in equation (23) can be redefined as

\begin{array}{l} {\hat{\dot{Θ}}}_{RNS} - \dot{Θ} = \hat{K} ω_{0} \end{array}

where $\dot{Θ}$ and $ω_{0}$ are the exact measured joint velocities and base angular velocity from the sensors, respectively, and ${\hat{^{•}}}$ denotes the current estimates of the unknown variables from the noncooperative target.

According to the relation equations (23) and (24), one can obtain the data generating mechanism

\begin{array}{l} y (n) = K (n - 1) ω_{0} (n) \end{array}

where the time index n is introduced to describe the discrete nature of the process in a practical control system, assuming a sampling rate of $Δ t$ and $t = n Δ t$ . $y (n)$ is the reference output error vector, which is a sequence of independent random variables with zero mean values (white noise).

Then, it is natural to define the a priori estimation error as

\begin{array}{l} e (n) = y (n) - \hat{K} (n - 1) ω_{0} (n) \end{array}

$e (n)$ is the sequence of independent Gaussian variables with $E [e (n)] = 0$ . $\hat{K} (n - 1)$ is the estimation of the regression coefficient $K (n - 1)$ in equation (25). The prediction of the reference output vector $y (n)$ , based on the input measurement vector $ω_{0} (n)$ and the previous estimation of the filter parameter $\hat{K} (n - 1)$ , is given by

\begin{array}{l} \hat{y} (n) = \hat{K} (n - 1) ω_{0} (n) \end{array}

Weighted RLS algorithm with exponential forgetting factor

It is more common in the adaptive filtering scenario expressed as a system parameter identification problem in Figure 3 to employ a weighted regularized least-squares cost function. Here $ω (n)$ is the input signal that passes through the system $K (n)$ and resulting in the sensor output signal $y (n)$ . The noise $δ (n)$ is also picked up by the sensors. $d (n)$ is the desired filter response and $e (n)$ is the prediction error or residual. The uncertain system parameter $K (n)$ is estimated at each time step by minimizing the criterion function J. More specifically, greater weighting was selectively applied to those data points deemed to be more informative based on some criteria. Here we consider the case where the most recent data were assumed to be more informative than past data, and hence we exponentially discarded old data, leading to the exponentially weighted cost function

min J (n) = \frac{1}{n} \sum_{i = 0}^{n} β (n, i) e^{2} (i)

where $β (n, i)$ is the profile of the forgetting variable with typical increase in i for a given n. The prediction error $e (n)$ is defined by equation (26).

Figure 3.

System identification problem.

The corresponding mean of the criterion function J is readily seen to be a mathematical expectation, that is, the criterion function equation (26) reduces to

J (n) = E [β (n, i) e^{2} (i)]

A recursive form for the weight factor $β (n, i)$ was proposed in the study by Kovacevic et al.¹⁹ as

β (n, i) = λ (n) β (n - 1, i)

According to equation (30), one further concludes that

\begin{array}{l} β (n, i) = [\prod_{j = i + 1}^{n} λ (j)] α (i), α (i) = β (i, i) \end{array}

The typical value of $λ (j)$ is bounded by $0 < λ (j) \leq 1$ , and in particular, if $λ (j)$ is constant, that is, $λ (j) = λ$ , one obtains

\begin{array}{l} β (n, i) = λ^{n - i} α (i) \end{array}

Basically, the cost function in equation (28) represents the mean of squares of the prediction errors $e (n)$ . The purpose of this algorithm is to emphasize artificially the effect of current data by exponentially weighting past data values, because while a space robot is grasping a target, a huge system impact may occur that will cause large deviations in motion and measurement errors in the sensors. Since the sequence $λ (j)$ has the effect of attaching a smaller weight to the terms in which $y (n)$ is expected to have larger errors, the algorithm is frequently used in time-varying systems. Another use of the sequence $λ (j)$ is to discard initial data in nonlinear estimation problems. The erroneous initial data may deteriorate the performance of the algorithm and must be discarded once the algorithm begins.

According to equations (26) and (27), $J (n)$ is a quadratic function of the argument $\hat{K} (n)$ . Note that according to the prediction error of equation (26), the parameters are slowly changing, that is, $\hat{K} (n) \approx \hat{K} (n - 1)$ . Its minimum is obtained by solving the equation

\begin{array}{l} \frac{\partial J (n)}{\partial \hat{K} (n)} & = 2 n^{- 1} \sum_{i = 1}^{n} β (n, i) [y (i) - ω_{0} {(i)}^{T} \hat{K} (i)] \frac{\partial ω_{0} {(i)}^{T} \hat{K} (i)}{\partial \hat{K} (i)} \\ ​ & = 0 \end{array}

Solving the obtained algebraic equation over $\hat{K} (n)$ , one obtains

\hat{K} (n) = \frac{\sum_{i = 1}^{n} β (n, i) ω_{0} {(i)}^{T} y (i)}{\sum_{i = 1}^{n} β (n, i) ω_{0} {(i)}^{T} ω_{0} (i)}

Equation (34) is the non-recursive form of the WRLS algorithm. According to the relation equation (30), a possible recursive version can be obtained as

\begin{array}{l} R (n) = \sum_{i = 1}^{n} β (n, i) ω_{0} {(i)}^{T} ω_{0} (i) \\ = \sum_{i = 1}^{n - 1} β (n - 1, i) ω_{0} {(i)}^{T} ω_{0} (i) + β (n, n) ω_{0} {(n)}^{T} ω_{0} (n) \\ = λ (n) R (n - 1) + α (n) ω_{0} {(n)}^{T} ω_{0} (n) \end{array}

The matrix $R (n)$ represents the correlation matrix of the input signal $ω_{0} (n)$ . $R (n)$ also represents the estimation of the Hessian $\nabla^{2} J (n)$ at the moment n.

After applying the lemma on matrix inversion²⁰ in equation (35), one obtains

\begin{array}{l} P (n) = R^{- 1} (n) \\ = \frac{1}{λ (n)} [P (n - 1) - \frac{P (n - 1) ω_{0} (n) ω_{0} {(n)}^{T} P (n - 1)}{λ (n) / α (n) + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)}] \end{array}

$P (n)$ , initialized by $P (- 1) = σ^{2} E$ for $σ^{2} ≫ 1$ , is the inverse of the correlation matrix $R (n)$ . In this manner, the solution $\hat{K} (n)$ of the exponentially weighted RLS problem (equation (28)) can be computed recursively as

\begin{array}{l} \hat{K} (n) = \hat{K} (n - 1) + G (n) e (n) \end{array}

G (n) = \frac{P (n - 1) ω_{0} (n)}{λ (n) / α (n) + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)}

where $G (n)$ is the Kalman filtering gain vector. The matrix $P (n)$ has the meaning of an error covariance matrix of the estimated parameters^.19 The variable $α (n)$ is the convergence factor that controls the convergence speed of the algorithm. Thus, the role of matrix P is to accelerate the convergence of the adaptive algorithm, and the price to pay is an increase of computational complexity. If the value $P (n - 1)$ in equation (38) is replaced by a unit matrix E, one can obtain the well-known least mean square algorithm, which has worse convergence properties but a lower complexity.¹⁹

Choice of forgetting factor on prediction error

The forgetting factor $λ (n)$ suggests that it represents the measure by considering the previous measurements in the estimation process. In other words, the choice of the values for the forgetting factor determines how quickly one neglects the influence of the previous measurements, which is one form of data windowing, whereby the effective length of the window is $\approx 1 / (1 - λ)$ samples. Typical values for λ are between 0.9 and 0.99, which corresponds to the effective length of a window between 10 and 100 samples, respectively.

In the articles,^9,10 a constant forgetting factor λ was employed, but problems can occur in an adaptive control situation. Since the matrix $\hat{K} (n)$ changes with time, matrix $P (n)$ may become excessively large or approach zero. $P (n)$ needs to be reset to its initial value whenever it surpasses the preset thresholds; otherwise, the parameter estimator will become unstable. When the algorithm gain $P (n)$ approaches zero, the corresponding data point will not be included in the parameter update and the estimator switches off. Then, the sequential least-squares algorithm might become unsuitable for time-varying problems.

To overcome this problem, the fixed exponential forgetting factor $β (n, i) = λ^{n - i}$ should be replaced by some weight function (forgetting factor variable in time) that represents an increasing function of the argument i $(1 \leq i \leq n)$ for a given moment of time t.

The idea of exponential data weighting with VFF was employed. It has been shown in the study by Goodwin and Sin²¹ that a good choice for $λ (n)$ in such cases is

\begin{array}{l} λ (n) = 1 - λ_{0} \frac{e {(n)}^{2}}{{\bar{e}}^{2}} \end{array}

where $e {(n)}^{2}$ is the current prediction error, ${\bar{e}}^{2}$ is the mean value of $e {(n)}^{2}$ over a certain period, and $λ_{0}$ is a small constant to satisfy the desired estimation quality in the stationary operation mode. Based on the knowledge of Kalman filtering theory,²¹ it can be shown that ${\bar{e}}^{2}$ is proportional to $1 + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)$ , and hence equation (39) can be rewritten as

\begin{array}{l} λ (n) = 1 - λ_{0} \frac{e {(n)}^{2}}{1 + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)} \end{array}

Unlike traditional VFF schemes, the improved VFF-WRLS algorithm is built on prediction errors. Equation (40) represents a normalized error because the term $1 + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)$ represents an estimation of the error variance $e (n)$ . The effect of the algorithm (equations (39) and (40)) can be explained as follows. When the space robot captures the target satellite, a sudden impact in the control system occurs and $e {(n)}^{2}$ increases; this reduces $λ (n)$ temporarily but increases $P (n)$ quickly, and rapid adaptation occurs. After adaptation, $e {(n)}^{2}$ decreases and $λ (n)$ returns to a value near 1 and the cycle repeats.

In the typical RLS algorithm with a fixed exponential forgetting factor, $α (n)$ is a constant scalar variable, that is, $α (n) = 1$ . Since $α (n)$ controls the convergence speed of the gradient method in such a case, we also redefine the series of $α (n)$ as

α (n) = 1 - λ (n) = λ_{0} \frac{e {(n)}^{2}}{1 + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)}

Hence, the new VFF-WRLS algorithm can be obtained by equations (24) to (40). The new VFF-WRLS has a higher numerical stability and faster convergence than the conventional RLS algorithm in the study by Nguyen-Huynh and Sharf.^9,10

Once $\hat{K} (n)$ is calculated, the desired ARNS motion of the arms can be obtained as

\begin{array}{l} {\hat{\dot{Θ}}}_{RNS} (n) = \dot{Θ} (n) + \hat{K} (n) ω_{0} (n) \end{array}

The proposed ARNS scheme with VFF requires measurements of the base angular velocity $ω_{0} (n)$ and the current joint rates $\dot{Θ} (n)$ . To compute the VFF in equation (40), the current prediction error $e (n)$ is also required. In this way, the parameter matrix $\hat{K} (n)$ and the ARNS joint rates ${\dot{Θ}}_{RNS} (n)$ are updated.

It is emphasized that the principal objective of the proposed control algorithm is to maintain the attitude of the space base, that is, $ω_{0} (n) \approx 0$ after capturing a noncooperative target. Since the total prediction error is minimized using the VFF-WRLS approach and the VFF plays a significant role during the process of converging, this control objective can be satisfied even though there is a mismatch between the estimated matrix $\hat{K} (n)$ and its true value $K (n)$ .

The process of the complete algorithm is shown in Figure 4, and the following primary steps are obtained below, assuming a sampling rate of $Δ t$ .

Step 1. Initialize the system based on pre-capture parameters, ${\dot{Θ}}_{RNS} (0) = H_{Θ}^{+} L_{m}, n = 0$ .

Step 2. Measure $Θ (n)$ , $\dot{Θ} (n)$ , and $ω_{0} (n)$ .

Step 3. Compute $λ (n)$ from equation (40) and $\hat{K} (n)$ from equations (36) to (38).

Step 4. Using $\hat{K} (n)$ , update the desired motion from equation (42)

Step 5. $n = n + 1$ , return to step 2 until finished.

Figure 4.

ARNS control scheme with VFF-WRLS algorithm. ARNS: adaptive reaction null space; VFF-WRLS: variable forgetting factor weighted recursive least square.

As illustrated in Figure 4, the control architecture lies on two loops: the inner loop with the proportional–derivative (PD) joint velocity-based controller and the outer loop with the improved VFF-WRLS algorithm. The inner loop controller can drive the joint velocity error $y = {\hat{\dot{Θ}}}_{RNS} - \dot{Θ}$ to zero where the joint torques are computed as per the PD control law with constant gains; the outer loop can update the ARNS motion for the two arms with the online parameter adaptation algorithm (VFF-WRLS).

Convergence analysis

Following the proposed approach to deal with a time-varying system, the convergence properties are discussed in this section. A nonnegative Lyapunov function $V (n)$ is defined as

\begin{array}{l} V (n) = \frac{\tilde{K} {(n)}^{T} P {(n)}^{- 1} \tilde{K} (n)}{\prod_{i = 1}^{n} λ (i)} \end{array}

Using equations (25), (26), (37), and (38), we have

\begin{array}{l} \tilde{K} (n) & = \hat{K} (n) - K (n) \\ ​ & = (E - G (n) ω_{0} {(n)}^{T}) \tilde{K} (n - 1) \end{array}

From equations (36) and (38), we obtain

\begin{array}{l} P {(n)}^{- 1} \tilde{K} (n) = λ (n) P {(n - 1)}^{- 1} \tilde{K} (n - 1) \end{array}

The difference of $V (n)$ is obtained by

\begin{array}{l} Δ V = V (n) - V (n - 1) \\ = \frac{\tilde{K} {(n)}^{T} P {(n)}^{- 1} \tilde{K} (n)}{\prod_{i = 1}^{n} λ (i)} \\ - \frac{\tilde{K} {(n - 1)}^{T} P {(n - 1)}^{- 1} \tilde{K} (n - 1)}{\prod_{i = 1}^{n - 1} λ (i)} \end{array}

Inserting equations (44) and (45) into equation (46) gives

\begin{array}{l} Δ V = V (n) - V (n - 1) \\ = \frac{\tilde{K} {(n)}^{T} P {(n)}^{- 1} \tilde{K} (n)}{\prod_{i = 1}^{T} λ (i)} \\ - \frac{\tilde{K} {(n - 1)}^{T} P {(n - 1)}^{- 1} \tilde{K} (n - 1)}{\prod_{i = 1}^{n - 1} λ (i)} \\ = \frac{{[\tilde{K} (n) - (\tilde{K} (n - 1))]}^{T} P {(n - 1)}^{- 1} \tilde{K} (n - 1)}{\prod_{i = 1}^{n - 1} λ (i)} \\ = - \frac{1}{\prod_{i = 1}^{n - 1} λ (i)} [\frac{\tilde{K} {(n - 1)}^{T} ω_{0} (n) ω_{0} {(n)}^{T} \tilde{K} (n - 1)}{λ (n) + ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)}] \\ = - \frac{e {(n)}^{2}}{\prod_{i = 1}^{n} λ (i) + \prod_{i = 1}^{n - 1} λ (i) ω_{0} {(n)}^{T} P (n - 1) ω_{0} (n)} \\ \leq - \frac{e {(n)}^{2}}{\prod_{i = 1}^{n} λ (i) + \prod_{i = 1}^{n - 1} λ (i) σ_{max} (P (n)) ω_{0} {(n)}^{T} ω_{0} (n)} \\ \leq 0 \end{array}

Recall that $0 < λ (n) \leq 1$ and $P (- 1) = σ^{2} E$ for $σ^{2} ≫ 1$ is a positive definite matrix and $σ_{max} (P (n))$ is the maximum eigenvalue of the matrix $P (n)$ which is bounded, that is

0 < σ_{max} (P (n)) \leq B

It is clear that $V (n)$ is a nonnegative, nonincreasing function, and hence it converges. Thus, we have

lim_{t \to \infty} \frac{e {(n)}^{2}}{\prod_{i = 1}^{n} λ (i) + \prod_{i = 1}^{n - 1} λ (i) σ_{max} (P (n)) ω_{0} {(n)}^{T} ω_{0} (n)} = 0

It is clear from the equations above that $\tilde{K} (n)$ will converge to zero. This is a desirable property and tends to improve the robustness of the algorithm.

Simulation study

In this section, the proposed control algorithms are evaluated and compared using a planar dual-arm space robot. The free-floating space robot includes three components, two of which are three-link manipulators, while the third is the space base. The target is assumed to be firmly held by the end-effector of the mission arm with no relative motion as illustrated in Figure 5.

Figure 5.

Dual-arm space robot with captured target.

The dynamic models of the system were created primarily in MATLAB/SimMechanics with S-functions. The geometric and dynamic parameters of the space robot and the target are presented in Tables 1 and 2. The desired motion generated by ARNS was produced by driving the joints with torques computed using the PD control law with constant gains for the entire motion. The parameters of the PD controller are presented in Table 3.

Table 1.

Parameters for dual-arm space robot.

Symbol	Mass (kg)	Length (m)	Inertia
		$a_{i}^{k}, b_{i}^{k}$	$I_{i}^{k}$
$B_{0}$	44	0, 0.3	44
$B_{1}^{a}$	2.5	0.2, 0.2	0.21
$B_{2}^{a}$	2.5	0.2, 0.2	0.21
$B_{3}^{a}$	2.5	0.2, 0.2	0.21
$B_{1}^{b}$	2.5	0.2, 0.2	0.21
$B_{2}^{b}$	2.5	0.2, 0.2	0.21
$B_{3}^{b}$	2.5	0.2, 0.2	0.21

Table 2.

Initial joint state for dual-arm space robot.

Joint	$J_{1}^{a}$	$J_{2}^{a}$	$J_{3}^{a}$	$J_{1}^{b}$	$J_{2}^{b}$	$J_{3}^{b}$
$Θ_{0}$	45	−10	−20	−20	45	−60

Table 3.

PD controller parameters for dual-arm space robot.

Gains	$J_{1}^{a}$	$J_{2}^{a}$	$J_{3}^{a}$	$J_{1}^{b}$	$J_{2}^{b}$	$J_{3}^{b}$
$K_{p}$	300	200	150	300	200	150
$K_{d}$	20	15	10	20	15	10

PD: proportional–derivative.

To demonstrate the capability of the proposed algorithm, the noncooperative target was much larger than the space robot. The mass and inertia were assumed to be almost three times as large as that of the servicer (refer to Table 4).

Table 4.

Parameters for noncooperative target.

Mass (kg)	Length (m)	Inertia (kg·m²)
$m_{t}$	$b_{t}$	$I_{t}$
120	0.3	120

The space robot initially has no momentum before capture, and the target is tumbling with an initial angular velocity. The initial adaptation gain matrix was chosen as $P (- 1) = 500$ . The initial forgetting factor for the RLS algorithm was defined as $λ (0) = 0.99$ and was reset when $λ (n) \leq 0.1$ or $λ (n) \geq 0.9999$ . The step size was $Δ t = 0.005 s$ and the actual capture occurred at $t = 0 s$ .

The proposed algorithm was tested for two basic cases: $L_{t} = 0$ and $L_{t} \neq 0$ . The first case verified the accuracy of the simulation platform by examining the momentum conservation of the system. This case also demonstrated the adaptation of ARNS algorithm in producing the RNS motion when the inertia parameters of the space manipulator were modified as a result of target capture. For the case $L_{t} \neq 0$ , the simulation of the post-capture of an unknown tumbling target aimed to verify the capability of the proposed scheme to minimize the disturbance to the base.

Case A: Test case with $L_{t} = 0$

Considering the “noise” from inaccurate modeling and computer round-off errors, before the implementation of the algorithm, we first checked the accuracy of the simulation platform by examining the momentum conservation of the system during the ARNS motion. In this case, the initial momentum of the target was $L_{t} = 0$ ; the space robot was initially at rest and the manipulators were commanded with joint motions as per equation (42). The convergence speed factor α was typically set as 1. Figure 6 shows that the angular momentum distribution as the total angular momentum was zero. It is apparent that the total momentum of the system was conserved. Figure 7 shows the attitude change of the base. The attitude disturbance remained on the scale of $10^{- 3}$ degrees after 100 s. A base-10 log scale was used for the x-axis (t = 100 s) to show the rate of change in some figures. In Figure 8, the minimum base disturbance was achieved by the ARNS motion, while one arm held the tumbling target. When the end-effector contacted the target, a significant rate disturbance (as high as 0.017° s⁻¹) was produced by the impact between the manipulator and the target. However, the perturbation was quickly damped out by the ARNS motion. The results of the ARNS joint rates are shown in Figure 9. Figure 10 displays the error profiles of the PD velocity-based controller, from which one can observe that the joint rate error profiles were consistent with the base angular velocity profile in Figure 9. The result in Figure 11 shows that the prediction errors or residual of matrix $\hat{K} (n)$ converges to 0. This is a desired property and tends to improve the robustness of the control system. The obtained results showed the accuracy of the simulation platform and the effectiveness of the proposed ARNS motion algorithm.

Figure 6.
Angular momentum of system.

Figure 7.
Base attitude $L_{t} = 0$ .

Figure 8.
Base angular velocity $L_{t} = 0$ .

Figure 9.
Joint rates $L_{t} = 0$ .

Figure 10.
Joint rate errors $L_{t} = 0$ .

Figure 11.
Prediction errors $e (n)$ , $L_{t} = 0$ .

Case B: Test case with $L_{t} \neq 0$

In this subsection, the improved VFF-WRLS algorithm tested by simulating the capture of a tumbling target is described. As in the previous simulations, we assumed that the target was attached rigidly to the end-effector after capture and its tumbling motion was emulated with an external impulse torque of 240 N·m applied to the target for a duration of 0.005 s. The initial forgetting factor for the RLS algorithm was $λ (0) = 0.99$ . In this simulation, we utilized the factor $α (n) = 1 - λ (n)$ to accelerate the convergence of the system. The actual capture occurred at $t = 0 s$ . The corresponding results for the VFF-WRLS algorithm are shown in Figures 12 to 17.

Figure 12.
Joint rates $L_{t} \neq 0$ .

Figure 13.
Base attitude $L_{t} \neq 0$ .

Figure 14.
Base angular velocity, $L_{t} \neq 0$ .

Figure 15.
Joint rate errors $L_{t} \neq 0$ .

Figure 16.
Prediction error $e (n)$ , $L_{t} \neq 0$ .

Figure 17.
Variable forgetting factor $L_{t} \neq 0$ .

Figure 12 shows the joint rates, from which one can observe that the arms were initialized at the instant of capture with rates ${\dot{Θ}}_{RNS}$ computed from equation (42), and the ARNS adaptively updated the reactionless motion. The base attitude and base angular velocity are shown in Figures 13 and 14, respectively. In Figure 13, the base attitude remained on the scale of $O (10^{- 3})$ . As shown in Figure 14, the capture created an initial angular disturbance on the base. This disturbance, however, was successfully reduced with the ARNS motion to the level of $O (10^{- 5})$ within 2 s (400 iterations) after capture. The joint rate error profiles of the velocity-based PD controller shown in Figure 15 match the base angular velocity profile in Figure 13. Figure 16 shows the prediction errors or residual of matrix $\hat{K} (n)$ . Further details about the convergence properties are presented in the next subsection. Figure 17 shows the forgetting factor curves. When the space robot captures the target, the algorithm obtains a small forgetting factor to discard the estimation because of the large error. These initial data may deteriorate the performance of the algorithm unless they are discarded once the algorithm operation begins. Later, it increases the forgetting factor to attach more recent data to the parameter adaptation problems. Finally, the forgetting factor converges to 1, and the algorithm deteriorates to the well-known standard least-squares algorithm. In this way, the adaptive forgetting factor speeds up the convergence process substantially.

Case C: Convergence analysis

To evaluate the performance of the proposed VFF-WRLS algorithm, we compared the classical RLS algorithm with different constant values of forgetting factor with the improved VFF-WRLS algorithm. The forgetting factors were set with typical values of $λ = 0.9$ and $λ = 0.99$ , respectively. Other initial parameters were the same as those of the previous simulation in case B. The results are presented in Figures 18(a) and 19(b), and the relevant results of the improved VFF-WRLS are shown in Figures 13 and 16. According to these results, all the algorithms were able to achieve the ARNS motion regardless the value of the forgetting factor. However, problems can occur with a constant value of λ. In this situation, this can result in a “burst” phenomenon in parameter estimates that occur at around t = 50 s in Figure 18(a) and t = 80 s in Figure 18(b). The “blips” occurring in these figures reflect the non-robustness of the algorithm. This can be explained as follows.

Figure 18.
Prediction errors $e (n)$ with RLS, constant forgetting factor: (a) $λ = 0.9$ and (b) $λ = 0.99$ . RLS: recursive least squares.

Initially, with poor parameter estimates (within the first 0.1 s), the resulting feedback leads to bad regulation, and hence the data are rich in information. Then, as the estimates converge, the system under feedback tends to settle down, but simultaneously the estimation covariance matrix $P (n)$ begins to grow because of the loss of persistent excitation. After some time, the parameter estimator can become unstable because $P (n)$ appears as a gain in the algorithm. This can give rise to poor estimates, and the resulting feedback controller will begin to perform poorly. From Figures 13, 19(a) and (b), in terms of the disturbance to the base, the proposed VFF-WRLS outperformed the other algorithms. From Figure 19(a) and (b), one can observe that the ARNS motion with a constant forgetting factor produced a significant rate disturbance (as high as 0.021° s⁻¹) on the base, and the time to converge was much longer.

Figure 19.
Disturbance to base with RLS, constant forgetting factor: (a). $λ = 0.9$ and (b) $λ = 0.99$ . RLS: recursive least squares.

The obtained experimental results based on the simulations pointed out that the use of a VFF leads to a better adaptability of the filtering parameter estimation in comparison to the conventional algorithm with a fixed forgetting factor. With the correct choice of forgetting factor, the improved VFF-WRLS algorithm achieved a faster convergence and a greater robustness.

Conclusion

This study presented a practical implementation of a weighted RLS algorithm with a VFF for a dual-arm space robot capturing task. In the course of on-orbit servicing, the tumbling target was assumed to be much larger than the space robot, which meant that the uncertainties of the inertia properties of the target would degrade the control performance and the compound stabilization. To address this problem, the ARNS algorithm was extended to a dual-arm space robot and was enhanced by incorporating the VFF-WRLS technique. The novelty of the proposed VFF-WRLS algorithm lies in the time-varying function of determining the forgetting factor as well as relating the forgetting factor to the prediction error in the estimated parameters. The convergence properties of this algorithm were analyzed. Simulation results revealed the good performance of the proposed algorithm for both maintaining a minimum disturbance to the base and guaranteeing the convergence of tracking errors. We conclude that the proposed methods are applicable to a dual-arm space robot supplying on-orbit services.

Based on the proposed methods, we provide here a brief account of some implementation considerations in adaptive control design to raise some issues relevant to practical applications.
Adaptive control is a useful way of approaching control problems, but in practice, one must keep in mind the practical realities of the problem under study and include as much physical insight as possible. One of the first decisions that must be made in digital control is the sampling speed. This varies greatly depending on the application and hardware (from milliseconds to minutes). Roughly, the sampling period should depend on the computation speed, system time delay, choice of control law, and other factors.

To apply the algorithm into practice, we must investigate the performance of our prototype estimation scheme in the presence of bounded noise, such as measurement noise, inaccurate modeling, and computer round-off errors. The ARNS algorithm was analyzed under somewhat idealized assumptions. However, in practice, one must be aware of the degree of robustness of the algorithms to conditions that do not strictly comply with the assumptions.

The computation cost should be analyzed for the adaptive filtering algorithm because the high number of estimated states leads to a larger number of calculations that increase the time delay of the system and possibly deteriorate the overall performances. In particular, for a space robotic system, the performance of a microcomputer is worse than that of a normal computer on the ground.

To ensure security of the operation, self-collision detection algorithm should be considered for multi-arm robotics. Space robot is a kind of complex multi-body system, which may cause the collision between the links, the spacecraft module and external objections while in the on-orbit tasks. A challenge of real-time self collision detection is its computation cost, since computation resource for a space robot is limited. Servo loop, sensor-based motion generator, and collision checker need to share the resource.

Footnotes

Nomenclature

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (61673239, 61703228) and Science and Technology Project of Shenzhen (JCYJ20160301100921349, JCYJ20170817152701660).

ORCID iD

Chunting Jiao

References

Debus

Dougherty

. Overview and performance of the front-end robotics enabling near-term demonstration (FREND) Robotic Arm. In: AIAA Infotech@Aerospace Conference, Seattle, Washington, 6–9 April 2009.

Dimitrov

Yoshida

. Momentum distribution in a space manipulator for facilitating the post-impact control. In: 2004 IEEE/RSJ international conference on intelligent robots and systems (IROS), Vol. 4, Sendai, Japan, 28 September–2 October 2004, pp. 3345–3350. Washington, DC, USA: IEEE. DOI: 10.1109/IROS.2004.1389933.

Flores-Abad

Pham

, et al. A review of space robotics technologies for on-orbit servicing. Prog Aerosp Sci 2014; 68: 1–26.

Nenchev

Yoshida

Vichitkulsawat

, et al. Reaction null-space control of flexible structure mounted manipulator systems. IEEE Trans Robot Autom 1999; 15(6): 1011–1023.

Yoshida

Hashizume

Abiko

. Zero reaction maneuver: flight validation with ETS-VII space robot and extension to kinematically redundant arm. In: IEEE international conference on robotics and automation, Vol. 1. Seoul, South Korea, 21–26 May 2001, pp. 441–446. Washington, DC, USA: IEEE.

Huang

Liang

. Balance control of multi-arm free-floating space robots during capture operation. Int J Adv Robot Syst 2005; 2(2): 398–403.

Wang

Luo

Yuan

, et al. Coordinated trajectory planning of dual-arm space robot using constrained particle swarm optimization. Acta Astronaut 2018; 146: 259–272. http://www.sciencedirect.com/science/article/pii/S0094576516313923.

Stolfi

Gasbarri

Sabatini

. A combined impedance-PD approach for controlling a dual-arm space manipulator in the capture of a non-cooperative target. Acta Astronaut 2017; 139: 243–253.

Nguyen-Huynh

Sharf

. Adaptive reactionless motion for space manipulator when capturing an unknown tumbling target. In: IEEE international conference on robotics and automation, Shanghai, China, 9–13 May 2011, pp. 4202–4207. Washington, DC, USA: IEEE.

10.

Nguyen-Huynh

Sharf

. Adaptive ReactionLess motion with joint limit avoidance for robotic capture of unknown target in space. In: IEEE/RSJ international conference on intelligent robots and systems, Vilamoura, Portugal, 7–12 October 2012, pp. 1155–1160. Washington, DC, USA: IEEE. https://ieeexplore.ieee.org/document/6385861/.

11.

Lakhdari

Mokhtari

Léluse

, et al. Adaptive predictive control of a class of nonlinear systems: a case study. In: Fifth IFAC symposium on adaptive systems in control and signal processing 1995, Vol. 28, Budapest, Hungary, 14–16 June 1995, pp. 209–214. http://www.sciencedirect.com/science/article/pii/S1474667017453511.

12.

Bhotto

MZA

Antoniou

. New improved recursive least-squares adaptive-filtering algorithms. IEEE Trans Circuits Syst I 2013; 60(6): 1548–1558.

13.

Paleologu

Benesty

Ciochina

. A robust variable forgetting factor recursive least-squares algorithm for system identification. IEEE Signal Process Lett 2008; 15: 597–600.

14.

Paleologu

Benesty

Ciochină

. A practical variable forgetting factor recursive least-squares algorithm. In: 2014 11th international symposium on electronics and telecommunications (ISETC), Timisoara, Romania, 14–15 November 2014, pp. 1–4. DOI: 10.1109/ISETC.2014.7010812.

15.

Zhang

Liang

Wang

, et al. Manipulability measure of dual-arm space robot and its application to design an optimal configuration. Acta Astronaut 2016; 128: 322–329.

16.

Huang

Zhang

. Dynamic modelling and coordinated controller designing for the manoeuvrable tether-net space robot system. Multibody Syst Dyn 2016; 36(2): 115–141.

17.

Jiao

Liang

Wang

. Adaptive reaction null-space control of dual-arm space robot for post-capture of non-cooperative target. In: 29th Chinese control and decision conference (CCDC), Chongqing, China, 28–30 May 2017, pp. 531–537. Washington, DC, USA: IEEE.

18.

Jiao

Liang

Wang

, et al. Coordinated motion control of a dual-arm space robot with joint-limit avoidance and uncertain inertial parameters. Int J Adv Robot Syst 2017; 14(5): 1–14.

19.

Kovacevic

Banjac

Milosavljević

. Adaptive digital filters. Berlin, Heidelberg: Springer, 2014.

20.

Zhang

. Matrix analysis and applications. Cambridge: Cambridge University Press, 2017.

21.

Goodwin

Sin

. Adaptive filtering prediction and control. Mineola: Dover Publications, 2009.

Adaptive coordinated motion control with variable forgetting factor for a dual-arm space robot in post-capture of a noncooperative target

Abstract

Keywords

Introduction

Dual-arm space robot system

Basic assumptions

Kinematic modeling of a dual-arm space robot

Weighted recursive adaptation algorithm with VFFs

Weighted RLS algorithm with exponential forgetting factor

Choice of forgetting factor on prediction error

Convergence analysis

Simulation study

Case A: Test case with L t = 0

Case B: Test case with L t ≠ 0

Case C: Convergence analysis

Conclusion

Footnotes

Nomenclature

Declaration of conflicting interests

Funding

ORCID iD

References

Case A: Test case with $L_{t} = 0$

Case B: Test case with $L_{t} \neq 0$