Abstract
This paper deals with experimental comparison between stable adaptive controllers of robotic manipulators based on Model Based Adaptive, Neural Network and Wavelet -Based control. The above control methods were compared with each other in terms of computational efficiency, need for accurate mathematical model of the manipulator and tracking performances. An original management algorithm of the Wavelet Network control scheme has been designed, with the aim of constructing the net automatically during the trajectory tracking, without the need to tune it to the trajectory itself. Experimental tests, carried out on a planar two link manipulator, show that the Wavelet-Based control scheme, with the new management algorithm, outperforms the conventional Model-Based schemes in the presence of structural uncertainties in the mathematical model of the robot, without pre-training and more efficiently than the Neural Network approach.
1. Introduction
Model Based Adaptive (MBA) control of robotic manipulators has been widely investigated due to its capability in coping with those system modifications which can be modelled as parameter variations, also of high entities, of the nonlinear mathematical model of the manipulators themselves [1–5] such as, for example, phenomena of wear and ageing. The main assumption in the synthesis of MBA controllers is the knowledge of the structure of the mathematical model of the system, whereas the parameters appearing in this model can be unknown.
Unfortunately, the manipulators can rarely be perfectly modelled. For example, manipulators which grasp certain unknown mechanical characteristics, for example, can be modelled with uncertainties both in the structure and in the parameters of the mathematical model. Moreover, it is difficult to model stiction components of friction torques and, consequently, it is not convenient to take them into account explicitly in the model used for the MBA controller design.
Obviously, the presence of structural uncertainties in the mathematical model of the plant can deteriorate the performance of the above MBA controller, depending on their entity, resulting in unsatisfactory results or instability [6–8]. To overcome this drawback, some authors propose the addition of additional loops to the MBA controller (cf. for example [6]), whereas other authors propose non-parametric adaptive control structures, such as Neural Networks (NN) [8]-[19]. The NN is usually designed in order to identify the inverse dynamics of the system to be controlled and then implemented online as a feedforward controller [8]-[12]. An optimal structure of the NN can be obtained using Genetic Algorithms (GA) [12]-[14]. Furthermore, if parameter variations occur during operation, additional features can be introduced consisting of online adaptation sto the weights of the net, the so-called “specialised learning phase”, which allows one to obtain a neural controller whose performance is optimised for the considered reference trajectories [9], [11].
The gradient descent optimisation method, employed for adjustment of the parameters of the net, is effective in practice in the presence of structural modifications to the system [7], [12]; however, currently there is no systematic way to ensure these methods will be successfully employed. Moreover, offline pre-training of the net is often necessary. The analysis of the system becomes further complicated when learning and control are attempted simultaneously.
In two studies [19] and [20], NN controllers for robot manipulators are designed with enhanced performances while retaining the stability and convergence characteristics of MBA controllers. These algorithms treat the inverse dynamics of the manipulator as an unknown function and try to approximate this function. Practical applications of these techniques crucially depend on both the accuracy of the function approximation and efficiency of the computational structure of the approximate function.
The structure of a neural net is the topological representation of a weighted superposition of functions, particularly in the case of the single-hidden-layer perceptron and, consequently, function decomposition over a given functional basis can be represented as a neural net. There are two main topics related to function approximation via NN; the first topic regards the network structure, i.e., the choice of the topology, the family of activation functions of the nonlinear nodes, and the number of these nodes. The second topic concerns the choice of an updating law for the output weights, so that the net output could approximate the unknown function in a context of adaptive nonlinear approximation.
Within the context of the function approximation by means of NNs, the kind of activation function holds great importance. In three studies [21]-[23], fuzzy membership functions have been used. In this context, wavelets are potentially a good choice as activation functions because they have a practically-bounded support both in space and frequency [24], [26]. These characteristics allow the dynamic management of the net, adding and removing nodes from its structure without interfering with the weights of the other nodes.
Another advantage of wavelets over other basis functions, like splines or even sinusoids, is the multiresolution property which allows more efficient representation of local details of the approximated function. The multiresolution capabilities make the wavelets a very efficient tool, since by retaining only the coefficients which are above a certain threshold, it is possible to obtain a “sparse” representation which does not imply a significant loss of details [26].
An interesting control structure is that illustrated in Cannon and Slotine's study [27], which uses a dynamic wavelet network, the structure of which is updated according to the hard-threshold algorithm [26]. In Sanner and Slotine's study [28], the approach used in [27] was extended to the adaptive control of robotic manipulators (WN) and a simple algorithm was proposed to manage the network nodes: for each active neuron the nearest neurons can be activated if the weight of the active neuron is greater than a given threshold. A drawback of this mechanism is that the behaviour of the system is greatly influenced by the choice of the above threshold, which implies that the net can have a number of nodes either too low or too high for the task to be executed.
In Alonge et al.'s study [29], a new activation mechanism is proposed, based on the observation that in order to ensure a good tracking of a given trajectory, the active nodes of the net must cover at least all the points of the net input space, which are actually part of the trajectory itself. To this end, the following procedure is employed: a) during the evolution of the system, a new node at a maximum scale is inserted each time the trajectory of the system reaches an interval of the net input space not already covered by other nodes; b) insertion and removal of nodes with activation function at minor scales is carried out so that local details of the new unknown dynamics can be better approximated.
Step b) is executed according to a routine based on the threshold algorithm previously described, using local thresholds instead of global ones. In particular, for each maximum scale node inserted in the network structure, a couple of thresholds are considered and updated starting from the local tracking error. When the weight of a node exceeds the insertion threshold, the nodes which are adjacent in the wavelet index space are inserted; in the same way, nodes are removed when their weights go below the removal threshold. Since the thresholds are locally updated on the basis of the tracking error, insertion of new nodes is more probable in those areas of the input space in which the dynamics are insufficiently modelled. In the paper, the previously described algorithm for basis function selection is considered. The resulting adaptive algorithm, named WN+, was experimentally compared with the MBA and NN approach. Moreover, the complexity of the resulting network is compared with the complexity of the WN controller illustrated in Sanner and Slotine's study [28].
In Section 2 the MBA approach we consider in the paper is first reviewed then the WN+ approach is introduced with the new network management algorithm. In Section3, experimental results on a planar two-link manipulator are displayed with the aim of comparing the WN approach with the MBA approach in two studies, [2] and [5] and NN based control laws [7], [12] and [28]. Finally, Section 4 deals with some conclusions.
2. The adaptive controllers
2.1 Model-Based Adaptive Controller
A robotic manipulator is a dynamic structure consisting of a number of links joined together by joints typically actuated by motors coupled to the joints themselves. For a manipulator with m joints, a typical mathematical representation of the dynamics of the system is the Euler model given by:
where
Given an assigned desired trajectory in the joint space, specified by means of a couple of position and velocity functions
where
The control law (2) is based on an inverse dynamic model of the system used to compensate the nonlinearities of the model (1), so that the resulting closed-loop system, decoupled and linear, can be driven by the PD action -
If model (1) admits the linear parameterisation:
where
where Γ and
However, some nonlinear systems which can be modelled using (1) cannot be parameterized as in (3) and, consequently, the adaptive approach (4) and (5) cannot be applied even if the structure of the matrices
2.2 Wavelet Network Controllers
Following [25], let us assume that the matrices
where
and
With previous positions, the control law (2) can be written as:
where
A wavelet network is a family of functions which present some useful properties in the field of function approximation, resulting in a powerful tool to perform signal analysis and synthesis. As mentioned previously, the main advantages of the wavelet analysis over other methods of function decomposition, like splines or even Fourier analysis, are the multi-resolution property and the compact support.
A wavelet basis is a family of functions having a double index each function of the set has a translation index, which is a vector if the set is defined over a multidimensional input space, and a scaling index which is often scalar. Multidimensional scale indices are also used in the so-called multiscaling wavelet frame. A wavelet basis is thus defined as an infinite collection of translated and scaled versions of a single motherwavelet. Since the mother wavelet is a function with a compact support, the multiple scales of the wavelets allows representation of the analysed function with various levels of detail, maintaining the localization of the information about the characteristics of the analysed function. The multi-resolution analysis also makes the wavelets a very efficient tool, since retaining only the basis function whose coefficients are above a certain threshold, it is possible to obtain a “sparse” representation of the function which does not involve a significant loss of details.
Within the context of function approximation by dynamic neural networks, wavelets are potentially a good choice as activation functions; in fact, while the compact support allows the management of the network dynamically, adding and removing nodes from the net structure without interfering with the weights of the other nodes, the multi-resolution property allows efficient representation of the local details of the approximated function.
The wavelet expansion used for function approximation can be considered as a neural net with a single layer the nodes of which have wavelets as activation functions.
In order to approximate the function matrix
For an n-dimensional input space, the corresponding wavelet family is obtained as follows:
where j∈Z,
Wavelets of the form (10) satisfy the so-called “frame” properties [13], which ensure that a function f ∈ L2(ℜ n ) having a compact support can be approximated within a given error by a superposition of a finite number of elements in the form (10). The above error depends on the need for truncation of the frame in order to obtain a neural network with a finite number of nodes and, consequently, physically realisable or with acceptable computational cost, so that its implementation on a digital system can be efficient.
In order to affect the frame truncation it is convenient, first of all, to choose a dominion of the space R
n
of
A dominion Iex ⊃ Id and a modulating function m(
A new matrix
The previous considerations suggest the following architecture of the control law:
where
in which
With reference to the structure of the net, it is convenient to observe that
Regarding the choice of the number of neurons of the hidden layer, truncation operations of the frame are obtained by defining limits on the wavelet indices j and ki, i =1,…, n. To this end, it is convenient to observe that the dominion of the vector
The superior limit on j is chosen so that the support of the wavelet has the minimum dimension of Id, whereas the inferior limit is chosen according to the spectral content of the unknown function to approximate. The limits on ki for assigned j are given by ceil(2jxi,min) ≤ ki (j) ≤ floor(2jxi,max), where floor(.) and ceil(.) denote the round-off to the nearest integer number superior or inferior, respectively.
Obviously, the above limits imply a finite number of nodes of the net. Let
The generic element
where the quantities ŵil,
for the nodes considered active, where si and vl are respectively the i-th and the l-th components of
for those nodes which are selected for deletion, the overall system results are stable and converge asymptotically when the filtered tracking error is greater than a given threshold given by:
where kd,min is the minimum value of the elements of
and
where: a) Er, Ed and ET are, respectively, the error due to the gradual updating of the weights of the nodes to be removed, the error due to the dynamical management of the net which implies that some nodes are removed and consequently the wavelet expansion involves a reduced number of wavelets of the chosen frame, and the errors due to the truncation of the frame; b) Nrem and wmax are the number of the nodes to be removed and the maximum absolute value of the weights of the net. The proof is similar to that shown in a study by Gomi and Kawato [10].
Note that (17) implies that the
A great importance is assigned to the algorithm which causes the insertion and removal of the nodes of the net. In a study by Gomi and Kawato [10], for a similar control system, an algorithm based on a simple threshold has been proposed. In this solution, whenever the weights of a node active in the net overcome a certain threshold ϑ, all the nodes which have adjacent wavelet indexes are inserted in the net structure. In a similar way, all the nodes whose weights go under a given threshold μ are removed from the structure, after their weights have been gradually brought to zero. A drawback of this solution is that the behaviour of the system is greatly influenced by the choice of the numerical values of the thresholds, so that the result may be either insensitivity to the weight growth, thus keeping too low the number of nodes of the network, or oversensitivity generating a network with an excessive complexity. Within a context of adaptive control, it seems unrealistic that the algorithm managing the net needs to be tuned to the problem in order to give good results.
Moreover, in order to bring an efficient structure to the net, the algorithm managing the net must correlate the subset of the input space effectively covered by current trajectories with the position of the active neurons and the number of wavelets with minor scale index, with tracking error, which is a measure of the wellness of the local identification. Following the above consideration, in order to ensure a good approximation of the unknown dynamics of the manipulator, the active nodes in the net must, at least, cover all the points of the net input space which are actually part of the trajectory of the system. Therefore, in the proposed algorithm, since each activation function has a bounded support and centres are located in an m-dimensional grid, during the evolution of the system new nodes are inserted each time the vector
Besides this major policy of node insertion, which ensures at least the minimum degree of coverage of the input space, a second routine causes the insertion and removal of nodes with activation function at minor scales, so that local details of the unknown dynamics can be better approximated. This routine is based on the threshold algorithm described previously but uses local thresholds instead of global ones. For each maximum scale node inserted in the network structure, a couple of thresholds are considered, the values of which are updated during the control action on the base of the local tracking error. The approach described before concurs to the insertion of wavelets in the frame, which gives a valuable contribution to the error tracking reduction, taking advantage, in such a way, of space localisation properties of such functions.
The update algorithm of local thresholds is obtained as a discrete time task, the period of which is T. Each time this task is activated, it checks the maximum value of the filtered tracking error
where α < 1, β > 1, a, β ∈ ℝ, making more probable, in such a way, the local insertion of wavelets at different scales.
The nodes which are not interested by the (18) have their threshold brought back to their nominal values, ϑs, μs, by a similar formula
After this threshold adjustment, the algorithm is applied as in the study by Gomi and Kawato [10]; thus, when a weight of a node exceeds the insertion threshold, nodes which are adjacent in the wavelet index space are inserted; in the same way, nodes are removed when their weights go below the reduction threshold. Since the thresholds are locally updated on the base of the tracking error, insertion of new nodes is more probable in those areas where the dynamics are insufficiently modelled, with the presence of nodes at multiple resolutions. The described procedure can alleviate the problem of the ingestible increasing of the number of basis function with the dimensionality of the input space. In fact, as a result of the algorithm, the wavelet allocation is near the actual trajectories on the net input space by making the problem, in such a way, nearly always1-dimensional.
3. Experimental tests
Experimental tests were carried out in order to prove the practical application of the WN control law and compare it with both the NN controller described in [7] and [12] and MBA control described in [5] and [6]. In this section, the results of the above experiments are discussed.
The experimental equipment is illustrated in Figure 1 and consists of:
Functional scheme of the experimental setup. a two link planar robotic manipulator directly driven by variable reluctance motors (NSK Megatorque: stall torque 250Nm for link 1 and 40Nm for link 2), equipped with two resolvers and resolver-to-digital converter boards equivalent to a 19,200ppr incremental encoder; industrial controllers designed for torque control loops, with ±10 volt torque command signals; dSpace DS1103 PPC Controller Board; the Real-Time Interface software from dSpace, which automatically generates real-time code from Simulink models and implements this code on dSPACE real-time hardware.
The manipulator to be controlled is a two-DOF SCARA type whose structure is illustrated in Fig. 2. The mathematical model used for the implementation of MBA control law is given by:
Schematic structure of the 2-DOF SCARA manipulator with asymmetric load.
where matrices and parameters are given in Tables 1–3. The elements of vector p are considered unknown and were updated online with an MBA approach.
Matrices of the 2-DOF SCARA manipulator model.
Mechanical parameters.
Nominal values of both the mechanical parameters and the vector
An application of MBA control law is also considered based on the same model as above, but without load (i.e., Mp = 0, Ip = 0, δ = 0). The corresponding control law, named MBA-, was considered in this paper in order to test the robustness properties of MBA controllers to the manipulator structure uncertainties. Note that the presence of an asymmetric load implies both structure and parameter variations in the mathematical model, because both matrix
Finally, a NN controller was considered in the paper because for its implementation, as for WN, the knowledge of the mathematical model of the manipulator is not necessary. The considered approach is that described by Alonge et al.[12], in which a multi-layer sigmoidal perceptron was used to control the manipulator sketched in Figure 2 but without load. The final structure of the net was obtained from an offline training phase with input-output data acquired from a closed loop scheme with a PD controller, followed by a optimization of the net structure based on Genetic Algorithm, consisting in the determination of an optimal net architecture, number of hidden layers, number of neurons, connectivity percentage for each layer and percentage of connections between layers. The resulting neural network has the following characteristics: 7 input variable (
In all tests, the desired trajectory is:
and the parameters of the PD control component, for all the controllers, are chosen as
In order to show the capabilities of the proposed network management algorithm, two experiments were performed using the same control scheme with two different values of the maximum allowed error. The results displayed in Table 4 show that improving the tracking performances, by decreasing of the maximum allowed error, produces increasing of the complexity of the net.
Final number of nodes of the wavelet controller for different choice of the Error Threshold.
The experimental results are displayed in Figures 3–9. Figures 3–8 show the results obtained applying WN+ control law with the Error Threshold chosen as in the first column of the Table 4. In particular, Figures 3 and 4 show joint tracking errors, Figures 5 and 6 show the applied control torques and Figure 6 shows the shape of the end-point tracking error. Finally, Figure 8 shows the number of nodes at different scales automatically obtained by the node allocation mechanism.

WN+ control law. Joint 1 position tracking error.

WN+ control law. Joint 2 position tracking error.

WN+ control law. Applied control torques to the joint 1.

WN+ control law. Applied control torque to the joint 2.

WN+ control law. End-effector position tracking error.

WN+ control law. Number of nodes automatically chosen by the node adaptation mechanism.

Filtered end-effector tracking errors for the different control algorithm: Wavelet Network (WN+), Neural Network (NN), Model-based Adaptive (MBA) and Model-based Adaptive based on load free model (MBA-).
Figure 9 shows the displacement of the filtered end-point tracking errors for WN, NN, MBA and MBA- control laws, which were obtained from MBA without modelling the grasped load. Filtering was necessary in order to extract the average displacement in a suited temporal window, from end-point tracking errors. The chosen temporal window was 5 seconds. The comparative tests show that MBA is the best control approach if a sufficiently accurate model of the manipulator is known. But if the model does not take into account, for example, the grasped load, the performance deteriorates (MBA-). NN can learn the structural modification as illustrated by the corresponding shape of the filtered tracking error but this approach necessitates a long pre-training phase and optimisation phase. A comparison between WN+ and NN controllers shows that the application of NN controller causes the tracking error to start from lower than the average value because of its pre-trained structure, but application of WN+ outperforms that of NN in a short time. Moreover, the final structure of WN has 14 neurons, whereas NN has 100.
Finally, a third experiment has been performed in order to compare performances of the WB and WB+ algorithms in term of complexity of the net in both cases. The results illustrated in Figure 10 show that WB+ bring in a simpler structure than that using WB, as theoretically expected.

WN control law. Number of nodes automatically chosen by the node adaptation mechanism illustrated in [28].
4. Conclusions
In this paper an experimental comparison between Model Based Adaptive (MBA), Neural Network (NN) and Wavelet Network (WN) control has been considered. The comparative tests show that MBA is the best control approach if a sufficiently accurate model of the manipulator is known. But if the model does not take into account, for example, the grasped load, the performance deteriorates. Moreover, NN control application causes the tracking error to start from lower average value with respect to WN approach because of its pre-trained structure but application of WN outperforms that of NN in a short time. Furthermore, the final structure of WN has 14 neurons whereas NN has 100. Wavelet Networks controllers, with the new network management strategy, can learn the complex dynamics of manipulators without a pre-training phase, more efficiently and more accurately than NN and the MBA- approach.
Footnotes
5. Acknowledgments
This work was supported by the Italian Ministry of Education, University and Research – Research Project of National Interest “ROCOCO” (PRIN 2009).
