Abstract
Piezoelectric actuators are widely used in micro/nanoscale robotic manipulators. Due to its hysteresis and dynamic-related nonlinearity, accurate displacement tracking control of piezoelectric actuator is challenging. Besides, in some low-cost practical systems with low sampling rate, transmission delay causes mismatches between feedback and real displacement, further increasing the challenge in tracking control. In this article, a neural network-based model predictive controller (MPC) is proposed for precise tracking control of piezoelectric actuator’s displacement in situation where feedback is slow and delayed. The prediction model is based on a nonlinear-autoregressive-moving-average-with-exogenous-inputs framework, which outputs entire prediction horizon of future displacement in a single time, and is fulfilled by a multilayer feedforward neural network. An extended Kalman filter-based estimation for displacement is introduced to relieve the influence of feedback delays so as to improve dynamic performance of the controller. Another neural network is trained to provide initial values for MPC to reduce computation costs and improve performance in dynamic tracking. In a series of tracking experiments, the effectiveness of proposed controller is verified.
Introduction
Manipulating objects in nanoscale is a fast growing demand in research of nanotechnology, and robotic nano-manipulators are developed to serve this goal. With excellent performance in precise positioning and capability of generating large forces, piezoelectric actuators (PEAs) have been widely used as motion components of robotic micro/nano-manipulators. 1,2 However, driven by inverse piezoelectric effect, 3 the displacement of a PEA is greatly influenced by its nonlinear characteristics, such as hysteresis and creep. 4,5 Hysteresis is a phenomenon that the displacement of an actuator depends not only on current input signal but also on memories of past signals. Creep is a delayed deformation which slowly changes displacement after input signal is unchanged. In practice, the frequency of motion also greatly influence the voltage–displacement relationship, that is, the dynamic nonlinearity of PEAs. 6
Dealing with the aforementioned nonlinear characteristics is crucial to the control quality of piezoelectric displacement. Traditional feedback strategies like proportional–integral–derivative (PID) can easily deal with creeps but have poor performance handling other nonlinearity, especially in dynamic occasions. 7 So feedforward mechanisms which determine control signals according to prior knowledges are introduced to combine with feedback control for improvement in tracking performance.
The model describing nonlinear characteristics of PEAs is the base of feedforwarding, for which a number of modeling methods have been proposed, generally including physics-based models and phenomenon models. Models in former group try to establish model through inherent relationship of physical variables. Jiles–Atherton model is an example of such kind of models. 8 Physics-based models are difficult to build and lacks generality to be applied in different system. The latter group, namely phenomenon-based models, includes three subgroups: differential equation models, operator models, and others. The first two subgroups include Duhem model, 9 Bouc–Wen model, 10 Preisach hysteresis model, 11 and Prandtl–Ishlinskii (PI) hysteresis model. 12 These models try to extract a mathematical law from phenomenon without involving physical principles. Similar to physics-based models, these mathematical hysteresis models usually have complicated structures and cost much to compute, also have to be appended with additional models of creep and dynamic. 13,14 The last group of models are totally data-driven approaches and have relatively simple forms. Examples include autoregressive-moving-average (ARMA) models, fuzzy models, and neural network-based models. Cao et al. 15 proposed a linear ARMA model for hysteresis. Cheng et al. 14 proposed an adaptive Takagi–Sugeno fuzzy model to describe the nonlinear behaviors of PEAs. Wen and Cheng 16 proposed a recurrent fuzzy model. Liaw et al. 17 designed a radial basis function neural network model. Nonlinear ARMA with exogenous input (NARMAX) form is combined with two neural networks to separately obtain hysteresis submodel and dynamic submodel by Cheng et al. 18 As phenomenon-based models are easier to obtain and have better generality, they’re more popular in actual application.
Based on model of PEAs, many control algorithms are proposed for PEAs positioning. Inversion-based methods are most widely used. Usually, an inverse model of hysteresis is cascaded with control plant to carry out feedforward compensation, then feedback approaches are used to deal with dynamic and other minor nonlinearity. Ge and Jouaneh 19 designed a PID controller combined with a feedforward compensator based on numerical inverse Preisach model. Al Janaideh et al. 20 used an analytical generalized PI model inversion as compensation in micropositioning control. Song at al. 21 proposed an inverse extended unparallel PI hysteresis model along with an inverse dynamic model to alter the PEA into highly linearity, then a nonvector space approach is proposed to control PEA-driven scanning probe microscope. Li et al. 22 designed an adaptive internal model control (IMC) scheme with a fuzzy hysteresis model and its inversion working together. Jian et al. 23 combined iterative learning control with direct-inverse-compensated PID approach to reduce the tracking errors caused by incompletely compensated hysteresis. However, accuracy inverse models are hard to acquire whereas the computation costs are usually high, which is a significant drawback for online tracking. Some inversion-free methods are then proposed. Al Janaideh et al. 24 designed a control scheme, in which a PI model acts in a feedback fashion rather than an inversion as a feedforward compensation, with another equivalent linear model derived as internal model of the compensated plant so as to form feedforward–feedback control in an IMC scheme. However, this kind of controller have complex structure and lack the flexibility for practical applications. Different from aforementioned approaches that use model to compensate nonlinearity, model predictive control (MPC) is a more straightforward method, widely used in practical applications. 25,26 Nonlinear model predicts future displacement based on input of model, and actual control signal applied on PEA is determined according to predicted displacement. 18,27 MPC usually takes consecutive predicted future outputs over a certain prediction horizon for optimization, which demands repeated running of the model, thus increases computational burden. A popular solution is linearizing the prediction model to simplify calculation. 28,27 In addition, the accuracy of model greatly influences the performance of MPC.
Besides the nonlinearity, imperfect feedbacks from sensors can also cause problems, inefficient sampling rate with delays or even data losses exist in practice, especially for low-cost hardware systems. As changing voltage directly acts on PEA’s displacement in extremely short time, delay of sensors can cause large mismatches between feedback and real displacement when PEA deforms rapidly, increasing the challenge in dynamic tracking control. The performances of many typical controllers significantly deteriorate in such situation. The simplest solution for this problem is improving the sensors by updating hardwares to increase the sampling/transmission rate which however can be costly and difficult in some actual applications. Hence, some soft methods are proposed to solve similar problems, usually based on estimation of delay and additional model of corresponding states, combined with multiple controllers under switching rules. 29,30 However, few has been adopted in tracking control of PEA, which has special nonlinear characteristics as aforementioned.
In this article, aiming to improve tracking performance of PEA control systems where low-rate sensor is used, a neural network-based model predictive controller is introduced, which works well when feedback is very inefficient and delayed. The proposed method also requires less computational resources than traditional neural MPC (NMPC), make it easier to be applied in practical systems. First, a NARMAX structure is adopted for nonlinear modeling of PEA, implemented by a multilayer feedforward neural network (MFNN). To ease the computation, the model is designed to output an entire prediction horizon of future displacement altogether when called, avoiding repeated calculation in sequential prediction. Then the control issue is transformed into an optimization problem that iteratively adjusts the control signals to minimize the errors between predicted displacement and reference. Another MFNN is trained as a feedforward controller, outputting an initial control signal for the optimization. In addition, an accumulated-error compensation term is embedded into controller, to reduce steady-state error caused by model mismatches. To relieve the influence of delayed feedbacks, an extended Kalman filter (EKF)-based method is adopted to fuse sensor feedbacks with the predicted displacement, providing controller with better state estimation so as to accordingly modify the control signal. To evaluate the performance of proposed controller, a series of experiments have been conducted versus PID and a typical NMPC, and the result verifies effectiveness of proposed controller.
Neural model of PEAs based on NARMAX
The main nonlinear characteristics to be modeled are hysteresis and rate-dependent dynamic property. As aforementioned, hysteresis is a kind of memory phenomenon, which relates displacement of PEA to states and signals in the past. Meanwhile, frequency can be estimated by extracting differential values from discrete signal sequence, so rate-dependent property can also be viewed as a behavior determined by historical signals and states. Hence, the input–output relationship of PEAs can be concluded as following function
where
To achieve smoother performance of MPC, a series of predicted outputs
where
NARMAX is a framework of input–output representation, whereas obtaining specific model, that is, determining the function
Neural model of PEAs
An MFNN with single hidden layer is used as approximator for NARMAX representation
And the activation function in output layer is set linear.

MFNN approximator for NARMAX-based model of PEAs. MFNN: multilayer feedforward neural network; NARMAX: nonlinear-autoregressive-moving-average-with-exogenous-input; PEA: piezoelectric actuator.
Identification of model
Let
where
Then the problem becomes determining the parameter matrices Wo , Wh , Bh , and Bo . Provided sufficient input–output data pairs sampled in experiment measuring displacement of PEAs, supervised training can be conducted to solve following optimization problem
where
There are plenty of proven optimization approaches for neural network training. In this article, adaptive moment estimation algorithm (ADAM) is chosen to play the role. As a gradient-descent-based optimization, ADAM computes individual adaptive learning rates for different parameters according to estimations of both first and second moments of gradients.
33
As a result, ADAM shows good convergence in MFNN training and can still work well when samples are with heavy noises. In our case, ADAM excels other approaches both in quality and speed of training. Coordinated with ADAM,
where

Neural MPC-based tracking control system of PEA. PEA: piezoelectric actuator.
Tracking control of displacement of PEA
With aforementioned model of piezoelectric displacement, the model predictive controller repeatedly tests virtual control signals on the model and adjusts outputs according to the corresponding predictions. As the MFNN model is differentiable, such adjustment can utilize the gradients of predicted errors with respect to control signals, hence turns the control issue into an optimization problem. Besides, a term of static error compensation, a neural feedforward controller, and a feedback estimation are supplemented to the MPC controller to improve tracking performance with feedback delays. The block diagram of the whole control scheme is shown in Figure 2.
Model predictive control scheme
Most elements in neural model’s inputting vector
The goal is to minimize the error of predicted displacement
where
To reduce excessive changes of control signal, a penalty term is added to (8), then the objective function to be minimized becomes
where
As
Optimization by LM algorithm
LM algorithm is a combination of Gauss–Newton and trust region method, useful in solving optimization problems with quadratic forms. In our case, objective function (10) is in quadratic forms
where
Then the basic update rule of LM algorithm goes as following way
where
If
The complete LM algorithm is summarized in Algorithm 1:
Calculate MPC output by LM algorithm.
Model error compensation
The model can never be totally accurate for real plant. Actually, there always exist mismatches between model predictions and actual PEA displacement. As a result, static errors will be accumulated in control progress. Hence, a compensation term is added to controller outputs, functioning the same way as the integral term in PID a controller
where
Neural feedforward controller
MPC optimizes predictive errors at every operating points but this repeated process is time-costly and the resulted signals will not help to improve control quality when similar situation occurs. Warm-start technique
25
is proposed to lessen the iterative optimization calculation, usually by setting initial value

MFNN of neural controller. MFNN: multilayer feedforward neural network.
The set-point reference
MFNN2 has a single hidden layer, whose activation function
ADAM optimizer is chosen again to train MFNN2. The training can be both conducted off-line and online along with control progress, with small batch of training sets which are updated over time, absorbing newly proven samples and dropping outdated one. In this way, the performance of neural controller will be gradually improved over time before deployment.
With a well-trained neural feedforward controller, the original value of control signal can be close to the optimal one, which decreases iterations in optimizing MPC objective function to only one iteration. Combined with the proposed model which prevents repeating predictions, the computational cost is largely lowered from traditional NMPC. Assuming a traditional NMPC requires nt
iteration, then
Modification against feedback delays
The feedbacks in inefficient rate with delay causes mismatches between feedbacks and real displacement can greatly affect the control accuracy and stability, especially in dynamic tracking. When reference changes rapidly, the controlled displacement should also follows quickly, but delayed feedbacks fail to return correct displacement of PEA in time, making control errors larger than reality, which misleads controller to adjusting output even harder. As a result, large overshoot occurs when reference/displacement changes rapidly. All kinds of feedback controllers are affected by this effect, where MPC is influenced in the prediction process, as NARMAX model requires current displacement as one of inputs. So if current displacement can be better estimated despite of mismatched feedback, the control can be improved based on better prediction. As we have a ready prediction
The states functions of prediction and measurement are respectively written as
where
From Jacobian
P is estimation covariance matrix, I is identity matrix, K is Kalman gain. In this case, however,
Although the Q and R are defined as covariances of errors, they are not directly related to true errors in our case. They act as two adjustable parameters controlling this fusion process. As (26) shows, the estimation
To ensure the coverage and smooth the shifts, the most recent
cQ
,
where a and b are nonnegative constants. To make it function well, cQ
,
Experiments and discussion
Experiment setup
As shown in Figure 4, the experiment system are configured as following: a stack PEA produced by Thorlab is driven by a 0–60 V voltage driver with peak-to-peak ripples less than 10 mV. The output voltage can be regarded as linear to control signals. A self-sensing method-based device is used as displacement sensor for feedbacks. It has two channels. One is serially connected between PEA and negative end of power, measuring the charges on negative electrode of PEA; another is parallel connected with PEA measuring its voltage. With self-sensing principle derived from piezoelectric formula, the device can acquire displacement of PEA indirectly from quantity of electric charges and driving voltage. 35 The control algorithm runs in a PC as controller, communicating with driver and sensor both through COM connections. The sampling rate of the sensor is about 250 Hz whereas the driver has higher maximal output rate. The feedbacks received by controller usually have a delay of one sampling period, which causes mismatches between feedbacks and reality, worsening the control performance in tracking task. In such condition the proposed method is tested and its effectiveness is verified.

Experiment system. Note that one channel of self-sensing sensor is serially connected between the negative electrodes of PEA and driver. The driving loop is marked with arrows. PEA: piezoelectric actuator.

Sinusoidal tracking performance comparison: (a), (c), and (e) show control performance of proposed controller in 1, 2, and 5 Hz sinusoidal tracking, respectively, compared by PID and basic NMPC in (b), (d), and (f) with same reference trajectories. Arrowheads in (e) and (f) indicate the influence of duration changes of feedback delay. Note that MAEs and RMSEs marked in figures are statistics of the entire experiment containing many sinusoidal periods. PID: proportional–integral–derivative; MAE: mean absolute error; RMSE: root mean square error.
Verification of proposed control method
A series of experiments are conducted to evaluate the effectiveness of proposed method, compared with PID and a basic NMPC. 18 For MPC and PID both, there are tradeoffs between performances in tracking of high and low frequencies. Thus, before following experiments, parameters of all controllers are adjusted to achieve balanced performances in sinusoidal tracking of multiple frequencies and in step response. Then all following experiments are conducted under a same set of parameters, which may not make the best performance for specific tasks, but focus on the overall performance.
Firstly, tracking experiments under sinusoidal reference trajectory of different frequencies (
Tracking performance comparison.
PID: proportional–integral–derivative; MAE: mean absolute error; RMSE: root mean square error.

Step response comparison. Circled part indicates the feedback delay where real displacement has been changed but not be sampled yet.
In Figure 6, a 2
Tracking of mixed sinusoidal reference with random frequency (0.5–5 Hz) and random amplitude are also tested, presented by Figure 7, which shows the proposed controller is effective in random reference tracking. And Figure 8 shows its performance in linear tracking, tested by 1 Hz triangular reference trajectory. Though proposed controller suffers some slight oscillation right after the change of motion direction, it still achieves smaller MAE and RMSE than other two controllers. The results shows that proposed method is less affected by low feedback rate which makes pure-feedback controllers produce large lags.

Random sinusoidal tracking by proposed controller.

Linear (triangular) reference tracking. (a) Linear tracking performance of proposed controller. (b) Linear tracking performance of PID and basic NMPC. (c) Error comparison between proposed controller, PID and basic NMPC. PID: proportional–integral–derivative.
As the results of above experiments show, with sparse control points, the basic NMPC losses its advantage over PID. The effectiveness of proposed controller in displacement tracking is verified with generally significant advantages over PID and basic NMPC. And the approaches taken to relieve the feedback delay problem have also been validated as effective.
Conclusion
Hysteresis and dynamic-related nonlinearity are challenging problems in displacement tracking control of PEA. The displacement feedback with delay and low frequency causes mismatch between feedback and real state, increasing the difficulty of precise tracking control for pure feedback approaches. In this article, a neural network-based model predictive controller is introduced for tracking control, combined with neural feedforward controller and EKF-based feedback estimation. The proposed method achieves much better performances than basic NMPC and PID controller in tracking experiments, proving its effectiveness in handling the nonlinearity and feedback delay. To achieve further improvements in tracking performance, especially for high-frequency trajectory, more studies are to be conducted. Potential solutions include combining current NARMAX model with more explicit expression of motion trend, which may improve model’s ability of providing better prediction and gradient estimation. The proposed controller is also to be applied to systems with higher feedback rate so that tracking the trajectory of higher frequency can be possible. Furthermore, with MPC’s ability of multi-goal optimization, current controller has potential to be developed into a space trajectory tracking controller for multi-axis piezoelectric positioning system.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported in part by the National Natural Science Foundation of China under Grant No. 61873268 and No. 62033013, also supported by Strategic Priority Research Program of Chinese Academy of Sciences, Grant No. XDB32050100.
