Abstract
This paper presents a new control strategy for stochastic distribution shape tracking regarding non-Gaussian stochastic non-linear systems. The objective can be summarised as adjusting the probability density function (PDF) of the system output to any given desired distribution. In order to achieve this objective, the system output PDF has first been formulated analytically, which is time-variant. Then, the PDF vectorisation has been implemented to simplify the model description. Using the vector-based representation, the system identification and control design have been performed to achieve the PDF tracking. In practice, the PDF evolution is difficult to implement in real-time, thus a data-driven extension has also been discussed in this paper, where the vector-based model can be obtained using kernel density estimation (KDE) with the real-time data. Furthermore, the stability of the presented control design has been analysed, which is validated by a numerical example. As an extension, the multi-output stochastic systems have also been discussed for joint PDF tracking using the proposed algorithm, and the perspectives of advanced controller have been discussed. The main contribution of this paper is to propose: (1) a new sampling-based PDF transformation to reduce the modelling complexity, (2) a data-driven approach for online implementation without model pre-training, and (3) a feasible framework to integrate the existing control methods.
Keywords
Introduction
The stochastic distribution control was presented for a class of non-Gaussian stochastic systems in the late 1990s (Wang, 1999), where the randomness of the system output can be controlled by adjusting the shape of the output probability density function (PDF). As an important research topic, stochastic distribution control inspires other topics such as the fault diagnosis in non-Gaussian systems (Guo and Wang, 2005; Yao et al., 2012), networked Direct Current (DC) motor control (Ren et al., 2015), probabilistic decoupling (Zhang et al., 2017), performance enhancement (Zhou et al., 2016), data-based identification (Zhang and Sepulveda, 2017), non-Gaussian filtering (Zhang and Yin, 2018; Zhao and Mili, 2017), operational control (Ding et al., 2012; Zhang and Hu, 2018), multi-path estimation (Cheng et al., 2018), industry 4.0 (Trovati et al., 2019), and so forth. In practice, tracking the given desired PDF is required in many process control and manufacturing processes, such as the quality control for paper-making (Wang, 1998).
A key step in PDF control is to establish the dynamic system model. Most of the existing results can be divided into two groups (Ren et al., 2019): (1) establish the PDF of the system output analytically if the stochastic distribution of the system noise is known (Guo and Wang, 2010). Following the analytical formulation, the full-probabilistic design can be achieved (Zhou and Herzallah, 2020) and the non-Gaussian filtering problem was investigated using the same approach (Guo and Wang, 2006). However, the PDF evolution is difficult to obtain considering the complex nature of stochastic systems and that the stochastic distribution of the system noise is normally unknown in practice. (2) represent the PDF as a weighing vector of the neural network, such as B-spline neural networks, where the dynamics of the system output can be reflected by the time-variant weighting vector (Wang, 2012). Although the PDF of the system noise is not necessarily known, the pre-training of the neural network can be applied to approximate the real-time PDF with the weighting-based representation. One problem of this approach is that it leads to a time delay for the control system design.
To overcome the aforementioned problems, it is important to develop a new approach combining the features of the two approaches above, where the new approach can be implemented in simple formulation and without requiring pre-training. This motivates the model development in this paper, where the evolutionary method is used to obtain the time-variant PDF. However, the analytically PDF will not be used for control design due to the computational cost. Alternatively, the sampling-based PDF vectorisation method is introduced, which converts the PDF to a vector with reduced dimension. Thus, the vector-based PDF model can be used in control system design. Note that the pre-training is not required for obtaining the vector, which makes the main difference from the neural network approach. Basically, the desired PDF can be converted into a vector. Then, the PDF tracking problem is re-written as a vector assignment problem. Any existing control methods can be used to minimise or eliminate the error between the reference vector and the time-variant vector along the time horizon. After identifying the model of the PDF dynamics, in this paper, we use Proportional-integral-derivative (PID) as a standard design, which is widely used in industry. Moreover, the stability of the proposed method is analysed to guarantee the tracking performance
In addition, the data-driven approach is also discussed as an extension of the presented strategy. The output PDF can be estimated by a sliding window that is known as the kernel density estimation (KDE) (Odiowei and Cao, 2009; Tang et al. 2020). Then, the data-based estimated PDF can be used to replace the evolutionary PDF. Note that the data-driven vectorisation of PDF can also be attained via Monte Carlo methods (Zhang and Wang, 2020), which can be considered as the PDF discretisation with zero-order hold. In addition, a first-order B-spline method (Wang, 2012) will lead to the equivalent histogram compared with Monte Carlo methods (Zhang and Wang, 2020). However, the modelling accuracy of these methods depends on the selection of the intervals in sample space, for which the pre-specified intervals may not sensitively reflects the dynamics of the PDFs. The hybrid modelling can be further discussed as the future perspective of the PDF tracking research where the model-data fusion is inspired for industrial process control and optimisation.
In practice, the data-driven approach will provide more flexibilities in terms of the implementation as the analytical formula of the stochastic distribution is difficult to obtain. In addition, the data-driven approach will be naturally included into the artificial intelligence design, where many recent design options are available. All these benefits can be considered in the future work.
The remainder of this paper is organised as follows. In the next section, preliminaries and the PDF representation are given. After that, the main result on PID control design is presented. Following the previous section, the stability analysis is given. The data-driven extension of the presented algorithm is proposed in the subsequent section. Then, numerical simulation results are demonstrated to validate the proposed method. In addition, the multi-output systems and advanced controller design are discussed at the end as an extension to the presented framework. Conclusions are given in the final part of the paper.
Preliminaries and PDF evolution
Consider the following general stochastic non-linear system with single input and single output
where
where
where
while
To describe the evolution of the output PDF for stochastic non-linear system (1), the following lemma has been recalled (Yin and Guo, 2012; Zhou et al., 2017).
where
Using the lemma above, the PDF of system output
where
where

PDF representation using vectorisation which can be considered as PDF sampling.
The desired PDF
where
In particular, the selection of minimum set of the base points can be implemented using the inflection points of the desired PDF. For each two inflection points of the PDF, the curve between them is positive and monotonous. As the integral of the PDF over the definition space is equal to one, the inflection points can uniquely represent the shape of the desired PDF. Thus, the minimum dimension of the vector-valued PDF representative state can be obtained. However, the minimum set of vector points cannot reflect the transient part of the PDF dynamics. Additional points can be pre-selected between each two inflection points, which would increase the accuracy of the control performance when additional transient information has been considered into the PDF control.
PID controller design algorithm
Using the PDF vectorisation, a new output PDF model can be established where many existing controller design approaches can be applied such as PID, Linear-quadratic regulator (LQR), and so forth. For instance, a PID controller with the following structure
where
The parameters of the controller can be determined by various methods; in this paper, we apply the time-domain design approach using linear matrix inequality (LMI). Firstly, the model should be built up to describe the dynamics of the PDF based upon the transformed vector-valued PDF representative state. In particular, the following linear format is used
where
where the entries in
As a result, we can further rewrite the PID formula as follows
where
Defining
where
Based on the identified linear model, the parameter matrix
where
where
Based upon the presented Lyapunov function candidate, we have the following equation
We can further consider a decay rate for the Lyapunov function candidate such that the following inequality holds
where
Thus, the condition of system stability can be described as follows
To rewritten the inequality into LMI, we can further introduce
Stability
In this section, the stability of the proposed output PDF control system is analysed. Combining the system model (1) and PID law (6), the following equations are obtained
and
where
Based upon the two assumptions of
Notice that there always exist two real numbers
where
Equation (18) leads to the following result using the norm operation
Since the vector
Calculating the mean-value of equation (19) and substituting equations (21)–(22) to equation (19), we can have
where
as
Using the assumption of function
As a summary of the analysis in this section, a theorem is given to indicate the stability condition of the proposed output PDF control algorithm.
Data-driven extension with KDE
As aforementioned in the Introduction, the KDE can be adopted to estimate the PDF using the data of system output. In this case, the evolutionary PDF can be replaced by the estimated PDF that further reduces the complexity of the computing. Since the collected system output data can be denoted as
where
Note that the estimated vector
Based on the stability analysis, the flowchart has been produced in Figure 2 to demonstrate the procedure of the presented algorithm, where the flow with hollow arrows indicates the data-based extension of the presented framework. In particular, the framework has been shown to include three components: PDF evolution, PDF dynamics modelling and PDF tracking control.

The flowchart of the presented PDF tracking control framework using PID controller.
Simulation
To illustrate the effectiveness of the proposed stochastic distribution control algorithm, a numerical example is demonstrated in this section and the system model is formulated as follows
where
Suppose that the desired PDF is given as Gamma distribution
Figure 3 shows the vectorised PDF

The vector-valued PDF representative state

The PID control input for the investigated closed-loop system.

The 3D mesh for the vector

The pseudo PDF of the system output

The interpolation-based pseudo PDF of the system output

The system output

The error of the probability density states

The mean-square error of the probability density states

The vector-valued PDF representative state of the system output
It has been shown from the numerical study that the presented algorithm is convenient for implementing without pre-training of weights. Using the data-driven approach, the PDF evolution can be replaced by a vector dynamics while the system identification may not be necessary as the PID parameters turning is achievable by trial and error. We can consider the presented framework as a model-free design for PDF tracking problem.
Further discussion
Multi-output systems extension
In this paper, we only investigate the single-input, single-output (SISO) systems where the PDF of the system output can be visualised by the 2D curve at each sampling instant. For complex industrial processes, the multi-input, multi-output (MIMO) systems should be discussed. The multi-variable system output will lead to a multi-dimensional PDF which is a joint PDF. Thus, the PDF tracking problem for multi-output systems can be summarised as a joint PDF tracking problem.
Suppose that the system outputs are
The sets of base points for each system output will be pre-specified as follows
where
The multi-dimensional vector-valued PDF representative state can be obtained by substituting the base points into the joint PDF, then the joint PDF of the system output can be represented by a single vector. Take an example with two system outputs
Vectorising the matrix, the vector-valued PDF representative state for the joint PDF can be formulated as a vector. In particular, we can choose each row of the matrix and connect them as a vector, alternatively, the column approach can be used to form the vector. Both types of vectorisation will give the same dynamics of the joint PDF. For instance, the row-based vectorisation for the matrix above can be converted into the following vector
The multi-output system PDF tracking problem can be described uniformly using the proposed framework. In general, the vector can be denoted as
To implement the presented algorithm via a data-driven framework, the multi-dimensional KDE can be adopted, thus the joint PDF can be approximated using the collected data sets of the system outputs. The data-based estimated vector
Advanced controller design
Another extension of the presented framework can be investigated from the controller design point of view. Currently, we used PID as a standard controller for a linear system. However, the linear model may not represent complex output PDF systems, thus, non-linear models and un-modelled dynamics should be considered. Advanced controller design need to be developed on complex system models. In particular, the small-gain technique can be used to deal with the uncertainty and the robustness requirement (Ma et al., 2020). Fuzzy tracking control design also focuses on the uncertainties and robustness, where non-linear networked system model (Li and Park, 2018), high-order non-linear model (Zhao et al., 2015), event-triggered non-linear model (Li et al., 2019), non-strict feedback non-linear model (Wang et al., 2021) have been presented. All these mentioned methods can be implemented with the proposed data-driven approach.
Conclusions
This paper proposes an output PDF tracking control algorithm for a class of stochastic systems. Different from the existing neural network-based methods and PDF evolution solutions, the presented control algorithm converts the system output PDF to a vector-formed probability density states using vectorisation. The dynamics of the system output PDF is approximated by the time-variant vector-valued PDF representative state. Thus, the PDF tracking problem has been transformed into a vector assignment problem. To describe the relationship between the control input and the PDF representative vector, a vector-based model is established with linear dynamics assumed, where the vector is measurable using the PDF evolution and the PDF vectorisation. To achieve the PDF tracking, a LMI-based PID design is adopted to eliminate the distance between the reference vector and the real-time vector asymptotically. Following the theoretical analysis, the closed-loop stability of the system can be achieved where the PDF tracking error is convergent and the system output is bounded in the mean-norm sense.
To deal with the system noise with unknown distribution, the data-driven approach is further discussed where the vector can be estimated by KDE and the stability is still guaranteed. The numerical simulation results demonstrate the effectiveness of the presented design.
As the main contribution of the paper, a new PDF tracking control algorithm is proposed, which can be used in real-time control without pre-training of weights and potentially it can be implemented as a pure data-driven or even model-free algorithm. The computing complexity has been reduced by the vectorisation of continuous PDF, and existing controllers can be integrated into the framework.
Using the presented framework, the following three aspects can be considered as our future works: (1) data-driven optimisation using machine learning algorithms, (2) advanced controller design considering the un-modelled dynamics of PDF and (3) industrial applications with analysis and randomness attenuation.
Footnotes
Acknowledgements
The authors would like to thank the reviewers for the valuable comments which improve the quality of the paper. The authors would like to thank Professor Tianyou Chai from Northeastern University, China, and Professor Hong Wang from Oak Ridge National Laboratory, USA, for the valuable discussions.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This paper is partly supported by National Science Foundation of China under Grants (61603262 and 62073226), Liaoning Province Natural Science Joint Foundation in Key Areas (2019-KF-03-08), Natural Science Foundation of Liaoning Province (20180550418), Liaoning BaiQianWan Talents Program, i5 Intelligent Manufacturing Institute Fund of Shenyang Institute of Technology (i5201701), Central Government Guides Local Science and Technology Development Funds of Liaoning Province (2021JH6/10500137).
