Sage Journals: Discover world-class research

Abstract

The fault diagnosis approaches based on k-nearest neighbor rule have been widely researched for industrial processes and achieve excellent performance. However, for quality-related fault diagnosis, the approaches using k-nearest neighbor rule have been still not sufficiently studied. To tackle this problem, in this article, we propose a novel quality-related fault diagnosis framework, which is made up of two parts: fault detection and fault isolation. In the fault detection stage, we innovatively propose a novel non-linear quality-related fault detection method called kernel partial least squares-k-nearest neighbor rule, which organically incorporates k-nearest neighbor rule with kernel partial least squares. Specifically, we first employ kernel partial least squares to establish a non-linear regression model between quality variables and process variables. After that, the statistics and thresholds corresponding to process space and predicted quality space are appropriately designed by adopting k-nearest neighbor rule. In the fault isolation stage, in order to match our proposed non-linear quality-related fault detection method kernel partial least squares-k-nearest neighbor seamlessly, we propose a modified variable contributions by k-nearest neighbor (VCkNN) fault isolation method called modified variable contributions by k-nearest neighbor (MVCkNN), which elaborately introduces the idea of the accumulative relative contribution rate into VC k-nearest neighbor, such that the smearing effect caused by the normal distribution hypothesis of VC k-nearest neighbor can be mitigated effectively. Finally, a widely used numerical example and the Tennessee Eastman process are employed to verify the effectiveness of our proposed approach.

Keywords

Fault detection fault diagnosis quality-related non-linear industrial process

Introduction

With the rapid development of industry, modern industrial systems are expanding toward the direction of large scale and complexity. To ensure safety and reliability in industrial production, multivariate statistical process monitoring (MSPM) as a kind of data-driven approach has been extensively studied and successfully applied to actual industrial processes.^1–3 In MSPM, process-related fault detection is a popular research task, of which the mainstream method is principal component analysis (PCA).⁴ Many researchers have improved PCA for better application in fault detection. For example, Xiu et al.⁵ proposed a novel Laplacian regularized robust PCA method that can effectively capture the intrinsic non-linear geometric information. Other process-related fault detection methods have canonical correlation analysis (CCA),⁶ non-negative matrix factorization (NMF),^7,8 and so on. Another particularly important research direction for MSPM is quality-related fault diagnosis,^9–11 in which quality-related fault detection and fault isolation are two key tasks. Quality-related fault detection belongs to a supervised learning task in machine learning area,¹² and it aims to detect whether a fault that affects product quality occurs in the industrial system. When a fault detection algorithm indicates some faults exist in the system, fault isolation attempts to locate the faulty sensors. By the diagnosis of quality-related faults, unnecessary downtime and cost brought by the quality-unrelated faults can be greatly reduced, and the risky faulty sensors can also be located as quickly as possible. Therefore, quality-related fault diagnosis has been a research hotspot recently.^13–15

Compared with process samples in real industry, quality samples usually have a large time lag to collect and a relatively rare quantity. Hence, the direct use of quality samples cannot meet the requirement of real-time online monitoring. To tackle this situation, the commonly adopted idea is to first establish a regression model between process variables and quality variables, and then extract the quality-related features from process variables, which will replace quality variables to realize quality-related online fault detection. Currently, two mainstream frameworks based on this idea are least squares (LS)–based and partial least squares (PLS)–based approaches. Zhou et al.¹⁶ comprehensively analyzed the defect of PLS¹⁷ for quality-related fault detection and proposed a total projection to latent structures (TPLS) model.¹⁸ Yin et al.^19,20 performed singular value decomposition (SVD) method on coefficient matrix of LS and PLS separately, and presented modified partial least squares (MPLS) and improved partial least squares (IPLS). Since all above are linear methods which are unsuitable for non-linear processes, the kernel trick has been widely adopted for non-linear fault detection. Its main idea is to map the original process variables into Reproducing Kernel Hilbert Space (RKHS) through some kernel function, thus making the process variables linearly separable in such kernel space.²¹ By introducing the kernel trick, many linear methods can be transformed into non-linear versions.^14,22,23 However, all of the above methods design statistics without considering the local characteristics among samples.

k-Nearest neighbor (k-NN) rule is a classical machine learning method, which is usually adopted as a classifier.^24,25 Because it is capable of mining local characteristics between near neighbors,²⁶ k-NN rule has been modified to propose a fault detection method based on the k-NN rule, namely, FD-k-NN.²⁷ It provides a promising direction for the solution of the above problems to some extent. The statistics of FD-k-NN are designed by fully considering the Euclidean distance measurement among local neighbor samples. Due to its excellent performance, k-NN-based fault detection methods have been studied extensively.^28–30 Although these methods have been employed into various tasks, such as multirate sampling process³¹ and multimode process,³² quality-related fault detection methods based on k-NN rule have still not been well-established so far.

Fault isolation is a successor task of fault detection, which is utilized to locate fault sensor variables. Many classical fault isolation approaches have been proposed. Contribution plot and reconstruction-based contribution (RBC) are two most commonly used isolation methods, but they have relatively obvious smearing effect. To deal with this problem, many other fault isolation techniques have been addressed in previous works.^33–36 Unfortunately, these methods fail to be used in k-NN-based fault detection. To handle the problem, Zhou et al.³⁷ proposed a novel fault isolation method based on k-NN rule, VCk-NN, which makes it possible to determine the failed sensors after detecting faults using FD-k-NN. Compared with previous methods, VCk-NN suffers from less effect of fault smearing. Nevertheless, VCk-NN assumes that the process samples follow a multivariate normal distribution, which greatly restricts its effect of fault isolation.

In this article, we propose a novel quality-related fault diagnosis framework, which is made up of two parts: non-linear quality-related fault detection and fault isolation. For quality-related fault detection task, the motivation of proposing kernel partial least squares (KPLS) by combining KPLS with k-NN rule is that (1) KPLS can effectively utilize process variables to obtain predictive quality variables, which will replace the actual quality variables that cannot get in real time, and (2) k-NN rule can mine information between a test sample and the nearest template samples for a better detection effect compared with only using a single test sample. For fault isolation task, the motivation to improve VCk-NN to MVCk-NN is that the hypothesis is sometimes not satisfied, and in this case, VCk-NN still has a smearing effect, so MVCk-NN is presented to deal with this problem by mitigating the influence of faulty variables on faultless variables. The main contributions of this article are summarized as follows:

We propose a new quality-related non-linear fault diagnosis framework based on k-NN rule, including a new quality-related non-linear fault detection method KPLS-k-NN and a new fault isolation method modified VCk-NN.

KPLS-k-NN is proposed by combining KPLS with k-NN rule. Quality-related statistics of KPLS-k-NN take into full consideration the local neighbor information among predicted quality samples, which greatly improve the detection rate (DR) for quality-related faulty samples.

Modified VCk-NN is established by introducing the idea of the relative variable contributions of accumulative relative contribution rate (ARCR) into VCk-NN, which does not need the assumption that the process samples obey a multivariate normal distribution and has more precise isolation results in identifying latent fault root cause than VCk-NN.

The rest of this article is arranged as follows: first, give some relevant preliminaries. Afterward, the k-NN scheme for quality-related non-linear fault diagnosis is proposed in a detailed presentation. Then, the simulation results are provided and discussed. Finally, we conclude the article and present our future work.

Preliminaries

Let the non-linear process contain an input data matrix $X$ which records $N$ samples of $m$ process variables and an output data matrix $Y$ including $N$ samples of $l$ quality variables, that is

X = {[x_{1}, x_{2}, \dots, x_{N}]}^{T} \in ℝ^{N \times m}

(1)

Y = {[y_{1}, y_{2}, . . ., y_{N}]}^{T} \in R^{N \times l}

(2)

where $x_{i} \in R^{m}$ and $y_{i} \in R^{l} (i = 1, . . ., N)$ represent the $i th$ sample of $X$ and $Y$ , respectively. All samples are supposed to obey normal distribution.

The implementation of KPLS method is divided into two steps as follows. First, kernel trick²¹ is introduced into KPLS model to effectively deal with the non-linear relationship among variables. Given a kernel function $ϕ$ , it maps the original samples $x_{i} (i = 1, 2, \dots, N)$ into a high-dimension kernel space $F$ , which is defined as $x_{i} \in R^{m} \to ϕ (x_{i}) \in R^{f}$ , where $f$ is the dimension of $F$ . Thus, the process matrix $X$ is transformed into feature matrix $Φ$ as

Φ = {[ϕ (x_{1}), ϕ (x_{2}), \dots, ϕ (x_{N})]}^{T} \in R^{N \times f}

(3)

As a necessary step, $ϕ (x_{i})$ needs to be processed to zero mean vector $\bar{ϕ}$ , that

\bar{ϕ} (x_{i}) = ϕ (x_{i}) - \bar{ϕ}

(4)

\bar{ϕ} = \frac{1}{N} \sum_{i = 1}^{N} ϕ (x_{i}) = \frac{1}{N} Φ^{T} 1_{N}

(5)

where $1_{N} = [1, 1, \dots, 1]^{T} \in R^{N}$ . Hence, the zero mean matrix of $Φ$ can be obtained by

\begin{matrix} \bar{Φ} = {[\bar{ϕ} (x_{1}), \bar{ϕ} (x_{2}), \dots, \bar{ϕ} (x_{N})]}^{T} \\ = (I_{N} - \frac{1}{N} 1_{N} 1_{N}^{T}) Φ \end{matrix}

(6)

where $I_{N}$ is the identity matrix. The concrete form of $Φ$ is unknowable, but its kernel matrix $K = Φ Φ^{T} \in R^{N \times N}$ can be artificial setting. In this article, the radial basis function (RBF) kernel is used to calculate the elements in $K$ , which is

K_{i, j} = \exp (- \frac{{‖ x_{i} - x_{j} ‖}^{2}}{2 σ^{2}}) (i, j = 1, 2, \dots, N)

(7)

where $σ$ is the kernel parameter that needs to be set according to experience. The zero mean of $K$ is calculated as follows

\bar{K} = \bar{Φ} {\bar{Φ}}^{T} = (I_{N} - \frac{1}{N} 1_{N} 1_{N}^{T}) K (I_{N} - \frac{1}{N} 1_{N} 1_{N}^{T})

(8)

Second, PLS model is established between $\bar{Φ}$ and $Y$ in space $F$ . Thus, the KPLS model is constructed as

{\begin{matrix} \bar{Φ} = T P^{T} + {\bar{Φ}}_{r} \\ Y = U Q^{T} + Y_{r} \end{matrix}

(9)

where $T \in R^{N \times a}$ and $P \in R^{f \times a}$ are the score matrix and the loading matrix of $\bar{Φ}$ , respectively, $U \in R^{N \times a}$ and $Q \in R^{l \times a}$ are the score matrix and the loading matrix of $Y$ , respectively, ${\bar{Φ}}_{r}$ and $Y_{r}$ are the residual matrices, and $a$ represents the number of latent variables.

The iterative calculation algorithm of KPLS has been elaborated in Jiao et al.,¹⁴ in which we can get the score matrix $U$ and $R$ that is $R = {\bar{Φ}}^{T} U (T^{T} \bar{K} U)^{- 1}$ . For the score matrix $T$ and the loading matrix $Q$ , the following equations hold

T = \bar{Φ} R

(10)

Q = Y^{T} T

(11)

Obviously, the score vector $t_{new}$ of $\bar{ϕ} (x_{new})$ is

t_{new} = R^{T} \bar{ϕ} (x_{new})

(12)

Methodology

FD-k-NN as a popular fault detection approach has ability in determining whether a fault has occurred in the process, but it cannot estimate whether the fault occurred will have an impact on the production quality.

Therefore, in this section, we propose k-NN-based fault diagnosis scheme. Its fault detection scheme KPLS-k-NN is designed as follows: first, FD-k-NN is employed to monitor the process space. Then, KPLS is adopted to obtain the predicted quality samples $Y_{p}$ of training process samples $X$ in predicted quality space, where quality-related fault detection will be carried out. When KPLS-k-NN indicates that the system has failed, a new fault isolation method MVCk-NN is given for better locating the faulty sensor variables.

The proposed KPLS-k-NN for fault detection

Fault detection in process space

FD-k-NN is designed through following the principle: any normal samples should be close to other normal samples to some extent, while for a faulty sample, it should deviate from normal samples. Usually, the degree of deviation is measured by k-NN distance, which is defined as the average square distance between the test sample and its k-NNs from the training normal samples. When the k-NN distance of a sample exceeds the threshold, it is considered as a faulty sample, otherwise, it is judged as a normal sample. The details of the algorithm are as follows.

Model building

Given the training samples $X$ , the model is built according to the following procedures:

1. Find the k-NNs for each sample $x_{i}$ in $X$ and compute all the Euclidean distance, that

d_{ij} = ‖ x_{i} - x_{j} ‖_{2}, j \in k - NNs (x_{i})

(13)

where $k - NNs (x_{i})$ represents the set of k-NNs of $x_{i}$ .

2. Calculate the k-NN distance of $x_{i}$ . k-NN distance is adopted as the statistics $D_{x}^{2} (x_{i})$ as

D_{x}^{2} (x_{i}) = \frac{1}{k} \sum_{j = 1}^{k} d_{ij}^{2}

(14)

3. Determine the threshold $D_{x, α}^{2}$ :

$D_{x}^{2}$ is rearranged in descending order as $D_{x_arrange}^{2}$ , of which $(1 - α) - empirical quartile$ is chosen as $D_{x, α}^{2}$ in Zhou et al.,³⁷ that is, $D_{x_arrange}^{2} (⌊ N (1 - α) ⌋)$ .

Fault detection

For a new incoming test sample $x_{new}$ , the sample categories are inferred as the following steps:

Find $x_{new}' s$ k-NNs from $X$ .

Compute $x_{new}' s$ k-NN distance $D_{x}^{2} (x_{new})$ .

Compare $D_{x}^{2} (x_{new})$ with the threshold $D_{x, α}^{2}$ :

If $D_{x}^{2} (x_{new}) > D_{x, α}^{2}$ , then $x_{new}$ is considered as a faulty sample. Otherwise, $x_{new}$ is a normal sample.

Fault detection in predicted quality space

Given the training process samples $X$ and training quality samples $Y$ . To obtain the predicted quality samples $Y_{p}$ of training process samples $X$ , KPLS⁴¹ is adopted to obtain $Y_{p}$ (the prediction value of $Y$ ), since it has the ability to utilize $X$ to effectively mine the essential information of $Y$ . We set the coefficient matrix of $X$ and $Y_{p}$ as $M$ , which is

M = U {(T^{T} \bar{K} U)}^{- 1} T^{T} Y

(15)

According to the calculation of equations (10) and (11), the predictive output $Y_{p}$ is calculated by

Y_{p} = T Q^{T} = \bar{Φ} {\bar{Φ}}^{T} U {(T^{T} \bar{K} U)}^{- 1} T^{T} Y = \bar{K} M

(16)

Similar to process space, we call the space where $y_{p} (i = 1, \dots, N) \in Y_{p}$ is located predicted quality space.

At this point, FD-k-NN is employed to perform the detection of quality-related faults in predicted quality space. We need to find the k-NNs for each sample $y_{p}$ in $Y_{p}$ , and obtain the $y_{p}' s$ k-NN distance $D_{y}^{2} (y_{p})$ according to equation (14). Notice that since $X$ and $Y_{p}$ generally do not obey the same distribution, for $x$ and $y_{p}$ in the same point, their corresponding k-NNs might not be the samples in the same points. Hence, the k-NNs for $x$ and $y_{p}$ should be calculated separately. In addition, our KPLS-k-NN method does not need to carry out any variable transformation in the original space, and it directly depends on the Euclidian distance of the variables in the original space as an index to quantify the discrepancy between test samples and normal samples.

Different from the threshold in Zhou et al.,³⁷ in this article, kernel density estimation (KDE)⁴², as a non-parameter probability density estimation method of random variable, is utilized to determine the threshold for two monitoring spaces, which can be referred in detail in Parzen.³⁸ Thus, corresponding to $D_{x}^{2}$ and $D_{y}^{2}$ , we will get their thresholds $D_{x, α}^{2}$ and $D_{y, α}^{2}$ .

For a new incoming test sample $x_{new}$ , its predicted quality $y_{p, new}$ is calculated by equations (12) and (15) as follows

y_{p, new} = Q t_{new} = M^{T} \bar{Φ} \bar{ϕ} (x_{new}) = M^{T} {\bar{k}}_{new}

(17)

To determine whether $y_{p, new}$ is a faulty sample, we find $y_{p, new}' s$ k-NNs from $Y_{p}$ , and compute $y_{p, new}' s$ k-NN distance $D_{y}^{2} (y_{p, new})$ .

Finally, detection logic is performed by comparing $D_{y}^{2} (y_{p, new})$ with the threshold $D_{y, α}^{2}$ : if $D_{y}^{2} (y_{p, new}) > D_{y, α}^{2}$ , then $x_{new}$ is considered as a quality-related faulty sample. Otherwise, $x_{new}$ is a normal sample or a quality-unrelated faulty sample.

The whole scheme of KPLS-k-NN

By combining fault detection in process space as well as in predicted quality space, our proposed the whole KPLS-k-NN non-linear quality-related fault detection scheme is summarized as follows:

Offline modeling

Normalize the training process sample $X$ and training quality samples $Y$ to the zero mean and unit variance.

Obtain the coefficient matrix $M$ of KPLS by equation (15) and get $Y_{p}$ using equation (16).

Calculate $x' s$ k-NN distance $D_{x}^{2} (x)$ and $y_{p}' s$ k-NN distance $D_{y}^{2} (y_{p})$ to get the thresholds $D_{x, α}^{2}$ and $D_{y, α}^{2}$ .

Online detection

For a new incoming test sample $x_{new}$ :

Obtain $y_{p, new}$ using equation (17).

Compute the statistics $D_{x}^{2} (x_{new})$ and $D_{y}^{2} ({y_{p,}}_{new})$ .

Detection logic

If $D_{x_{new}}^{2} \leq D_{x, α}^{2}$ and $D_{y_{new}}^{2} \leq D_{y, α}^{2}$ , the system is fault-free.

If $D_{y_{new}}^{2} > D_{y, α}^{2}$ , the system has quality-related faults.

If $D_{y_{new}}^{2} \leq D_{y, α}^{2}$ and $D_{x_{new}}^{2} > D_{x, α}^{2}$ , the system has some quality-unrelated faults.

Notice that our KPLS-k-NN is essentially a supervised quality-related fault detection method, which is designed by combining KPLS with FD-k-NN. The above seems to be a simple combination, but FD-k-NN as an unsupervised method is successfully applied to complete a supervised fault detection task.

The proposed MVCk-NN for fault isolation

The KPLS-k-NN-based fault detection approach has been presented in the above section, which can effectively judge whether there are some faults in the process and whether the faults are related to product quality. Next, when KPLS-k-NN indicates the system exists some faults, a fault isolation method matched with KPLS-k-NN will be needed to locate faulty variables. Here, we propose a new fault isolation method MVCk-NN in detail.

In general, the sensor fault as a kind of system faults is classified as the additive fault. Hence, a process sample $x \in R^{m}$ can be expressed as

x = x^{*} + f = x^{*} + \sum_{i = 1}^{m} ξ_{i} f_{i}

(18)

where $x^{*} \in R^{m}$ denotes the fault-free component of $x$ and $f \in R^{m}$ denotes the fault component. If $f = 0$ , $x$ is a normal sample, otherwise a faulty sample. $ξ_{i} \in R^{m} (i = 1, \dots m)$ is the fault-direction indicator vector of the ith process variable among $m$ variables, of which the ith element is $1$ and other elements are $0$ .

To locate which sensors cause the statistics $D_{x}^{2}$ to alarm faults, the contributions from all variables of $x$ to $D_{x}^{2}$ need to be calculated. We decompose $D_{x}^{2}$ as follows

D_{x}^{2} = \frac{1}{k} \sum_{j = 1}^{k} ‖ x - x_{j} ‖_{2}^{2} = \sum_{i = 1}^{m} {\frac{1}{k} \sum_{j = 1}^{k} [ξ_{i}^{T} (x - x_{j})]}

(19)

Then, define the contribution from ith variable of $x$ to $D_{x}^{2}$ as

C (x, i) = \frac{1}{k} \sum_{j = 1}^{k} [ξ_{i}^{T} (x - x_{j})]

(20)

Obviously, by equations (19) and (20), $D_{x}^{2}$ is the sum of the contributions of all variables, namely, $D_{x}^{2} = \sum_{i = 1}^{m} C (x, i)$ .

Next, we discuss the influence of the fault magnitude on variable contributions. Set $x$ include at least one fault. Without loss of generality, we assume the ith variable be faulty, and others be normal or faulty, that

x = x^{*} + ξ_{i} f_{i} + \sum_{j = 1, j \neq i}^{m} ξ_{j} f_{j} (f_{i} \neq 0)

(21)

By equations (20) and (21), we have

C (x, i) = \frac{1}{k} \sum_{j = 1}^{k} [ξ_{i}^{T} (x^{*} - x_{j} + ξ_{i} f_{i} + \sum_{j = 1, j \neq i}^{m} ξ_{j} f_{j})]

(22)

According to neighborhood relationship, a reasonable hypothesis is given³⁷ as follows.

Hypothesis 1

Any variable’s fault magnitude is much larger than the Euclidean distance between the variable and its neighbors on the same dimension, that

‖ f_{i} ‖_{2} \geq ‖ {[x^{*}]}_{i} - {[x_{j}]}_{i} ‖_{2}

(23)

where $[\cdot]_{i}$ represents the ith component of vector.

Due to $ξ_{i}^{T} ξ_{i} = 1$ , $ξ_{i}^{T} ξ_{j} = 0, \forall j \neq i$ , and Hypothesis 1, it can be obtained that

C (x, i) \approx \frac{1}{k} \sum_{j = 1}^{k} f_{i}^{2} = f_{i}^{2}

(24)

It indicates that the contribution of each fault variable to the statistics is approximately equal to the square of the fault magnitude, and this method hardly suffers from smearing effect when the Hypothesis 1 is satisfied.

The above are VCk-NN-based variable contributions, but there exists a drawback that Hypothesis 1 sometimes cannot be enough satisfied, which causes that the contribution of fault variables may not be significantly different from that of normal variables, so as to increase the smearing effect.

Therefore, our proposed MVCk-NN introduces the idea of ARCR³⁵ into VCk-NN to obtain the relative variable contributions instead of absolute ones, so that the smearing effect is further eliminated. The detailed steps are illustrated as follows:

1. For a new sample $x_{new} \in R^{m}$ , when $m = 1$ , it means process variable is a single source variable. In this case, once a fault occurs, the faulty variable must be this process variable. However, when $m \geq 2$ , we need to execute the following Steps 2–5 to isolate faulty process variables.

2. Normalize each $C (x_{new}, i)$ obtained by equation (20) to guarantee that all variables contribute more or less the same to $x_{new}$

\hat{C} (x_{new}, i) = \frac{C (x_{new}, i)}{\frac{1}{N} \sum_{j = 1}^{N} C (x_{j}, i)}

(25)

3. To eliminate the smearing effect, $\hat{C} (x_{new}, i)$ is divided by the sum of all the $m$ normalized contributions to obtain relative variable contribution $C_{r} (x_{new}, i)$

C_{r} (x_{new}, i) = \frac{\hat{C} (x_{new}, i)}{\sum_{i = 1}^{m} \hat{C} (x_{new}, i)}

(26)

4. The recommended experience threshold is given in Peng et al.³⁵

θ = \frac{{‖ C_{r} (x_{new}) ‖}_{2} + \frac{1}{m}}{2}

(27)

where $C_{r} (x_{new}) = [C_{r} (x_{new}, 1), \dots, C_{r} (x_{new}, m)]^{T}$ .

5. If $C_{r} (x_{new}, i)$ is more than $θ$ , the ith variable of $x_{new}$ can be considered as a root cause of faults, otherwise not.

Through the above derivation, the flow chart of the proposed k-NN-based quality-related non-linear fault diagnosis scheme is summarized in Figure 1.

Figure 1.

The quality-related non-linear fault diagnosis framework of k-NN.

Case study

This section applies a widely typical numerical and a real industrial Tennessee Eastman (TE) process benchmark to validate the effectiveness of our proposed method. Two fault evaluation indexes are adopted for performance evaluation. In the fault detection stage, our method KPLS-k-NN will be compared with the state-of-the-art approach total kernel projection to latent structures (TKPLS)²² and the most recent SVD-based non-linear method modified kernel least squares (MKLS)²³ to show its superiorities. In the fault isolation stage, our MVCk-NN will be compared with VCk-NN. Besides, in the experiment, the confidence level $α$ is set to 0.99.

Evaluation index

Two evaluation indexes—that is, the fault DR and the false alarm rate (FAR)—are used in our experiments, which are defined as follows

D R = \frac{Number of effective alarms}{Total faulty samples} \times 100 %

(28)

F AR = \frac{Number of false alarms}{Total faulty samples} \times 100 %

(29)

where an effective alarm represents a quality-related faulty sample is detected, while a false alarm represents a quality-unrelated faulty sample is detected. For performance evaluation, when a quality-related fault occurs, DR is adopted as the key indicator, while when a quality-unrelated fault happens, then FAR is used as the key indicator. More details on DR and FAR can be referred to Wang and Jiao.²³

Typical numerical example

The following numerical example introduced in Peng et al.²² is applied

{\begin{matrix} x_{1} ~ N ({1, 0.01}^{2}), x_{2} ~ N ({1, 0.01}^{2}) \\ x_{3} = \sin (x_{1}) + e_{1} \\ x_{4} = x_{1}^{2} - 3 x_{1} + 4 + e_{2} \\ x_{5} = x_{2}^{2} + \cos (x_{2}^{2}) + 1 + e_{3} \\ y = x_{3}^{2} + x_{3} x_{4} + x_{1} + v \end{matrix}

(30)

where $e_{i} ~ N (0, {0.001}^{2}) (i = 1, 2, 3)$ , $v ~ N (0, 0 . 005^{2})$ , and $e_{i}$ and $v$ denote the noises. $x = [x_{1}, x_{2}, x_{3}, x_{4}, x_{5}]^{T}$ is the process variable, and $y$ is the quality variable.

From the above equation, we can see that $y$ can only be affected by $x_{1}$ , $x_{3}$ , and $x_{4}$ , but not by $x_{2}$ and $x_{5}$ . Hence, when a fault occurs in $x_{1}$ , it will have influence on quality variable $y$ . While a fault happens in $x_{2}$ , it will not do anything to $y$ . We generate 400 normal samples as training samples for offline modeling, 400 normal samples as validation samples for selecting the hyper-parameters, and 400 test samples which include 200 normal samples and 200 faulty samples for online detection. The fault scenarios are as follows:

Fault 1: step bias occurs in $x_{1}$ : $x_{1} = x_{1}^{*} + f$ .

Fault 2: ramp change occurs in $x_{1}$ : $x_{1} = x_{1}^{*} + (t - 200) f$ .

Fault 3: step bias occurs in $x_{2}$ : $x_{2} = x_{2}^{*} + f$ .

Fault 4: ramp change occurs in $x_{2}$ : $x_{2} = x_{2}^{*} + (t - 200) f$ , where $x_{1}^{*}$ and $x_{2}^{*}$ are the normal values of $x_{1}$ and $x_{2}$ , respectively, $t$ $(201 \leq t \leq 400)$ is the sequence number of the $t th$ sample, and $f$ is the fault magnitude. It can be seen that Fault 1 and Fault 2 are quality-related faults, while Fault 3 and Fault 4 are quality-unrelated faults.

The model parameters are $N = 400$ , $m = 5$ , and $l = 1$ . For the choice of hyper-parameters, that is, $A$ , $c$ , and $k$ , we utilize cross-validation method to determine them. In detail, in the normal state, several sets of parameters with the minimum of sum of FARs of process variables and quality variables will be selected as the alternative hyper-parameters. In addition, we also need to consider the predictive performance of KPLS meanwhile. Therefore, we select one from the alternative hyper-parameters that has the minimum prediction error as final hyper-parameters. In this experiment, the hyper-parameters are set as $A = 2$ , $c = 2 \times 10^{4}$ , and $k = 5$ . The significance level $α$ is 0.01. To make the experiment more comprehensive, the magnitude of Fault 1 is changed in turn that $f = 0.2, 0.4, 0.6, 0.8$ , while for Fault 2, $f = 0.002, 0.003, 0.004, 0.005$ .

The results of quality-related faults

Fault detection

The detection results for Fault 1 and Fault 2 of KPLS-k-NN, MKLS, and TKPLS are displayed in Table 1. As shown in Table 1, the DRs of three methods to detect Fault 1 are all $100 %$ , and the DRs of three methods for Fault 2 are almost above $90 %$ . So, three methods all give right detection results.

Table 1.

Detection results of KPLS-k-NN, MKLS, and TKPLS for quality-related Fault 1 and Fault 2 (%).

Fault 1								Fault 2
f	KPLS-k-NN		MKLS		TKPLS		f	KPLS-k-NN		MKLS		TKPLS
	$D_{y}^{2}$	$D_{x}^{2}$	$T_{mkls}^{2}$	$Q_{mkls}$	$T_{ky}^{2} & Q_{kr}$	$T_{ko}^{2} & T_{kr}^{2}$		$D_{y}^{2}$	$D_{x}^{2}$	$T_{mkls}^{2}$	$Q_{mkls}$	$T_{ky}^{2} & Q_{kr}$	$T_{ko}^{2} & T_{kr}^{2}$
0.2	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	0.002	${(92)}^{a}$	${(93.5)}^{a}$	${(95)}^{a}$	${(93)}^{a}$	${(94.5)}^{a}$	${(86.5)}^{a}$
0.4	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	0.003	${(96)}^{a}$	${(96.5)}^{a}$	${(97)}^{a}$	${(96.5)}^{a}$	${(96.5)}^{a}$	${(93)}^{a}$
0.6	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	0.004	${(97.5)}^{a}$	${(97.5)}^{a}$	${(98.5)}^{a}$	${(98)}^{a}$	${(98)}^{a}$	${(92.5)}^{a}$
0.8	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	0.005	${(97)}^{a}$	${(97.5)}^{a}$	${(99)}^{a}$	${(97.5)}^{a}$	${(98.5)}^{a}$	${(95.5)}^{a}$

KPLS-k-NN: kernel partial least squares-k-nearest neighbor.

where “a” refers to DRs.

Furthermore, without loss of generally, we set $f = 0.4$ in Fault 1 and observe the statistics values of three methods. From Figure 2, it is obvious that for these three methods, almost of all statistics of normal samples are below the threshold, while those faulty samples are significantly above the threshold. Therefore, three methods all have a good detection ability for quality-related faults.

Figure 2.

Detection results of quality-related Fault 1 with $f = 0.4$ by (a) KPLS-k-NN, (b) MKLS, and (c) TKPLS.

Fault isolation

When $f = 0.4$ , the Fault 1 isolation results of MVCk-NN and VCk-NN are shown in Figure 3. According to the setting of Fault 1, variable $x_{1}$ is the root cause of Fault 1, while $x_{3}$ and $x_{4}$ are also affected by $x_{1}$ , but they are not root causes. In Figure 3(a), VCk-NN regards $x_{1}$ , $x_{3}$ , and $x_{4}$ as the root cause of Fault 1, while from Figure 3(b), MVCk-NN can only identify $x_{1}$ as the root cause of Fault 1. By contrast with VCk-NN, our proposed MVCk-NN has higher accuracy in the fault isolation task.

Figure 3.

Isolation results of quality-related Fault 1 with $f = 0.4$ by (a) MVCk-NN and (b) VCk-NN.

The results of quality-unrelated faults

Fault detection

The results to detect Fault 3 and Fault 4 using three methods are shown in Table 2. Because the quality-unrelated statistics of these three approaches are all over $90 %$ , they can all warn that some process faults have happened in the system. The FARs of the statistics using TKPLS are over $90 %$ , so TKPLS fails to identify the true natures of these faults. For MKLS, although its FARs of the statistics $T_{mkls}^{2}$ are all less than $1 %$ , they are still not equal to $0$ . However, using KPLS-k-NN, the FARs of the statistics $D_{y}^{2}$ are all $0$ , so it achieves the best performance in comparison methods.

Table 2.

Detection results of KPLS-k-NN, MKLS, and TKPLS for quality-unrelated Fault 3 and Fault 4 (%).

Fault 1								Fault 2
f	KPLS-k-NN		MKLS		TKPLS		f	KPLS-k-NN		MKLS		TKPLS
	$D_{y}^{2}$	$D_{x}^{2}$	$T_{mkls}^{2}$	$Q_{mkls}$	$T_{ky}^{2} & Q_{kr}$	$T_{ko}^{2} & T_{kr}^{2}$		$D_{y}^{2}$	$D_{x}^{2}$	$T_{mkls}^{2}$	$Q_{mkls}$	$T_{ky}^{2} & Q_{kr}$	$T_{ko}^{2} & T_{kr}^{2}$
0.2	${(0)}^{b}$	${(100)}^{a}$	${(0)}^{b}$	${(100)}^{a}$	${(100)}^{b}$	${(100)}^{a}$	0.002	${(0)}^{b}$	${(92)}^{a}$	${(1)}^{b}$	${(92)}^{a}$	${(91.5)}^{b}$	${(92)}^{a}$
0.4	${(0)}^{b}$	${(100)}^{a}$	${(0.5)}^{b}$	${(100)}^{a}$	${(100)}^{b}$	${(100)}^{a}$	0.003	${(0)}^{b}$	${(96)}^{a}$	${(0.5)}^{b}$	${(96.5)}^{a}$	${(96)}^{b}$	${(96)}^{a}$
0.6	${(0)}^{b}$	${(100)}^{a}$	${(0.5)}^{b}$	${(100)}^{a}$	${(100)}^{b}$	${(100)}^{a}$	0.004	${(0)}^{b}$	${(95.5)}^{a}$	${(0.5)}^{b}$	${(95.5)}^{a}$	${(95.5)}^{b}$	${(96)}^{a}$
0.8	${(0)}^{b}$	${(100)}^{a}$	${(0)}^{b}$	${(100)}^{a}$	${(100)}^{b}$	${(100)}^{a}$	0.005	${(0)}^{b}$	${(97)}^{a}$	${(0.5)}^{b}$	${(97)}^{a}$	${(96.5)}^{b}$	${(97.5)}^{a}$

KPLS-k-NN: kernel partial least squares-k-nearest neighbor.

where “a” refers to DRs and “b” refers to FARs.

We set $f = 0.003$ in Fault 2 to valid the fault detection and isolation performance for quality-unrelated faults. The values of three methods’ statistics are shown in Figure 4. From Figure 4, when Fault 2 occurs, quality-unrelated statistics of three methods are all over their own thresholds, so they all indicate some faults have occurred. Because $Q_{kr}^{2}$ of TKPLS is almost higher than its threshold in all faulty samples, it misunderstands that the faults are quality-related, so it fails. Since quality-related statistics of both the KPLS-k-NN method and MKLS are lower than the thresholds, they both can give the correct results: the system exists some quality-unrelated faults. Compared with $T_{mkls}^{2}$ of MKLS, it is very obvious that $D_{y}^{2}$ of KPLS-k-NN can achieve a better margin, because it is much lower than the threshold. Therefore, for quality-unrelated faults, KPLS-k-NN can make a clearer division between quality-related and quality-unrelated samples.

Figure 4.

Detection results of quality-unrelated Fault 2 with $f = 0.003$ by (a) KPLS-k-NN, (b) MKLS, and (c) TKPLS.

Fault isolation

When $f = 0.003$ , the Fault 2 isolation results of two methods are displayed in Figure 5. Variable $x_{2}$ is the root cause of Fault 2. Although $x_{5}$ is not root cause, it will be affected by $x_{2}$ . In Figure 3, VCk-NN considers $x_{2}$ and $x_{5}$ both as root cause variables, while our MVCk-NN only determines that $x_{2}$ is the root cause of Fault 2, which is consistent with the fact.

Figure 5.

Isolation results of quality-related Fault 2 with $f = 0.003$ by (a) MVCk-NN and (b) VCk-NN.

Through the above analysis of the experimental results, our method KPLS-k-NN does a much excellent job for quality-related fault detection task than other comparison methods. Besides, when KPLS-k-NN indicates some faults happen, our proposed MVCk-NN has high accuracy in fault root cause diagnosis and has prominent advantages over VCk-NN.

TE benchmark

TE process is a real simulation benchmark of industrial process, which has been widely utilized for the simulation and verification of various control and MSPM approaches,³⁹ and its structure flowchart is shown in Figure 6. The variables in this process contain two blocks of variables: the XMV block of 11 manipulated variables and the XMEAS block of 41 measured variables which include 22 process and 19 analysis variables. In this simulation, 22 process variables (XMEAS (1–22)) and 11 manipulated variables (XMV (1–11)) are chosen to be process input $X$ , and select purge gas analysis component G (XMEAS (35)) as the quality output $Y$ .

Figure 6.

The structure flowchart of the TE process.

The training data set and validation data set contain 500 and 960 fault-free data samples, respectively. For the test data set, it is composed of 21 different fault sets, each of which includes 960 data samples and they are displayed in Table 3. The fault categories can be roughly divided into the following ones: step faults, random variation faults, slow drift faults, sticking faults, constant position faults, and some unknown faults. The detailed fault information is described in Downs and Vogel⁴⁰ and the website (http://depts.washington.edu/control/LARRY/TE/download.html).

Table 3.

Fault types in the TE process.

Fault	Description	Type
IDV(1)	A/C Feed ratio, B composition constant (Stream 4)	Step
IDV(2)	B composition, A/C ratio constant (Stream 4)	Step
IDV(3)	D feed temperature (Stream 2)	Step
IDV(4)	Reactor cooling water inlet temperature	Step
IDV(5)	Condenser cooling water inlet temperature	Step
IDV(6)	A feed loss (Stream 1)	Step
IDV(7)	C header pressure loss (Stream 4)	Step
IDV(8)	A, B, C feed composition (Stream 4)	Random variation
IDV(9)	D feed temperature (Stream 2)	Random variation
IDV(10)	C feed temperature (Stream 4)	Random variation
IDV(11)	Reactor cooling water inlet temperature	Random variation
IDV(12)	Condenser cooling water inlet temperature	Random variation
IDV(13)	Reactor kinetics	Slow drift
IDV(14)	Reactor cooling water valve	Sticking
IDV(15)	Condenser cooling water valve	Sticking
IDV(16)	Unknown	Unknown
IDV(17)	Unknown	Unknown
IDV(18)	Unknown	Unknown
IDV(19)	Unknown	Unknown
IDV(20)	Unknown	Unknown
IDV(21)	Valve (Stream 4)	Constant position

TE: Tennessee Eastman.

To classify the process faults into the category of affecting $Y$ and that of not affecting, the criterion of Zhou et al.¹⁶ is adopted, that is, if $n_{y} / n_{t} > 10 %$ , then the faults are considered to be quality-related, otherwise quality-unrelated, where $n_{y} / n_{t}$ denotes the affected rate of $Y$ . According to the validation data set, the model parameters are $N = 500$ , $m = 33$ , and $l = 1$ and the hyper-parameters are set as $A = 10$ , $c = 2 \times 10^{4}$ , and $k = 10$ .

The results of fault detection

For the quality-related faults, detection results of KPLS-k-NN, MKLS and TKPLS are presented in Table 4. We can see that TKPLS performs better than KPLS-k-NN and MKLS since all its FDRs are higher than the corresponding ones of the other methods. However, KPLS-k-NN and MKLS still provide satisfactory results, with most of their statistics above being $10 %$ . The DRs of KPLS-k-NN are significantly superior to MKLS in IDV(6) and IDV(18).

Table 4.

Detection results of KPLS-k-NN, MKLS, and TKPLS for quality-related faults in the TE process (%).

Fault number	$n_{y} / n_{t}$	KPLS-k-NN		MKLS		TKPLS
		$D_{y}^{2}$	$D_{x}^{2}$	$T_{mkls}^{2}$	$Q_{mkls}$	$T_{ky}^{2} & Q_{kr}$	$T_{ko}^{2} & T_{kr}^{2}$
IDV(1)	22.72	${(82.25)}^{a}$	${(99.75)}^{a}$	${(34.88)}^{a}$	${(99.88)}^{a}$	${(99.75)}^{a}$	${(99.75)}^{a}$
IDV(2)	72.16	${(85.75)}^{a}$	${(98.63)}^{a}$	${(86.88)}^{a}$	${(98.50)}^{a}$	${(98.50)}^{a}$	${(98.75)}^{a}$
IDV(5)	14.98	${(13.88)}^{a}$	${(36.63)}^{a}$	${(23.50)}^{a}$	${(33.38)}^{a}$	${(33.38)}^{a}$	${(33.25)}^{a}$
IDV(6)	95.63	${(94.13)}^{a}$	${(100)}^{a}$	${(17.75)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(99.25)}^{a}$
IDV(7)	18.23	${(29.88)}^{a}$	${(100)}^{a}$	${(32.63)}^{a}$	${(100)}^{a}$	${(100)}^{a}$	${(100)}^{a}$
IDV(8)	58.93	${(62.25)}^{a}$	${(98.63)}^{a}$	${(79.00)}^{a}$	${(97.75)}^{a}$	${(98.50)}^{a}$	${(97.75)}^{a}$
IDV(10)	11.99	${(60.12)}^{a}$	${(63.88)}^{a}$	${(26.75)}^{a}$	${(59.38)}^{a}$	${(73.25)}^{a}$	${(87.25)}^{a}$
IDV(12)	61.67	${(67.38)}^{a}$	${(99.63)}^{a}$	${(76.63)}^{a}$	${(99.25)}^{a}$	${(99.50)}^{a}$	${(99.13)}^{a}$
IDV(13)	66.67	${(83.00)}^{a}$	${(95.38)}^{a}$	${(79.13)}^{a}$	${(95.25)}^{a}$	${(95.50)}^{a}$	${(94.75)}^{a}$
IDV(18)	85.39	${(86.00)}^{a}$	${(90.88)}^{a}$	${(24.38)}^{a}$	${(90.50)}^{a}$	${(91.13)}^{a}$	${(89.88)}^{a}$
IDV(20)	12.48	${(10.00)}^{a}$	${(65.25)}^{a}$	${(28.38)}^{a}$	${(63.50)}^{a}$	${(59.75)}^{a}$	${(50.25)}^{a}$
IDV(21)	24.97	${(46.88)}^{a}$	${(50.75)}^{a}$	${(30.50)}^{a}$	${(48.63)}^{a}$	${(64.00)}^{a}$	${(42.88)}^{a}$

KPLS-k-NN: kernel partial least squares-k-nearest neighbor.

where “a” refers to DRs.

For the quality-unrelated faults, Table 5 gives the detection results. We can see the corresponding statistics $D_{x}^{2}$ , $Q_{mkls}$ , and $T_{ko}^{2} & T_{kr}^{2}$ of all comparative methods have very close values, indicating that the three statistics are equally effective to detect system faults. TKPLS fails to judge the correlation between faults and quality, because its statistics for these faults are all above $10 %$ . For difficult-to-detect faults IDV(9), IDV(15)–IDV(17), both KPLS-k-NN and MKLS fail, but they enable to give correct results for the other faults. Compared with MKLS, KPLS-k-NN performs much better, because the FARs of KPLS-k-NN is much less than MKLS. Because KPLS-k-NN has good detection ability for different 21 kinds of faults, it indicates that the model trained by KPLS-k-NN has perfect generalization ability.

Table 5.

Detection results of KPLS-k-NN, MKLS, and TKPLS for quality-unrelated faults in the TE process (%).

Fault number	$n_{y} / n_{t}$	KPLS-k-NN		MKLS		TKPLS
		$D_{y}^{2}$	$D_{x}^{2}$	$T_{mkls}^{2}$	$Q_{mkls}$	$T_{ky}^{2} & Q_{kr}$	$T_{ko}^{2} & T_{kr}^{2}$
IDV(3)	7.74	${(1.13)}^{b}$	${(14.37)}^{a}$	${(7.25)}^{b}$	${(14.00)}^{a}$	${(15.38)}^{b}$	${(6.88)}^{a}$
IDV(4)	6.74	${(1.88)}^{b}$	${(99.75)}^{a}$	${(5.63)}^{b}$	${(98.50)}^{a}$	${(87.50)}^{b}$	${(99.50)}^{a}$
IDV(9)	4.99	${(0.75)}^{b}$	${(11.13)}^{a}$	${(5.63)}^{b}$	${(10.88)}^{a}$	${(12.13)}^{b}$	${(7.12)}^{a}$
IDV(11)	9.99	${(3.13)}^{b}$	${(78.00)}^{a}$	${(9.50)}^{b}$	${(74.00)}^{a}$	${(70.13)}^{b}$	${(75.63)}^{a}$
IDV(14)	6.24	${(7.63)}^{b}$	${(100)}^{a}$	${(3.38)}^{b}$	${(100)}^{a}$	${(100)}^{b}$	${(100)}^{a}$
IDV(15)	6.24	${(3.13)}^{b}$	${(19.25)}^{a}$	${(10.50)}^{b}$	${(14.63)}^{a}$	${(18.38)}^{b}$	${(12.63)}^{a}$
IDV(16)	3.75	${(39.25)}^{b}$	${(59.88)}^{a}$	${(20.00)}^{b}$	${(52.50)}^{a}$	${(63.13)}^{b}$	${(74.38)}^{a}$
IDV(17)	9.49	${(32.75)}^{b}$	${(95.50)}^{a}$	${(13.25)}^{b}$	${(93.50)}^{a}$	${(95.25)}^{b}$	${(90.38)}^{a}$
IDV(19)	8.36	${(0.50)}^{b}$	${(26.13)}^{a}$	${(1.00)}^{b}$	${(19.88)}^{a}$	${(43.13)}^{b}$	${(13.63)}^{a}$

KPLS-k-NN: kernel partial least squares-k-nearest neighbor.

where “a” refers to DRs and “b” refers to FARs.

The results of fault isolation

We select IDV(1) to verify the fault isolation effect of MVCk-NN by compared with VCk-NN. IDV(1) is a step fault which represents the change of A/C feed ration. The root cause of IDV(1) is the variable of $x_{25}$ . When IDV(1) happens, variables $x_{1}$ , $x_{26}$ , $x_{4}$ , and $x_{18}$ will deviate from the normal operation state in turn because of the influence of $x_{25}$ . Figure 7 displays the relative variable contributions of all process variables from 161th sample point to 960th sample point. From Figure 7(a), we can see that for the most part, VCk-NN identifies $x_{25}$ , $x_{1}$ , $x_{4}$ , and $x_{18}$ as the root cause of IDV(1). Whereas in Figure 7(b), MVCk-NN only needs to consider variables $x_{25}$ and $x_{1}$ as the root cause. Therefore, our proposed MVCk-NN has a relatively better ability to locate the variables causing faults than VCk-NN.

Figure 7.

Fault isolation results for IDV(1) by (a) MVCk-NN and (b) VCk-NN.

According to the above results, we can conclude that the proposed method is effective in quality-related non-linear fault diagnosis and has prominent advantages over traditional methods. Compared with the SVD-based methods, KPLS-k-NN directly utilizes the relationship among the predicted quality samples to design the statistics, which considers the information of the nearest neighbor structure. Besides, KPLS-k-NN directly detects the process space, avoiding the problem that the residual statistics may have a large component of variance, such as MKLS. Moreover, compared with VCk-NN, the proposed MVCk-NN has advantages in both variable contribution and threshold setting.

Conclusion

In this article, we present a novel quality-related non-linear fault diagnosis method based on k-NN, which is especially suitable for non-linear industrial processes with relatively small training samples. It consists of a new quality-related fault detection method KPLS-k-NN and a new fault isolation method MVCk-NN. First, KPLS is applied to establish a regression model between process variables and quality variables in order to obtain predictive quality samples. Then, FD-k-NN method is used to design two corresponding statistics, that is, $D_{x}^{2}$ and $D_{y}^{2}$ for the process space and the predicted quality space, respectively. They will make a judgment on whether there is a fault happening in the system, and whether the fault is related to quality when a fault occurs. When KPLS-k-NN detects faults, in order to locate root cause variables of faults, MVCk-NN is proposed by introducing the idea of ARCR into VCk-NN. Finally, the proposed method is applied to detect the faults in case study and the effectiveness is validated. Due to the superior fitting ability of the deep neural networks in comparison to KPLS, further work will focus on how to combine deep neural networks with k-NN rule to detect quality-related faults.

Footnotes

Handling Editor: Yanjiao Chen

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (No. 61976213, No. 61772525 and No. 61906191).

ORCID iD

Zelin Ren

References

Jiang

Yan

Huang

Review and perspectives of data-driven distributed monitoring for industrial plant-wide processes. Ind Eng Chem Res 2019; 58(29): 12899–12912.

Yin

Ding

Xie

, et al. A review on basic data-driven approaches for industrial process monitoring. IEEE Trans Ind Electron 2014; 61(11): 6418–6428.

Zhang

Peng

Chu

, et al. Implementing multivariate statistics-based process monitoring: a comparison of basic data modeling approaches. Neurocomputing 2018; 290: 172–184.

Wise

Richer

Veltkamp

, et al. A theoretical basis for the use of principal component models for monitoring multivariate processes. Proc Contr Qual 1990; 1(1): 41–51.

Xiu

Yang

Kong

, et al. Laplacian regularized robust principal component analysis for process monitoring. J Proc Contr 2020; 92: 212–219.

Xiu

Yang

Kong

, et al. Data-driven process monitoring using structured joint sparse canonical correlation analysis. IEEE Trans Circ Syst II Exp Brief 2021; 68(1): 361–365.

Ren

Zhang

A deep nonnegative matrix factorization approach via autoencoder for nonlinear fault detection. IEEE Trans Ind Inform 2020; 16(8): 5042–5052.

Xiu

Fan

Yang

, et al. Fault detection using structured joint sparse nonnegative matrix factorization. IEEE Trans Instrum Meas 2021; 70: 1–11.

Yan

Quality-driven autoencoder for nonlinear quality-related and process-related fault detection based on least-squares regularization and enhanced statistics. Ind Eng Chem Res 2020; 59(26): 12136–12143.

10.

Zhang

Shardt

YAW

Chen

, et al. A KPI-based process monitoring and fault detection framework for large-scale processes. ISA Trans 2017; 68: 276–286.

11.

Wang

Zhou

Chen

, et al. Weighted part mutual information related component analysis for quality-related process monitoring. J Proc Contr 2020; 88: 111–123.

12.

Chen

De Luca

Technologies supporting artificial intelligence and robotics application development. J Artif Intell Technol 2021; 1(1): 1–8.

13.

Dong

Peng

A novel hierarchical detection and isolation framework for quality-related multiple faults in large-scale processes. IEEE Trans Ind Electron 2020; 67(2): 1316–1327.

14.

Jiao

Zhao

Wang

, et al. A nonlinear quality-related fault detection approach based on modified kernel partial least squares. ISA Trans 2017; 66: 275–283.

15.

Yao

Shao

Hierarchical quality monitoring for large-scale industrial plants with big process data. IEEE Trans Neur Netw Learn Syst 2021; 32(8): 3330–3341.

16.

Zhou

Qin

SJ.

Total projection to latent structures for process monitoring. AIChE J 2010; 56(1): 168–178.

17.

MacGregor

Jaeckle

Kiparissides

, et al. Process monitoring and diagnosis by multiblock PLS methods. AIChE J 1994; 40(5): 826–838.

18.

Dong

Zhang

Huang

, et al. Adaptive total PLS based quality-relevant process monitoring with application to the Tennessee Eastman process. Neurocomputing 2015; 154: 77–85.

19.

Yin

Ding

Zhang

, et al. Study on modifications of PLS approach for process monitoring. IFAC Proc Vol 2011; 44(1): 12389–12394.

20.

Yin

Zhu

Kaynak

Improved PLS focused on key-performance-indicator-related fault diagnosis. IEEE Trans Ind Electron 2015; 62(3): 1651–1658.

21.

Schölkopf

Smola

Müller

KR.

Nonlinear component analysis as a kernel eigenvalue problem. Neur Comput 1998; 10: 1299–1319.

22.

Peng

Zhang

Quality-related process monitoring based on total kernel PLS model and its industrial application. Math Probl Eng 2013; 2013: 707953.

23.

Wang

Jiao

A kernel least squares based approach for nonlinear quality-related fault detection. IEEE Trans Ind Electron 2017; 64(4): 3195–3204.

24.

Satapathy

Loganathan

Kondaveeti

, et al. Performance analysis of machine learning algorithms on automated sleep staging feature sets. CAAI Trans Intell Technol 2021; 6(2): 155–174.

25.

Ullah

Ahmad

Sana

, et al. Comparative study for machine learning classifier recommendation to predict political affiliation based on online reviews. CAAI Trans Intell Technol 2021; 6(3): 251–264.

26.

Doreswamy, Hooshmand

Gad

Feature selection approach using ensemble learning for network anomaly detection. CAAI Trans Intell Technol 2020; 5(4): 283–293.

27.

Wang

Fault detection using the k-nearest neighbor rule for semiconductor manufacturing processes. IEEE Trans Semiconduct Manuf 2007; 20(4): 345–354.

28.

Zhang

Jiang

, et al. Automated feature learning for nonlinear process monitoring–an approach using stacked denoising autoencoder and k-nearest neighbor rule. J Proc Contr 2018; 64: 49–61.

29.

Zhang

Gao

, et al. Fault detection strategy based on weighted distance of k-nearest neighbors for semiconductor manufacturing processes. IEEE Trans Semiconduct Manuf 2019; 32(1): 75–81.

30.

Harrou

Taghezouit

Sun

Improved kNN-based monitoring schemes for detecting faults in PV systems. IEEE J Photovolt 2019; 9(3): 811–821.

31.

Feng

MRS-kNN fault detection method for multirate sampling process based variable grouping threshold. J Proc Contr 2020; 85: 149–158.

32.

Song

Tan

Shi

, et al. Fault detection and diagnosis via standardized k-nearest neighbor for multimode process. J Tai Inst Chem Eng 2020; 106: 1–8.

33.

Kariwala

Odiowei

Cao

, et al. A branch and bound method for isolation of faulty variables through missing variable analysis. J Proc Contr 2010; 20(10): 1198–1206.

34.

Yan

Yao

Huang

, et al. Reconstruction-based multivariate process fault isolation using Bayesian Lasso. Ind Eng Chem Res 2018; 57(30): 9779–9787.

35.

Peng

Zhang

, et al. Contribution rate plot for nonlinear quality-related fault diagnosis with application to the hot strip mill process. Control Eng Pract 2013; 21(4): 360–369.

36.

Wang

Jiao

Yin

Efficient nonlinear fault diagnosis based on kernel sample equivalent replacement. IEEE Trans Ind Inform 2019; 15(5): 2682–2690.

37.

Zhou

Wen

Yang

Fault isolation based onk-nearest neighbor rule for industrial processes. IEEE Trans Ind Electron 2016; 63(4): 2578–2586.

38.

Parzen

On estimation of a probability density function and mode. Ann Math Stat 1962; 33(3): 1065–1076.

39.

Chiang

Russell

Braatz

RD.

Fault detection and diagnosis in industrial systems. Berlin: Springer Science + Business Media, 2000.

40.

Downs

Vogel

EF.

A plant-wide industrial process control problem. Comput Chem Eng 1993; 17(3): 245–255.

41.

Rosipal

Kernel partial least squares for nonlinear regression and discrimination. Neur Netw World 2003; 13: 291–300.

42.

Mugdadi

Ahmad

IA.

A bandwidth selection for kernel density estimation of functions of random variables. Comput Stat Data Anal 2004; 47(1): 49–62.

Quality-related fault diagnosis based on k -nearest neighbor rule for non-linear industrial processes

Abstract

Keywords

Introduction

Preliminaries

Methodology

The proposed KPLS-k-NN for fault detection

Fault detection in process space

Model building

Fault detection

Fault detection in predicted quality space

The whole scheme of KPLS-k-NN

Offline modeling

Online detection

Detection logic

The proposed MVCk-NN for fault isolation

Hypothesis 1

Case study

Evaluation index

Typical numerical example

The results of quality-related faults

Fault detection

Fault isolation

The results of quality-unrelated faults

Fault detection

Fault isolation

TE benchmark

The results of fault detection

The results of fault isolation

Conclusion

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

References