Sage Journals: Discover world-class research

Abstract

Fluid catalytic cracking (FCC) is an important process in petroleum processing. Effective monitoring of the status and quality of FCC is vital. Accurate description of the relationship between process and quality variables is the basis of quality-driven monitoring. Many process variables affect the quality of FCC; some of these effects are linear, and others are nonlinear. We propose a combination method from the perspective of linearity and nonlinearity to improve the monitoring performance of FCC quality. Partial least squares (PLS) is initially used to extract linear features, and its residual space is saved as the input of the deep feedforward neural network (DFNN). DFNN is then used to extract nonlinear features for the further decomposition of subspaces. The PLS-DFNN method accurately describes processes involving linearity and nonlinearity. We construct three monitoring statistics to characterize the types of faults. The proposed method proves its excellent effect on a numerical simulation data set. It effectively distinguishes the types of faults on the Tennessee Eastman process data set, and the fault detection rate is superior to other related methods. Finally, we apply this method to the actual FCC and verify the superiority of this combination.

Keywords

Fluid catalytic cracking multivariate statistical process monitoring partial least squares deep feedforward neural network quality-driven fault detection

Introduction

Fluid catalytic cracking (FCC) is a vital process to lighten heavy oil in petroleum processing (Ancheyta et al., 2004; Vistisen and Zeuthen, 2008). FCC, as the main process to produce transportation fuel and provide part of low carbon olefin, has outstanding advantages and irreplaceable role. FCC has the advantages of strong feedstock adaptability, high yield of light oil products, and mature technology; it is therefore an important source of profit for oil refining enterprises at present (Han and Chung, 2001; Lz and Hdl, 2019). The deterioration of heavy crude oil, the rising market demand for light oil products, and the increasing pressure on clean fuel production and environment in recent years have accelerated the process of refining integration. Global refineries speed up the adjustment of process equipment structure, while the core position of FCC, as the main process conversion unit, remains unchanged (Li et al., 2020; Lin and Zeng, 2013). However, the process of FCC, which includes reaction regeneration, fractionation, absorption and stabilization, and desulfurization and denitrification systems, is complicated; each system also involves multiple towers (Lopez-Zamora and de Lasa, 2019). System performance is not only closely related to economic benefits but also may lead to potential safety hazards (Jiang et al., 2020b). Process monitoring has been highly valued for ensuring the long-term reliable operation of such complicated processes. Classical monitoring methods based on models or prior knowledge have failed to work well; thus, data-driven methods are bound to develop rapidly because of the wide use of new measurement technologies and the considerable progress in data mining (Bounoua et al., 2019; Ge, 2017; Jiang et al., 2019; Li and Feng, 2020; Theisen et al., 2021; Yao et al., 2022). Multivariate statistical process monitoring (MSPM) methods are fairly representative of data-driven methods. They extract low-dimensional features from high-dimensional data, and then, they establish monitoring statistics in low-dimensional spaces (Bounoua et al., 2019; Ge, 2017; Jiang et al., 2019). The two main methods are principal component analysis (PCA) (Kano et al., 2001) and partial least squares (PLS) (Qin and Zhou, 2010). The former performs orthogonal projection on the feature space, while the latter performs oblique projection on the feature space.

Quality-driven methods have elicited increasing attention in recent years. The process can be better controlled by monitoring whether quality has been influenced (Huang and Yan, 2019; Yan et al., 2019). However, obtaining quality variables is costly and time consuming; we cannot guarantee the real-time acquisition of quality variables in online monitoring. Thus, the quality information contained in process variables should be found in offline modeling. PLS is commonly used to establish the relationship between process variables and quality variables. PLS extracts components that reflect the regression relationship between process variables $X$ and quality variables $Y$ . $X$ are divided into the dominant space that can estimate $Y$ and the residual space, whereas $Y$ are divided depending on whether they can be predicted. In order to detect quality-relevant faults, Zhou et al. (2010) proposed total partial least squares (TPLS), in which the dominant space of $X$ is further divided into two parts depending on whether it can be estimated by $Y$ . Wang et al. (2016b) proposed principal component regression (PCR) to establish regression relationship between quality variables and potential variables. Li et al. (2019) improved PLS by using the coefficient matrix between $X$ and $Y$ to project $X$ to two orthogonal spaces that are non-correlated and highly correlated with $Y$ , respectively.

However, the FCC process consists of multiple units and a large number of process variables. The effects of these variables on the product quality are uncertain; some of these effects are linear, and others are complex nonlinear. The above linear methods may not be applicable to complex nonlinear processes. Therefore, kernel tricks are introduced. Peng et al. (2013) first proposed total kernel partial least squares (TKPLS) to deal with nonlinear process. Jiao et al. (2017) and Zhou et al. (2019) proposed modified kernel partial least squares (MKPLS) and kernel principal component regression (KPCR), respectively. Yan et al. (2019) tried to divide quality-relevant and quality-irrelevant groups based on self-organizing mapping and kernel methods (SOM-KPLS/KPCA). This splits the relationship between quality-relevant variables and quality-irrelevant variables. What’s more, the choice of kernel function is vital to the quality-related fault detection. The selection of kernel functions also lacks reasonable proof. Jiang et al. (2020a) proposed a data-driven parity relation residual generator for quality-related fault detection. However, this method has obvious limitations on fault strength. Yan and Yan (2021) proposed the maximum correlation neural network (MCNN) to capture the change trends of quality variables. However, the kernel function is still used to construct the nonlinear mapping of process variables.

Through the analysis of the above problems and inspired by the relevant combination methods (Fan et al., 2014; Huang and Yan, 2019; Lin et al., 2014), we hope to adopt a method that can not only better characterize the linear process but also deal with complex nonlinear problems without separating the relationship between quality-relevant variables and quality-irrelevant variables. Therefore, we combine PLS and deep feedforward neural network (DFNN) to establish a hybrid model for fault detection. The linear relationships between $X$ and $Y$ are extracted by PLS. The residual space, which still contains nonlinear relationships, will be handled by DFNN. We also pursue to divide subspaces appropriately because $X$ will be divided into the linear-correlated portion, nonlinear-correlated portion, and final residual space. We select the neural network instead of the kernel method for the convenience of processing new samples to solve the nonlinear problem. The structure of our DFNN is specially designed based on a feedforward neural network (FNN) and an auto encoder (AE). FNN methods, which are combined with feature extraction, have been widely used as a soft sensor in prediction and regression problems. AE, which is a special FNN, is considered to be an effective approach to extract features in nonlinear process monitoring. What’s more, due to the constraints of the AE, the DFNN can avoid over fitting the data to some extent. Finally, three statistics that characterize the types of faults are proposed to monitor three portions of $X$ .

The main innovations of the paper are as follows:

A “nonlinear PLS” DFNN structure is designed. Combining PLS and DFNN to decompose dominant space and residual space, it can effectively divide linearity subspace, nonlinearity subspace, and remaining residual space.

Three monitoring statistics are constructed to detect and classify faults, which can effectively distinguish linear quality-relevant, nonlinear quality-relevant, and quality-irrelevant faults.

The remainder of the paper is organized as follows. We provide a brief description of PLS, FNN, and AE in section “Preliminary work.” Section “Fault detection based on PLS-DFNN model” elaborates our PLS-DFNN model and its application in fault detection. We test our model on a numerical simulation data set, which is the Tennessee Eastman (TE) process data set, in section “Experiments and discussion.” We formally apply our model to the FCC process in section “Applications on FCC process.” We provide our conclusions in section “Conclusion.”

Preliminary work

PLS

We assume that the process variables $X_{m \times n}$ have m samples with n variables corresponding m quality variables $Y_{m \times r}$ with dimension r. PLS calculates the largest eigenvalue of $X^{T} Y Y^{T} X$ and its unit eigenvector $ω_{1}$ . The first component is gained by $t_{1} = X ω_{1}$ .

Then, the component is used to reconstruct $X$ and $Y$ by equations (1) and (2).

{\begin{cases} p_{1} = \frac{X^{T} t_{1}}{t_{1}^{T} t_{1}} \\ q_{1} = \frac{Y^{T} t_{1}}{t_{1}^{T} t_{1}} \end{cases}

(1)

{\begin{matrix} X = t_{1} p_{1}^{T} + E_{1} \\ Y = t_{1} q_{1}^{T} + F_{1} \end{matrix}

(2)

We treat $X$ and $Y$ as $E_{0}$ and $F_{0}$ , respectively; $E$ and $F$ will repeat the abovementioned process. Given the number of components k, $X$ and $Y$ are finally decomposed as equation (3).

{\begin{matrix} X = \sum_{i = 1}^{k} t_{i} p_{i}^{T} + E = T P^{T} + E \\ Y = \sum_{i = 1}^{k} t_{i} q_{i}^{T} + F = T Q^{T} + F \end{matrix}

(3)

To establish the relationship between $X$ and $T$ directly, the projection matrix $R = [r_{1}, r_{2}, . . ., r_{k}]$ is calculated by equation (4).

r_{i} = Π_{j = 1}^{i - 1} (I_{m} - ω_{j} p_{j}^{T}) ω_{i}

(4)

Two matrices, $R$ and $P$ , are retained in offline modeling. When obtaining new process variables $X_{new}$ in online monitoring, we can deduce the dominant and residual spaces by equation (5).

{\begin{matrix} T_{new} = X_{new} \cdot R \\ {\hat{X}}_{new} = T_{new} \cdot P^{T} \\ E_{new} = X_{new} - {\hat{X}}_{new} \end{matrix}

(5)

FNN and AE

FNN has only one hidden layer, which is fully connected with the input and output layers. No connection exists between the nodes of two layers that are not adjacent or between nodes of the same layer. The hidden layer has the maximum number of nodes, $u > n > r$ . Even if only one hidden layer is present, FNN can learn any relationship between the input and the output as long as it has sufficient nodes (Figure 1).

Figure 1.

FNN structure.

$b_{1}$ and $b_{2}$ are the input and output bias, and they are always replaced by a constant node. $W_{1}$ and $W_{2}$ are weights. The value of hidden and output layers can be calculated as follows

{\begin{matrix} h^{i} = φ (W_{1} x^{i} + b_{1}), i = 1, 2, . . ., m \\ {\hat{y}}^{i} = φ (W_{2} h^{i} + b_{2}), i = 1, 2, . . ., m \end{matrix}

(6)

where $φ$ is a nonlinear activation function such as the sigmoid function. Weights and biases are adjusted constantly based on the output error until satisfactory accuracy is achieved, and the error backpropagation (BP) algorithm is used to solve them.

AE is an FNN with a special structure that is widely used in unsupervised learning. First, its input and output layers are of the same sizes because the output needs to be the same as the input. Second, the hidden layer has the least number of nodes because its target is to extract features. The BP algorithm is also used for training the network.

Fault detection based on PLS-DFNN model

The structure of our DFNN is specially designed to imitate PLS. Thus, first, we detail its construction process. Second, we introduce the decomposition of subspaces based on our model. Third, we provide the calculation of monitoring statistics and threshold determinations. Finally, we show the entire scheme of fault detection.

DFNN structure

The construction of DFNN can be divided into four steps. Figure 2 is the diagram of each neural network. We initially construct a simple FNN with the assumption that the input is $X_{m \times n}$ and the target is $Y_{m \times r}$ . The hidden layer becomes an expression of $X$ that can best predict $Y$ to minimize the output error. We make the size of the hidden layer greater than the two others to simulate the projection to high-dimensional space and further improve the accuracy simultaneously. We obtain the hidden features $G_{m \times u}$ after the first round of training. Then, $G$ is placed into an AE as the input to extract low-dimensional features $H_{m \times v}$ in the second round of training. In the third network, we connect the FNN and the AE. The third round is for fine-tuning, and the weights and bias of the first two networks are saved and used as the initial value. Finally, $H$ is updated and used to reconstruct $X$ by another FNN. The nodes in each layer have the relationship $u > n > w > v > r$ . Compared with stacked AE, DFNN further extracts the information from $Y$ , and the final step of reconstruction helps obtain the residual space.

Figure 2.

DFNN structure.

Subspaces based on PLS-DFNN

First, process variables $X_{m \times n}$ and quality variables $Y_{m \times r}$ are decomposed by PLS. The number of the component $(k)$ can be determined by experience or by minimizing the error of the PLS by cross-validation. The result is shown in equation (7).

{\begin{matrix} X = T P^{T} + E = {\hat{X}}_{L} + E \\ Y = T Q^{T} + F = {\hat{Y}}_{L} + F \end{matrix}

(7)

Subscript L is the linear estimation, and $T_{m \times k}$ is the linear feature. We also calculate the projection matrix $R_{n \times k}$ by equation (4).

Residual spaces $E$ and $F$ are turned into the input and the target of DFNN to further extract nonlinear features. We can gain the nonlinear feature $H_{m \times v}$ by training the networks. The value of v is continually adjusted to determine the best accuracy and the size of features. The reconstruction result $\hat{E}$ and $\hat{F}$ are also gained, and the residual spaces are recalculated by equation (8).

{\begin{matrix} \bar{X} = E - \hat{E} \\ \bar{Y} = F - \hat{F} \end{matrix}

(8)

We view $\hat{E}$ and $\hat{F}$ as the nonlinear estimation of $X$ and $Y$ , which are written as ${\hat{X}}_{NL}$ and ${\hat{Y}}_{NL}$ , respectively. Finally, $X$ and $Y$ are divided into three parts, as follows

{\begin{matrix} X = {\hat{X}}_{L} + {\hat{X}}_{NL} + \bar{X} \\ Y = {\hat{Y}}_{L} + {\hat{Y}}_{NL} + \bar{Y} \end{matrix}

(9)

Monitoring statistics and threshold determinations

We assume that only process variables can be obtained during online monitoring. Quality variables will be delayed. Thus, we use them to verify the monitoring results. We set three statistics to monitor the three subspaces of $X$ . They represent the parts of linear estimation of $Y$ , nonlinear estimation of $Y$ , and the residual, which can be expressed as $T_{L}^{2}$ , $T_{NL}^{2}$ , and Q, respectively. They can be calculated as follows

T_{Li}^{2} = t_{te}^{i} {(\frac{T_{tr}^{T} T_{tr}}{m - 1})}^{- 1} t_{te}^{iT}, i = 1, 2, \dots, m

(10)

T_{NLi}^{2} = {(h_{te}^{i} - {\bar{h}}_{tr})}^{T} (h_{te}^{i} - {\bar{h}}_{tr}), i = 1, 2, \dots, m

(11)

Q_{i} = {({\hat{x}}_{te}^{i} - x_{te}^{i})}^{T} ({\hat{x}}_{te}^{i} - x_{te}^{i}), i = 1, 2, \dots, m

(12)

where subscripts tr means the offline training samples and te means the online testing samples. $T_{te}$ , $H_{te}$ , and ${\hat{X}}_{te}$ can be calculated by the saved parameters in offline modeling, whereas $t_{te}^{i}$ , $h_{te}^{i}$ , and ${\hat{x}}_{te}^{i}$ are their ith rows, respectively. ${\bar{h}}_{tr}$ is the average of $h_{tr}^{i}$ .

We monitor the quality variables similarly as equation (10) to verify the results. The monitoring statistic is written as $T_{Y}^{2}$ .

T_{Yi}^{2} = y_{te}^{i} {(\frac{Y_{tr}^{T} Y_{tr}}{m - 1})}^{- 1} y_{te}^{iT}, i = 1, 2, \dots, m

(13)

where $y^{i}$ is the ith row of $Y$ . The effects of faults on quality can be changeable. We can further explore the relationship between them based on $T_{Y}^{2}$ .

The kernel density estimation (KDE) method (Chen et al., 2000; Gonzalez et al., 2015) is used to calculate threshold determinations. On the basis of the distribution of $T_{L}^{2}$ , $T_{NL}^{2}$ , Q, and $T_{Y}^{2}$ , four probability density functions are estimated by KDE. We integrate them and then determine the percentage $β$ . The values of the cumulative distribution functions at the probability $β$ are the threshold determinations.

When the monitoring statistics are calculated in online monitoring, they will be compared with threshold determinations. The samples with larger statistics than threshold determinations will be judged as a fault. If we can study prior knowledge before testing, then we can obtain the relationship between fault and quality to choose the correct monitoring statistics.

Scheme of fault detection

The detailed scheme of the proposed approach is shown in Figure 3. The implementation contains offline modeling and online monitoring. The detailed steps are summarized as follows:

Offline modeling:

Step 1: Normalize process and quality variables and save the parameters.

Step 2: Use PLS to extract linear features to obtain PLS dominant space and residual space.

Step 3: In PLS residual space, use DFNN to extract nonlinear features to obtain DFNN dominant space and residual space.

Step 4: Use formula equations (10)–(13) to obtain $T_{L}^{2}$ , $T_{NL}^{2}$ , Q, and $T_{Y}^{2}$ , and then KDE is used to calculate thresholds $δ_{T_{L}^{2}}$ , $δ_{T_{NL}^{2}}$ , $δ_{Q}$ , and $δ_{T_{Y}^{2}}$ .

Online monitoring:

Step 1: Use offline modeling parameters to normalize online process variables.

Step 2: Obtain $T_{L}^{2}$ , $T_{NL}^{2}$ , and Q of online samples with trained PLS-DFNN.

Step 3: Generate the detailed information when a fault occurs.

Because linear and nonlinear relationships may be difficult to distinguish in most cases, we design an “OR” logic as $T_{L}^{2} | | T_{NL}^{2}$ . If any one’s alarm occurs, then we think quality-relevant faults may exist. Subsequently, quality-irrelevant faults are detected by Q. If $T_{L}^{2} | | T_{NL}^{2}$ or Q exceeds threshold, we will consider it as a fault occurs. Finally, we can evaluate the validity of the statistics we selected based on $T_{Y}^{2}$ .

Figure 3.

Architecture of PLS-DFNN fault detection.

Experiments and discussion

Before our model is applied to FCC process, we test our model on two data sets. The first data set is a numerical simulation data set, and we prove the capability of our model to detect linear quality-relevant, nonlinear quality-relevant, and quality-irrelevant faults. The second data set is the TE process data set, and we compare our detection results with those of other related models to show the superiority of our model.

Numerical simulation data set

We initially set seven original variables $s_{1}, s_{2}, \dots, s_{7}$ . Their values are fixed, and 500 samples are used.

[\begin{matrix} 1.1 & 1.2 & 1.3 & 1.4 & 1.5 & 1.6 & 1.7 \\ | & | & | & | & | & | & | \\ 1.1 & 1.2 & 1.3 & 1.4 & 1.5 & 1.6 & 1.7 \end{matrix}]

(14)

Second, the process variables are the product of original variables and random matrix in equation (15).

{\begin{matrix} [m_{1}, m_{2}, m_{3}, m_{4}] = [s_{1}, s_{2}, s_{3}] \cdot [\begin{matrix} 1.05 & 1.72 & 1.90 & 1.21 \\ 1.72 & 1.11 & 1.33 & 1.66 \\ 1.37 & 1.52 & 1.42 & 1.10 \end{matrix}] \\ + [e_{1}, e_{2}, e_{3}, e_{4}] \\ [m_{5}, m_{6}, m_{7}, m_{8}] = [s_{3}, s_{4}, s_{5}] \cdot [\begin{matrix} 1.74 & 1.52 & 1.43 & 1.65 \\ 1.33 & 1.67 & 1.66 & 1.69 \\ 1.16 & 1.15 & 1.54 & 1.02 \end{matrix}] \\ + [e_{5}, e_{6}, e_{7}, e_{8}] \\ [m_{9}, m_{10}, m_{11}, m_{12}] = [s_{5}, s_{6}, s_{7}] \cdot [\begin{matrix} 1.29 & 1.39 & 1.07 & 1.10 \\ 1.84 & 1.89 & 1.23 & 1.92 \\ 1.53 & 1.31 & 1.63 & 1.83 \end{matrix}] \\ + [e_{9}, e_{10}, e_{11}, e_{12}] \end{matrix}

(15)

Third, we establish the quality variables according to equation (16).

{\begin{matrix} q_{1} = 1.43 \cdot m_{1} + 1.14 \cdot m_{2} + e_{13} \\ q_{2} = 1.53 \cdot m_{3} + 1.67 \cdot m_{4} + e_{14} \\ q_{3} = 1.53 \cdot \sin (m_{5}) + 1.46 \cdot m_{6} + e_{15} \\ q_{4} = 1.22 \cdot \sqrt{m_{7}} + 1.63 \cdot \log (m_{8}) + e_{16} \end{matrix}

(16)

The first two quality variables have a linear relationship with process variables, and the last two have a nonlinear relationship. Other process variables have nothing to do with quality. $e_{1}, e_{2}, \dots, e_{16}$ are different Gaussian noises with a mean value of 0 and a variance of 0.1, and they are randomly generated.

Our training set is as described above, and then, we start to obtain the testing sets by adding different noises to the process variables and set up different faults starting from the 201st sample. We set up three step changes with amplitudes of 0.04, 0.07, and 0.05 on $m_{1}$ , $m_{8}$ , and $m_{10}$ , respectively, to simulate three types of faults. Finally, we obtain three corresponding testing sets.

Extracting linear features by PLS

We want PLS to extract two components because we have two quality variables that have a linear relationship with process variables. If the two features can perfectly represent the two quality variables, then the project matrix should be equation (17). The actual project matrix is equation (18).

{[\begin{matrix} 0 & 0 & 0.68 & 0.73 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0.78 & 0.62 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}]}^{T}

(17)

{[\begin{matrix} - 0.10 & - 0.22 & - 0.62 & - 0.69 & 0.01 & 0.18 & - 0.15 & 0.01 & 0.15 & 0.01 & 0.05 & 0.05 \\ - 0.77 & - 0.58 & 0.18 & 0.13 & - 0.02 & - 0.01 & 0.03 & 0.07 & 0.06 & - 0.04 & 0.03 & 0.13 \end{matrix}]}^{T}

(18)

The results are close, which proves the effect of PLS. Then, we calculate residuals $E$ and $F$ as the input of DFNN.

Extracting nonlinear features by DFNN

The structures of two DFNNs that estimate $E$ and $F$ are 12-15-8-15-4 and 8-10-12, respectively. We do not set the number of nonlinear features to 2 because the nonlinear functions are different from our activation functions. We hope to adopt more hidden layer nodes to achieve higher accuracy. We set the maximum training epoch to 5000 and the accuracy requirement to 0.01 for matching the variance of noise.

Detecting results of three testing sets

We decide k by 10-fold cross-validation. The $β$ of KDE is 99.5%. The calculated monitoring statistics and threshold determinations are plotted in Figure 4. In an ideal status, the first 200 samples should be below the threshold determinations, and the last 300 samples should be above.

Figure 4.

Detection results of the numerical simulation data set: (a)–(c) Fault 1. (d)–(f) Fault 2. (g)–(i) Fault 3. (j)–(l) Verifying.

We calculate the false alarm rate (FAR) and the fault detection rate (FDR) as the standard to evaluate the performance of the model. Table 1 shows the results. The first fault occurs on $m_{1}$ , and it is linearly relevant to quality. Thus, $T_{L}^{2}$ should be the reference, and its performance is obviously the best. Then, the second fault should refer to $T_{NL}^{2}$ because $m_{8}$ is nonlinearly relevant to quality. Although Q provides good FDR, it causes inevitable FAR in Faults 1 and 2. However, Q shows an obvious advantage in the last quality-irrelevant fault.

Table 1.

FAR (%) and FDR (%) of three monitoring statistics on three faults.

Fault	$T_{L}^{2}$		$T_{NL}^{2}$		Q
Fault	FAR	FDR	FAR	FDR	FAR	FDR
1	0.0 ± 0.0	100.0 ± 0.00	9.50 ± 0.08	9.0 ± 0.15	4.50 ± 0.18	85.2 ± 1.75
2	0.1 ± 0.04	2.20 ± 0.16	2.50 ± 0.08	100.0 ± 0.00	10.50 ± 0.24	100.0 ± 0.00
3	0.1 ± 0.03	0.1 ± 0.05	1.00 ± 0.12	0.3 ± 0.21	4.00 ± 0.26	96.0 ± 1.47

Bold faced values are used to highlight the optimal values for the same experimental conditions.

Verification

The last line of Figure 4 shows the statistic $T_{Y}^{2}$ . Quality variables in Faults 1 and 2 suddenly change and exceed the threshold determination at the 201st sample, whereas $T_{Y}^{2}$ in Fault 3 is mostly below the threshold determination. This scenario is exactly what we establish, and it further demonstrates the effectiveness of $T_{Y}^{2}$ .

TE process data set

TE process simulates an actual chemical program. A total of 52 variables are set up in simulation, and they comprise 33 process variables that can be obtained in real time and 19 quality variables that must be analyzed (Downs and Vogel, 1993). The former is directly detected by process instruments; thus, they become $X$ . The latter represents the components of eight reactants; thus, they become $Y$ .

Faults of TE process data set

Chiang et al. (2000) improved the simulation process and sorted out 22 data sets, including normal condition, and 21 faults. Prior knowledge can provide great help for fault detection. We can classify those faults based on the description of each fault (Kong et al., 2018; Li et al., 2019; Wang and Jiao, 2017). Faults 1, 2, 5, 6, 7, 8, 10, 12, 13, and 21 are quality-relevant, and $T_{L}^{2} | | T_{NL}^{2}$ is used to monitor them because we cannot further obtain the linear or nonlinear information. Faults 3, 4, 9, 11, 14, and 15 are quality-irrelevant and monitored by Q. Given that other faults are unknown, we use $T_{L}^{2} | | T_{NL}^{2}$ and Q to compare their performance. We can also judge the correlation between unknown faults and quality variables.

Each of the 22 training sets has 500 samples, and each of the 22 testing sets has 960 samples. Our model is trained based on the normal training set and is tested on 21 fault testing sets. Those fault testing sets are particularly designed because the fault appears from the 161st sample; in this manner, FAR and FDR can be comprehensively examined.

Model establishment

When establishing the PLS model, we decide k by 10-fold cross-validation. The prediction error is minimized by the principle of $k = 4$ . The extent of dimensional reduction is large; thus, the dominant space of PLS cannot contain most of the information. The situation also proves the significance of the further division of the residual space. Our bold conjecture is that considerable information must still be extracted by DFNN.

The DFNN structures are 33-45-29-45-19 and 29-31-33. The maximum training epoch is 5000, and the accuracy requirement is 0.01. All of these factors are decided according to experience.

Fault detection and verification

We give FAR and FDR of all faults under $T_{L}^{2} | | T_{NL}^{2}$ and Q statistics. We compare the performance of our model with some quality-relevant fault monitoring approaches mentioned above. The comparison approaches include KPCR (Zhou et al., 2019), TPLS (Peng et al., 2013), SOM-KPLS/KPCA (Yan et al., 2019), MCNN (Yan and Yan, 2021), and a deep neural network method commonly used for quality monitoring—deep belief network (DBN) (Chadha and Schwung, 2017). Faults 3, 9, and 15 are consistently ignored because they barely have any effect on the overall process (Ge and Song, 2007; Tong et al., 2019). Except from them, the average FAR of all faults is controlled below 5%. Table 2 shows the FDRs under this premise. We use higher FDR in statistics to calculate the average for compared methods. The calculation is based on the type of fault for our method. In particular, TPLS has three different $T^{2}$ , and we show the best one.

Table 2.

FDRs (%) of seven methods on TE process data set.

Fault	KPCR		TPLS		SOM-KPLS/KPCA		DBN		MCNN		PLS-DFNN
Fault	${T_{y}}^{2}$	${T_{0}}^{2}$	$T^{2}$	$Q_{\max}$	$T_{y}^{2}$	$Q_{\max}$	$T^{2}$	Q	$T_{y}^{2}$	$T_{0}^{2}$	$T_{L}^{2} \| \| T_{NL}^{2}$	Q
1	43.6	99.8	99.1	99.9	27.6	98.8	99.0	99.9	26.6	100	99.6 ± 0.35	99.6 ± 0.24
2	77.8	98.8	98.4	97.9	98.3	100	98.0	98.8	95.5	98.4	98.5 ± 1.25	98.3 ± 0.69
5	17.1	26.9	21.2	19.0	13.8	29.8	21.9	36.3	6.5	100	48.8 ± 0.21	100 ± 0.0
6	91.8	99.0	99.5	100	94.9	99.1	99.0	100	96.5	100	100 ± 0.0	100 ± 0.0
7	29.4	100	98.6	100	42.5	100	99.6	100	17.8	100	100 ± 0.0	100 ± 0.0
8	84.9	99.8	92.6	94.4	87.1	100	94.0	97.8	51.4	98.0	99.4 ± 0.37	98.3 ± 0.98
10	22.9	53.6	30.4	21.9	10.8	94.8	18.0	56.0	14.8	89.6	81.6 ± 2.85	91.0 ± 1.96
12	80.5	99.8	93.0	92.1	92.0	100	92.1	98.8	40.9	99.9	99.9 ± 0.1	99.8 ± 0.13
13	83.0	95.4	92.4	94.7	85.4	96.8	88.0	95.9	59.4	95.3	95.0 ± 1.58	95.3 ± 1.69
18	87.9	98.9	89.0	90.1	88.6	100	88.4	90.6	82.6	90.6	89.4 ± 1.25	90.6 ± 2.14
21	19.2	64.1	44.6	45.7	27.1	67.6	36.8	50.4	11.4	57.0	72.1 ± 3.54	73.1 ± 3.21
Average	58.1	85.1	78.1 7	77.8	60.7	89.7	75.9	84.0	45.8	93.5	89.5 ± 1.05	95.1 ± 1.00
3	1.2	11.6	1.6	2.4	0.9	33.9	8.0	3.0	1.6	5.0	4.2 ± 2.14	27.5 ± 2.25
4	0.8	100	14.0	100	0.1	100	8.5	100	6.3	100	8.9 ± 0.86	100 ± 0.0
9	1.0	8.0	2.0	2.1	2.8	17.1	2.5	1.0	2.0	4.5	0.9 ± 0.57	20.0 ± 1.42
11	2.5	98.4	19.8	70.0	1.4	98.1	34.5	74.1	8.6	81.3	6.8 ± 0.89	98.6 ± 0.62
14	0.0	100	85.6	100	0.5	96.8	92.5	99.9	35.3	100	59.8 ± 0.85	100 ± 0.0
15	4.3	13.3	3.5	2.6	1.8	29.5	4.6	7.3	3.5	7.4	7.3 ± 3.74	29.4 ± 4.21
16	10.9	94.8	14.8	18.0	6.6	95.8	4.4	45.3	6.1	92.9	4.6 ± 1.59	95.9 ± 0.95
17	17.3	97.5	74.7	93.2	9.5	97.1	64.3	96.9	56.3	97.4	7.0 ± 0.78	97.9 ± 0.84
19	0.0	100	1.5	23.4	0.0	99.4	14.8	40.8	0.1	94.5	0.4 ± 0.15	98.6 ± 1.08
20	25.7	80.8	41.2	45.2	8.5	90.6	42.5	58.9	6.9	91.4	7.1 ± 1.24	90.8 ± 1.28
Average	6.4	70.4	25.9	45.7	3.2	75.8	27.7	52.7	12.7	67.4	10.7 ± 1.28	76.2 ± 1.27

Bold faced values are used to highlight the optimal values for the same experimental conditions.

According to the FDR of $T_{Y}^{2}$ , the table is divided into two parts: the upper part is quality-relevant, and the lower part is quality-irrelevant. In the first part, most of the faults obviously connect the quality. Therefore, we mainly focus on quality-relevant indicators $T_{L}^{2} | | T_{NL}^{2}$ . The first column of the comparison methods is also their respective quality-relevant indicator. By observing Table 2, it can be concluded that PLS-DFNN performs better than the comparison methods in quality-relevant faults. The performance of Faults 5 and 21 is not as good as others, and we will specifically analyze them. In the second part, the monitoring performance of the first column of almost all methods is not very good. This is because the quality is nearly unaffected. Therefore, we prefer to use the quality-irrelevant monitoring indicator Q to determine whether a quality-irrelevant fault occurs. The performance of PLS-DFNN proposed is also slightly better than that of other methods.

Figure 5 shows the result of Faults 5 and 21. The quality for Fault 5 is only affected for a period of time after the fault occurs, and then, it quickly returns to normal. $T_{L}^{2} | | T_{NL}^{2}$ follows the trend of $T_{Y}^{2}$ , and Q continues to provide a good monitoring effect to achieve 100% FDR when the quality becomes normal. The FDR of $T_{Y}^{2}$ for Fault 21 is extremely low because Fault 21 occurs gradually. Using $T_{L}^{2} | | T_{NL}^{2}$ to monitor this situation is reasonable.

Figure 5.

Detection and verification of results in TE process data set: (a) Fault 5 and (b) Fault 21.

Generally, our method effectively distinguishes quality-relevant and quality-irrelevant faults, which also reflects the rationality of decomposing the subspaces. Our method also provides better monitoring performance than the other methods.

Applications on FCC process

In this section, PLS, DFNN, and PLS-DFNN are applied to an actual FCC process, and their performance in process monitoring is analyzed and compared. All production data are derived from the FCC unit of a petrochemical enterprise in Beijing. The characteristics of the three methods above are compared and discussed using the data of the whole year in 2020.

FCC process

FCC unit is a secondary refining unit with heavy oil as raw material. The main products are dry gas, liquefied petroleum gas (LPG), gasoline, diesel, and slurry (Ancheyta et al., 2004; Han and Chung, 2001; Lin and Zeng, 2013; Vistisen and Zeuthen, 2008). The main unit consists of a reaction regenerator, a fractionator, two absorption towers, and a stabilizer tower. The flow chart is shown in Figure 6. After entering the unit, the raw material is preheated first and then sent to the reaction system for catalytic cracking reaction. The reaction oil and gas are sent to the fractionation system for separation to obtain crude products such as gasoline and diesel. The rich gas and crude gasoline at the top outlet of the fractionator enter the absorption and stabilization system for further separation to obtain dry gas, LPG, and stabilized gasoline (Chang et al., 2014; Naik et al., 2017; Wang et al., 2016a). Among these products, gasoline is the most important and the most productive fraction. Therefore, producing qualified gasoline is important for FCC process (Han and Chung, 2001). ASTM D86 and saturated vapor pressure are two important indexes of vehicle gasoline, and they reflect the evaporation performance of gasoline. The difficulty of gasoline engine starting is determined by 10% distillate temperature. A total of 50% distillate temperature determines the heating and acceleration times of gasoline engine. A total of 90% distillate temperature and final boiling point (FBP) determine whether gasoline can be completely evaporated and burned. Saturated vapor pressure determines the start-up loss of gasoline engine and plays an important role in the environment (Gupta et al., 2007). However, these data cannot be detected online in real time due to the difficulty of measurement and the inability of remote transmission of some instruments. In addition, we prefer to use process variables to reflect whether and what type of fault occurred in production conditions. Therefore, the data-driven method has more advantages.

Figure 6.

Flow chart of catalytic cracking process.

Fault detection of FCC process

According to the latest national standards for automotive gasoline, ASTM D86 and saturated vapor pressure should meet certain requirements, which are displayed in Table 3. The FCC process has failed when the produced gasoline does not meet these standards. We collected the data of the whole year in 2020, and 31 process variables can be used for state detection, as shown in Table 4. Quality variables are unconventional monitoring data. Thus, the data scale that can be collected is limited. After comprehensive consideration, 50% distillation temperature, FBP, and saturated vapor pressure are finally selected as quality variables. The saturated vapor pressure data are those on 1 May and solstice on 31 October. Given that ASTM D86 distillation curve and saturated vapor pressure are not measured simultaneously, we consider them as quality variables separately.

Table 3.

Standards of normal quality variable.

Automotive gasoline standard	Standard requirement
10% distillate temperature (°C)	$\leq 70$
50% distillate temperature (°C)	$\leq 110$
90% distillate temperature (°C)	$\leq 190$
FBP (°C)	$\leq 205$
Saturated vapor pressure $(KPa)$	45~85 (from 1 November to 30 April)40~65 (from 1 May to 31 October)

Table 4.

Process variables used in the FCC process.

No.	Variable
1	Riser reaction temperature
2	Feed temperature of raw material
3	Regenerator dilute phase temperature
4	Regenerator dense phase temperature
5	Riser reaction pressure
6	Regenerator pressure
7	Riser circulating oil flow
8	Cold residue flow
9	Hot residue flow
10	Total feed flow
11	Fractionator top temperature
12	Fractionator 20-layer temperature
13	Bottom temperature of fractionator
14	Overhead reflux of fractionator
15	Upper reflux of fractionator
16	Lower reflux of fractionator
17	Return temperature on fractionator
18	Lower return temperature of fractionator
19	Slurry return temperature
20	Upper reflux of slurry
21	Lower reflux of slurry
22	Absorption tower top temperature
23	Absorption tower bottom temperature
24	Desorption tower top temperature
25	Desorption tower bottom temperature
26	Reabsorption of tower top temperature
27	Reabsorption of tower bottom temperature
28	Stability tower top temperature
29	Stabilized tower bottom temperature
30	Gas return tower temperature of stabilizer
31	Overhead reflux of stabilizer

Faults of FCC process

According to the technical monthly report, the fault was caused mainly by the riser reaction temperature fluctuations. Faults have different effects on the three quality variables of 50% distillate temperature, FBP, and saturated vapor pressure, and the collected data correspond to different times. Thus, they can be regarded as three groups of faults, which are respectively called Faults 1, 2, and 3. Each training set consists of 500 samples. Table 5 lists the types of faults for the testing set and the number of samples where the faults occurred and ended. The fault types can be divided into quality-relevant and quality-irrelevant according to their influence on quality variables.

Table 5.

Description of the faults.

Qualityvariable	Faulttype	Total(st)	Faultstart (st)	Faultend (st)
Fault 1	quality-relevant	1247	471	830
Fault 2	quality-relevant	1247	449	840
Fault 3	quality-irrelevant	1147	422	783

st: sample time series.

Performance of PLS

We use 10-fold cross-validation to determine the number of components in PLS. Table 6 lists the prediction errors of selecting different quantity components for each fault. For the three kinds of faults, the prediction error is less than that in other cases when $k = 7, 11, 4$ , respectively. Figures 7(a), 8(a), and 9(a) show the unsatisfactory results of PLS method for three kinds of fault detection. As shown in Figure 7(a), although the FDR of PLS method on Q is excellent, it is FAR under normal conditions is up to 59.9%, which is intolerable. The detection performance of $T^{2}$ is also unacceptable. As shown in Figure 8(a), the FDR of PLS method is quite small in the case of either $T^{2}$ or Q, which indicates that this method has a poor successful alarm rate for such a fault. As shown in Figure 9(a), the FDR of PLS method on $T^{2}$ is only 3.0%. Moreover, although its FDR of Q can reach 75.1%, the detection performance is still poor compared with those of DFNN and PLS-DFNN. These findings prove that only extracting linear features cannot achieve the fault detection of complex industrial processes.

Table 6.

Cross-validation in PLS on the FCC process.

k	1	2	3	4	5	6	7	8	9	10
Prediction error
Fault 1	2.483	2.224	2.139	2.109	2.154	2.119	2.108	2.140	2.129	2.131
Fault 2	2.338	2.143	2.105	2.043	2.000	2.024	1.995	1.992	1.992	1.986
Fault 3	1.898	1.719	1.666	1.649	1.696	1.680	1.687	1.671	1.677	1.667
k	11	12	13	14	15	16	17	18	19	20
Prediction error
Fault 1	2.128	2.122	2.124	2.133	2.128	2.133	2.130	2.130	2.130	2.130
Fault 2	1.984	1.986	1.987	1.987	1.985	1.985	1.985	1.985	1.985	1.987
Fault 3	1.677	1.675	1.673	1.671	1.660	1.663	1.663	1.660	1.663	1.662
k	21	22	23	24	25	26	27	28	29	30
Prediction error
Fault 1	2.130	2.131	2.130	2.130	2.130	2.130	2.130	2.130	2.130	2.130
Fault 2	1.987	1.986	1.986	1.987	1.987	1.986	1.986	1.986	1.986	1.986
Fault 3	1.661	1.660	1.661	1.661	1.661	1.661	1.661	1.661	1.661	1.661

Bold faced values are used to highlight the optimal values for the same experimental conditions.

Figure 7.

Detection results of the FCC process on Fault 1 by (a) PLS, (b) DFNN, and (c) PLS–DFNN.

Figure 8.

Detection results of the FCC process on Fault 2 by (a) PLS, (b) DFNN, and (c) PLS–DFNN .

Figure 9.

Detection results of the FCC process on Fault 3 by (a) PLS, (b) DFNN, and (c) PLS–DFNN.

Performance of DFNN and PLS-DFNN

We keep the structures of DFNN consistent as 31-55-12-55-1 and 12-20-31 to test the performance of DFNN and PLS-DFNN. The maximum training epoch is 5000, and the accuracy requirement is 0.01. Figures 7(b), 8(b), and 9(b) are the results of DFNN corresponding to three kinds of fault detection. Meanwhile, Figures 7(c), 8(c), and 9(c) are the results of PLS-DFNN. Comparison statistics of the detection performance of the three methods are shown in Table 7. Through $T_{y}^{2}$ , we can conclude that Faults 1 and 2 are quality-relevant faults, while Fault 3 is quality-irrelevant. For Fault 1, the FAR/FDR of DFNN on $T^{2}$ and Q is better than that of PLS, which indicates the necessity of extracting nonlinear features. However, the detection performance of DFNN for Fault 2 is not as good as that of PLS, which means that direct extraction of nonlinear features may not bring good performance. PLS-DFNN is not only superior to the two methods above in terms of detection performance but also can more accurately determine what types of faults occur in the process. The $T_{NL}^{2}$ of Faults 1 and 2 has the best detection performance. Thus, Faults 1 and 2 can be considered nonlinear quality-relevant faults. Q of Fault 3 has the optimal detection performance, which means that Fault 3 is quality-irrelevant fault. Therefore, we can draw the conclusion that the division of subspace is crucial, and the combination of PLS and DFNN to reasonably extract features can better complete the detection task.

Table 7.

FARs (%) and FDRs (%) of three methods on the FCC process.

Faults	PLS		DFNN		PLS-DFNN
Faults	$T^{2}$	$Q$	$T^{2}$	$Q$	$T_{L}^{2}$	$T_{NL}^{2}$	$Q$	$T_{y}^{2}$
Fault 1	16.6/56.7	59.9/100	1.2/79.2	14.4/100	16.6 ± 0.65/56.7 ± 0.74	1.5 ± 0.24/99.7 ± 0.12	14.3 ± 2.26/100 ± 0.0	2.6 ± 0.54/100 ± 0.0
Fault 2	0.2/29.9	0.5/56.1	0.1/17.4	13.7/66.6	0.2 ± 0.04/29.9 ± 0.68	0.8 ± 0.11/83.2 ± 1.35	0.7 ± 0.15/31.1 ± 1.21	0.9 ± 0.27/99.5 ± 0.36
Fault 3	3.8/3.0	1.0/75.1	0.1/0	2.6/65.2	3.8 ± 0.54/3.6 ± 0.44	0.0 ± 0.0/7.7 ± 0.58	0.1 ± 0.02/81.8 ± 2.58	0.8 ± 0.03/1.9 ± 0.75

Bold faced values are used to highlight the optimal values for the same experimental conditions.

Conclusion

We propose PLS-DFNN to effectively monitor the quality of FCC process. From the perspective of linearity and nonlinearity, we decompose three subspaces for further extracting information from variables. We prove that three monitoring statistics for three subspaces of PLS-DFNN can excellently monitor three types of faults. However, the specific division of three subspaces depends on the number of components of PLS that are determined by cross-validation and the network structure that is determined by experience. An improved division method can be further studied, and the performance may also be further improved.

We can choose the appropriate monitoring statistic for the fault on the basis of the detection of quality variables. PLS-DFNN provides good performance on the TE process data set and the FCC process.

Footnotes

Appendix A

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: this work was supported by the National Natural Science Foundation of China (grant no. 21878081).

ORCID iD

Xuefeng Yan

References

Ancheyta

Morales

Betancourt

, et al. (2004) Individual hydrotreating of FCC feed components. Energy & Fuels 18(4): 1001–1004.

Bounoua

Benkara

Kouadri

, et al. (2019) Online monitoring scheme using principal component analysis through Kullback-Leibler divergence analysis technique for fault detection. Transactions of the Institute of Measurement and Control 42(6): 1225–1238.

Chadha

Schwung

(2017) Comparison of deep neural network architectures for fault detection in Tennessee Eastman process. In: 2017 22nd IEEE international conference on emerging technologies and factory automation (ETFA), Limassol, Cyprus, 12–15 September, pp. 1–8. New York: IEEE.

Chang

Cai

Zhang

, et al. (2014) Computational investigation of the hydrodynamics, heat transfer and kinetic reaction in an FCC gasoline riser. Chemical Engineering Science 111: 170–179.

Chen

Wynne

Goulding

, et al. (2000) The application of principal component analysis and kernel density estimation to enhance process monitoring. Control Engineering Practice 8(5): 531–543.

Chiang

Russell

Braatz

(2000) Fault diagnosis in chemical processes using Fisher discriminant analysis, discriminant partial least squares, and principal component analysis. Chemometrics and Intelligent Laboratory Systems 50(2): 243–252.

Downs

Vogel

(1993) A plant-wide industrial process control problem. Computers & Chemical Engineering 17(3): 245–255.

Fan

Qin

Wang

(2014) Online monitoring of nonlinear multivariate industrial processes using filtering KICA–PCA. Control Engineering Practice 22: 205–216.

(2017) Review on data-driven modeling and monitoring for plant-wide industrial processes. Chemometrics and Intelligent Laboratory Systems 171: 16–25.

10.

Song

(2007) Process monitoring based on independent component analysis–principal component analysis (ICA-PCA) and similarity factors. Industrial & Engineering Chemistry Research 46(7): 2054–2063.

11.

Gonzalez

Huang

Lau

(2015) Process monitoring using kernel density estimation and Bayesian networking with an industrial case study. ISA Transactions 58: 330–347.

12.

Gupta

Kumar

Srivastava

(2007) A new generic approach for the modeling of fluid catalytic cracking (FCC) riser reactor. Chemical Engineering Science 62(17): 4510–4528.

13.

Han

Chung

(2001) Dynamic modeling and simulation of a fluidized catalytic cracking process. Part I: Process modeling. Chemical Engineering Science 56(5): 1951–1971.

14.

Huang

Yan

(2019) Quality-driven principal component analysis combined with kernel least squares for multivariate statistical process monitoring. IEEE Transactions on Control Systems Technology 27(6): 2688–2695.

15.

Jiang

Yan

Huang

(2019) Review and perspectives of data-driven distributed monitoring for industrial plant-wide processes. Industrial & Engineering Chemistry Research 58(29): 12899–12912.

16.

Jiang

Yin

Kaynak

(2020a) Optimized design of parity relation based residual generator for fault detection: Data-driven approaches. IEEE Transactions on Industrial Informatics 17(2): 1449–1458.

17.

Jiang

Yin

Kaynak

(2020b) Performance supervised plant-wide process monitoring in industry 4.0: A roadmap. IEEE Open Journal of the Industrial Electronics Society 2: 21–35.

18.

Jiao

Zhao

Wang

, et al. (2017) A nonlinear quality-related fault detection approach based on modified kernel partial least squares. ISA Transactions 66: 275–283.

19.

Kano

Hasebe

Hashimoto

, et al. (2001) A new multivariate statistical process monitoring method using principal component analysis. Computers & Chemical Engineering 25(7–8): 1103–1113.

20.

Kong

Cao

, et al. (2018) Quality-related and process-related fault monitoring with online monitoring dynamic concurrent PLS. IEEE Access 6: 59074–59086.

21.

Zhang

Jiao

, et al. (2019) Quality-related fault diagnosis based on improved PLS for industrial process. In: 2019 34rd youth academic annual conference of Chinese Association of Automation (YAC), Jinzhou, China, 6–8 June.

22.

Feng

(2020) A nonlinear method for monitoring industrial process. Transactions of the Institute of Measurement and Control 43(2): 400–411.

23.

Sun

Qin

, et al. (2020) Gasoline to diesel consumption ratio: A new socioeconomic indicator of carbon dioxide emissions in China. Sustainability 12(14): 1–14.

24.

Lin

Zeng

(2013) The elasticity of demand for gasoline in China. Energy Policy 59: 189–197.

25.

Lin

Lan

(2014) Independent component analysis—Based sparse autoencoder in the application of fault diagnosis. In: Proceeding of the 11th world congress on intelligent control and automation, Shenyang, China, 29 June–4 July, pp. 1378–1382. New York: IEEE.

26.

Lopez-Zamora

de Lasa

(2019) A mid-infrared region (MIR) lumped Group Contribution based method for monitoring light gases and gasolines in fluid catalytic cracking. Chemical Engineering Science 212: 115324.

27.

Naik

Karthik

Kumar

, et al. (2017) Kinetic modeling for catalytic cracking of pyrolysis oils with VGO in a FCC unit. Chemical Engineering Science 170: 790–798.

28.

Peng

Zhang

(2013) Quality-related process monitoring based on total kernel PLS model and its industrial application. Mathematical Problems in Engineering 2013(Pt. 17): 1–14.

29.

Qin

Zhou

(2010) Geometric properties of partial least squares for process monitoring. Automatica 46(1): 204–210.

30.

Theisen

Drg

Abonyi

, et al. (2021) Sparse PCA support exploration of process structures for decentralized fault detection. Industrial & Engineering Chemistry Research 60(22): 8183–8195.

31.

Tong

Lan

, et al. (2019) Distributed partial least squares based residual generation for statistical process monitoring. Journal of Process Control 75: 77–85.

32.

Vistisen

PØ

Zeuthen

(2008) Reactions of organic sulfur and nitrogen compounds in the FCC pretreater and the FCC unit. Industrial & Engineering Chemistry Research 47(21): 8471–8477.

33.

Wang

Fang

(2016a) Co-processing of catalytic pyrolysis derived bio-oil with VGO in a pilot scale FCC riser. Industrial & Engineering Chemistry Research 55(12): 3525–3534.

34.

Wang

Jiao

(2017) A kernel least squares based approach for nonlinear quality-related fault detection. IEEE Transactions on Industrial Electronics 64(4): 3195–3204.

35.

Wang

Luo

Peng

(2016b) Quality-related fault detection using linear and nonlinear principal component regression. Journal of the Franklin Institute 353(10): 2159–2177.

36.

Yan

(2021) Nonlinear quality-relevant process monitoring based on maximizing correlation neural network. Neural Computing Applications 33(2): 10129–10139.

37.

Yan

Huang

Yan

(2019) Monitoring of quality-relevant and quality-irrelevant blocks with characteristic-similar variables based on self-organizing map and kernel approaches. Journal of Process Control 73: 103–112.

38.

Yao

Zhao

, et al. (2022) Batch process monitoring based on global enhanced multiple neighborhoods preserving embedding. Transactions of the Institute of Measurement and Control 44(3): 620–633.

39.

Zhou

Gang

Qin

(2010) Total projection to latent structures for process monitoring. AICHE Journal 56(1): 168–178.

40.

Zhou

Ren

Wang

(2019) Quality-relevant fault monitoring based on locally linear embedding orthogonal projection to latent structure. Industrial Engineering Chemistry Research 58(3): 1262–1272.

Fluid catalytic cracking process quality-driven fault detection based on partial least squares and deep feedforward neural network

Abstract

Keywords

Introduction

Preliminary work

PLS

FNN and AE

Fault detection based on PLS-DFNN model

DFNN structure

Subspaces based on PLS-DFNN

Monitoring statistics and threshold determinations

Scheme of fault detection

Experiments and discussion

Numerical simulation data set

Extracting linear features by PLS

Extracting nonlinear features by DFNN

Detecting results of three testing sets

Verification

TE process data set

Faults of TE process data set

Model establishment

Fault detection and verification

Applications on FCC process

FCC process

Fault detection of FCC process

Faults of FCC process

Performance of PLS

Performance of DFNN and PLS-DFNN

Conclusion

Footnotes

Appendix A

Declaration of conflicting interests

Funding

ORCID iD

References