Sage Journals: Discover world-class research

Abstract

The integrated navigation system highly relies on the accuracy of measurements of sensors that are susceptible to unknown disturbances. In order to improve the reliability and safety of the navigation system, there is an increasing need for the fault detection of the sensors. In the present study, a hybrid data-driven fault detection strategy is proposed, which is based on residual sequence analysis. Currently, the one-class support vector machine is one of the most popular fault detection methods for navigation systems with many successful cases. Therefore, the one-class support vector machine is combined with time-series similarity measure and modified principal components analysis approaches. The similarity measurement of multi-sequence residuals between a real-time sample and normal condition samples is computed to construct learning features for one-class support vector machine. Similarly, the modified principal components analysis scheme is applied to project residuals onto subspaces and obtain learning features. Moreover, the one-class support vector machine model is applied for abnormal detection if unexpected sensor faults exhibit in measurements and residuals. Finally, experiments are carried out to evaluate the performance of the proposed strategy for abrupt faults and soft faults on navigation sensors. Experimental results show that the hybrid data-driven fault detection strategy can effectively detect these faults with short time delay and high accuracy.

Keywords

Date-driven method fault detection one-class support vector machine principal components analysis real-time system time-series similarity measure

Introduction

With the considerably increasing demands for reliability and stability in the complex multi-sensor systems, fault detection (FD) has become an essential research field to ensure the precise and accurate performance of sensors. In addition to the FD research hotspot in industrial processes, the data-driven FD methodology which originates from insufficient knowledge about complex systems and unknown fault types ^1–3 has attracted many scholars. Recently, the model-based FD method has been applied in diverse applications, and remarkable results have been applied for process monitoring and system fault diagnosis.^4,5 However, the model-based FD method needs physical and mathematical prior knowledge of the system. On the contrary, when the structure of the engineering system is complicated and the operational requirements vary in different conditions, the model-based approaches confront significant limitations for obtaining satisfactory FD results.⁶ In order to resolve this problem, a data-driven strategy has been proposed to collect the data of various operating states. The normal input/output variables and their correlations in this condition can be obtained based on data training of a “particular” operation state. Then, the training model can be used for process monitoring and abnormal detection. Reviewing the literature indicates that the data-driven FD schemes have attracted many scholars in recent years for investigating dynamic systems.

The integrated navigation system that contributes a significant role in various carriers, including aircraft, vehicle, ship, and modern weapon system, should provide accurate geographical position, velocity, and attitude information.⁷ In order to maintain the navigation sensors (i.e. gyroscopes, accelerometers, and global positioning system (GPS) receiver) at stable performance and monitor the corresponding estimation states, many FD strategies have been proposed so far by the researchers.^8–18 Conventional model-based approaches and data-driven approaches were proposed accordingly for intelligent FD and fault diagnosis.⁸ Moreover, considering extensive applications of unmanned operation systems and complicated integrated navigation systems, finding a reliable real-time FD method is of great importance. The integrated navigation system faults can be mainly divided into two categories: (1) abrupt faults caused by hardware failures or strong impulse disturbance, and (2) soft faults that widely exist in inertial sensors, which may come from severe drift errors. Abrupt faults cause a serious deviation of the navigation system in a short time. However, these faults can be detected by simple analysis. On the other hand, soft faults that affect other subsystems or even the whole system by slow changes can be hardly detected and isolated. This is especially more pronounced when soft faults originate from minor errors where there is insufficient knowledge about these faults. In order to resolve this shortcoming, investigating the FD of integrated navigation systems has received significant attention in the past decades.^9–11

The existing FD methods for the navigation systems can be generally divided into two groups, including analytical model-based methods and data-driven methods.⁹ Analytical model-based methods depend on the constructed physical model. However, analytical methods require lots of prior knowledge such as accurate parameters of the dynamic model.^12,13 For instance, the state chi-square test (SCST) is the most classic method for an integrated navigation system that detects the fault through a constructed statistic between measurements and predictions of the recursion filter. Monteriu et al.¹³ presented a model-based multiple sensors fault detection and isolation (FDI) by using the “structural analysis” that includes the residual generation and ad hoc residual evaluation for unmanned ground vehicles (UGVs). Moreover, $H_{\infty}$ estimation is another effective way for the navigation system FD by the residual generator design.¹⁵ However, residual evaluation is still an enormous challenge for nonlinear systems.

By contrast, the data-driven method solves FD problems by multivariate statistical methods and training machine learning models from the historical input/output dataset. From this point of view, the residual chi-square test, which detects the various faults by a mathematical statistics method, can be classified into a data-driven method. Although the residual chi-square test has more reasonable dynamic and real-time performance, it lacks sensitivity to soft faults and heavily relies on system parameters. On the contrary, the intelligent methods depending on the nonanalytic model, such as the artificial neural network (ANN), support vector machine (SVM), Markov models (MMs), or other models, provide powerful approaches to implement data-driven FD. Reviewing the literature shows that studies on data-driven FD methods for integrated navigation systems have received much attention recently. Guo et al.¹⁶ proposed an active FD method based on one-class support vector machine (OC-SVM) and deep neural network (DNN). They effectively applied the OC-SVM to detect faults of navigation sensors, and the DNN predicts the running data to replace fault time data. Moreover, Zhao et al.¹⁷ established an FD model by using the belief rule base (BRB). Then, an expectation–maximization (EM) algorithm is adopted to investigate parameter recursive estimation and online update. The model can track the fault state and investigate the FD in real time. Xu and Lian¹⁸ proposed a multi-channel single-dimensional fully convolutional neural network (MS-FCN) FD method. This method extracts the features from measuring residual sequences of the sensors and discriminates the operating state with the prior information. These methods utilize sensor sampling data or residual data as a mathematical model training the dataset directly. However, it is a challenging task for an integrated navigation system due to its complicated multi-sensors, limited training dataset, and lack of prior knowledge of fault states. It is worth noting that as the accuracy of the recursive filter algorithm is improved, it is expected to make full use of residual data to implement FD in a data-driven way.^19,20 Therefore, the hybrid method based on model and data-driven has been proposed in the literature. For example, Liu et al.²¹ combined the SCST and simplified the fuzzy Adaptive Resonance Theory Map (ARTMAP) neural (SFAM) network to overcome the problem of the FD in a noisy environment.

According to these studies and their successful applications in the navigation system, it is intended to propose a new hybrid data-driven FD strategy. The proposed strategy combines the OC-SVM model with time-series similarity measurement (SIM) and modified principal components analysis (MPCA) approaches. The residual sequence from the Kalman filter (KF) is preprocessed using SIM and MPCA approach to investigate accumulated fault errors. Then, the dataset of the normal condition is collected and trained in the SVM model for detecting sensor faults. The proposed data-driven FD method for the integrated navigation system is formulated as an abnormal detection problem when prior knowledge of fault types is difficult to obtain. It is expected that the proposed strategy can provide a real-time FD method based on status monitoring for both abrupt and soft faults.

The present study is organized as follows: The “Framework of hybrid data-driven FD” section presents the hybrid data-driven FD framework for the integrated system. Then, SIM and MPCA methods are introduced for analyzing the multi-sequence residuals. In the “Residual characterization” section, the FD is formulated as an abnormal detection problem using OC-SVM. Experimental validation of the proposed strategy is presented in the “FD based on OC-SVM” section, and then follows the “Simulation experiments and results” section. Finally, concluding remarks are given iresulting inn the “Conclusion” section.

Framework of hybrid data-driven FD

Figure 1 illustrates the main framework of the hybrid data-driven FD strategy for the integrated system. It is observed that the FD sub-filter consists of the measurement, preprocessing, and FD units. The measurement unit provides the sampling data of sensors (inertial measurement unit and GPS) in real time and residual data of KF. When the system is operating at the normal condition, the residual sequences of KF are stored as multivariate time-series dataset beforehand and then these datasets are processed as the training dataset L . Moreover, the preprocessing unit consists of two main approaches, including SIM and MPCA approaches. Since the soft fault changes slowly, SIM and MPCA approaches are utilized to characterize residual errors over long intervals. To this end, fault-free residuals have initially been recorded for long enough periods and then these residuals are processed by the SIM and MPCA modules to determine the characteristics of the faultless dataset. Second, the corresponding real-time residual errors are calculated through the SIM and MPCA methods and then these errors are used as a testing vector for the FD unit. Then, possible faults and failures are identified in the FD unit by the OC-SVM model, which is responsible for abnormal detections and gives feedback if faults are determined. The framework shows that this data-driven FD strategy heavily relies on the performance of the KF and can effectively give the state estimations and measurement predictions. Unlike other FD methods, the residual sequences are normally applied for constructing learning features instead of setting a threshold directly.

Figure 1.

Main framework of data-driven fault detection strategy.

It should be indicated that the modular of the KF, SIM and MPCA processor, and OC-SVM model can be considered as sub-filters in more integrated navigation systems.¹⁶ Meanwhile, the sub-filter design can be applied in a federal filter structure. The FD module serves as a filter and controls a switch to determine whether the connected sensor is in a good condition or not. The main filter of the navigation system can adjust the filtering mode to generate reliable navigation data. Therefore, the FD module plays a significant role in the integrated navigation system. Furthermore, the hybrid data-driven FD method provides a basis for the fault diagnosis and fault-tolerant techniques to meet the reliability requirements of navigation sensors.

Residual characterization

Residual characterization based on SIM

In the residual chi-square test method, the soft faults coincide with minor errors. Moreover, the forecasting value ${\hat{X}}_{k / k - 1}$ traces the output of fault, resulting in the small residual remaining, which can be hardly detected. In fact, finding patterns from residuals for navigation system FD and a promising alternative method is a challenging task. Therefore, the method of detecting residuals at a single time is changed to the method of analyzing the variation characteristics of residuals in a certain period. The main purpose of residual characterization is to discover meaning features at different fault conditions. Based on analyzing multivariate time series in real time, it is found that FD methods mainly require three research contents as the following: (1) appropriate representation, (2) SIM, and (3) suitable pattern recognition.²² It is worth noting that the clustering algorithm²³ and the time-series fitted method²⁴ are successful data-mining approaches in diverse FD tasks. Accordingly, it is intended to introduce the SIM method based on multivariate time series into the FD strategy of the integrated navigation system. It is worth noting that the SIM method serves as a preprocessor to characterize the informative fault patterns.

Assume that the dynamic model of a discrete integrated navigation system with a fault can be formulated in the form below

{\begin{matrix} X (k) = ϕ (k, k - 1) X (k - 1) + Γ (k - 1) W (k - 1) \\ Z (k) = H (k) X (k) + V (k) + f (k, φ) γ \end{matrix}

(1)

where $Z (k) \in R^{m}$ , $X (k) \in R^{n}$ , and $ϕ (k, k - 1) \in R^{m \times n}$ denote the measurements of the system, system state, and the transition matrix states, respectively. Moreover, $Γ (k - 1) \in R^{n \times r}$ is the noise matrix. $W (k - 1) \in R^{r}$ and $V (k) \in R^{m}$ are independent Gauss white noise sequences. $γ$ is random fault sequence and $f (k, φ)$ is piecewise function, which can be mathematically expressed as follows

f (k, φ) = {\begin{matrix} 1, k \geq φ \\ 0, k < φ \end{matrix}

(2)

where $φ$ donates the time once a fault occurs.

The recursive state vector $\hat{X} (k, k - 1)$ and prediction of measurements $\hat{Z} (k, k - 1)$ of the system at time k can be computed recursively as follows

{\begin{matrix} \hat{X} (k, k - 1) = ϕ (k, k - 1) \hat{X} (k - 1) \\ \hat{Z} (k, k - 1) = H (k) \hat{X} (k, k - 1) \end{matrix}

(3)

In the residual chi-square test method, a statistic is constructed using predicted measurements $\hat{Z} (k, k - 1)$ and real measurements $Z (k)$ of one epoch. Assume that the residual at normal condition is represented as $r_{k}$ and obviously, it is Gauss white noise with zero means. When a fault occurs at time $φ$ , the system state can be expressed in the form below

{\begin{matrix} r_{k} = Z (k) - \hat{Z} (k, k - 1), k < φ \\ r_{k} + γ_{k} = Z (k) - \hat{Z} (k, k - 1), k \geq φ \end{matrix}

(4)

where $γ_{k}$ represents errors added to the residual vector at time k. Based on discussions in the foregoing section, when a soft fault is presented by $γ_{k}$ , then the chi-square test method cannot effectively detect the fault. Because once a fault occurs, the state estimation follows the soft fault recursively. In order to improve the sensitivity of soft FD, the SIM module is applied in the present study to characterize accumulative fault errors over a period of time. Assuming that the SIM module can store the residual sequences with length a, then the multivariate time-series dataset $D$ at time k can be obtained as follows

D = {\begin{matrix} [\begin{matrix} r_{k - a + 1} \\ \begin{matrix} ⋮ \\ r_{k - 1} \end{matrix} \\ r_{k} \end{matrix}] a, k < φ \\ [\begin{matrix} r_{k - a + 1} \\ \begin{matrix} ⋮ \\ r_{φ - 1} \\ (r_{φ} + γ_{φ}) \end{matrix} \\ \begin{matrix} ⋮ \\ (r_{k} + γ_{k}) \end{matrix} \end{matrix}] a < φ, k \geq φ \end{matrix}

(5)

Equation (5) indicates that the dataset $D \in R^{a \times m}$ is time-series collection with multi-attributes. Once a fault occurs in navigation system sensors, the fault error is accumulated and presented in this multivariate time series of residuals. Subsequently, the SIM module is applied for constructing the learning features of the OC-SVM model, and the dynamic time warping (DTW) method is applied in the proposed strategy. A residual sequence dataset L of the system in good condition, called the normal multi-sequence, can be measured and collected in the form below

L = [\begin{matrix} r_{1} \\ \begin{matrix} r_{2} \\ ⋮ \end{matrix} \\ r_{N} \end{matrix}], N > 0, L \in R^{N \times m}

(6)

When the system performs the actual navigation task, a real-time residual at each discrete epoch k is generated based on the local filter and the obtained result is stored as multi-sequence $D_{a}$ over a period of time. These processes can be mathematically expressed as

D_{a} = [\begin{matrix} r_{k - a + 1} \\ \begin{matrix} ⋮ \\ r_{k - 1} \end{matrix} \\ r_{k} \end{matrix}], a > 0, D_{a} \in R^{a \times m}

(7)

Then, a elements of the residual should be selected from normal multi-sequence dataset $L$ . In the present study, the random selection method is utilized to ensure that the validity of the proposed method is not affected by a certain time. The selected normal multi-sequence $L_{a}$ can be expressed as

L_{a} = [\begin{matrix} r_{i + 1} \\ \begin{matrix} r_{i + 2} \\ ⋮ \end{matrix} \\ r_{i + a} \end{matrix}], i + a < N, L_{a} \in R^{a \times m}

(8)

Subsequently, the DTW distance between the same variables of $D_{a}$ and $L_{a}$ can be computed. The SIM value of $D_{a}$ and $L_{a}$ is described as

Si m_{k} = dwt (D_{a}, L_{a})

(9)

D_{a} = [\begin{matrix} r_{k - a + 1} \\ \begin{matrix} ⋮ \\ r_{k - 1} \end{matrix} \\ r_{k} \end{matrix}] = [V_{D 1}, V_{D 2}, \dots, V_{Dm}]

(10)

L_{a} = [\begin{matrix} r_{i + 1} \\ \begin{matrix} r_{i + 2} \\ ⋮ \end{matrix} \\ r_{i + a} \end{matrix}] = [V_{L 1}, V_{L 2}, \dots, V_{Lm}]

(11)

dwt (D_{a}, L_{a}) = [dwt (V_{D 1}, V_{L 1}), \dots, dwt (V_{Dm}, V_{Lm})]

(12)

Based on the measured distance through the DTW method, accumulative error during fault occurring time can be integrated into similarity measurement $Si m_{k}$ , which is also the learning feature vector of the SVM module.

A modified PCA for residual characterization

Studies show that the PCA method is a basic and efficient statistical method that can effectively extract and preserve a significant amount of information for the data variability and proposes originally of the dimension reduction. On the contrary, the PCA method has a simple structure, which is more appropriate for handling a large number of stationary process data with a Gaussian distributed variable. Furthermore, the PCA scheme has been widely and successfully employed as a multivariate statistical tool in many status monitoring and fault diagnosis applications.^25–27 Based on a hybrid linear–nonlinear statistical modeling, Deng et al.²⁸ proposed a serial PCA (SPCA) for nonlinear process monitoring. Furthermore, Peng et al.²⁹ reported a kernel independent and principal components analysis (kernel ICA-PCA) for the hot strip mill process. As an effective data-driven FD and diagnosis tool based on multivariate statistical process monitoring, PCA and its extension have been investigated by many researchers.¹ In this section, a modified PCA is proposed to obtain residual characterization vector to efficiently characterize residuals as learning features for the SVM method.

Similar to the SIM method, a recorded residual dataset D at the normal condition is collected with zero mean and normalized with the unit variance for training purposes. In the proposed hybrid FD framework, the multivariate dataset L can be shared by SIM and PCA methods as follows

L = [\begin{matrix} r_{1} \\ \begin{matrix} r_{2} \\ ⋮ \end{matrix} \\ r_{N} \end{matrix}] = [\begin{matrix} r_{11} & \dots & r_{1 m} \\ ⋮ & ⋱ & ⋮ \\ r_{N 1} & \dots & r_{Nm} \end{matrix}], L \in R^{N \times m}

(13)

The covariance matrix is defined as

ϕ = \frac{1}{N - 1} L^{T} L

(14)

Then, singular value decomposition (SVD) is performed on the covariance matrix $ϕ$

\frac{1}{N - 1} L^{T} L = P Λ P^{T}

(15)

where $Λ = diag (λ_{1}, \dots, λ_{m}) λ_{1} \geq \dots \geq λ_{m} \geq 0$ denotes singular matrix. Based on the magnitude of singularities, $P$ and $Λ$ can be mathematically expressed as

P = [\begin{matrix} P_{pc} & P_{res} \end{matrix}], Λ = [\begin{matrix} Λ_{pc} & 0 \\ 0 & Λ_{res} \end{matrix}]

(16)

where $P_{pc} \in R^{m \times β}$ and $P_{res} \in R^{m \times (m - β)}$ contain the singular vectors correlated to the first $β$ large singular values and last $(m - β)$ small singular values in $Λ_{pc}$ and $Λ_{res}$ . Therefore, the subspace of singular vectors $P_{pc}^{T}$ and $P_{res}^{T}$ are called principal subspace and “residual” subspace, respectively. For the basic FD method by the PCA model, the measured variable z is projected onto the two orthogonal subspaces $P_{pc}^{T}$ and $P_{res}^{T}$ and evaluation thresholds are defined by the aforementioned projections. However, in order to avoid missing a fault in one subspace, a combined method is adopted in the proposed strategy that simultaneously uses both test statistics. In the proposed modified PCA method, the multivariate residual sequence in real time is applied as the input matrix to form a learning vector for the SVM model. Assume $D_{a}$ is a $a \times m$ -dimensional measured multivariate residual dataset. Then, the corresponding residual characterization can be expressed as

T_{ReC}^{2} = λ_{m} (T^{2} + T_{H}^{2}) = D_{a} P Ξ P^{T} D_{a}^{T}

(17)

Ξ = diag (\begin{matrix} \frac{λ_{m}}{λ_{1}}, & \dots & , \begin{matrix} \frac{λ_{m}}{λ_{m - 1}} & 1 \end{matrix} \end{matrix}) \in R^{m \times m}

(18)

D_{a} = [\begin{matrix} r_{k - a + 1} \\ \begin{matrix} ⋮ \\ r_{k - 1} \end{matrix} \\ r_{k} \end{matrix}] \in R^{a \times m}

(19)

The matrix $T_{ReC}^{2} \in R^{a \times a}$ is characterized and obtained by the principal subspace $P_{pc}^{T}$ and “residual” subspace $P_{res}^{T}$ . Then, the diagonal elements of the matrix $T_{ReC}^{2}$ is chosen as the learning features for the SVM model

T_{ReC}^{2} = [\begin{matrix} t_{11} & \dots & \dots \\ ⋮ & ⋱ & ⋮ \\ \dots & \dots & t_{aa} \end{matrix}]

(20)

PC A_{k} = [\begin{matrix} t_{11} & \dots & t_{aa} \end{matrix}]

(21)

Similarly, the modified PCA method can be effectively applied to characterize accumulative errors in residual sequences in the vector $PC A_{k}$ during a epochs. It should be indicated that these residual sequences can provide additional learning features for the SVM method.

Theoretical analysis for residual characterization

As mentioned in the foregoing sections, the residual characterization based on the SIM and MPCA methods can be applied to construct the learning features for the SVM method, which drives the OC-SVM model to implement abnormal detection. The common advantage of these two methods is the ability to characterize error $γ_{k}$ at each measurement epoch into learning vectors. In this case, the obtained vector at the fault condition is different from that of the normal condition. Therefore, a theoretical analysis for the advantages of SIM and MPCA characterization methods is described as the following.

Assume that real-time multi-residual sequences at fault-free condition $D_{a_FF}$ and fault condition $D_{a_F 0}$ are mathematically expressed in the form below

D_{a_FF} = [\begin{matrix} r_{1} \\ \begin{matrix} r_{2} \\ ⋮ \end{matrix} \\ r_{a} \end{matrix}] = [\begin{matrix} v_{11} \\ \begin{matrix} v_{21} \\ ⋮ \end{matrix} \\ v_{a 1} \end{matrix} \dots \begin{matrix} v_{1 m} \\ \begin{matrix} v_{2 m} \\ ⋮ \end{matrix} \\ v_{am} \end{matrix}] = [V_{D 1} \dots V_{Dm}]

(22)

\begin{matrix} D_{a_F 0} = [\begin{matrix} r_{1} + γ_{1} \\ \begin{matrix} r_{2} + γ_{2} \\ ⋮ \end{matrix} \\ r_{a} + γ_{a} \end{matrix}] \\ = [\begin{matrix} v_{11} + γ_{1} \\ \begin{matrix} v_{21} + γ_{2} \\ ⋮ \end{matrix} \\ v_{a 1} + γ_{a} \end{matrix} \dots \begin{matrix} v_{1 m} + γ_{1} \\ \begin{matrix} v_{2 m} + γ_{2} \\ ⋮ \end{matrix} \\ v_{am} + γ_{a} \end{matrix}] \\ = [V_{D 1}^{'} \dots V_{Dm}^{'}] \end{matrix}

(23)

When the multi-sequences $D_{a_FF}$ and $D_{a_F 0}$ are characterized with the selected normal multi-sequence $L_{a}$ , the characterization vectors at different conditions can be obtained as

L_{a} = [\begin{matrix} r_{i + 1} \\ \begin{matrix} r_{i + 2} \\ ⋮ \end{matrix} \\ r_{i + a} \end{matrix}] = [\begin{matrix} l_{11} \\ \begin{matrix} l_{21} \\ ⋮ \end{matrix} \\ l_{a 1} \end{matrix} \dots \begin{matrix} l_{1 m} \\ \begin{matrix} l_{2 m} \\ ⋮ \end{matrix} \\ l_{am} \end{matrix}] = [V_{L 1}, \dots, V_{Lm}]

(24)

Si m_{k_FF} = dwt (D_{a_FF}, L_{a})

(25)

Si m_{k_F 0} = dwt (D_{a_F 0}, L_{a})

(26)

dwt (V_{D 1}, V_{L 1}) = dwt ([v_{11}, \dots, v_{a 1}], [l_{11}, \dots, l_{a 1}])

(27)

dwt (V_{D 1}^{'}, V_{L 1}) = dwt ([v_{11} + γ_{1}, \dots, v_{a 1} + γ_{a}], [l_{11}, \dots, l_{a 1}])

(28)

Comparing the sequences $V_{D 1}$ and $V_{D 1}^{'}$ indicates that each element in $V_{D 1}^{'}$ adds a different error $γ_{i} (1 \leq i \leq k)$ , which results in a remarkable difference between similarity measurements of $Si m_{k_FF}$ and $Si m_{k_F 0}$ . It is worth noting that such difference increases as the measuring time extends. Consequently, when errors originating from soft faults accumulate over a period of time to a certain extent, an abnormal SIM occurs, which can be detected by the OC-SVM method.

Similarly, the multi-residual sequences $D_{a_FF}$ and $D_{a_F 0}$ can be characterized by the MPCA method. Assume that $PC A_{k_FF}$ and $PC A_{k_F 0}$ are two vectors after characterization of the MPCA. Then, the following equations can be obtained accordingly

PC A_{k_FF} = [\begin{matrix} t_{11_FF} & \dots & t_{aa_FF} \end{matrix}]

(29)

PC A_{k_F 0} = [\begin{matrix} t_{11_F 0} & \dots & t_{aa_F 0} \end{matrix}]

(30)

\begin{matrix} t_{11_FF} = r_{1} P Ξ P^{T} r_{1}^{T} (r_{1} = [r_{11}, \dots, r_{1 m}]) \\ t_{11_F 0} = (r_{1} + γ_{1}) P Ξ P^{T} {(r_{1} + γ_{1})}^{T} \end{matrix}

(31)

r_{1} + y_{1} = [(r_{11} + γ_{1}) + \dots + (r_{1 m} + γ_{1})]

(32)

Since fault errors affect the results of features learned by the PCA model, the elements of characterized vectors $PC A_{k_FF}$ and $PC A_{k_F 0}$ are significantly different. Moreover, equations (28) and (32) indicate that errors $[γ_{1}, \dots, γ_{a}]$ can be characterized into elements of vector $Si m_{k_F 0}$ , while $m \times γ_{i} (1 \leq i \leq k)$ errors are characterized into the elements of $PC A_{k_F 0}$ . These two different methods of characterizing errors can also be investigated from vertical and horizontal directions. Therefore, the detection strategy is a hybrid strategy (HS) of the characterization methods.

FD based on OC-SVM

FD based on the OC-SVM method is an anomaly detection approach. It is one of the most popular data-driven FD methods with wide applications in diverse areas.^30–33 Studies show that this technique is especially effective for the situation where normal operation samples are easily accessible, while the fault samples are expensive to be understood. Therefore, since the prior knowledge of unknown faults is rare, the OC-SVM method is a powerful scheme to FD of the multi-sensor navigation system.

OC-SVM

The OC-SVM method is a kernel based on a support vector description with a training dataset (target class) consisting of positive examples only. It computes the smallest sphere in the feature and finds a unique optimal hyperplane that separates the training dataset from the origin with maximum margin. In other words, the origin is treated as an outlier from the target class. In the proposed hybrid data-driven FD strategy, the characterization vectors of normal condition dataset $Si m_{K}$ and $PC A_{K}$ based on the SIM and MPCA methods, respectively, are treated as training samples with positive labels, where K denotes the number of training samples. The training dataset can be mathematically expressed as

{\begin{matrix} Si m_{K} = {s_{1}, s_{2} \dots s_{k}} \\ PC A_{K} = {s_{1}, s_{2} \dots s_{k}} \end{matrix}

(33)

Then, the optimal hyperplane is described as the following

f (s) = (w \cdot ψ (S)) - ρ = 0

(34)

Where $ψ$ donates a mapping function. Therefore, this optimization problem can be solved as follows

\begin{matrix} \begin{matrix} \min \\ w, ρ, ξ \end{matrix} {\frac{1}{2} | | w | |^{2} - ρ + \frac{1}{vK} \sum_{i} ξ_{i}} \\ s . t . (w \cdot ψ (s_{i})) \geq ρ - ξ_{i} \\ ξ_{i} \geq 0, i = 1, \dots, K \end{matrix}

(35)

where $w$ and $ρ$ are normal vector and offset, respectively. Moreover, $v \in (0, 1]$ and $ξ_{i} \geq 0$ donate the regularization parameter and slack variable, respectively. Introducing the Lagrange multipliers $α_{i}$ to equation (34) and deriving $w$ , the dual optimization problem is described as

\begin{matrix} \begin{matrix} \max \\ α \end{matrix} - \frac{1}{2} α^{T} F α \\ s . t . 0 \leq α \leq \frac{1}{vK} \\ \sum_{i} α_{i} = 1, i = 1, \dots, K \end{matrix}

(36)

where $F$ is a positive definite function (kernel function), such as polynomial kernel, radial basis function (RBF) kernel, or Gaussian kernel. Moreover, its expression is described as follows

F (s_{i}, s_{j}) = 〈 ψ (s_{i}), ψ (s_{j}) 〉

(37)

The kernel function induces the OC-SVM working in the feature space and we focus on RBF kernel in our strategy. After obtaining the optimal solution $α$ , the constant $ρ$ can be given as

ρ = 〈 w \cdot ϕ (s_{i}) 〉

(38)

where $s_{i}$ is the sample with the corresponding $α_{i} \in (0, 1 / vK)$ . Moreover, the hyperplane function of OC-SVM is determined, and a new sample $s_{x}$ can be estimated by $sgn (f (s_{x}))$ . Tax and Duin³⁴ proposed the support vector domain description (SVDD), which is an equivalent formulation of the OC-SVM method. The main purpose of the SVDD is to find a hypersphere to surround the training samples with the lowest volume.

Abnormal detection algorithm

In practical applications, a statistic detection amount should be determined for the FD problem. More specifically, $T^{2}$ and Q statistics are used in the classical FD method to monitor the process data. Similarly, a corresponding threshold should be developed for the OC-SVM abnormal detection. In the present study, $- f (s)$ is selected as the distance metric, which can be formulated as

F (s) = - f (s) = - \sum_{i = 1}^{K} α_{i} F (s_{i}, s) + ρ

(39)

where $f (s)$ donates the normal distance of the sample data from the decision boundary in the feature space.³⁵ For the obtained real-time multi-sequence residual data, when the corresponding $Si m_{K}$ or $PC A_{K}$ feature vector locates inside the boundary of the training space, the term $F (s)$ takes negative values so that it is classified as a normal point. In contrast, the residual data are considered as outliers when the corresponding term $F (s)$ becomes greater than zero. It is observed that zero should be determined as the threshold of the distance metric for the abnormal detection. However, it requires to tune the parameter $θ$ and the RBF kernel $σ$ precisely. The cross-validation method proposed by Mahadevan and Shah³⁶ is adopted in the present work to obtain a suitable distance metric. Based on the discussed data, Figure 2 shows the abnormal detection scheme for the integrated navigation system FD. Based on discussions in the forgoing sections, the pre-collected multi-sequence residual dataset is applied for constructing learning vectors using the SIM and MPCA methods. Then, these vectors are applied for training the OC-SVM model. The real-time multi-sequence residual samples are processed by the SIM and MPCA module, and the new vector is used for the abnormal detection to distinguish the status of the system.

Figure 2.

Abnormal detection scheme based on OC-SVM.

Simulation experiments and results

Experiments setting and FD results

In this section, an inertial navigation systems/global navigation satellite system(INS/GNSS)-integrated navigation system of the unmanned aerial vehicle (UAV) is designed in the MATLAB environment to evaluate the validity of the proposed hybrid data-driven FD strategy. The abrupt faults and soft faults are both simulated to occur on the integrated navigation system. The training dataset is initially generated by simulating the normal operation of the system. Then, several faults are set into navigation sensors successively at different times. The multi-sequence residuals of fault condition are selected as the testing dataset. Table 1 shows the specifications of the UAV integrated navigation system. Moreover, Table 2 presents details of specific faults.

Table 1.

Specifications of the simulated UAV integrated navigation system.

Parameters	Values
Initial position [longitude latitude height]	[114° 38° 50 m]
Initial velocity (body axis) $[x, y, z]$	[0,0,0]
Initial attitude angle [azimuth roll pitch]	[90°,0°,0°]
Accelerometer bias error [initial, in-run]	[300 µg, 50 µg]
Gyro bias error [initial, in-run]	[0.2°/h, 0.03°/h]
Output frequency of INS	100 Hz
GNSS position error, velocity error	15 m, 0.3 m/s
Output frequency of GPS	1 Hz
Simulation time	600 s

UAV: unmanned aerial vehicle; GNSS: global navigation satellite system; GPS: global positioning system.

Table 2.

Details of specific faults.

Faults source	Type	Value	Duration
GNSS positioning error	Abrupt	40 m	150–151 s
GNSS positioning error	Soft-1	0.5 m/s	200–230 s
Gyro (x-axis)	Soft-2	0.08°/h	300–350 s
Accelerometer (y-axis)	Soft-3	60 µg	400–475 s

GNSS: global navigation satellite system.

In order to obtain more ideal Gaussian distribution residual data, the data in the stable state of trajectory at normal conditions are collected as prior data. It should be indicated that each simulation is conducted twice with the same trajectory to obtain the training dataset in reasonable condition. The duration of each simulation is 20 min and $2 \times 1000$ groups of residual samples are collected accordingly. In other words, the number of samples is $N = 1000$ in the multivariate datasets $L_{1}$ and $L_{2}$ . The two datasets are used for generating the learning features of the OC-SVM scheme. Therefore, it is concluded that the residual data have ideal normal distribution.

Based on the continuous residual samples, the feature vectors for training the OC-SVM model should be computed through the proposed SIM and modified PCA methods. In order to compare the obtained results from the two methods, the same parameter setting should be set applied in both SIM and MPCA methods. More specifically, length a of multi-sequence residuals $D_{a}$ is set accordingly. However, as aforementioned, the way of selecting multi-sequence residuals dataset is different. In fact, in the SIM method, the real-time multi-sequence residuals dataset $L_{a}$ is selected as $L_{1}$ . The comparative residual dataset $D_{a}$ is selected randomly from $L_{2}$ . However, in the MPCA method, $D_{a}$ is selected from $L_{2}$ and $L_{1}$ is determined as the prior multivariate dataset $L$ , which is applied to perform the SVD scheme.

After generating feature vectors, the OC-SVM model is trained using an RBF with the RBF kernel $σ = 2$ and regularization parameter $v = 0.1$ . First, the validity of these two methods should be verified.

The faults discussed in Table. 2 are injected into the navigation sensors, and 600 groups of real-time samples for each fault are collected during the failure period. Figures 3 and 4 show the FD results of SIM + OC-SVM and MPCA + OC-SVM methods, respectively, where x- and y-axes represent the detection time and distance metric by the OC-SVM method, respectively. The obtained results reveal that both SIM + OC-SVM and MPCA + OC-SVM methods can successfully detect the faults with short delay time (DT). Moreover, for an abrupt fault, both SIM + OC-SVM and MPCA + OC-SVM methods can directly detect the fault without delay. However, for soft faults, a short DT of 4 to 11 s exists in the detection process, which is mainly caused by the insufficiency of error accumulation. Moreover, it is worth noting that there are some detection points that are below the detection threshold during the failure period. These points are presented in Figures 3(c), 4(b), and 4(d). However, this does not affect the detection effectiveness FD of the proposed method. Therefore, data of navigation sensors should be verified prior to the use. To this end, experiments are carried out to verify the validity of SIM + OC-SVM and MPCA + OC-SVM methods.

Figure 3.

Fault results based on SIM + OC-SVM: (a) abrupt fault detection based on SIM + OC-SVM, (b) soft fault (GNSS) detection based on SIM + OC-SVM, (c) soft fault (Gyro) detection based on SIM + OC-SVM, and (d) soft fault (accelerometer) detection based on SIM + OC-SVM.

Figure 4.

Fault results based on SIM + OC-SVM: (a) abrupt fault detection based on MPCA + OC-SVM, (b) soft fault (GNSS) detection based on MPCA + OC-SVM, (c) soft fault (Gyro) detection based on MPCA + OC-SVM, and (d) soft fault (accelerometer) detection based on MPCA + OC-SVM.

Comparison study with HS

Based on the foregoing section, it is found that SIM + OC-SVM and MPCA + OC-SVM methods can be effectively applied to obtain effective FD with a short time delay. In this section, it is intended to apply several OC-SVM FD methods for the navigation system to evaluate the FD efficiency of the HS. To this end, the OC-SVM method based on the phase space reconstruction (PSR + OC-SVM)¹⁶ and the OC-SVM method based on multiple kernel anomaly detection (MKAD + OC-SVM)³⁷ are applied in the navigation experiment. Generally used FD indices, including fault detection rate (FDR) and false alarm rate (FAR), are initially introduced to act as the performance evaluation standards³⁸

FDR = \frac{No . of samples (F > F_{th} | l \neq 0)}{Total samples (l \neq 0)}

(40)

FAR = \frac{No . of samples (F > F_{th} | l = 0)}{Total samples (l = 0)}

(41)

where $F > F_{th} | l \neq 0$ and $F > F_{th} | l = 0$ denote samples whose distance metric exceeds the threshold and normal samples whose distance metric exceeds the threshold, respectively.

The proposed methods, including SIM + OC-SVM and MPCA + OC-SVM, utilize the multi-sequence residuals to construct the learning features for OC-SVM. It should be indicated that the abnormal detection methods using continuous data have been the mainstream in the past decade. Similarly, MKAD + OC-SVM is also a data-driven method by using multivariate continuous data to detect anomalies, which is derived from multiple kernel learning. The resultant kernels can be constructed over discrete sequences and discretized continuous time series for OC-SVM constructing an optimal hyperplane. It should be indicated that the process of constructing kernels is to measure the similarity between the discrete sequences, in other words, to find the representation of time series, which is inversely proportional to the distance. This is similar to the proposed SIM + OC-SVM method. In the proposed method, it is intended to verify whether the MKAD + OC-SVM method can find the similarity in the multivariate residual sequences. However, the PSR + OC-SVM utilizes a single sample for detection rather than multi-sequence data. The reason for this comparison is that it is intended to test whether the faults can be detected by constructing features at one point. In other words, several dimensional features are constructed from time-series navigation signals for OC-SVM training in the PSR + OC-SVM method. However, in the detection stage, a sample point x will be mapped into the feature space.

In the simulation, 50 groups of the real-time multi-sequence residual datasets are included for comparison. The injected faults are selected from Table 2, and the corresponding parameter is set as according to the “OC-SVM” section. In other words, the length of the multi-sequence is $a = 15$ . Fifty FD simulation experiments are carried out for each kind of fault. Table 3 presents the detailed FD results by utilizing all methods on the simulated navigation system. Moreover, the HS informs whether SIM + OC-SVM or MPCA + OC-SVM detect the outlier in the FD process.

Table 3.

FDR and FAR (%) results of different faults (Table 2) utilizing all the methods.

Fault	Hybrid strategy		SIM+OC-SVM		MPCA+OC-SVM		PSR+OC-SVM		MKAD+OC-SVM
Fault	FDR	FAR	FDR	FAR	FDR	FAR	FDR	FAR	FDR	FAR
Abrupt	100	0.87	100	0.55	100	0.38	100	2.43	100	0.45
Soft-1	99.88	1.96	95.23	1.54	94.41	1.09	90.75	6.43	94.75	2.74
Soft-2	98.37	2.01	96.85	1.32	95.85	1.62	93.38	8.57	92.31	2.98
Soft-3	98.85	1.12	95.38	0.97	97.46	0.88	91.04	4.32	96.54	1.87

The comparison study demonstrates that the proposed SIM + OC-SVM and MPCA + OC-SVM methods offer high FDRs and low FARs in contrast with PSR + OC-SVM and MKAD + OC-SVM, especially in soft faults FD. From the HS results in Table 3, the HS shows superior performance than any other method on FDRs with paying for higher FARs cost. The fourth and sixth columns of Table 3 show that FARs of SIM + OC-SVM and MPCA + OC-SVM methods provide the lowest false detection performance over all other methods with better FD results. In comparison, the proposed methods have superior characteristics in detecting accumulative error from successive residuals.

Study on residual sequence length

Since the parameter would make a difference to the results of FD, in this section the length of multi-sequence residuals is investigated. In order to consider both sensitivity and FDR performances, the appropriate length of the residual sequence should be selected for constructing features. The too-short sequence length would not be detected easily and the too-long sequence length would cause larger DT. Therefore, another simulation test is performed with different design parameters selected by SIM + OC-SVM and MPCA + OC-SVM. The soft fault of an accelerometer is determined to be injected to the system with a length of multi-sequence residuals ranging from 5 to 20. Table 4 summarizes the detailed FDRs, FARs, and time-delay indices of the simulation results.

Table 4.

FDRs, FARs (%), and DT (s) based on different design parameters.

Length	SIM + OC-SVM			MPCA + OC-SVM			Hybrid strategy
Length	FDR	FAR	DT	FDR	FAR	DT	FDR	FAR	DT
$a = 5$	87.57	1.97	5.6	86.34	1.86	5.5	89.02	2.32	5.5
$a = 6$	88.23	1.84	5.8	86.65	1.67	5.9	89.54	2.18	5.8
$a = 7$	88.63	1.72	5.5	87.09	1.45	6.1	90.47	1.98	5.5
$a = 8$	90.58	1.53	5.3	89.74	1.36	5.6	90.65	1.87	5.2
$a = 9$	91.48	1.49	5.1	90.83	1.22	5.8	91.20	1.65	5.1
$a = 10$	93.08	1.37	5.4	92.27	1.13	5.8	93.23	1.54	5.4
$a = 11$	94.54	1.28	5.9	94.85	1.03	5.4	95.87	1.40	5.4
$a = 12$	95.83	1.13	6.4	94.05	1.03	5.3	96.43	1.35	5.3
$a = 13$	96.74	1.09	6.6	95.65	0.91	5.8	97.17	1.19	5.7
$a = 14$	97.41	1.08	6.8	96.95	0.84	6.2	98.44	1.07	6.1
$a = 15$	97.38	0.97	6.7	97.46	0.88	6.7	98.85	1.12	6.5
$a = 16$	96.34	1.06	7.4	97.46	0.95	7.3	98.23	1.27	7.3
$a = 17$	96.83	1.21	8.2	97.19	0.92	8.1	98.54	1.47	7.9
$a = 18$	96.39	1.35	8.7	96.74	1.10	8.7	98.10	1.62	8.6
$a = 19$	97.28	1.44	9.0	96.67	1.14	9.0	97.75	1.59	8.8
$a = 20$	97.03	1.51	9.6	97.33	1.23	9.9	98.06	1.88	9.6

FDR: fault detection rate; FAR: false alarm rate; SIM: similarity measure; OC-SVM: one-class support vector machine; MPCA: modified principal components analysis; DT: delay time; The bold indicates the optimal result obtained by the method in this index.

According to the results of FDRs and FARs given in Table 4, it is observed that the different design parameters, including the length of the multi-sequence residual, significantly affect the FD performance of SIM + OC-SVM and MPCA + OC-SVM methods. It should be indicated that the length of the multi-sequence residual will generate a different accumulative error in learning features. Correspondingly, the performance of the HS will be affected due to different lengths. In the column of FDRs, all lengths of multi-sequence residuals obtain similar FD performance after exceeding 15 points, which means that sufficient length of the sequence is essential to ensure FD results. The FARs column shows that too long or short length would cause more false alarms. Therefore, an appropriate length ( $14 \leq l \leq 17$ in the proposed simulation system) should be determined for a specific system. Moreover, the column DT indicates that the time delay will fundamentally become larger as the length increases.

Conclusion

In the present study, a hybrid data-driven FD strategy is proposed. The proposed strategy is based on multi-sequence residual analysis and OC-SVM, which is applied to navigation sensors. First, the basic data-driven fault diagnosis methods and their recent developments are reviewed. Then, the HS framework is presented and the FD is formulated as an outlier-detection problem. The SIM and modified PCA are adopted to construct the learning features in which the fault errors over a period of time can be accumulated. Moreover, OC-SVM is applied for implementing outlier detection by training the learning features.

Furthermore, the proposed strategy is validated on the simulated integrated navigation system. The training dataset is obtained on free-faults conditions and four typical faults are added to the simulation system. The experimental results show that both SIM + OC-SVM and MPCA + OC-SVM methods can detect the abrupt and soft faults with high accuracy in real time. The HS can improve the FD rate by paying a small false alarm cost. Furthermore, the selection of multi-sequence residuals length in SIM + OC-SVM and MPCA + OC-SVM methods is discussed. Compared with previous studies, the data-driven FD strategy is more efficient and accurate. In the near future, it is intended to validate this method on real navigation sensors and integrate it with other FD approaches to improve reliability and stability.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported in part by the National Natural Science Foundation of China (61501493).

ORCID iD

Huahui Yang

References

Yin

Ding

Xie

, et al. A review on basic data-driven approaches for industrial process monitoring. IEEE T Ind Electron 2014; 61(11): 6418–6428.

Ding

Yang

, et al. A fault detection approach for nonlinear systems based on data-driven realizations of fuzzy kernel representations. IEEE T Fuzzy Syst 2018; 26(4): 1800–1812.

Wang

Liu

Zhang

. Neural network-based model-free adaptive fault-tolerant control for discrete-time nonlinear systems with sensor fault. IEEE T Syst Man Cybern Syst 2017; 47(8): 2351–2362.

Najafabadi

Salmasi

Jabehdar-Maralani

. Detection and isolation of speed-, DC-link voltage-, and current-sensor faults based on an adaptive observer in induction-motor drives. IEEE T Ind Electron 2011; 58(5): 1662–1672.

Hilton

, et al. Adaptive sliding mode control for nonlinear active suspension vehicle systems using T-S fuzzy approach. IEEE T Ind Electron 2013; 60(8): 3328–3338.

Liu

, et al. Analysis of Kalman filter innovation-based GNSS spoofing detection method for INS/GNSS integrated navigation system. IEEE Sensors J 2019; 19(13): 5167–5178.

Dai

Gao

. From model, signal to knowledge: a data-driven perspective of fault detection and diagnosis. IEEE T Ind Inf 2013; 9(4): 2226–2238.

Cui

. A robust graph optimization realization of tightly coupled GNSS/INS integrated navigation system for urban vehicles. Tsinghua Sci Technol 2018; 23(6): 724–732.

Yang

. An enhanced MEMS-INS/GNSS integrated system with fault detection and exclusion capability for land vehicle navigation in urban areas. GPS Solut 2014; 18(4): 593–603.

10.

Lai

Liu

, et al. Homologous fault monitoring technology of redundant INS in airborne avionics systems. J Syst Eng Electron 2013; 24(6): 1038–1044.

11.

Park

Jeong

Kim

, et al. Magnetic compass fault detection method for GPS/INS/magnetic compass integrated navigation systems. Int J Contr Automat Syst 2011; 9(2): 276–284.

12.

Song

Zhang

Chan

. Fuzzy fault tree analysis based on T-S model with application to INS/GPS navigation system. Soft Comput 2009; 13(1): 31–40.

13.

Monteriù

Asthana

Valavanis

, et al. Real-time model-based fault detection and isolation for UGVs. J Intell Robot Syst 2009; 56: 425–439.

14.

Jaradat

Abdel-Hafez

Saadeddin

, et al. Intelligent fault detection and fusion for INS/GPS navigation system. In: Proceedings of the 9th international symposium on mechatronics and its applications, Amman, Jordan, 9–11 April 2013. New York: IEEE.

15.

Zhong

Guo

, et al. An extended Hi/H optimization approach to fault detection of INS/GPS-integrated system. IEEE Trans Instru Meas 2016; 65(11): 2495–2504.

16.

Guo

Tian

, et al. Intelligent active fault-tolerant system for multi-source integrated navigation system based on deep neural network. Neur Comput Appl. Epub ahead of print 5 January 2019. DOI: 10.1007/s00521-018-03975-z.

17.

Zhao

Wang

Zhang

, et al. Real-time fault detection method based on belief rule base for aircraft navigation system. Chin J Aeronaut 2013; 26(3): 717–729.

18.

Lian

. Fault detection for multi-source integrated navigation system using fully convolutional neural network. IET Radar Sonar Naviga 2018; 12(7): 774–782.

19.

Serdio

Lughofer

Pichler

, et al. Residual-based fault detection using soft computing techniques for condition monitoring at rolling mills. Inf Sci 2014; 259: 304–320.

20.

Jesussek

Ellermann

. Fault detection and isolation for a nonlinear railway vehicle suspension with a hybrid extended Kalman filter. Veh Syst Dyn 2013; 51(10): 1489–1501.

21.

Liu

Wang

, et al. Sensor fault diagnosis of GPS/INS tightly coupled navigation system based on state chi-square test and improved simplified fuzzy ARTMAP neural network. In: Proceedings of the 2017 IEEE international conference on robotics and biomimetics, Macau SAR, China, 5–8 December 2017, pp. 2527–2532. New York: IEEE.

22.

Sun

Cheng

Wang

, et al. A novel online data-driven algorithm for detecting UAV navigation sensor faults. Sensors 2017; 17(10): 22431–22412.

23.

Wang

Liu

, et al. Least-squares fault detection and diagnosis for networked sensing systems using a direct state estimation approach. IEEE T Ind Inf 2013; 9(3): 1670–1679.

24.

Baek

Kim

. Empirical sensitivity analysis of discretization parameters for fault pattern extraction from multivariate time series data. IEEE T Cybern 2017; 47(5): 1198–1209.

25.

Vong

Wong

. A new framework of simultaneous fault diagnosis using pairwise probabilistic multi-label classification for time-dependent patterns. IEEE T Ind Electron 2013; 60(8): 3372–3385.

26.

Ding

Zhang

Ding

, et al. On the application of PCA technique to fault diagnosis. Tsinghua Sci Technol 2010; 15(2): 138–144.

27.

Wong

Jang

, et al. Fault detection based on statistical multivariate analysis and microarray visualization. IEEE T Ind Inf 2010; 6(1): 18–24.

28.

Deng

Tian

Chen

, et al. Nonlinear process fault diagnosis based on serial principal component analysis. IEEE T Neural Netw Learn Syst 2016; 29(3): 560–572.

29.

Peng

Zhang

, et al. New kernel independent and principal components analysis-based process monitoring approach with application to hot strip mill process. IET Contr Theor Appl 2014; 8(16): 1723–1731.

30.

Fontes

Pereira

. Pattern recognition in multivariate time series: a case study applied to fault detection in a gas turbine. Eng Appl Artif Intel 2016; 49: 10–18.

31.

Chen

Tino

Rodan

, et al. Learning in the model space for cognitive fault diagnosis. IEEE T Neural Netw 2014; 25(1): 124–136.

32.

Erfani

Rajasegarar

Karunasekera

, et al. High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning. Patt Recogn 2016; 58: 121–134.

33.

Muñoz-Marí

Bovolo

Gómez-Chova

, et al. Semisupervised one-class support vector machines for classification of remote sensing data. IEEE T Geosci Remote Sensing 2010; 48(8): 3188–3197.

34.

Tax

DMJ

Duin

RPW

. Support vector domain description. Patt Recogn Lett 1999; 20(11–13): 1191–1199.

35.

Xiao

Wang

Zhang

, et al. Two methods of selecting Gaussian kernel parameters for one-class SVM and their application to fault detection. Knowl-Based Syst 2014; 59: 75–84.

36.

Mahadevan

Shah

. Fault detection and diagnosis in process data using one-class support vector machines. J Process Contr 2009; 19(10): 1627–2693.

37.

Das

Matthews

Srivastava

, et al. Multiple kernel learning for heterogeneous anomaly detection: algorithm and aviation safety case study. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, Washington, DC, 25–28 July 2010. New York: ACM.

38.

Yin

Ding

Haghani

, et al. A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process. J Process Contr 2012; 22(9): 1567–1581.

A hybrid data-driven fault detection strategy with application to navigation sensors

Abstract

Keywords

Introduction

Framework of hybrid data-driven FD

Residual characterization

Residual characterization based on SIM

A modified PCA for residual characterization

Theoretical analysis for residual characterization

FD based on OC-SVM

OC-SVM

Abnormal detection algorithm

Simulation experiments and results

Experiments setting and FD results

Comparison study with HS

Study on residual sequence length

Conclusion

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

References