A rolling bearing fault diagnosis method based on LSSVM

Abstract

The fault characteristic signals of rolling bearings are coupled with each other, thus increasing the difficulty in identifying the fault characteristics. In this study, a fault diagnosis method of rolling bearing based on least squares support vector machine is proposed. First, least squares support vector machine model is trained with the samples of known classes. Least squares support vector machine algorithm involves the selection of a kernel function. The complexity of samples in high-dimensional space can be adjusted through changing the parameters of kernel function, thus affecting the search for the optimal function as well as final classification results. Particle swarm optimization and 10-fold cross-validation method are adopted to optimize the parameters in the training model. Then, with the optimized parameters, the classification model is updated. Finally, with the feature vector of the test samples as the input of least squares support vector machine, the pattern recognition of the testing samples is performed to achieve the purpose of fault diagnosis. The actual bearing fault data are analyzed with the diagnosis method. This method allows the accurate classification results and fast diagnosis and can be applied in the diagnosis of compound faults of rolling bearing.

Keywords

Rolling bearing particle swarm optimization and 10-fold cross-validation particle swarm optimization and 10-fold cross-validation method pattern recognition fault diagnosis

Introduction

A rolling bearing is easily damaged and widely used in rotating machinery. According to the statistical data, 30% of rotating machinery failures are caused by bearing failures. Therefore, the fault diagnosis of rolling bearings is important.¹

The essence of fault diagnosis is the pattern recognition of fault features, whereas the key to pattern recognition is to design a reasonable classifier. In recent years, intelligent pattern recognition methods have been developed, such as expert system, neural network, support vector machine (SVM), and least squares support vector machine (LSSVM).

An expert system is the earliest intelligent system. Knowledge and experiences of experts in a certain class are saved into a computer to establish a database, which enables the computer to simulate the thinking and judgment processes of experts, and then deal with some complex problems.^2,3 However, an expert system still has some defects. The expert system has no self-learning ability and is highly dependent on experts. It is necessary to acquire and update expert knowledge. The level of experts largely determines the level of system fault diagnosis, thus seriously affecting the efficiency of the expert system.

Neural network refers to artificial neural network, which completes the logical expression between input and output by imitating the structure of animal neurons and has a strong nonlinear mapping ability. It is widely used in pattern recognition, fault diagnosis, automatic control, and other fields.^4,5 However, the defects of neural network, such as long training time, poor generalization ability, and slow convergence speed, are not conducive to the pattern recognition of small samples.

SVM was first introduced in 1995 by Cortes and Vapnik⁶ to deal with the problem of dichotomy and showed unique advantages in pattern recognition of nonlinear small samples.^7,8 Then its application scope was extended to machine learning. However, it takes time to solve the quadratic programming problem of SVM, which leads to a long time for the fault diagnosis with SVM model. Widodo and Yang⁹ used the wavelet support vector machine to analyze the current signals of many faults for an induction motor and obtained a high fault identification rate. Zhang et al.¹⁰ used an SVM to obtain an ideal diagnostic effect of cutting tool faults. However, when the noise in the signal was stronger than the actual fault signal, the characteristic parameters of the vibration signal for the state monitoring and fault diagnosis were not clear and it is difficult to express the complicated mapping relationship between the health status of equipment and the measured signal with traditional SVM. Moreover, a large amount of data collected by the collection system for mechanical system fault diagnosis led to the problem of high dimensionality and it is difficult to obtain a high accuracy rate for fault diagnosis.^11–14

Based on SVM, Suykens and Vandewalle¹⁵ proposed an extension of Standard SVM, LSSVM algorithm. Inequality constraints in SVM are changed into equality constraints in LSSVM algorithm, thus greatly facilitating the solution of Lagrange multiplier. It changes the quadratic programming problem into the problem of solving linear equations, thus reducing the computational complexity and accelerating the solution process. S Zhang¹⁶ used LSSVM as the classifier and combined the detection coils with LSSVM for bearing fault detection. The detection coils provided signals for LSSVM and LSSVM reported fault status through statistical learning algorithm. Wang et al.¹⁷ explored the nonlinear and non-stationary properties of vibration signals of bearings and identified the faults with particle swarm optimization least square support vector machine (PSO-LSSVM) algorithm. Li et al.¹⁸ used the LSSVM to recognize different fault types of wheel bearings. PSO algorithm is a kind of evolutionary algorithm and similar to the simulated annealing algorithm. It starts from a stochastic solution, evaluates the solution by fitness, and finds the optimal solution by iteration. The rule of PSO is simpler since it has no crossover or mutation operation. PSO searches for the global optimum based on the current optimal value. PSO has been widely concerned due to its significant advantages of easy implementation, high accuracy fast convergence, and its high performance in solving practical problems.

In this article, the particle swarm optimization and 10-fold cross-validation (PSO-10-fold CV) and the LSSVM are combined together to identify various failure modes of rolling bearings. First, with LSSVM as the classifier of bearing fault signals, the parameters of the classifier are optimized by combining the PSO with 10-fold cross-validation method. Then, the performance of the method in multi-classification is analyzed. Finally, the LSSVM multi-classifier is used to identify the failure modes of a rolling bearing.

Principle of LSSVM

SVM is a good method to minimize a structural risk and also the core of statistical learning theory. It can get a better generalization ability from the limited sample information and deal with nonlinear and small samples well. SVM is widely applied in the fields of fault diagnosis and regression estimation.¹⁹

SVM is a process of searching for the optimal hyperplane in a linearly separable state. For a binary classification problem, $n$ sample training sets are given as: $D = {(x_{i}, y_{i}) | i = 1, 2, \dots, n}$ , $x \in R^{n}$ , $y_{i} \in [- 1, 1]$ . A hyperplane is obtained as $H : ω \cdot x + b = 0$ . It can correctly classify the sample points in the training set. The distance between the nearest vector and the hyperplane is the maximum value and the hyperplane is the optimal hyperplane, as shown in Figure 1.

Figure 1.

SVM optimal hyperplane schematic diagram.

However, in practice, most of the sample sets are linearly inseparable, thus generating the problem of nonlinear sample classification. In order to solve this kind of problem, the sample set is mapped to a high-dimensional space by nonlinear mapping, so that the problem becomes a linearly separable classification problem in high-dimensional space. Then the problem is solved through searching for the optimal hyperplane. Finally, the optimal hyperplane in the high-dimensional space is mapped back to the original space, so that it becomes the optimal hyperplane in the nonlinear problem. The process of nonlinear classification is shown in Figure 2.

Figure 2.

Nonlinear classification process.

According to the strategy of SVM, linearly inseparable sample points are mapped to high-dimensional space to obtain linearly separable sample points. Then the problem is equivalent to the problem of finding the optimal hyperplane in high-dimensional space. Therefore, the introduction of an appropriate nonlinear mapping $Φ$ is the key of solving the problem. After introducing the nonlinear mapping $Φ$ , the original sample set is mapped to the high-dimensional space $x_{i} \to Φ (x_{i})$ .

Then the optimal hyperplane in the original space is $(ω^{*} \cdot x) + b^{*} = 0 \to (ω^{*} \cdot Φ (x)) + b^{*} = 0$ . The classification constraint conditions in the feature space are also converted into

y_{i} [(ω \cdot Φ (x_{i})) + b] - 1 + ξ_{i} \geq 0

(1)

The objective function is

min_{ω, b} \frac{1}{2} ω^{T} ω + C \sum_{i = 1}^{N} ξ_{i}

(2)

According to equation (2), the Lagrangian function is constructed as

\begin{matrix} L (ω, b, ξ, α, β) = \frac{1}{2} ω^{T} ω + C \sum_{i = 1}^{n} ξ_{i} \\ - \sum_{i = 1}^{n} α_{i} [y_{i} ((ω \cdot Φ (x_{i}) + b) + ξ_{i} - 1)] - \sum_{i = 1}^{n} β_{i} ξ_{i} \end{matrix}

(3)

Take partial derivatives respectively with respect to $ω$ , $b$ , and $ξ_{i}$ and set the partial derivative to 0

{\begin{matrix} \frac{\partial L}{\partial ω} = 0 \Rightarrow ω = \sum_{i = 1}^{n} α_{i} y_{i} Φ (x_{i}) \\ \frac{\partial L}{\partial b} = 0 \Rightarrow \sum_{i = 1}^{n} α_{i} y_{i} = 0 \\ \frac{\partial L}{\partial ξ_{i}} = 0 \Rightarrow C - α_{i} - β_{i} = 0 \end{matrix}

(4)

Substituting equation (4) into equation (3) gives

L (ω, b, ξ, α, β) = \sum_{i = 1}^{n} α_{i} - \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} α_{i} α_{j} y_{i} y_{j} [Φ (x_{i}) \cdot Φ (x_{j})]

(5)

According to the Karush–Kuhn–Tucker (KKT) condition, the optimal solution should meet the following conditions

{\begin{matrix} α_{i} [y_{i} ((ω \cdot Φ (x) + b) - 1 + ξ_{i})] = 0 \\ β_{i} ξ_{i} = 0 \Rightarrow (C - α_{i}) ξ_{i} = 0 \end{matrix}

(6)

The kernel function $K (x_{i}, x_{j})$ is used to replace $Φ (x_{i}) \cdot Φ (x_{j})$ in equation (5)

\begin{matrix} max \sum_{i = 1}^{n} α_{i} - \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} α_{i} α_{j} y_{i} y_{j} K (x_{i}, x_{j}) \\ s . t . {\begin{matrix} \sum_{i = 1}^{n} y_{i} α_{i} \\ 0 \leq α_{i} \leq C i = 1, 2, \dots, n \end{matrix} \end{matrix}

(7)

The final optimal classification function is

\begin{matrix} f (x) = sgn [(ω \cdot Φ (x) + b)] \\ = sgn [\sum_{i = 1}^{n} α_{i} y_{i} K (x_{i}, x) + b] \end{matrix}

(8)

In order to solve multiple-scale problems and accelerate the calculation, LSSVM¹⁵ is used. The least squares linear system is set as the loss function to replacing the quadratic programming method adopted in traditional SVM and inequality constraints are replaced with equality constraints. Although the sparsity and robustness of LSSVM are slightly decreased, the algorithm is greatly simplified and the calculation cost is reduced. LSSVM shows high performance in fault diagnosis and pattern recognition.

In general, the nonlinear optimal classification function is set as $f (x) = ω \cdot Φ (x) + b$ and n sample training sets are given as $D = {(x_{k}, y_{k}) | k = 1, 2, \dots, n}$ , where $x_{k} \in R^{n}$ is the input of the function; $y_{k} \in R^{n}$ is the output of the function; $ω$ is the weight vector; $b$ is the deviation.

The optimization problem of LSSVM is expressed as

\begin{matrix} min J (ω, ξ) = \frac{1}{2} ω^{T} ω + C \sum_{k = 1}^{n} ξ_{k}^{2} \\ s . t . y_{k} [ω^{T} Φ (x_{k}) + b] - 1 + ξ_{k} = 0 \end{matrix}

(9)

where $ξ_{k}$ is the error variable and $C$ is the penalty factor. Then, the Lagrange function is obtained as

L (ω, b, ξ, α) = J (ω, ξ) - \sum_{k = 1}^{n} α_{k} {ω \cdot Φ (x) + b + ξ_{k} - y_{k}}

(10)

Take partial derivative respectively with respect to $ω$ , $b$ , $ξ_{k}$ , and $α_{k}$ and set the partial derivative to 0

{\begin{matrix} \frac{\partial L}{\partial ω} = 0 \Rightarrow ω = \sum_{k = 1}^{n} α_{k} Φ (x_{k}) \\ \frac{\partial L}{\partial b} = 0 \Rightarrow \sum_{k = 1}^{n} α_{k} = 0 \\ \frac{\partial L}{\partial ξ_{k}} = 0 \Rightarrow α_{k} = C ξ_{k} \\ \frac{\partial L}{\partial α_{k}} = 0 \Rightarrow ω \cdot Φ (x) + b + ξ_{k} + y_{k} = 0 \end{matrix}

(11)

After eliminating the variables $ω$ and $ξ$ , the following matrix equation can be obtained

[\begin{matrix} 0 & {I_{V}}^{T} \\ I_{V} & Ω + C^{- 1} I \end{matrix}] [\begin{matrix} b \\ α \end{matrix}] = [\begin{matrix} 0 \\ y \end{matrix}]

(12)

where $I_{v} = [1; \dots; 1]$ ; $y = [y_{1}; \dots; y_{n}]$ ; $α = [α_{1}; \dots; α_{n}]$ ; $Ω_{kl} = Φ (x_{k})^{T} Φ (x_{l})$ , $k, l = 1, 2, \dots n$

According to Mercer condition, there are mapping functions $Φ (x)$ and kernel functions $K (x, x_{k})$

K (x_{k}, (x_{l})) = Φ (x_{k})^{T} Φ (x_{l})

(13)

Substituting equation (13) into the nonlinear optimal classification function $f (x) = ω \cdot Φ (x) + b$ , the optimal classification function is obtained as

f (x) = \sum_{k = 1}^{n} α_{k} y_{i} K (x_{k}, x) + b = 0

(14)

Kernel function selection and parameter optimization

Selection of Kernel functions

According to the LSSVM algorithm, the kernel function $K (x_{i}, x_{j})$ corresponds to the mapping function $Φ (x_{i})$ . After an appropriate kernel function is selected, the mapping function is also established to determine the high-dimensional feature space.

The selection of kernel function has great flexibility. The complexity of samples in high-dimensional space can be adjusted through changing the parameters of kernel function, thus affecting the search of the optimal function and the final classification effect. The commonly used kernel functions are introduced as follows:

1. Linear kernel function

K (x_{i}, x_{j}) = x_{i} \cdot x_{j}

2. Sigmoid kernel function

K (x_{i}, x_{j}) = \tan (λ (x_{i} \cdot x_{j}) + b)

3. Polynomial kernel function

K (x_{i}, x_{j}) = (x \cdot x_{i} + c)^{l}, l = 1, 2, \dots

4. Radial basis function (RBF)

K (x_{i}, x_{j}) = \exp [- \frac{| | x - x_{i} | |^{2}}{2 σ^{2}}]

The characteristics of LSSVM vary with the selected kernel function. Both Sigmoid kernel functions and polynomial kernel functions have certain limitations in parameter selection. Inappropriate parameters may increase the calculation load and sometimes lead to improper results. Linear kernel function is a special form of RBF kernel function, which has a good performance and the strong learning ability, and has been widely used in pattern recognition and fault diagnosis. RBF kernel function has universal features. By adjusting the appropriate parameter $σ$ , RBF kernel function can make any sample linearly separable in higher dimensions and has good generalization ability and learning ability. Based on the above analysis, RBF is selected as the kernel function in the study.

Parameter optimization

LSSVM classifier based on RBF kernel function has two main structural parameters: width $σ$ in RBF kernel function and penalty factor $C$ in equation (9). If $σ$ is selected improperly, the phenomenon of “over-learning” or “under-learning” occurs. As an important parameter of the fitting error and prediction error of LSSVM classifier, penalty factor $C$ can be used to adjust the punishment degree of misclassification. The range of $C$ is $0 ~ + \infty$ . When $C \to 0$ , the classification hyperplane has the maximum generalization and the minimum classification error rate, which allows the minimum fitting accuracy of samples and affects the prediction error. When $C \to + \infty$ , the number of misclassified samples of LSSVM is 0 and the complexity and training time gradually increase, but the generalization may be ignored. Therefore, the compromised parameters should be selected to obtain the optimal classification effect.

In order to further improve the performance of LSSVM classifier, PSO-10-fold CV methods are adopted to improve the selection of parameters. PSO^20,21 is characterized by a simple procedure, fast convergence, and high global optimization and constraint processing ability. Its steps are provided as follows:

The given sample set is divided into 10 independent subsets of the same size.

Two parameters in the kernel function that need to be optimized, $(C, σ)$ and the parameters in the particle swarm optimization algorithm, are initialized, including the speed and position of each particle.

The precision of the classifier of each particle is obtained with current parameters by the 10-fold cross-validation algorithm and the fitness value of each particle is taken as the fitness value of the current particle. One subset is taken as the verification set, whereas the other subset is taken as the training set. Then, the accuracy of the LSSVM classifier is calculated.

The fitness value of each particle is compared with the fitness value of the particle at the current optimal position. If the fitness value is better than the current optimal value, the current optimal position is updated.

The fitness value of each particle is compared with the fitness value of the particle at the global optimal position. If the fitness value is better than the global optimal value, the global optimal position is updated.

The position and speed of particles are updated. The accurate classification rate of more than 90% is set as the termination conditions. When the termination conditions are not satisfied, go to Step 3. When the termination conditions are satisfied, the process ends and the optimal parameter $(C, σ)$ is output.

The flow chart of LSSVM parameter optimization is shown in Figure 3.

Figure 3.

Parameter optimization process of LSSVM.

Multi-classification problem analysis

It can be seen from the above basic theories that both SVM and LSSVM are binary classification models, which use the optimal hyperplane to divide original samples into two classes. However, in practice, many problems are not limited to dichotomy, so it is necessary to extend SVM and LSSVM to solve multi-classification problems. The most intuitive idea is to use multiple classifiers for multiple classifications. Common methods include one-versus-one classification algorithm, one-versus-rest classification algorithm, and one-versus-others classification algorithms.

The main idea of the one-versus-one classification algorithm is to establish $m (m - 1) / 2$ binary classifiers in $m$ classes of samples for classifying the testing samples, as shown in Figure 4. Then, the decision is made through “voting.” For a sample with an unknown class, the class with the most votes is identified as the class of the sample.

Figure 4.

One-versus-one classification method.

The main idea of the one-versus-rest classification algorithm is to establish m classifiers in the m classes of samples and set them in the ith classifier. The ith class of samples is positive and the rest are negative. Each classifier can separate a class of samples. The classification method is shown in Figure 5.

Figure 5.

One-versus-rest classification method.

The main idea of the one-versus-others classification algorithm is to establish $m - 1$ classifiers in the m classes of samples and set them in the ith classifier. The ith class of samples is positive, whereas the rest of the $i + 1$ class of samples are negative. After a class of samples is isolated, they will be removed from the original samples. Then, other samples are classified, as shown in Figure 6.

Figure 6.

One-versus-others classification method.

Multiple classification of LSSVM can be realized through the above three methods. However, according to the one-versus-one classification method, two classes may have the same number of votes for a sample in the voting, indicating that the sample will be inseparable. For one-versus-rest classification method, overlapping regions or unknown regions often occur. Since the classified samples are removed from original samples in one-versus-others classification method, the above phenomena do not occur. Therefore, one-versus-others classification method is adopted to classify bearing faults and the classification order is from single fault to composite fault.

Experimental verification and analysis

Previous studies focused on a single rolling bearing fault in the field of pattern recognition. In this article, a rolling bearing fault diagnosis method based on LSSVM classifier is studied. For single faults and compound faults, it has more accurate classification and requires less time. The following experiments showed that this method was applicable for rolling bearing fault diagnosis.

Case 1: Case Western Reserve University Bearing Data

In order to verify the classification effect of LSSVM, the bearing failure data from Case Western Reserve University Bearing Data Center were selected for the verification experiment. The data center only provided the fault data of a single bearing, so the fault data of composite bearings were obtained through the principle of vector superposition simulation.

When SVM or LSSVM is used as a classifier for pattern recognition, a training model should be established first. Therefore, the data set should be divided into training sample subset and testing sample subset. According to the above analysis, the classifier used RBF as the kernel function. Sampling points of time-domain signals could be directly inputted into the classifier as fault eigenvectors, so 1000 sampling points were selected for each set of eigenvectors and 50 sets of data were collected for each fault, including 15 sets of data as training samples and 35 sets of data as testing samples. The two parameters $σ$ and $C$ were first set as $σ = 0.1$ and $C = 2$ . The feature vectors were input into the SVM and LSSVM classifiers and then the classification results were compared. Classes 1–6 corresponded to the outer fault, inner fault, ball fault, outer-inner fault, outer-ball fault, and inner-ball fault, respectively. Fault classification results are shown in Figures 7 and 8. The comparison of fault diagnosis results is provided in Table 1.

Figure 7.

Fault classification of rolling bearing based on SVM.

Figure 8.

Fault classification of rolling bearing based on LSSVM.

Table 1.

Comparison of fault diagnosis results.

Methods	Accuracy (%)	Time (s)
SVM	56.67	5.6229
LSSVM	68.57	2.5102

SVM: support vector machine; LSSVM: least squares support vector machine.

Compared with SVM, LSSVM had 11.9% higher fault diagnosis accuracy and reduced diagnosis time by 3.1127 (Figures 7 and 8 and Table 1), indicating that the modeling speed and accuracy of LSSVM were significantly higher than those of SVM. LSSVM had a better classification effect than SVM.

In order to improve the classification accuracy, the PSO-10-fold CV method was adopted to optimize the two parameters of the classifier and the optimal parameters were obtained as $σ = 0.04$ and $C = 6.45$ . The LSSVM fault diagnosis results are shown in Figure 9. The fault diagnosis accuracy rate was 70.48%. After selecting the optimal parameters, the fault diagnosis accuracy of LSSVM classifier was improved by 1.91%.

Figure 9.

LSSVM classification with optimized parameters.

Case 2: data of MFS mechanical fault simulation bench

In order to further verify the classification method in this article, the MFS mechanical fault comprehensive simulation test bench produced by SQI Company in the United States was used in the experiment. The structure and device of the test bench are shown in Figure 10. The test bench can simulate the common mechanical equipment failure and be used to study fault diagnosis.

Figure 10.

Mechanical fault simulation experiment device.

By changing fault bearings, MFS mechanical fault comprehensive simulation test bench can simulate outer fault, inner fault, ball fault, outer-inner fault, outer-ball fault, and inner-ball fault of rolling bearings to obtain the actual bearing fault vibration signals. The six types of fault bearings are shown in Figure 11(a)–(f). The acceleration sensor was installed on the bearing seat to measure the vibration signal of the bearing. The bearing model is KR-12K and the inner diameter is 19.05 mm. The outer diameter is 46.99 mm and the number of steel balls is 8. The ball diameter is 7.87 mm and the contact angle is 0.

Figure 11.

Six types of fault bearing: (a) outer fault bearing, (b) inner fault bearing, (c) ball fault bearing, (d) outer–inner fault bearing, (e) outer-ball fault bearing, and (f) inner-ball fault bearing.

When the rolling bearing vibration signals were collected on the MFS mechanical fault comprehensive simulation laboratory bench, the sampling frequency was 2.56 kHz and the rotation frequency was 30 Hz. Normal bearing signals and six kinds of bearing failure signals were collected through the module conversion process.

LSSVM was used as a classifier for pattern recognition of the above six bearing fault signals. With RBF as the kernel function, 100 sets of vibration signals were selected for each fault. Each set of vibration signals involved 4000 sampling points. Twenty sets of signals were used as training samples, and the remaining 80 sets of signals were used as testing samples. Since there are six failure states and one normal state in the experiment, it was necessary to construct six LSSVM classifiers, which respectively correspond to outer fault, inner fault, ball fault, inner ball bearing fault, outer ball bearing fault, and outer inner fault, and the rest signals were normal signals. The two parameters were set as $σ = 0.1$ and $C = 2$ . The input eigenvectors of the classifier included multiscale sample entropy (MSE), energy, and multiscale sample entropy energy (MSEE). The output fault diagnosis results were compared. The classification results are shown in Figure 12 and the bearing fault diagnosis rate is shown in Table 2.

Figure 12.

Fault classification results based on three feature vectors: (a) fault classification diagram of MSE, (b) fault classification diagram of energy, and (c) fault classification diagram of MSEE.

Table 2.

Fault diagnosis rate of LSSVM.

Feature vectors	MSE	Energy	MSEE
False rate (%)	4.5833	5.2083	1.4583
Missing rate (%)	7.7084	10.0417	3.500
Accuracy (%)	87.7083	84.7500	95.0417

LSSVM: least squares support vector machine; MSE: multiscale sample entropy; MSEE: multiscale sample entropy energy.

$σ$ and $C$ corresponding to the highest accuracy could be found through the three-dimensional diagram of $σ$ , $C$ , and accuracy ( $σ = 1$ and $C = 16$ ). The process of parameter optimization is shown in Figure 13. Under the parameters ( $σ = 1$ and $C = 16$ ), the fault diagnosis accuracy of LSSVM was 96.4583%, whereas the false rate was 1.0417%, and the missing rate was 2.500%. The fault classification results are shown in Figure 14. Compared with the accuracy obtained with given parameters, the accuracy obtained with the optimized parameters was improved by 1.4166%, whereas the false rate and the missing rate were decreased by 0.4166% and 1%, respectively. The accuracy obtained with the optimized parameters met the demands of practical engineering, thus verifying the method in this article.

Figure 13.

Parameter optimization process.

Figure 14.

LSSVM classification obtained with optimized parameters.

Conclusion

In this article, a fault diagnosis method based on LSSVM is proposed to solve the problem that the fault characteristics of rolling bearing are coupled with each other and the difficulty in identifying fault characteristics of single faults and composite faults. Before establishing the training model, the algorithm combined with PSO-10-fold CV was used to optimize the parameters, so that the model has a better classification effect. Through the processing and comparative analysis of the actual bearing fault vibration signals, the experimental results showed that the fault diagnosis classifier based on LSSVM had a more accurate classification function, shorter classification time, and higher diagnosis performance of rolling bearing faults.

In this article, a supervised fault diagnosis method and the sample data of known classes were adopted. However, the classification of samples of unknown classes still needs to be further studied.

Footnotes

Handling Editor: Andreas Rosenkranz

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: National Natural Science Foundation of China (grant nos. 61803005, 61640312, 61763037) Beijing Natural Science Foundation (grant nos. 4192011, 4172007) Shandong Province Key Research and Development Program Project (grant no. 2018CXGC0608) and Education Commission of Beijing Funding.

ORCID iD

Hongfei Wei

References

Zheng

Liu

, et al. Sound-aided vibration weak signal enhancement for bearing fault detection by using adaptive stochastic resonance. J Sound Vib 2019; 449: 18–29.

Ramesh

Shum

Davis

JF.

A structured framework for efficient problem solving in diagnostic expert systems. Comput Chem Eng 1988; 12: 891–902.

Wang

Bai

MR.

Development of an expert system for fault diagnosis in scooter engine platform using fuzzy-logic inference. Expert Syst Appl 2007; 33: 1063–1075.

Wang

Song

, et al. An enhanced intelligent diagnosis method based on multi-sensor image fusion via improved deep learning network. IEEE T Instrum Meas. Epub ahead of print 12 July 2019. DOI: 10.1109/TIM.2019.2928346.

Wang

Ren

Song

, et al. A novel weighted sparse representation classification strategy based on dictionary learning for rotating machinery. IEEE T Instrum Meas. Epub ahead of print 15 July 2019. DOI: 10.1109/TIM.2019.2906334.

Cortes

Vapnik

Support-vector networks. Mach Learn 1995; 20: 273–297.

Sun

Chen

Analog circuits fault diagnosis using support vector machine. In: Proceedings of the 2007 international conference on communications, circuits and systems, Kokura, Japan, 11–13 July 2007, pp. 1003–1006. New York: IEEE.

Wang

Zhu

, et al. Design of online monitoring and fault diagnosis system for belt conveyors based on wavelet packet decomposition and support vector machine. Adv Mech Eng 2013; 5: 1–10.

Widodo

Yang

B-S.

Wavelet support vector machine for induction machine fault diagnosis based on transient current signal. Expert Syst Appl 2008; 35: 307–316.

10.

Zhang

Zhou

Guo

, et al. Vibrant fault diagnosis for hydroelectric generator units with a new combination of rough sets and support vector machine. Expert Syst Appl 2012; 39: 2621–2628.

11.

Wang

Song

, et al. A novel feature enhancement method based on improved constraint model of online dictionary learning. IEEE Access 2019; 7: 17599–17607.

12.

Hao

Song

Ren

, et al. Step-by-step compound faults diagnosis method for equipment based on majorization-minimization and constraint SCA. IEEE-ASME T Mech. Epub ahead of print 5 November 2019. DOI: 10.1109/TMECH.2019.2951589.

13.

Cui

Jin

Huang

, et al. Fault severity classification and size estimation for ball bearings based on vibration mechanism. IEEE Access 2019; 7: 56107–56116.

14.

Cui

Wang

, et al. Research on remaining useful life prediction of rolling element bearings based on time-varying Kalman filter. IEEE T Instrum Meas. Epub ahead of print 8 July 2019. DOI: 10.1109/TIM.2019.2924509.

15.

Suykens

JAK

Vandewalle

. Least squares support vector machine classifiers. Neural Process Lett 1999; 9: 293–300.

16.

Zhang

. Bearing fault detection of induction motors using detection coils and LSSVMA. In: Proceedings of the 2017 IEEE 2nd information technology, networking, electronic and automation control conference (ITNEC 2017), Chengdu, China, 15–17 December 2017. New York: IEEE.

17.

Wang

Jia

. A study of fault diagnosis method for the train axle box based on EMD and PSO-LSSVM. In: Proceedings of the 2013 3rd international conference on instrumentation, measurement, computer, communication and control, Shenyang, China, 21–23 September 2013. New York: IEEE.

18.

Yang

, et al. A fault diagnosis scheme for planetary gearboxes using modified multi-scale symbolic dynamic entropy and mRMR feature selection. Mech Syst Signal Pr 2017; 91: 295–312.

19.

Basudhar

Missoum

Adaptive explicit decision functions for probabilistic design and optimization using support vector machines. Comput Struct 2008; 86: 1904–1917.

20.

Chen

An intelligent fault identification method of rolling bearings based on LSSVM optimized by improved PSO. Mech Syst Signal Pr 2013; 35: 167–175.

21.

Yan

. Fault diagnosis of rolling bearing based on WP reconstructed energy entropy and PSO-LSSVM. In: Proceedings of the 2019 prognostics and system health management conference (PHM), Paris, 2–5 May 2019. New York: IEEE.