Abstract
To effectively extract the typical features of the bearing, a new method that related the local mean decomposition Shannon entropy and improved kernel principal component analysis model was proposed. First, the features are extracted by time–frequency domain method, local mean decomposition, and using the Shannon entropy to process the original separated product functions, so as to get the original features. However, the features been extracted still contain superfluous information; the nonlinear multi-features process technique, kernel principal component analysis, is introduced to fuse the characters. The kernel principal component analysis is improved by the weight factor. The extracted characteristic features were inputted in the Morlet wavelet kernel support vector machine to get the bearing running state classification model, bearing running state was thereby identified. Cases of test and actual were analyzed.
Keywords
Introduction
Bearing is commonly used in rotary machinery; the bearing failing will cause the machine to break down. So it is very important to recognize the fault features of the bearing, and then distinguish the running condition of the bearing.
In order to realize the bearing running state identification, there are two key issues. First, the typical features need to be extracted from the original signals, which need to take advantage of a variety of information and identify the bearing state precisely. Second, based on the features, we need a useful model to realize the running state recognition of bearing. Several methods such as time domain analysis, time–frequency domain analysis, and frequency domain analysis are used in feature extraction. 1 And the time–frequency domain analysis methods such as the wavelet and empirical mode decomposition (EMD) are widely reported. However, the wavelet is difficult to select the mother coefficient. The EMD 2 has the problems of the modal aliasing and endpoint leak. In this article, the local mean decomposition (LMD) is introduced to process the signals.3,4 In order to extract the features, the entropy is usually used. Traditional entropy is derived from the concept of probability and measures the discrimination of criteria. 5 De Luca 6 used the nonprobabilistic entropy and introduced some requirements to capture intuitive comprehension of the degree of fuzziness. Scmidt and Kacprzyk 7 researched the nonprobabilistic entropy. Following are the definitions of Shanon entropy, fuzzy entropy, and intuitionistic fuzzy entropy, where the Shannon entropy is approved as more powerful. 8 In this research, the Shannon entropy is used to extract the typical features.
After the feature is extracted by the LMD Shannon entropy, the features have the character of high dimension and the features cannot be processed by the artificial intelligence model precisely, because the information hidden in the feature are not mapped adequately and the artificial intelligence model is mainly for processing the low-dimensional features. So the feature dimension reduction method is needed to process the dimension of the features. There are many methods for processing the features, such as the principal component analysis (PCA), 9 the kernel principal component analysis (KPCA), 10 and the manifold-learning algorithm methods such as local tangent space alignment (LTSA). 11 But the PCA is mainly used for dealing with the linear data set, while the bearing vibration features are usually suppressed by the nonlinear characteristic features, so the PCA cannot work effectively. The KPCA and the LTSA can be used to extract the nonlinear characteristic features, and the LTSA has the problem of unsteadily and difficult to choose the neighborhood. In this research, the KPCA is selected and in addition the weight factor is added to the model to further improve the dimension reduction and feature selection effort.
Based on the typical feature extraction by the LMD Shannon entropy and the weighted KPCA method, the support vector machine (SVM) is served as a classifier. 12 But the SVM model is not useful to process the nonlinear feature, some researcher proposed the combination method of SVM and wavelet theories to achieve the classification, and get better performance than other leaning machine models; in this research, the new SVM model is constructed based on the Morlet kernel. 13
This article is organized as follows: In section “Feature extraction,” the LMD energy entropy concept is introduced and the LMD energy entropies are calculated. And the improved nonlinear embedding method weight KPCA is described. In section “The Morlet wavelet kernel SVM model,” the Morlet wavelet kernel SVM model is constructed. In section “Validation,” the proposed method is verified by bearing test data. Conclusion of the research is detailed in section “Conclusion.”
The proposed method flowchart is shown in Figure 1.

The proposed paper flowchart.
Feature extraction
In the LMD definition, any signal
From equation (1), it is clear that, following LMD, features from the vibration signal are better retained.
Once the k PF components and
where
After the LMD energy entropy is calculated, from one signal we can get one entropy value, and the bearing vibration signals are collected at different conditions and status, the resulting entropy will be in high dimensional, which will affect the classification effect of the SVM model, and thus the KPCA is used to reduce the feature dimension.
In KPCA, a set of multi-dimensional signals,
where
where
The eigendecomposition of
where
In this research, the Relief
16
method is used to determine the weight factor. In all the features set,
Setting the P represents the difference of the feature
Setting the Q represents the difference of the feature
where
The weight factor can be got from
The Morlet wavelet kernel SVM model
Based on Mercer’s Theorem,17,18 there are few number of wavelet kernel functions that meet the condition. In this research, the Morlet wavelet kernel as a wavelet kernel that meets the condition is given. The Morlet wavelet function can be defined as follows
The Morlet wavelet kernel function is defined as follows
In this research, the Morlet wavelet kernel function is worked as the support vector’s kernel function, and the SVM is defined as follows
From equations (11) and (12), the Morlet wavelet kernel SVM is constructed, and the new SVM will work as the model to separate the different bearing fault features.
Validation
Case 1
The Case Western Reserve University bearing fault signals is used to validate the proposed method. 19 The 2-hp reliance electric motor is used in the experiments. The SKF 6205-2RS JEM type bearing in the experiments is used. The test is done in order to simulate the bearing normal running state and fault running states, with fault depth of 0.18, 0.36, 0.53, and 0.71 mm at the outer raceway, inner raceway, and the ball to reflect the bearing deteriorating state. Electro-discharge machining is used to achieve the bearings faults seeded. In this case, the inner raceway fault signals were chosen. The vibration collection rate is set as 12,000/s, and 4096 data points were chosen for analyzing. 50 groups of bearing vibration data of each running states were chosen, with 30 groups for testing, the other 20 groups for training. Figure 2 shows the collected vibration signals at different states.

The collected vibration signals of normal state and inner-race four different fault depths. (a) normal; (b) the 0.18-mm depth of inner-race; (c) the 0.36-mm depth of inner-race; (d) the 0.53-mm depth of inner-race; and (e) the 0.71-mm depth of inner-race.
Based on the collected vibration signals, the LMD model is used to decompose the vibration signal, so as to get the PFs of all signals. The original signal is shown in Figure 3, and the 0.36-mm inner raceway fault PFs decomposed by LMD is shown in Figure 4.

The original signal of 0.36-mm inner-race fault.

The PFs decomposed by the LMD of the 0.36-mm inner-race signal.
The decomposed signals are 6PFs and 1; the result based on the LMD method is calculated based on the residue which is less than the set amplitude value. Based on the decomposed PFs, the Shannon entropy is used to calculate the value of every PFs, and the calculated Shannon entropy values are worked as the original features of the bearing running state indicator. However, those features still with high dimension and the features are so complex for the recognition model to achieve the recognition. The weighted KPCA is used to extract the typical features: normalized calculated entropy feature energy input to the KPCA, and reduction of mutual correlation between the features and characteristics of the fault further purification (Table 1).
The contribution rate of the Shannon entropy of the decomposed PFs.
PFs: product functions.
From a statistical viewpoint, the main contribution component corresponding to the training sample rate is greater than 75%. And through calculation, the first three eigenvalues account for 96.1% contribution rate of the total contribution rate; therefore, the first three eigenvalues corresponding component is chosen as a main input features. In order to compare the effort of the weighted KPCA on feature extraction, the PCA method, the KPCA method, and the weighted KPCA method are used to extract the features. The result are shown in Figures 5–7.

The feature extraction result based on PCA model.

The feature extraction result based on KPCA model.

The feature extraction result based on weighted KPCA model.
Thus, we have normalized characteristic energy input to the KPCA, reduction of mutual correlation between the features and characteristics of the fault further purification, and the main contribution component corresponding to the training sample rate.
From the results we can see that, the PCA method cannot deal with the features given in this case, because the features in this case are nonlinear character, the PCA is excellent in dealing with the linear data set, so the result of the PCA is poor. The KPCA can deal with the features better than the PCA model of nonlinear features; however, the results showing that for some features, the KPCA cannot separate them, while the weight KPCA can deal them well, so this proved the effect of the weight KPCA.
Based on the extracted feature, the Morlet wavelet kernel SVM is used identify the bearing running state. The five types of features are input into the new SVM (
The recognition rate of different methods.
RBF: radial basis function; KPCA: kernel principal component analysis; SVM: support vector machine.
From Table 2 we can see that, if the feature extraction based on the Shannon entropy directly been entered into the Morlet wavelet kernel SVM model, then the results are not satisfied. This is because the features are in high dimension and the main character is burned, so the recognition model cannot recognize the different types of features. The RBF SVM model also has some problem in recognizing the different characters, this is because for the local minimum, the RBF SVM is not excellent in dealing with the problem, while the Morlet wavelet kernel is good at finding the minor feature difference and it can solve the problem.
Case 2
In order to further validate the proposed method, the actual application is tested. The used rig is shown in Figure 8.

The test rig.
The rolling bearings driven by 0.55 KW AC motor are hosted on the shaft, the speed control, and AC inverter controller. The rotation speed is set as 1000 r/min. The brake, radial booster, using the magnetic clutch and brake, maximum torque is 5 N m, the radial load added to the bearing is set as 29.4 N. For every 2 h, the vibration data are collected. The data sampling rate is 25,600 Hz and 102,400 points are collected every time, as shown in Figure 9. The bearing is run for 12 months. For every 2 months, a set of data is chosen; 60 groups of collected data of different faults are obtained, with 30 groups for testing and the other 30 groups for training. In all, 4096 data points are selected to analyze. The decomposed method of LMD is used to process the collected signal, and the result is shown in Figure 10.

The collected vibration signal.

The decomposed vibration signal.
Then, the proposed method is used to achieve the bearing running state recognition, the original features are extracted by the LMD, and the decomposed PFs are shown in Figure 10.
Then, the features of the PFs are calculated by the Shannon entropy method, based on the calculated Shannon entropy values, the weight KPCA is used to deal with the features. A group of features of different fault conditions are obtained, as shown in Table 3 (not normalized beforehand).
A group of LMD energy entropy of different running states of the actual signal.
LMD: local mean decomposition.
Based on the features dealt by the weight KPCA, the Morlet wavelet kernel SVM is used to achieve the running state recognition. The results are shown in Table 4.
The recognition results based on the proposed method (recognition rate, η (%)).
From Table 4, we can see that the recognition rate is good; for some special machine, for example, the bearing work in the space, the method can achieve the running state recognition based on the collected vibration signals, so this validate the effect of the methods.
Conclusion
In this research, the LMD Shannon entropy method is used to extract the original features of signals. In order to reduce the feature dimension of the entropy, the weight KPCA was used. The Morlet kernel SVM was constructed. And the recognition results are validated by the collected signals in the research. However, this research has not been used in the industry, which will be done in the further research.
Footnotes
Acknowledgements
The authors are grateful to the anonymous reviewers for their helpful comments and constructive suggestions.
Academic Editor: Amir Alavi
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is supported by the National Natural Science Foundation of China (Nos 51405047, 51305472), The Scientific Research Fund of Chongqing Municipal Education Commission (No. KJ1500529), Chongqing Postdoctoral Science Foundation funded project (No. xm2015001), Science application research project of COSCO, China (Grant No. 2010-1-H-001), The Key Laboratory of Road Construction Technology and Equipment (Chang’an University), (No. 2014SZS11-K02), China Postdoctoral Science Foundation funded this research, Project no. 2014M552316, and the basic and advanced research key project of Chongqing (No. cstc2015jcyjBX0140).
