Bearing fault diagnosis based on spectrum image sparse representation of vibration signal

Abstract

Bearings are crucial for industrial production and susceptible to malfunction in rotating machines. Image analysis can give a comprehensive description of vibration signal, thus, it has achieved much more attention recently in fault diagnosis field. However, it brings lots of redundant information from a single spectrum image matrix behind rich fault information, and massive spectrum image samples lead to exacerbation of this situation, which readily results in the accuracy-dropping problem of multiple local defective bearings diagnosis. To solve this issue, a novel feature extraction method based on image sparse representation is proposed. Original spectrum images are acquired through fast Fourier transformation. Sparse coefficient that reveals the underlying structure of spectrum image based on raw signals is extracted as the feature by implementing the orthogonal matching pursuit and K-singular value decomposition algorithm strategically, and then two-dimensional principal component analysis is applied for further processing of these features. Finally, fault types are identified based on a minimum distance strategy. The experimental results are given to demonstrate the effectiveness of the proposed method.

Keywords

Fault diagnosis sparse representation image vibration signal

Introduction

Bearings as general components cover a broad range of rotating machinery and play a crucial role in it. Some data imply that bearing failure is one of major causes of machine failure,¹ which may lead to serious break-downs and cause significant economic losses and even casualties.^2–4 Thus, it is necessary for bearing fault diagnosis, which has attracted sustained attention with the enhancement of the machinery automation during the past decades.

Analysis of vibration signal is a preferred idea for local bearing fault diagnosis generally, and fault diagnosis of bearings is actually a classification task. Based on the above idea, plenty of signal processing methods have been designed to extract features, such as Hilbert–Huang transform (HHT)⁵ and wavelet analysis.⁶ Then, various features extracted from vibration signal in form of scalars or vectors are used for fault classification, such as amplitude, spectral kurtosis, and skewness.^7,8 For the purpose of improving the performance of classification, research about fault diagnosis and classification has paid much attention to multi-features information fusion,^9–14 which involves time domain, frequency domain, or time–frequency domain. For example, Khelf et al.¹⁵ selected several relevant features via indicator evaluation for fault diagnosis of rotating machines. However, end effects or mode-mixing problem limits accuracy of the signal mode decomposition in HHT, and energy leakage is inevitable whatever wavelet basis is.¹⁶ Features extracted based on these methods as fault indicators are insensitive to local damage and easily influenced by even weak noise.

Recently computer visual technology has been applied in the field of fault diagnosis.¹⁷ Visual identification by image maybe the most direct and efficient approach.¹⁸ The images of vibration signals contain rich information related to time domain, frequency domain, or time–frequency domain, and diagnosis based on images has been paid more and more attention. Li et al.¹⁹ first realized bearing fault–type recognition directly based on spectrum image of vibration signal. However, due to interference factors in images, several diagnostic results are still unsatisfactory. In general, these methods have some limitations in mining useful information from signals for bearing fault diagnosis and classification.

Currently, there has been a growing interest in the study of sparse representation as the core theory of the signal compressed sensing (CS), and it provides the possibility of solving the above problem. Signals are described as sparse linear combinations of prototype atoms by using an over-complete dictionary and sparse coefficient.²⁰ Rubinstein et al.²¹ made enormous contributions to the sparse representation theory and proposed the K-singular value decomposition (K-SVD) algorithm in this field. Mallat and Zhang²² pointed out that a signal can be considered as several atoms that are key component elements of an over-complete dictionary linear combination. The effectiveness of sparse representation based on vibration signal has been preliminarily explored for mechanical diagnosis.²³ Sparse representation has been applied to bearing fault classification.²⁴ Qin et al.²⁵ structured a synthetical transform basis dictionary, and then fault components of vibration signal were extracted by iteratively using basis pursuit algorithm. Liu et al.²⁶ employed matching pursuit (MP) with time–frequency atoms for feature extraction, and it is proved to be a promising method compared with continuous wavelet transform and envelope detection.

In this article, a novel bearing fault diagnosis method is proposed, and it uses the sparse coefficients of spectrum image as the features based on vibration signal. First of all, the frequency spectrum images of normal and faulty bearings are achieved based on fast Fourier transformation (FFT) of vibration signals, where all images are with the same resolution. Then, sparse coefficient of spectrum image is extracted as the feature by implementing the orthogonal matching pursuit (OMP)²⁷ and K-SVD algorithm strategically. Essentially, over-complete dictionary learning is employed as an adaptive feature extraction method regardless of any prior knowledge. Finally bearing fault categories are classified by using a minimum distance method after feature dimension reduction based on two-dimensional principal component analysis (2DPCA).

The rest of this article is organized as follows. Section “The principle of sparse representation” sketches out the principle of sparse representation. Section “Fault diagnosis based on the spectrum image sparse representation” introduces fault diagnosis based on the spectrum image sparse representation, including image generation with FFT, feature extraction based on sparse representation and classification strategy. Section “Experimental verification” presents the experimental analysis. The conclusions are given in section “Conclusion.”

The principle of sparse representation

In recent years, most of the published work related to sparse representation has been focused on fault diagnosis. Sparse representation is a modeling of the signal and is used to represent the signal characteristics. Formally, if x is a column signal, the sparsity assumption²⁸ can be described by the following sparse approximation problem

\underset{α}{m i n} | | α | |_{0} s u b j e c t t o x = D α

(1)

In the above formulation, $α$ denotes the sparse representation of x, D is the dictionary and $| | \cdot | |_{0}$ is the $l^{0}$ pseudo-norm which counts the non-zero elements. In order to realize sparse coding efficiently, the over-complete dictionary is created. Then the error-constrained form of equation (1) using the over-complete dictionary²⁹ can be depicted by

m i n | | x | |_{0} s u b j e c t to | | y - D x | |_{2}^{2} \leq ε

(2)

In this formula, x is described as the sparse decomposition coefficient, and D is the over-complete dictionary matrix whose column is called atom $d_{i}$ , that is, the signal y can be represented by the combination of atom $d_{i}$ . This problem is known to be NP-hard and can be efficiently solved by using OMP algorithm, which is an approach to approximate the solution of signal y.³⁰ First of all, an initial residual $r^{(0)} = y$ and a current linear combination of the atoms ${\hat{y}}^{(0)} = 0$ are defined successively; each atom in dictionary D is subject to $| | d_{i} | |_{2} = 1$ . Then, the $r^{(j)}$ and ${\hat{y}}^{(j)}$ are both updated and satisfy $y = {\hat{y}}^{(j)} + r^{(j)}$ in each iteration until stop condition where $j = 1, \dots, K$ . Specifically in $j th$ iteration, the $i_{k} th$ atom $d_{i_{k}}$ most commonly associated with $r^{(j - 1)}$ will be selected and added to the current linear combination for signal representation. Above correlation can be measured by the following processing

| < r^{(j - 1)}, x_{j} > | \geq \underset{i}{s u p} | < r^{(j - 1)}, x_{i} > |

(3)

Through the previous operation, signal model ${\hat{y}}^{(j)}$ can be obtained by using the added one in this stage³⁰

{\hat{y}}^{(j)} = \sum_{p = 1}^{p = n} d_{i p} x_{ip}^{k}

(4)

where the coefficient $x_{ip}^{k}$ can be obtained using least squares methods, that is, minimize $| | y - {\hat{y}}^{(j)} | |^{2}$ or $r^{(j)}$ . Note that recursion terminal condition needs to meet one of the following cases: (1) the current residual $r^{(j)}$ that used in the next iteration satisfies a set threshold, that is, $| | r^{(j)} | |_{2} < ε$ ; (2) non-zero elements in the coefficient $x_{ip}^{k}$ has to reach an upper limit, that is, $| | x_{ip}^{k} | |_{0} = c$ . The OMP can obtain the highly sparse solution in sparse coding stage and will be employed in our work.

An essential issue in formulation equation (2) is the choice of the dictionary. The early dictionary is predefined by a set of basis functions and cannot change, such as discrete cosine transforms (DCTs), curvelets, wavelets, and so on. K-SVD algorithm²¹ is a dictionary-learning method for better adapting the given example data, and sparse representation can be generated based on the learned over-complete dictionary in a sparse coding stage. It is flexible for training of dictionaries and works in conjunction with any pursuit algorithm, for example, OMP algorithm. Given a set of signal samples $Y = {y_{i}}_{i = 1}^{N}$ and an initial dictionary $D \in R^{n \times K}$ , the process of K-SVD algorithm is to find the optimal dictionary D by solving³¹

\min_{D, X} | | Y - DX | |_{F}^{2} s u b j e c t t o i | | x_{i} | |_{0} \leq T

(5)

$| | \cdot | |_{F}$ is Frobenius norm and T is the maximum number of non-zero elements. This algorithm solution procedure can be roughly divided into two steps: sparse coding and dictionary update stages. In sparse coding stage, we obtain the X by OMP algorithm under the condition that the over-complete dictionary D is fixed by approximating the solution of equation (4). The pivotal steps of K-SVD³¹ are illustrated briefly in the following dictionary update stage. The dictionary update problem can be expressed as

| | Y - D X | |_{F}^{2} = ‖ Y - \sum_{j = 1}^{K} d_{j} x_{T}^{j} | ‖_{F}^{2} = ‖ E_{k} - d_{k} x_{T}^{k} ‖_{F}^{2}

(6)

Where $E_{k} = Y - \sum_{j \neq k} d_{j} x_{T}^{j}$ stands for the error of all signal samples caused by $d_{k}$ . Singular value decomposition (SVD) is employed on $E_{k}$ according to the low-rank approximation, and suitable $d_{k}$ and $x_{T}^{k}$ are selected to reduce error of all signal samples. In order to avoid $x_{T}^{k}$ being filled, a key step must be performed before SVD. A variable is introduced and defined as follows

w_{k} = {i | 1 \leq i \leq K, x_{T}^{k} (i) \neq 0}

(7)

The given signal ${y_{i}}$ that represented by the atom $d_{k}$ can be indexed by $w_{k}$ , and the positions of non–zero entries in $x_{T}^{k}$ can be obtained by i in equation (7). $Ω_{k}$ is defined as a matrix of size $N \times | w_{k} |$ , with ones on the $(w_{k} (i), i)$ entries and zeros elsewhere. The multiplication $x_{R}^{k} = x_{T}^{k} Ω_{k}$ changes the length of $x_{T}^{k}$ to $| w_{k} |$ by removing the zero entries. Similarly, $Y_{k}^{R} = Y Ω_{k}$ , $Y_{k}^{R} \in R^{n \times | w_{k} |}$ includes a subset of samples that are currently using the atom $d_{k}$ , and $E_{k}^{R} = E_{k} Ω_{k}$ , $E_{k}^{R} \in R^{n \times | w_{k} |}$ implies a selection of error columns that correspond to sample signals that use the atom $d_{k}$ . Minimizing equation (6) is equivalent to the minimization of

‖ E_{k} Ω_{k} - d_{k} x_{T}^{k} Ω_{k} ‖_{F}^{2} = ‖ E_{k}^{R} - d_{k} x_{R}^{k} ‖_{F}^{2}

(8)

SVD is employed on $E_{k}^{R} = U Δ V^{T}$ , and the $d_{k}$ can be updated by the first column of U, and $x_{R}^{k}$ can be changed by the first column of V multiplied by $Δ (1, 1)$ . Repeat the above steps, all the atoms in D are updated one by one until the $K th$ atom is replaced. Thus this algorithm is referred to K-SVD. In our work, substantially dictionary updating is applied to an adaptive feature extraction and the left singular value sequence of updated dictionary matrix is used to extract feature vector.

Fault diagnosis based on spectrum image sparse representation

In this section, the framework of spectrum image sparse representation is illustrated in Figure 1. Note that NO, IF, BF, and OF represent normal bearing, inner-race fault, ball fault, and outer-race fault successively, and more details are in the following sections.

Figure 1.

The framework of the proposed method.

Image generation with FFT

Vibration signal presented in the form of image is intuition. In this research, FFT spectrums of vibration signals are caught as original images. In order to generalize the proposed method, x-axis of spectrum is set as frequency in Hertz, and the y-axis is set as the amplitude. Therefore, the sampling rate directly affects the x-axis of the spectrum image for a selected signal. Its y-axis is a fixed suitable scale when generating the frequency spectrum. Then, the parameter of the original image is just the resolution. Once the pixel size is selected, the original image is created.

Feature extraction based on sparse representation

These original images contain rich information related to structural information of spectrum, for example, FFT amplitudes and characteristic frequencies. In order to mine underlying information from original image, sparse representation based on an over-complete dictionary is applied to feature extraction from the obtained original image. We can efficiently extract the underlying information from original images by implementing OMP and K-SVD strategically. Details of the proposed feature extraction method are shown as follows:

Step 1: Supposed that there are Q training original image samples (with $w \times h$ pixels) which are compressed, and $p th$ original training image sample $M_{p} \in R^{m \times n}$ , $p = 1, 2, \dots, Q$ is processed by blocking ( $8 \times 8$ blocks) as the input matrix $Y^{p}$ , $Y^{p} = [y_{1}^{p}, y_{2}^{p}, \dots, y_{N}^{p}]$ .

Step 2: Using DCT to generate initialization over-complete dictionary $D^{p} = [d_{1}^{p}, d_{2}^{p}, \dots, d_{i}^{p}]$ .

Step 3: According to an over-complete dictionary $D^{p}$ , sparsity K, and input matrix signal $Y^{p}$ , figuring out the sparse decomposition coefficient $X^{p}$ by OMP algorithm, which includes abundant spectral structure information.

Step 4: The over-complete dictionary $D^{p}$ is updated through column-by-column according to K-SVD algorithm.

Step 5: Computing the final sparse decomposition coefficient $X^{p}$ by OMP algorithm in accordance with the updated dictionary $D^{p}$ , sparsity K and input matrix signal $Y^{p}$ .

Classification strategy

For purpose of classifying the bearing faults, features based on frequency spectrum images sparse representation of bearings which are unknown fault categories must be identified. First of all, several training original image samples are addressed through above feature extraction method to obtain the corresponding training features with different faults. Considering that two-dimensional (2D) features, 2DPCA method is used for feature space dimension reduction in our work. The process of 2DPCA and the proposed classification strategy will be attributed as follows:

Step 1: Suppose that extracted feature matrix $X \subseteq R^{K \times z}$ . The average sparse coefficient X of all training feature samples is indicated by $\bar{X}$ . The global sparse coefficient scatter matrix F will be defined by

F = \frac{1}{Q} \sum_{p = 1}^{N} {(X^{p} - \bar{X})}^{T} (X^{p} - \bar{X})

where $(\cdot)^{T}$ is the transpose of matrix (.).

Step 2: Find out the eigenvectors $v_{q}$ and eigenvalues $λ_{q}$ of the scatter matrix F by solving $Fv = λ v$ . The eigenvector $v = [v_{1}, v_{2}, \dots, v_{z}]$ is normalized and sorted in decreasing order on the basis of the eigenvalues $λ_{q}, q = 1, 2, \dots, z$ .

Step 3: Choose the first c largest eigenvectors to form the projection operator. Finally the sparse coefficient feature is used to classify, and it is given by

X_{FO}^{p} = X^{p} V_{g}

where $V_{g} = v_{\max g}$ , and $g = 1, 2, \dots, c$ . Then the projected sparse decomposition coefficient feature vectors $X_{FO}^{p} = [X_{FO 1}^{p}, X_{FO 2}^{p}, \dots, X_{FOg}^{p}] \in R^{K \times g}$ can be obtained.

Then fault identification based on these extracted sparse features will be carried out via a minimum distance strategy.

Supposing that the p feature matrix extracted from the Q training original image samples is $E_{p} = [X_{FO 1}^{(p)}, X_{FO 2}^{(p)}, \dots, X_{FOg}^{(p)}]$ , where $p = 1, 2, \dots, Q$ , and the $w th$ test feature is $T_{w} = [X_{FO 1}^{(w)}, X_{FO 2}^{(w)}, \dots, X_{FOg}^{(w)}]$ . In this article, Euclidean distance is applied to measuring the distance between $E_{p}$ and $T_{w}$ as follows

d e_{p} (E_{p}, T_{w}) = L_{l = 2} (E_{p}, T_{w}) = \sum_{r = 1}^{g} {‖ X_{FOr}^{(p)} - X_{FOr}^{(w)} ‖}_{2}

$L = s_{1}, s_{2}, \dots, s_{N}, (N \leq M)$ is defined as the classification label set of the Q training samples. For purpose of classifying the $p th$ test feature, it is essential to find the subscript $η$ , which satisfies

d e_{η} = min (d e_{p})

That is, if $s_{δ} (s_{δ} \in L)$ is the label of the $η$ training feature, then the wth test feature is assigned as $s_{δ}$ . The workflow of fault diagnosis based on the spectrum image sparse representation is illustrated in Figure 2.

Figure 2.

Flow chart of bearing fault diagnosis based on the spectrum image sparse representation.

Experimental verification

Experimental setup and database preparation

In order to validate the performance of the proposed fault diagnosis scheme experimentally, rolling bearing public data provided by the bearing data center of Case Western Reserve University are used.³² The test rig is shown in Figure 3, mainly containing a 2 hp motor, a torque sensor/encoder, a power meter, accelerometers, and electronic control unit. With the help of accelerometers attached to the rack with magnetic bases, the vibration data in this article are sampled at 12 kHz from drive end bearing, which is of single-point faults created by electrical discharge machining with four kinds of fault diameters (0.007, 0.014, and 0.021 in) under different load conditions varied at L0 = 0 hp/1797 r/min, L1 = 1 hp/1772 r/min, L2 = 2 hp/1750 r/min, and L3 = 3 hp/1730 r/min, and each fault degree corresponds to four faults (inner-race fault (IF), outer-race fault (OF), and ball fault (BF)). In this experiment, there are four groups of health condition data of normal bearing (NO), and the other three kinds of faulty ones under each load condition. In order to simulate actual diagnosis, we focus on the scenario that the labeled training samples including NO, IF, OF and BF are collected under one load condition (called training load condition) and unlabeled test samples are obtained under other load condition (called test load condition). Totally 12 different diagnostic tests are performed, and the details of the experimental scenario are presented in Table 1.

Figure 3.

Bearing test rig of Case Western Reserve University Data Center.

Table 1.

Description of the scenario setup.

No. oftest	Training	Test	Faulttype	Faultdegree
1	L0	L0, L1, L2, L3	NO, IF, BF, OF	0.007 in
2	L1	L0, L1, L2, L3	NO, IF, BF, OF	0.007 in
3	L2	L0, L1, L2, L3	NO, IF, BF, OF	0.007 in
4	L3	L0, L1, L2, L3	NO, IF, BF, OF	0.007 in
5	L0	L0, L1, L2, L3	NO, IF, BF, OF	0.014 in
6	L1	L0, L1, L2, L3	NO, IF, BF, OF	0.014 in
7	L2	L0, L1, L2, L3	NO, IF, BF, OF	0.014 in
8	L3	L0, L1, L2, L3	NO, IF, BF, OF	0.014 in
9	L0	L0, L1, L2, L3	NO, IF, BF, OF	0.021 in
10	L1	L0, L1, L2, L3	NO, IF, BF, OF	0.021 in
11	L2	L0, L1, L2, L3	NO, IF, BF, OF	0.021 in
12	L3	L0, L1, L2, L3	NO, IF, BF, OF	0.021 in

NO: normal bearing; IF: inner-race fault; BF: ball fault; OF: outer-race fault.

The spectrum images are generated based on FFT by selecting 1024 sampling points. The y-axis of the frequency spectrum is fixed with a suitable scale. Then each spectrum image is obtained with image size $260 \times 220$ pixels. A total of 800 sparse coefficient matrixes are obtained from the original images of NO, IF, OF, and BF under one load condition. The training features from the above four kinds of original images are sampled randomly under a certain load condition, and in test stage all features under the other load condition are devoted to directly classifying the bearing fault types. Every test in Table 1 is repeatedly executed 40 times, and then, the average classification accuracies are took as the diagnostic results.

With the purpose of further demonstrating the superiority of the proposed method, the baseline approaches and several successful methods are carried out simultaneously.

Baseline 1: Linear support vector mechanism (SVM) classifier with no projection technology is made, that is, we use the original FFT amplitudes without learning a new representation.

Baseline 2: Nearest-neighbor classifier with a new representation is learned by projecting technology.

NN SPI: Nearest-neighbor classifier based on the spectrum image is created.¹⁹

Both a and b are classical methods, which have achieved success in many fault diagnosis applications. Method c is one of the state-of-the-art method in image recognition field.

Diagnosis results of the proposed method

The diagnosis results of the four methods, with fault size being 0.007, 0.014, and 0.021 in, are illustrated in Figures 4 –6, respectively.

Figure 4.

The results with fault size being 0.007 in.

Figure 5.

The results with fault size being 0.014 in.

Figure 6.

The results with fault size being 0.021 in.

Each figure consists of four subfigures and has the same fault size in Figures 4–6. In each subfigure, the right of the symbol “->” represents the training load condition, and the left side represents the test load condition. In order to illustrate the diagnostic procedures of the proposed method, “L2->L1” in Figure 4(b) is taken as an example shown in Figure 7. First, all spectrums are blocking, and then sparse decomposition coefficients are obtained via dictionary update and sparse coding stages. From the result in Figure 7, it is clear that sparse decomposition coefficients of different fault categories extracted based on labeled training spectrums are of obviously different three-dimensional (3D) structures. Second, the projected space is established based on sparse decomposition coefficients of training spectrums through 2DPCA. Afterwards, unlabeled test coefficients are projected onto this space. It is evidently shown that features extracted in this space are of satisfactory clustering and classification in Figure 7, and the predicted result in Figure 4(b) and 7 verifies its effectiveness.

Figure 7.

Illustration of the proposed method for bearing fault diagnosis.

From the diagnostic results in Figures 4 –6, it is obvious that the highest accuracies for fault diagnosis are always obtained when training set is the same with the test set. Meanwhile, when there is much difference between training set and test set, the classification accuracy is generally low for compared methods. The performance of Baseline1 is very poor and it fails to detect fault in this scenario. For example, in Figures 5 and 6, many classification accuracies are about 75%. Especially in Figure 4, a lots of classification accuracies are only around 50%. Baseline2 is superior to Baseline1. However, in Figures 4(b) and 6(d), classification accuracies are about 75%. Nearest-neighbor classifier based on the spectrum image (NN SPI) is slightly better than Baseline2. Unfortunately, in Figure 4(a), the classification performance only reachs 55.95% when using L0 as training samples and L3 as test samples. What is exciting is that the performance of the proposed method is far better than other three methods, and the accuracies of the proposed method are almost 100%. Although the results of the four cases (L0->L1 in Figure 4(b), L0->L2 in Figure 4(c), L0->L3 in Figure 4(d), and L0->L1 in Figure 5(b)) are not 100%, they all exceed 98%. We can conclude that the proposed method can effectively distinguish bearing fault categories in this scenario that working conditions between training set and test set are very different.

Discussion

The spectrum image based on FFT is a very intuitive display of vibration signal, and signals in different health conditions are of different distributions. However, due to interference of redundant information, it is difficult to extract useful fault signature directly from the spectrum image efficiently. The sparse representation provides a different view to mine the useful information from the spectrum image of vibration signal. Taking “L2->L1” in Figure 4(b) as an example shown in Figure 8, features extracted from different methods are visualized via t-distributed stochastic neighbor embedding (t-SNE).³³ The right of the symbol “-” represents the training or test, and the left side represents the health condition. From Figure 8, it is very obvious that features extracted from the proposed method can reflect the health condition of bearings more accurately and be of preferable clustering than other compared methods. In addition, vast compared experiments are carried out to verify the effectiveness of the proposed method.

Figure 8.

Feature visualization via t-SNE³³ based on different methods.

For further illustration about the superiority of the proposed method in the field of bearing fault diagnosis, a comparative research between the current work and published literature is listed in Table 2.

Table 2.

Comparisons between the current work and some published work.

References	Load condition	Fault degree	No. of training samples	Average test accuracy (%)
Li et al.³⁵	Single	Single	100	98.33
Yang et al.³⁶	Single	Multiple	118	95.253 (0.014 in)
				99.368 (0.021 in)
Abbasion et al.³⁴	Multiple	Single		100
Li et al.¹⁹	Multiple	Multiple	10	95.65 (0.014 in)
				99.90 (0.021 in)
Zhang et al.³⁷	Multiple	Multiple	6600	95.9
The proposed method	Multiple	Multiple	10	99.77 (0.007 in)
				99.99 (0.014 in)
				100 (0.021 in)

In order to make a fair comparison, all these work use the same bearing data³² as used in this article. The majority of previous work focuses on only fault diagnosis under a single-load condition and fault degree. Under load 3-hp condtion, Li and Zhang³⁵ used a fault diagnosis method based on the supervised locally linear embedding projection (SLLEP) to identify faults with fault size being 0.021 in. The extracted features by this method are deeply influenced by the regularization parameter, and the number of training samples is considerably higher than that of the proposed method. In Abbasion et al.,³⁴ features extracted based on wavelet denoising and SVM method were used to diagnose faults under 2-hp condition with fault size being 0.007 in. In a study by Yang et al.,³⁶ SVMs and fractal dimension based on the fractal dimension feature (FDF) were applied for bearing fault diagnosis under single-load condition with fault size being 0.014 in and 0.021 in. Taking different load conditions into account, Li et al.¹⁹ directly used the spectrum image of vibration signal as a feature to diagnose the bearings fault types: however, unfortunately several cases are still around 75%. In a study by Zhang et al.,³⁷ a method based on deep convolutional neural networks with wide first-layer kernel (WDCNN) was proposed to diagnose three data sets. Three data sets include 10 kinds of health conditions (NO and IF, BF, OF with fault size being 0.007, 0.014, and 0.021 in) under three load conditions (Load1, Load2, and Load3), respectively, which is similar to L0, L1, L2, and L3 all in this article. The average accuracy of WDCNN is 95.9%, whereas average accuracy of the proposed method is 99.92%. It is worth mentioning that the performances of the proposed method are superior to the other methods listed in Table 2.

Conclusion

In this article, a novel fault diagnosis approach for bearings based on the spectrum image sparse representation technique has been proposed. The sensitive features were extracted from a small amount of spectrum image samples, and then such sensitive features were used to diagnose faults. Specifically the original spectrum image is obtained through FFT. Then, the sparse decomposition coefficient feature is extracted via a combination of OMP and K-SVD algorithm from the original spectrum. Finally, with the help of similarity measurement based on minimizing Euclid distance, the fault identification is realized based on the Eigen of the sparse decomposition coefficient matrix obtained by 2DPCA method. The proposed method provides a novel perspective for mining the useful information from a small amount of spectrum image matrix samples to reduce interference of redundant information, and then, it solves the accuracy-dropping problem of multiple local defective bearings diagnosis. Different experiment tests under different loads and motor speeds demonstrated the effectiveness and feasibility of the proposed method.

Footnotes

Handling Editor: Zengtao Chen

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is supported by National Key R&D Program of China (2016YFC0802900), National Natural Science Foundation of China (No. 51475455), Natural Science Foundation of Jiangsu Province (No. BK20160251), and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).

ORCID iDs

Zhe Tong

Fan Jiang

Gongbo Zhou

References

Bianchini

Immovilli

Cocconcelli

et al . Fault detection of linear bearings in brushless AC linear motors by vibration analysis. IEEE T Ind Electron 2011; 58: 1684–1694.

Ding

Sparse representation based on local time-frequency template matching for bearing transient fault feature extraction. J Sound Vib 2016; 370: 424–443.

Chen

et al . Compressed sensing based on dictionary learning for extracting impulse components. Signal Process 2014; 96: 94–109.

Bellini

Filippetti

Tassoni

et al . Advances in diagnostic techniques for induction machines. IEEE T Ind Electron 2008; 55: 4109–4126.

Babu

Srikanth

Sekhar

AS.

Hilbert-Huang transform for detection and monitoring of crack in a transient rotor. Mech Syst Signal Pr 2008; 22: 905–914.

Yang

Nagarajaiah

Structural damage identification via a combination of blind feature extraction and sparse representation classification. Mech Syst Signal Pr 2008; 45: 1–23.

Jardine

AKS

Lin

Banjevic

A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mech Syst Signal Pr 2006; 20: 1483–1510.

Antoni

The spectral kurtosis: a useful tool for characterising non-stationary signals. Mech Syst Signal Pr 2006; 20: 282–307.

Zhou

et al . Mechanical fault diagnosis based on redundant second generation wavelet packet transform, neighborhood rough set and support vector machine. Mech Syst Signal Pr 2012; 28: 608–621.

10.

Xian

Zeng

BQ.

An intelligent fault diagnosis method based on wavelet packer analysis and hybrid support vector machines. Expert Syst Appl 2009; 36: 12131–12136.

11.

Lei

Lin

et al . Application of an improved kurtogram method for fault diagnosis of rolling element bearings. Mech Syst Signal Pr 2011; 25: 1738–1749.

12.

Liu

et al . Feature fusion using kernel joint approximate diagonalization of Eigen-matrices for rolling bearing fault identification. J Sound Vib 2016; 385: 389–401.

13.

Liu

Zhang

Cheng

et al . Fault diagnosis of gearbox using empirical mode decomposition and multi-fractal detrended cross-correlation analysis. J Sound Vib 2016; 385: 350–371.

14.

Yang

A new rolling bearing fault diagnosis method based on GFT impulse component extraction. Mech Syst Signal Pr 2016; 81: 162–182.

15.

Khelf

Laouar

Bouchelaghem

AMR

et al . Adaptive fault diagnosis in rotating machines using indicators selection. Mech Syst Signal Pr 2013; 40: 452–468.

16.

Chandra

Sekhar

AS.

Fault detection in rotor bearing systems using time frequency techniques. Mech Syst Signal Pr 2016; 72–73: 105–133.

17.

Klein

Masad

Rudyk

et al . Bearing diagnostics using image processing methods. Mech Syst Signal Pr 2014; 45: 105–113.

18.

Hong

ZQ.

Algebraic feature extraction of image for recognition. Pattern Recogn 1991; 24: 211–219.

19.

Qiu

Zhu

et al . Bearing fault diagnosis based on spectrum images of vibration signals. Meas Sci Technol 2015; 27: 1–17.

20.

Guo

Gao

et al . Machinery vibration signal denoising based on learned dictionary and sparse representation. J Phys Conf Ser 2015; 628: 012124.

21.

Rubinstein

Faktor

lad

. K-SVD dictionary-learning for the analysis sparse model. In: Proceedings of the IEEE international conference on acoustics, Kyoto, Japan, 25–30 March 2012; 22: 5405–5408.

22.

Mallat

Zhang

Matching pursuits with time-frequency dictionaries. IEEE T Signal Proces 1993; 41: 3397–3415.

23.

Ding

Lin

Fault feature extraction of rolling element bearings using sparse representation. J Sound Vib 2016; 366: 514–527.

24.

Tang

Yang

Wang

et al . Sparse classification of rotating machinery faults based on compressive sensing strategy. Mechatronics 2014; 31: 60–67.

25.

Qin

Mao

Tang

Vibration signal component separation by iteratively using basis pursuit and its application in mechanical fault detection. J Sound Vib 2013; 332: 5217–5235.

26.

Liu

Ling

Gribonval

Bearing failure detection using matching pursuit. NDT&E Int 2002; 35: 255–262.

27.

Tropp

Gilbert

AC.

Signal recovery from random measurements via orthogonal matching pursuit. IEEE T Inform Theory 2007; 53: 4655–4666.

28.

Tang

Chen

Dong

Sparse representation based latent components analysis for machinery weak fault detection. Mech Syst Signal Pr 2014; 46: 373–388.

29.

Rubinstein

Peleg

Elad

Analysis K-SVD: a dictionary-learning algorithm for the analysis sparse model. IEEE T Signal Proces 2013; 61: 661–677.

30.

Pati

YCC

Rezaiifar

Krishnaprasad

PSS

. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition. In: Proceedings of 27th Asilomar conference on signals, systems and computers, Pacific Grove, CA, 1–3 November 1993, pp.1–5. New York: IEEE.

31.

Aharon

Elad

Bruckstein

AM.

The K-SVD: an algorithm for designing of overcomplete dictionaries for sparse representations. IEEE T Signal Proces 2006; 54: 4311–4322.

32.

Case Western Reserve University Bearing Data Center. Bearing Data Center Fault Test Data, http://csegroups.case.edu/bearingdatacenter/pages/download-data-file

33.

Maaten

Hinton

Visualizing data using t-SNE. J Mach Learn Res 2008; 9: 2579–2605.

34.

Abbasion

Rafsanjani

Farshidianfar

et al . Rolling element bearings multi-fault classification based on the wavelet denoising and support vector machine. Mech Syst Signal Pr 2007; 21: 2933–2945.

35.

Zhang

Supervised locally linear embedding projection (SLLEP) for machinery fault diagnosis. Mech Syst Signal Pr 2011; 25: 3125–3134.

36.

Yang

Zhang

Zhu

Intelligent fault diagnosis of rolling element bearing based on SVMs and fractal dimension. Mech Syst Signal Pr 2007; 21: 2012–2024.

37.

Zhang

Peng

et al . A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals. Sensors 2017; 17: 425.