Abstract
The traditional bearing fault diagnosis method is achieved often by sampling the bearing vibration data under the Shannon sampling theorem. Then, the information of the bearing state can be extracted from the vibration data, which will be used in fault diagnosis. A long-term and continuous monitoring needs to sample and store large amounts of raw vibration signals, which will burden the data storage and transmission greatly. For this problem, a new bearing fault diagnosis method based on compressed sensing is presented, which just needs to sample and store a small amount of compressed observation data and uses these data directly to achieve the fault diagnosis. First, several over-complete dictionaries are trained by dictionary learning method using the historical operating data of the bearings. Each of these dictionaries can be effective in signal sparse decomposition for a particular state, while the signals corresponding to other states cannot be decomposed sparsely. According to this difference, the bearing states can be identified finally. The fault diagnosis results of the proposed method with different parameters are analyzed. The effectiveness of the method is validated by experimental tests.
Introduction
Considering the material defects, manufacturing errors, working conditions, and other factors such as fatigue, aging, and so on, the damages and faults of the rotating machinery occur inevitably during operation. The bearing is one of the most common and most important key components in rotating machinery; in case of failure, this will lead to the equipment downtime which affects productivity and results in economic loss. Thus, it is particularly important for rotating machinery to execute bearing condition monitoring and fault diagnosing.
The traditional fault diagnosis method is achieved often by sampling the bearing vibration data under the Shannon sampling theorem. The long-term and continuous monitoring needs to sample and store large amounts of raw vibration signal, which will burden the data storage and transmission greatly. The compressed sensing theory can provide a new idea in solving this problem. In 2006, Candès proved in mathematic principle that the original signal could be reconstructed using parts of its Fourier transform coefficients, which would be the theoretical foundation for compressed sensing. 1 Then, Donoho 2 and Candès 3 proposed the concept of compressed sensing formally based on the related work. The main process of compressed sensing can be divided into two steps. First, combining the sampling with the compressing, we can acquire the nonadaptive linear projections (or measurements) of the original signal. Then, the original signal can be reconstructed directly with these measurements by the appropriate recovery algorithms.4–7 With this strategy, the amount of data in monitoring will be reduced greatly, and the burden in data transmission and storage will be alleviated effectively. Since it is possible to recover the original high-dimensional signal from the compressed measurements with low dimension, which means that most of the bearing state information is contained in these low-dimensional measurements,8–12 then we can consider achieving the bearing fault diagnosis just using the compressed measurements directly, without recovering the original signal. This is the starting point of our proposed method.
The other sections of this article are organized as follows: in section “Compressed sensing theory,” we will briefly introduce the basic theory of the compressed sensing; the theory of the proposed bearing fault diagnosis method using the compressed measurements directly will be presented in section “Fault diagnosis method”; in section “Experimental test,” the proposed method will be tested with different bearing vibration signals; and, finally, in section “Conclusion,” this article will be summarized.
Compressed sensing theory
For a signal
Then, we can acquire the compressed measurements (or observations)
Actually, the signal x is not sparse in general, but it can be represented sparsely using proper ways such as orthogonal transformation. If we expand
where
where
The compressed measurements can be represented in matrix form as Figure 1. Owing to the fact that the vector

Matrix form of the compressed measurements.
In the previous introduction, the original signal x is represented sparsely on the dictionary with orthogonal basis. However, this kind of dictionary has limited capacity to the signal sparse representation. Therefore, other types of the dictionaries such as a variety of over-complete dictionaries are often used in practice. According to the difference in the applications, over-complete dictionaries can be divided into two categories: fixed dictionaries which can be used for nonspecific signals and trained dictionaries which can be used only for specific signals. In general, the fixed dictionary can be used for many different kinds of signals, but it is difficult to decompose the signal very sparsely, and the signal representation error would be larger. While the trained dictionary can decompose the signal very sparsely since the structure and characteristics information of the training samples is used in dictionary learning process; therefore, the signal sparse representation result can be very good. However, this kind of dictionaries can be used only for the signals which have the same state with the training samples. The most frequently used dictionary learning methods are the method of optimal directions (MODs)26,27 and the K-singular value decomposition (K-SVD) method.28,29 In this article, the over-complete dictionaries corresponding to different bearing states will be trained by the K-SVD method.
Fault diagnosis method
For the acquisition of the bearing vibration signal, we denote the collected high-dimensional signal by

Relationship between the traditional sampling and compressed sampling.
We define
where
Defining
Then, we can obtain n + 1 representation errors as
For a low-dimensional compressed signal y, the representation errors
According to the above analysis, the bearing states can be identified by comparing the representation errors of the low-dimensional compressed signal on different dictionaries. The process of the proposed fault diagnosis method is shown in Figure 3, and the corresponding steps are as follows:
Acquiring the high-dimensional vibration signals when the bearing works in different states, which will be used as the training samples in dictionary learning;
Training the over-complete dictionaries (
Setting the thresholds (
Acquiring the low-dimensional vibration signals
Representing the low-dimensional signal y on all the known dictionaries (
Estimating the state of the bearing:
When
When
When

Flowchart of the proposed bearing fault diagnosis method.
In the diagnostic process shown in Figure 3, we determine that the bearing is in some unknown state and confirm that the bearing is in the fault state but not a known fault state. Actually, if more samples corresponding to different fault states can be acquired, then we can obtain more dictionaries by training, which can be used to recognize the unknown bearing state. The dictionaries corresponding to different states play a role similar to a sieve, which can filter the signals with the corresponding state and achieve the identification of the bearing state ultimately. For the error thresholds corresponding to different states, they can be set according to the prior knowledge.
The reason why we set different thresholds of signal representation errors is that there have been many different types of bearing faults, and we cannot obtain enough dictionaries by training to describe all the possible bearing states. For some signal which is not in any one of the known states, if we decompose this signal on all the known dictionaries (
The error
The error
As can be seen from the fault diagnosing process, the proposed method is mainly affected by the following factors: the principle to determine the fault state, the signal reconstruction algorithm, and the compressed sampling way. The signal reconstruction algorithm is used to solve the expansion coefficient vector
Experimental test
The proposed fault diagnosing method is validated using the vibration signals from the 6205-2RS JEK SKF deep groove ball bearings (data sources are from Case Western Reserve University Bearing Data Center Website,
30
and the signal sampling frequency is 12K). The data used in our tests can be divided into two types: the training samples used in dictionary learning and the test samples used for the validation of the proposed method. The signals sampled in a traditional way should be high-dimensional. For dictionary learning, the high-dimensional samples can be used directly. While the test samples should be low-dimensional, in our tests, they will be obtained by simulating the compressed sampling. Defining
The training samples can be divided into four categories: normal state samples, inner ring fault samples, outer ring fault samples, and rolling element fault samples. Each type of the samples contains 20,480 signals, which can be also divided into four groups, and each group contains 5120 signals according to different motor speeds and loads. The training samples are shown in Table 1.
Training samples.
Then, the over-complete dictionaries (
The original high-dimensional test samples contain 800 signals corresponding to the normal state and 1200 signals corresponding to the fault states. The fault can be divided into three categories: the inner ring fault, outer ring fault, and rolling element fault. For each type of fault, we acquire 400 signals. Considering that the effects of variable speed operation or variable loads are extremely common in industrial applications, we will analyze the data with different levels of speed and motor load in our experimental tests. In accordance with different motor speeds and loads, the test samples corresponding to each bearing state contain four groups of signals, as shown in Table 2. The faults were designed as single point in inner ring, outer ring, and rolling elements, which were introduced to the test bearings using electrodischarge machining with fault diameter of 0.021 in and fault depth of 0.011 in.
Original high-dimensional test samples.
In order to set appropriate error thresholds, we will refer to the prior knowledge. For each bearing state, we can select some training samples randomly in Table 1, taking 1000 signals, for instance, and then the corresponding low-dimensional signals can be obtained by simulating the compressed sampling. Denoting the observation matrix by

Representation errors of the low-dimensional signals on different dictionaries: (a) normal state, (b) inner ring fault, (c) outer ring fault, and (d) rolling element fault.
Ideally, the two parts of the data in Figure 4 should be separated completely according to the principle of the proposed fault diagnosis method. This means that for each state, the corresponding error threshold should be more than all of the values indicated by the circles and less than all of the values indicated by the points. Actually, this ideal case is extremely difficult to achieve in practice. Therefore, the following principle will be used in setting the thresholds: for each state, the corresponding error threshold should be more than most of the values indicated by the circles and less than most of the values indicated by the points in the corresponding figure. In all of our tests, we will follow the “95% principle,” meaning that the threshold will be set as the value which is more than and just more than 95% representation errors of the signals. For example, for the error threshold corresponding to the normal state, we can set it as the value which is more than and just more than 950 data indicated by the circles in Figure 4(a). Defining
Taking Gaussian random matrix 14 as the observation matrix in compressed sampling and setting the amount (M) of the compressed measurements as 80, then we can obtain the low-dimensional signals corresponding to the original test samples as shown in Table 2. As mentioned in our article, in order to reconstruct the sparse representation vector, the observation matrix should be irrelevant to the dictionary used in signal sparse representation. In our tests, the dictionaries are always trained with different training samples, meaning that these dictionaries are not constant and can be changed when the training samples change. Due to the fact that Gaussian random matrix can be irrelevant to most of the transform matrices, it can be used as the compressed observation matrix in most of the cases. That is why we take Gaussian random matrix as the compressed observation matrix in our tests.
In our proposed method, the bearing states can be determined just using these low-dimensional signals directly. The expansion coefficient vector is solved by the MP algorithm, where the sparsity of the coefficient vector is set as 10. Then, the bearing fault diagnosing can be achieved, and the results are shown in Table 3. The recognition rate to some state characterizes the ratio of the amount of the samples identified accurately to the amount of all the test samples in this state. It can be seen from Table 3 that the recognition rates of the proposed method to the normal state reach to 97.88%, and for the fault states, the recognition rates can be up to 80%. Meanwhile, the low-dimensional signals which cannot be identified correctly are determined to be in some unknown state. These results validate the effectiveness of the proposed method to the bearing fault diagnosing.
Fault diagnosing results.
According to the theory and process of the fault diagnosing used above, the thresholds of the signal representation errors have a significant impact on the diagnosing results. Keeping other parameters invariable, Figure 5 shows the recognition rates of the four bearing states when the corresponding error thresholds change. Figure 5(a) shows the recognition rates of the normal state when the error threshold

Recognition rates of the bearing states with different error thresholds: (a) normal state, (b) inner ring fault, (c) outer ring fault, and (d) rolling element fault.
It can be seen from Figure 5 that with the increase in the error thresholds, the recognition rates of the four bearing states will increase gradually. However, it should be noted that the thresholds should not be very large, although a larger threshold can improve the recognition rate of the corresponding state. A very large threshold will increase the possibility of misjudging when the signal is actually in some unknown state. In next test, we take the 400 signals corresponding to the outer ring fault as shown in Table 2 as the original high-dimensional test samples. Taking Gaussian random matrix as the observation matrix in compressed sampling and setting the amount (M) of the compressed measurements as 80, then we can obtain the low-dimensional signals corresponding to these original test samples. Suppose we obtain only three over-complete dictionaries as
The expansion coefficient vector is solved by the MP algorithm, where the sparsity of the coefficient vector is set as 10. If we keep the threshold

Misjudging rates of the bearing state with different error thresholds
It can be seen from Figure 6 that with the increase in the error threshold
In the above tests, the sparsity set in MP algorithm was set as 10 all the time, meaning that the number of the atoms involved in the representation of the low-dimensional signals is kept 10. According to the fault diagnosis theory in this article, the number of the atoms used in signal representation affects the sparse representation errors directly. Then, the fault diagnosing results will also be affected. Similar to the previous tests, we keep the thresholds of signal representation errors as follows: the threshold

Recognition rates of the different states with different sparsities in MP algorithm.
As can be seen from Figure 7, with the increase in the sparsity, the recognition rates of the normal state, inner ring fault, and outer ring fault will gradually improve, while the recognition rates of the rolling element fault show a trend of increasing first and then declining slightly. In general, a larger sparsity in MP algorithm is of benefit to bearing fault diagnosing. This result should relate to the character of the dictionaries. Taking the dictionary corresponding to the normal state for example, with the increase in the sparsity, the atoms involved in signal representation will increase. In this case, the signals in the normal state can be represented more accurately, and the corresponding representation errors will become smaller. While this dictionary is not available for the signals in other states, therefore, with the sparsity increasing, the corresponding representation errors cannot improve significantly. According to the fault diagnosis theory of the proposed method, this difference in representation errors will be of benefit to the state identification. That is why the state recognition rates improved with the increasing sparsity. For the trend that the recognition rates of the rolling element fault declined when the sparsity becomes larger, the following analysis may be taken as an explanation. For the dictionary corresponding to the rolling element fault, with the increase in the sparsity, not only the representation errors of the signals in the rolling element fault on this dictionary decline but also the representation errors of the signals in other states decline. In this case, the difference in the representation errors corresponding to different states will shrink, which may result in the declining of the recognition rates.
Although a larger sparsity in MP algorithm is propitious to the bearing state identification, we must note that the sparsity should not be very large, for a too large sparsity will bring in the increasing of the computation, which should be a disadvantage to bearing fault diagnosing. Therefore, we should set the sparsity as a moderate size. In general, the sparsity in MP algorithm can be set as the value which is equal to the sparsity set in dictionary learning, just as what we did in this article.
In the previous analysis, we just used 80 compressed measurements in fault diagnosing. The following analysis will focus on the bearing fault diagnosis with different amount of the compressed measurements. The error thresholds will always be set by the “95% principle” introduced above. In this test, the Gaussian random matrix is taken as the observation matrix. The expansion coefficient vector is solved by MP algorithm, and the sparsity of the coefficient vector is set as 10. Then, the recognition rates of the four states with different amount (M) of the compressed measurements are calculated, and the results are shown in Figure 8.

Recognition rates of the four states with different amount (M) of the compressed measurements.
It can be seen from Figure 8 that when M is less than 100, with the increase in the compressed measurements, the recognition rates of the four states will improve first and then being stabilized gradually on the whole, which means that more compressed measurements is of benefit to the bearing fault diagnosing. The more the compressed measurements, the more the information of the bearing operation will be contained, which is propitious to the bearing state identification. The less compressed measurements will alleviate the burden of the data storage and transmission, which is also a disadvantage to the fault diagnosing in the meantime. The more compressed measurements will be an advantage to the fault diagnosing, but the effectiveness in alleviating the burden of the data storage and transmission will be weaken accordingly. In practice, we should give full consideration to both of the two factors and set a moderate amount of the compressed measurements.
In the experimental tests of this section, we obtained the compressed measurements by taking the Gaussian random matrix as the observation matrix. Actually, different compressed sampling systems will correspond to different observation matrices, and of course, we can also use other compressed sampling ways. To validate our proposed method with different compressed sampling ways, we will analyze and compare the fault diagnosing results in several typical compressed sampling ways (based on Gaussian random matrix, 14 partial orthogonal matrix, 6 and Toeplitz and circulant matrices, 31 respectively). The error thresholds will always be set by the “95% principle” introduced above. The expansion coefficient vector is solved by MP algorithm, and the sparsity is set as 10. Then, the recognition rates of the four bearing states when using different observation matrices are calculated, respectively, and the results are shown in Figure 9. Figure 9(a) shows the recognition rates of the normal state in the three compressed sampling ways. Similarly, Figure 9(b)–(d) shows the recognition rates of the inner ring fault, outer ring fault, and rolling element fault, respectively.

Fault diagnosing results corresponding to different observation matrices: (a) normal state, (b) inner ring fault, (c) outer ring fault, and (d) rolling element fault.
As can be seen from Figure 9, when M is less than 100, with the increase in the compressed measurements, the recognition rates of the four bearing states will show an overall trend of improving gradually and then being stabilized in all of the three compressed sampling ways. This means that more compressed measurements are of benefit to the fault diagnosing in most cases, no matter which compressed sampling way we use. This results show that the proposed method is effective when taking any one of the three compressed sampling ways. The results in Figure 9 also indicate that the fault diagnosing result when using Toeplitz and circulant matrices is slightly better than that when using other two observation matrices. The fluctuations of the recognition rates in each figure should be attributed to the randomness introduced when building the observation matrix and setting the thresholds.
Conclusion
A bearing fault diagnosis method based on the low-dimensional compressed measurements is proposed in this article. In the proposed method, it is not necessary to reconstruct the original high-dimensional signals. The bearing fault diagnosis can be achieved using a small amount of compressed measurements directly and then the data we need to acquire for diagnosing can be reduced greatly, which will alleviate the burden in data storing and transmission. The main parameters which affect the fault diagnosing results include the threshold of signal representation error, the sparsity set in MP algorithm, the amount of the compressed measurements, and the type of the observation matrix. The impacts of these parameters on the fault diagnosing results are analyzed, respectively, in this article, and some conclusions can be summarized as follows:
The error thresholds in the proposed method can be set by referring to the prior knowledge;
The sparsity in MP algorithm can be set as the value which is equal to the sparsity set in dictionary learning;
More compressed measurements are propitious to the state identification;
The Toeplitz and circulant matrices in compressed sampling has better performance in fault diagnosing.
Footnotes
Acknowledgements
The authors gratefully acknowledge the Bearing Data Center of Case Western Reserve University for providing the bearing test data. Valuable comments on this article from anonymous reviewers are very much appreciated.
Academic Editor: Fakher Chaari
Declaration of conflicting interests
The authors declare that there is no conflict of interest.
Funding
This study was financially supported by National Natural Science Foundation of China under grant nos 51375484, 51205401, and 51475463.
