Abstract
Nowadays, multi-fault diagnosis has become the most interesting topic for researchers, since it has lately attracted a substantial attention. The most published works recently have considered defects detection, identification, and classification as the toughest challenge for rotating machinery monitoring. As feature extraction requires robust techniques for online inspection with a high level of expertise to make automatic decisions on the running machine health status, a robust approach is required to adjust the misclassification of the extracted features, especially under various working conditions. In this paper, we propose the combination of two Time Domain Features (TDFs) in tandem with Singular Value Decomposition (SVD) and Fuzzy Logic System (FLS) to build an enhanced fault diagnosis technique for rolling bearing. The original vibration signal is divided first into several data samples. Thereafter, TDFs are applied on each sample to construct a feature matrix during the feature extraction step. Afterwards, SVD is performed on the obtained matrices in order to reduce their dimension and select the most stable vectors (singular values). Finally, FLS is employed as a powerful tool for automatic feature classification. Experimental results confirm that our suggested approach can enhance the ability to assess the degradation of bearing faults with a higher recognition sensitivity even under different working conditions.
Keywords
Introduction
As key mechanical parts in rotating machines, bearings are often the heart of a wide range of industrial mechanisms, such as wind turbines and helicopters. However, they are considered as the most vulnerable parts due to their harsh working conditions. A sudden bearing failure may decrease the performance of the entire system and even lead to fatal damages. Therefore, the recent attention to condition-based and predictive maintenance of industrial assets, as well as product safety, have led to a growing focus on bearings condition monitoring. Vibration analysis has been and remains widely used as a predictive maintenance strategy because vibration signals carry a lot of valuable information that gives an insight about the operating machine health condition. When the shaft rotates inside a given machine, both frictional and rotational forces are created. The vibrations produced by these forces are transmitted through the bearings to the gearbox housing. In ordinary working conditions, each specific machine element produces its own relatively stable vibration patterns which are known as the vibration signatures. 1 These signatures are considered as a reference in order to determine any increase in the vibration level that is probably caused by a gear or a bearing failure. The change in the vibration signature can be used to detect emerging faults at an early stage before they become critical. Vibration monitoring techniques use the modified vibration signature to detect, locate, and diagnose the affected part. Vibration analysis is a more powerful tool than other condition monitoring methods like oil analysis, ultrasonic signals, etc, since a small bearing defect increases the amplitude of the acquired vibration signal. 1 However, these signals are often affected by contributions from many different components in addition of the background noise. Therefore, the choice of an appropriate technique becomes crucial and depends significantly on the nature of the captured data. Consequently, many signal processing techniques have been proposed by researchers during decades. The most popular ones include time domain, frequency domain and time–frequency domain as well. Several studies have focused on time domain analysis using pattern recognition basing on statistical characteristics which contribute to describe the machine’s state. 2 Time domain methods are applied directly on vibration signals in order to detect the change in their amplitude and energy distribution in the present of a bearing failure. 3 Frequency or spectrum domain analysis is probably the most common tool used to identify a defective gear. 4 The vibration signals produced by a machine component are composed of certain frequencies that remain unchanged, although their levels may change. The defective tooth generates a periodic signal with a unique characteristic frequency. 5 Unlike temporal analysis, frequency analysis identifies the position of the defect by tracking its characteristic frequency. However, spectrum domain analysis cannot handle the non-stationary signals, which are often very frequent in the event of machine part failure. Time-frequency domain methods allow a simultaneous vibration signals analysis in both time and frequency domains. Vibration signals consist of three important elements: “a sinusoidal component due to time varying loading, a broad-band impulsive component due to the fault impact, and random noise. 6 ” Time-frequency analysis has been developed for these non-stationary signals. Non-stationary signals are better described by a time-frequency distribution to show the signal’s energy distribution over a two-dimensional space (time and frequency). 7 Researchers have developed several time-frequency analysis methods to monitor the rotating machine health state. Short-Time Fourier Transform (STFT),8,9 Wigner Ville Distribution (WVD),10–12 Wavelet Transform (WT),13,14 and Hilbert–Huang Transform (HHT)15,16 are the most commonly used time-frequency techniques and recently Fast Kurtogram17–19 and Autogram.20–22 However, time-frequency methods have the ability to detect the occurrence of a bearing failure, but without giving any information about its nature. Moreover, the variation in load and speed of the rotating machine will eventually lead to inaccurate classification results. 23 Currently, intelligent classification techniques are the outstanding methods for bearings condition monitoring.23–26 Intelligent classification strategies contain two main steps: feature extraction and intelligent classification. The features extraction is the key step part in fault recognition and identification; an accurate features extraction process allows the extraction of the most relevant information from the raw vibration signal. Where, the secret of the accurate classification is directly related to the choice of the most advanced signal processing algorithms. Many researchers have suggested efficient techniques to extract rolling bearing defect features using vibration signals. Zair et al. 24 have proposed a combination between Empirical Mode Decomposition (EMD), Principal Component Analysis (PCA) and Fuzzy Entropy as a feature extraction method to identify bearing faults using experimental vibration signals. However, EMD suffers from two disadvantages: mode mixing, in which waves having identical frequencies are assigned to separate IMFs, and the end effect which gives inaccurate instantaneous values on either signal sides.27,28 To resolve this issue, Zhao et al. 29 have proposed a new fault feature extraction approach based on Ensemble EMD (EEMD) and multi-scale fuzzy entropy for bearing condition monitoring. EEMD adds white noise with finite amplitude to the raw signal during the decomposition step. This procedure is repeatedly performed with several white noise sets to obtain the corresponding IMFs ensemble averages reducing by that the chance of the mode mixing.27,28 Usually, vibration signals are often filled with real background noise which obscures the defect signature and the white noise series will raise the noise amplitude making it even harder to detect the gear failure. 30 An alternative decomposition method, Discrete Wavelet Transfer (DWT), has been extensively used in several works and literatures for the feature extraction phase.31–34 DWT decomposes the vibration signal using a band-pass filter in time and frequency into a series of signals with a specific frequency band. However, the dyadic step in the sub-sampling process appears to be the major limit in DWT. Maximal Overlap Discrete Wavelet Transform (MODWT) came out as an optimized version of DWT to handle the sub-sampling step, yet it still suffers from low frequency resolution as DWT does. 26 Maximal Overlap Discrete Wavelet Packet Transform (MODWPT) replaces MODWT and DWT for a better resolution. 26 Afia et al. 26 developed a new intelligent technique based on MODWPT, entropy for the feature extraction stage and artificial neural network (ANN) to detect and distinguish different types of defects under several loads and speeds. However, MODWPT is not a multi-rate decomposition method, making the feature extraction process very time consuming, which is not recommended for online condition monitoring of industrial devices. 20 In machine learning and artificial intelligence, the main purpose remains to find the most relevant features that play a dominant role in determining and influencing output results. Time energy indicators, such as Standard Deviation (SD), entropy (EN), kurtosis (KU), variance (VAR), etc. were used as time domain features (TDFs) through advanced signal processing algorithms to extract the different features matrices.23,24,26 However, these features matrices have a huge dimension which may lead to serious problems in the classification process and disrupt the most powerful classification algorithms, as well as, it makes the process considerably slow. Bellman, 35 named this issue by ‘the curse of dimensionality’ for their harmful influence on the features classification step. Therefore, dimensionality reduction plays a major role in feature classification. A smaller and relevant dimensional size might be helpful in identifying data patterns in feature classification phase. Basically, dimensionality reduction is a mapping from a multidimensional feature space to a reduced dimensions space. Principal component analysis (PCA)26,36,37 and singular value decomposition (SVD)38,39 are the most used methods for dimensional reduction. SVD is an excellent matrix analysis technique which has been extensively used for noise reduction, data compression, and feature extraction.38,39 After feature extraction procedure, feature classification seems to be the second most difficult task. It is a complex procedure which requires a powerful classification tools based on an artificial intelligence. Based on fuzzy set memberships, FLS is a simple classification method which is considered to be one of the most efficient techniques for fault diagnosis that makes a non-autonomous learning by the intervention of an expert.40,41 FLS is computer software application which simulates the expert’s inference process and performs an expert’s action manner.40,41 FLS uses fuzzy logic, fuzzy numbers and sets to reflect and handle inaccurate and unclear knowledge.40,41 Information about the rotating machine condition exhibit some degree of imprecision and uncertainty, that cannot reasonably be solved by traditional expertizing systems like neural networks. Thus, FLS has proven to be more adequate to diagnose defects in rotating equipment’s without depth knowledge and big database.
In this paper, a new intelligent automatic bearing fault diagnosis technique is proposed. Our hybrid technique combines two TDFs (Standard deviation and entropy), SVD and FLS in order to detect and to distinguish between different defects types under several loads and speeds. The paper is organized as follows: section “Introduction” involves an introduction of the work and a brief literature review. Section “Mathematical background” contains mathematical background, and further a straightforwardly experimental procedure. Results and discussion are given in section “Experimental study.” And finally the paper is ended with a conclusion and references.
Mathematical background
Feature extraction based on temporal analysis
In this sub-section, we will discuss the concept of feature extraction for assessing bearing performance degradations. Statistical features provide a valuable tool that discovers the change in vibration signal amplitude and energy distribution due to the event of a bearing failure.
In order to extract the defect information from the bearing time data, time energy indicators such as standard deviation (SD), entropy (EN), variance (VAR) and kurtosis (KU) are presented in equations (1) to (4), respectively 42 :
Where i is the sample of signal x, N represents the length.
In our approach, the original vibration data is first divided using equation (5) into many slices, whereby the time of each slice is greater than the bearing defect frequency period to avoid any loss of relevant information. Afterwards, SD and EN indicators are applied to every signal sample in order to calculate the feature matrices.
Where; Nt is the total signal length, fs is the sampling frequency while Ni is the number of slices.
Singular value decomposition
Singular value decomposition (SVD) is an excellent matrix analysis technique which has been extensively used for noise reduction, data compression and feature extraction. The singular value decomposition is a matrix transformation algorithm that decomposes any given matrix M(m×n) into three matrices U, Σ and V as follows38,39:
In which, U (m × m) and V (n × n) are orthogonal matrices while Σ is an (m × n) diagonal matrix of singular values (σ1 > σ2··· > σn).
The matrix M is represented as:
The columns of the orthogonal matrix U are the left singular vectors while the columns of the matrix V are the right singular vectors. The matrix Σ singular values can be used to understand the amount of variance explained by each of the singular vectors.
Fuzzy logic system
In classical set theory, the elements are either considered as part of a set specific or not without any intermediate ground. However, in fuzzy set theory, the element has a degree of set membership in which it might be a member in more than one set. Thus, fuzzy logic is used to define values between 0 and 1 based on membership set functions.43,44 These functions are characterized by linguistic variables that are commonly found in real life.
Fuzzy inference is composed of four essential parts: Fuzzification, rules, inference system and defuzzification, where the most used structure is Mamdani’s. 45 Figure 1 gives a short description about the main parts of the mamdani fuzzy system structure.

Mamdani fuzzy process.
Fuzzification
In the fuzzification step, crisp values are converted to linguistic variables using the membership function that is used for input fuzzification in which the membership degree is between 0 and 1. A set of membership functions are given to the inputs fuzzification such as: trapezoidal, Gaussian, triangular, etc. where the choice of these functions depends on the problem solving strategy (reasoning). The trapezoidal and triangular membership functions are described by four boundary values and three values, respectively (Figure 2). Equations (8) and (9) define the membership’s degree of fuzzy sets between every boundary values.

Trapezoidal (a) and triangular (b) membership functions.
Rules
Rules are the most important aspect of the classifier learning system, since they are based on each fuzzified membership input. These rules are defined using the IF-THEN expressions, for example if:
Defuzzification
In contrast to fuzzification, defuzzification converts the fuzzy output of the inference engine to a crisp value using membership functions similar to those used by fuzzifier. The center of gravity method is the most used technique for defuzzification as it averages the area of the resulting fuzzy sets using equation (10).
Figure 3 presents an example of defuzzification using the center of gravity procedure, where

Example of FLS defuzzication method (center of gravity).
Experimental study
Test rig description
In order to confirm the efficiency of our proposed method using experimental vibration signals. Vibration data sets have been provided by the Case Western Reserve University Bearing Data Center. 46 Figure 4 presents the basic design of the used test bench, which is composed of a 2 horsepower (HP) Reliance electric motor that drives a shaft on which a torque sensor and an encoder are installed. The torque is applied to the shaft via a dynamometer and an electronic control system. For the tests, only the Drive end bearing (DE) is considered in our study and the ball bearing details are given in Table 1.

Experimental bearing test rig with a schematic illustration.
Bearing parameters.
Single point defects of 0.007 and 0.028 inches in the inner race (IR) and rolling element (B) and single point defects of 0.007 inches in the outer race (OR) were introduced separately into the test bearings by electro-discharge machining (EDM). Vibration data were recorded with a sampling frequency of 12 KHZ. Experimental signals have been collected for several work conditions under different loads and speeds (0 HP [1797 rpm], 1 HP [1772 rpm], 2 HP [1750 rpm], 3 HP [1723 rpm]) as shown in Table 2.
Works conditions.
Experimental results and discussion
The data sets provided by Case Western Reserve University Bearing Data Center 46 will be used to evaluate the effectiveness and accuracy of our proposed approach for bearing diagnosis. Figure 5 displays the acceleration vibration signals recorded with healthy and three different OR, IR and B defects of 0.007 and 0.028 inches under 0 HP as load.

Vibration signal of various bearing defects under 0 HP load and 1797 rpm speed.
Temporal analysis
When the rolling elements pass over the faulty areas, small impacts occur, producing mechanical shock waves in the time data. These small impacts excite the natural mechanical resonance frequencies in the motor. Whenever a defect affects other bearing parts, the same phenomena appear, and its repetition rate is equal to one of the characteristic defect frequencies (Tables 3 and 4). However, in Figure 5, we can clearly see that the three defects OR, IR and B with 0.007 and 0.028 inches do not have a significant impact on the vibration signal, since there is no significant increase in the time signal energy and no repetitive shock waves are observed at each revolution period. The measured signals can be very noisy, due to electrical and mechanical sources. This makes it even more difficult to detect incipient the defect. Therefore, it is practically impossible to detect the presence of faults by simply viewing the signals in time domain, since the fault impulses are masked by noise. Thus, a further analysis is required in this case since is difficult to identify the bearing defects from the time data.
Formula of expected bearing faults frequencies.
Theorical frequencies at load of 0 Hp and speed of 1797 rpm.
Feature extraction based on TDFs and SVD
Our proposed method uses the TDFs-SVD combination to extract the defect feature in the feature extraction step while the FLS is used to automatically diagnose bearing failures. Experimental vibration signals with faultless and different bearing defects (OR, IR and B) under various loads as shown in the Table 2 are first divided into 270 groups of data sets using equation (11).
Where; Nt is the total signal length with 120,000 points, fs is the sampling frequency and it is equal to 12KHZ while the initial matrix contains 270 Ni slices. The 270 slices are divided into 90 groups of three and each group of three is arranged in a row, so that in total we have a (90×3=270) matrix with every value in the matrix is in fact a signal slice (for instance x1,1 is a signal slice). For six bearing states under various operating conditions, as shown in Table 2, we have in total 24 matrices of (90×3) presenting every case.
Figure 6 shows two time signal slices for OR (a) and IR (b) defects with 0.007 inches under no-load case with 1797 rpm. From Figure 6, the period of two successive maximum values is equal to the period obtained in equation (12). The obtained results match the theoretical frequencies in Table 4 avoiding by that any valuable information loss from the original signal.

Time vibration signal slices for OR (a) and IR (b) defects with 0.007 inches under 0 HP load and 1797 rpm speed.
Thereafter, entropy (EN) and standard deviation (SD) indicators are computed for each matrix obtained in the previous step in order to construct the feature matrix for each bearing state.
In order to avoid the dimensionality curse, SVD is then further used to reduce the dimensionality of the feature matrix and provide more stable vectors by extracting singular values for each bearing states under every working condition. The basis reason behind structuring the matrix in three columns is to limit the number of eigenvalues to three values. Finally, those three singular values will be selected to be used as inputs for the classifier system. Figures 7 and 8 give a better illustration about our proposed method.

Flowchart of proposed method.

Block diagram of proposed method.
Figures 9 and 10 show, respectively, the first, second and third singular values of the SD and EN feature matrices for the six different bearing states under variable speed and loads.

Standard deviation features of the first, second and the third singular values.

Entropy features of the first, second and the third singular values.
As shown in Figures 9 and 10, there is almost no overlapping between the six bearing states for all four speeds for the first singular values for both SD and EN. However, in the second and the third singular values almost all points overlap at every bearing states for the same speeds. From these figures, it appears that only the first singular values for both EN and SD are more stable and separable than the second and third singular values of the same indicators, which means that only the first singular values take ahead step in the feature classification, and it can be used to produce a new robust automatic classification method for bearing fault diagnosis.
In order to assess the impact of our proposed method before further investigation, a comparison is made between EN-SD and two of the most commonly used indicators; kurtosis (KU) and variance (VAR). Through the same feature extraction process, Figure 11 displays samples distribution using first singular value of EN-SD (a) and KU-VAR (b) are implemented in a two-dimensional representation. EN-SVD and SD-SVD features are plotted in Figure 11(a) respectively in the vertical and horizontal axes, while KU-SVD and VAR-SVD features are plotted in Figure 11(b).

Distribution samples (EN-SD) (a) and (KU-VAR) (b) with first singular values.
As shown in Figure 11, KU-VAR sample distribution using the first singular value gives a mixed states sample (the healthy sample with a 0.007 inch ball defect) while the EN-SD gives more appreciable results without any confusion. Moreover, the defects separability is highly satisfactory which will facilitate the input fuzzification.
Figure 12 shows another comparison between EN-SD and KU-VAR, but this time, the second singular values are plotted. As shown in Figure 12 and as mentioned previously, a poor feature extraction results using the second singular values have been obtained, which explains the decision of using only the first singular values for further analysis, since they are more appropriate for bearing fault diagnosis.

Distribution samples (EN-SD) (a) and (KU-VAR) (b) with second singular values.
State classification based on fuzzy logic system (FLS)
In this session, the fault feature vectors obtained by TDFs-SVD are used to identify and classify the bearing states by using the FLS. The proposed approach is tested under 24 different operating modes with 6 different bearing states (Table 2). Each vibration signal is divided into 270 data sets. The 270 slices are divided into 90 groups of three, and each group of three is arranged in a row, so that in total we have a (90×3=270) matrix. Thereafter, entropy and standard deviation indicators are applied for every matrix in order to obtain the feature matrix for each bearing state. Afterward, SVD is then further used to reduce the dimensionality of the feature matrix and to provide more stable vectors by extracting singular values for each bearing states under every working condition. Only the first singular values will be selected to be used as inputs for the classifier system. Finally, the FLS classifier provides an excellent tool for detecting, identifying and classifying different bearing states, even under different working conditions using well-defined memberships. The selected features of the first singular values of SD and EN are chosen as inputs of fuzzy classifier system as indicated in Figure 13.

Fuzzification of inputs.
The inputs are fuzzified on the trapezoidal membership function by calculating the min-max feature inputs as indicated in Figure 14. The choice of the trapezoidal function returns to a certain interval between the min-max input boundaries. The fuzzification of the inputs and outputs are presented in Figures 15 and 16, respectively.

Details of inputs-fuzzification.

Memberships type of inputs-fuzzification.

Memberships type of outputs-fuzzification.
Our classification system has two outputs as shown in Figure 16; the first is fuzzified for four memberships (H,IR,OR,B) to identify the fault type, while the second is fuzzified for three memberships, healthy, defects with sizes of 0.007 and 0.028 inches. The second output is used to show the bearing defect severity. The data sets are divided into training data (30 % of data sets) and test data (70 % of data sets) to test the effectiveness of the proposed approach. Table 5 provides detailed information on training and testing data numbers while Figure 17 summarizes the classification section.
Testing and training data.

Classification part.
The rules defined for our model are listed as follow:
If EN is H and SD is H then: Output 1 is H Output 2 is H.
If EN is IR7 and SD is IR7 then: Output 1 is 0.007 Output 2 is IR.
If EN is OR7 and SD is OR7 then: Output 1 is 0.007 Output 2 is OR.
If EN is B7 and SD is B7 then: Output 1 is 0.007 Output 2 is B.
If EN is IR28 and SD is IR28 then: Output 1 is 0.028 Output 2 is IR.
If EN is B28 and SD is B28 then: Output 1 is 0.028 Output 2 is B.
Figure 18 shows the surface view of the first (a) and second (b) output; where the inputs are structured in X and Y axes and the output is in Z axis. The axis boundaries are defined from the fuzzification part of the input–output boundaries (crisp values).

Surface view of the first (a) and the second (b) output.
Figure 19 shows the first output which is fuzzified for four memberships (H,IR,OR,B) to identify the bearing fault type while Figure 20 shows the second output which is fuzzified for three memberships, healthy, defects with 0.007 and 0.028 inch sizes. Figures 19 and 20 clearly show that in a noisy environment and even under different working conditions, especially in the no load case, all samples are correctly classified and the actual outputs meet the target output classes. Experimental results highlight the efficiency and the accuracy of our proposed approach, which is based on TDFs, SVD and FLS. Our method offers a high classification rate, which implies a good decision for bearing fault diagnosis, even under varying working conditions with several speeds and loads.

Tested samples of the first output.

Tested samples of the second output.
Recently, ANN has been extensively used for feature classification for its efficiency and accuracy. However, the major problem with machine learning methods is the small size of the databases, which affects the neural network performance. On the other hand, fuzzy logic integrates the learning process by using human (expert) logic to solve non-complex data. A comparative study with one of the most used ANN tools, multi-layer perceptron (MLP) neural network, is made to confirm the advantage of our hybrid method. The same datasets which are divided into training data (30% of the datasets) and test data (70% of the datasets) are taken as input to the neural network.
Figures 21 and 22 show the performance of MLP network for the first and the second outputs, respectively.

Performance evaluation of proposed MLP Network for the first output.

Performance evaluation of proposed MLP Network for the second output.
As shown in Figures 21 and 22, MLP offers a performance of 0.01638 Mean Squared Error (MSE) with 340 epochs for the first output and 0.0030129 MSE with 73 epochs for the second one. Clearly, FLS has an advantage over MLP in terms of classification accuracy (−1.5357e−4 for the first output and −2.7817e−4 for the second output). The obtained results illustrate the feasibility and efficiency of the proposed algorithm.
Conclusion
Bearing fault diagnosis using vibration analysis has been and remains widely used as a predictive maintenance strategy, since vibration signals carry a lot of valuable information that gives an insight about the health condition of an operating machine. However, vibration signals exhibit nonlinear and non-stationary behaviors buried in high background noise. This considerably increases the difficulty of defect detection, identification and classification using a conventional condition monitoring tool. Extracting bearing defect features require a robust automatic fault diagnosis technique for online inspection, a superb approach that could detect, identify and classify several bearing a defects especially under variable working condition with different speeds and loads. Therefore, a hybrid bearing fault diagnosis method has been developed in this paper to classify several bearing states from raw experimental vibration data. The proposed method combines between some TDFs tools and SVD for the feature extraction process and Fuzzy Logic System (FLS) for intelligent fault classification. Experimental results from the reference test bench provided by Case Wastern University (CWU) prove that our hybrid method is a reliable technique that shows great accuracy concerning the bearing failures classification even under different operating conditions.
Some future studies will be conducted to improve the field of bearing condition monitoring. For example, the proposed method has been studied for the case of different stable RPM therefore a natural extension of it will integrate an application to time-varying speed data.
Footnotes
Acknowledgements
The authors would like to thank the Solid Mechanics and Systems Laboratory (LMSS), University M’Hamed Bougara of Boumerdes, Faculty of Technology, Boumerdes, and the General Directorate for Scientific Research and Technological Development (DGRSDT), Algeria.
Handling Editor: James Baldwin
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
