Intelligent multi-fault identification and classification of defective bearings in gearbox

Abstract

Bearing faults in gearbox systems pose critical challenges to industrial operations, needing advanced diagnostic techniques for timely and accurate identification. In this paper, we propose a new hybrid method for automated classification and identification of defective bearings in gearbox systems with identical rotating frequencies. The method successfully segmented the signals and captured specific frequency components for deeper analysis employing three distinct signal processing approaches, ensemble empirical mode decomposition EEMD, wavelet packet transform WPT, empirical wavelet transform EWT. By decomposing vibration signals into discrete frequency bands using WPT, relevant features were extracted from each sub-band in the time domain, enabling the capturing of distinct fault characteristics across various frequency ranges. This extensive set of features is then served as inputs for machine learning algorithm in order to identify and classify the defective bearing in the gearbox system. Random forest RF, decision tree DT, ensemble tree ET classifiers showcased a notable accuracy in classifying different fault types and their localizations. The new approach shows the high performance of the diagnostic gearbox with a minimum of accuracy (Min = 99.95 %) and higher stability (standard deviation = 0.1).

Keywords

Bearing diagnosis gearbox signal processing fault classification machine learning

Introduction

Bearings are an essential component used in a broad range of industries, including automotive, wind turbine power generation, agriculture, manufacturing, and many others. They are widely employed in rotating machinery¹ and gearboxes^2,3 and being crucial transmission devices. In some situations, rotating machinery and gearbox systems may need to operate in conditions that are not ideal. These conditions may include situations where the system is subjected to high levels of stress, such as when it is overloaded or required to work at variable speeds. Additionally, the system may also have to work in environments with high ambient temperatures, which can further increase the stress placed on the system. A defective in a bearing, if it is not detected in time, this can lead to catastrophic machinery failure, prolonged machine downtime, and higher maintenance costs, resulting in significant financial loss. Further, the statistics that have been made show that the bearings defects represent about 76% in wind turbine gearbox.⁴ In order to ensure the safety of the system and to avoid damage, condition monitoring refers to the process of monitoring the condition of a system to identify any changes or abnormalities. On the other hand, fault diagnosis identifies the cause of a fault in a system. Applying the concepts of condition monitoring and fault diagnosis can improve the reliability and performance of a system, and the key challenge is to detect any defects at an early stage.

Signal processing approach can be employed to diagnose the bearing fault by utilizing the motor current,⁵ acoustic emission,⁶ and thermal image.⁷ However, numerous research studies have shown that the vibration signal is the most practical method to identify the bearing fault.^8–10 In modern industry with rapid development of sciences and technologies, Artificial Intelligence (AI) becomes a powerful tool for diagnosing faults. Therefore, essentially, two crucial steps have to be developed in AI tool: Features extraction, and fault recognition.

Several studies have been developed to improve feature extractions and to describe the bearing conditions. Most of frequency methods applied for nonstationary signals,¹¹ allow the extraction of the most relevant information from the raw vibration signal. To decompose the signal into an intrinsic mode function (IMF), some authors^12,13 use the empirical mode decomposition (EMD) method. However, this method has a disadvantage called mode mixing,¹⁴ which occurs when the intrinsic mode functions (IMFs) obtained from the decomposition process are contaminated by components from other modes. To mitigate the issue of the mixing mode, several techniques have been proposed, such as ensemble empirical mode decomposition (EEMD),^15–17 these technique use various methods, such as adding white noise to improve the accuracy and robustness of the decomposition.

Wavelet transforms analyze signals at different scale or resolution through a wavelet basis.^18,19 It is used to decompose signals into series of signal-components. This technique is a powerful tool for signal denoising,²⁰ but its weakness is the split of the high-frequency band which does not occur when the fault information exists.²¹ To conquer this issue, wavelet packet transform²² enable detection of both high and low frequency components with varying levels of decomposition.²³ By decomposing the signal into sub-bands using the wavelet packet transform, we can effectively separate the signal into different frequency components, allowing us to analyze and understand the signal in more detail. The decomposed signal is widely employed in fault diagnosis, particularly in extracting significant feature for the identification and analysis of fault.²⁴

Now, the challenge for researchers is to build an automatic fault diagnosis system, so that it is able to detect the defect in an early stage, identify and classify fault. The recent work on automatic fault diagnosis have focused on improving accuracy to the fault identification and localization. A variety of studies have been successfully employ artificial intelligent technique for bearing fault classification. Different machine learning algorithms are used such as: Artificial Neuron Network (ANN),²⁵ Fuzzy Logic System (FLS),²¹ Support Vector Machine (SVM),²⁶ and Random Forest (RF).²⁷

The structure of this paper is as follows: Section “Introduction” provides an introduction to the study and concise the overview of the existing literature. In Section “Gearbox System Description,” the mathematical model of the examined gearbox is presented. Section “Methodology and Fault Scenario,” we delve into the signal decomposition of both healthy and faulty modes, achieved by employing wavelet packet transform to extract pertinent signal features. Section “Results and Discussions” encompasses the outcomes and discourse on fault classification. Finally, the paper is ended with a conclusion and references.

Gearbox system description

In this paper, a new Gearbox system that has been realized in the laboratory is used to validate the fault detection and classification performance of the data-based methods versus different rolling bearings faults modes.

In this study, a Gearbox system was proposed, the physical model given in Figure 1, is described by the laws of physics to obtain the mathematical model. Hence, the major assumptions of the dynamic model are based upon^28–30 and are as follows:

Neglect the resonances of the gear case and the shaft transverse resonances;

Inertia and shaft mass are lumped at the bearings;

Ignore the shaft torsional stiffness (because the flexible coupling torsional stiffness is very low) and inter-tooth friction;

Gear teeth profiles are perfect involutes curves, with no geometrical, pitch or run out errors.

Figure 1.

Considered gearbox system: (a) real gearbox system and (b) physical gearbox system.

The system is presented in order to simulate the effects of the bearings faults, over the dynamical behavior of our model. The corresponding mathematical model has been developed.

With 10 degrees of freedom and 10 generalized coordinates the physical model has four angular variables $θ_{1}, θ_{2}, θ_{3}, θ_{4}$ , the motor rotational angle, the pinion and the wheel rotational angles and the load rotational angle. We have also six displacement variables $x_{1}, x_{2}, x_{3}, x_{4}, x_{5}, x_{6}$ , which are the radial displacement of 1st bearing, 2nd bearing, 3rd bearing, and 4th bearing respectively, and the radial displacement of pinion and wheel respectively. The mathematical equation describing the system to be diagnosed is given in equations (1) to (10).

Gearbox mechanical equations

First, it is necessary to write the gear model system in state space representation. In such a way that each second-order differential equation is written in the form of two first order differential equations. Thus, 10 nonlinear first order differential equations with time varying coefficients are obtained. These equations are written such that each equation contains the time derivative of only one variable. Thus, the obtained equations are simulated under Matlab software. The main notation used in the obtained equations are given in Table 1.

Table 1.

Gearbox data parameters.

Parameter	Description
$I_{m}$	Moments of inertia for electric motor (I_m).
$I_{p}$	Moments of inertia for pinion (I_p).
$I_{g}$	Moments of inertia for output gear (I_g).
$I_{r}$	Moments of inertia for driven machine (I_r).
$m$	Mass of the bearing and part of the shaft (m)
$m_{p}$	Mass of the input pinion (m_p).
$m_{g}$	Mass of the gear (m_g).
$T_{em}$	Output torque from load (T_em).
$K_{c}$	Torsional stiffness of the flexible coupling (k_c).
$C_{c}$	Viscous damping coefficient of flexible coupling (c_c).
$R_{p}$	Base circle radius of pinion (R_p).
$R_{g}$	Base circle radius of output gear (R_g).
$K_{b}$	Radial stiffness of the bearing (K_b).
$C_{b}$	Viscous damping coefficient of the bearing (C_b).
$K_{s}$	Shaft transverse stiffness (K_s).
	Number of teeth on pinion and gear.
$T_{r}$	Rotor time constant (T_r).
$p$	Number of poles pair (p).

Circular motion

{\overset{\cdot\cdot}{θ}}_{1} = (\frac{1}{I_{m}}) (T_{i} - k_{c} (θ_{1} - θ_{2}) - c_{c} ({\overset{\cdot}{θ}}_{1} - {\overset{\cdot}{θ}}_{2}))

(1)

\begin{array}{l} {\overset{\cdot\cdot}{θ}}_{2} = (\frac{1}{I_{1}}) (k_{c} (θ_{1} - θ_{2}) + c_{c} ({\dot{θ}}_{1} - {\dot{θ}}_{2}) \\ + R_{p} k (t) (R_{w} θ_{3} - R_{p} θ_{2} + x_{2} - x_{5}) \\ + R_{1} C (t) (R_{w} {\dot{θ}}_{3} - R_{p} {\dot{θ}}_{2} + {\dot{X}}_{2} - {\dot{X}}_{5})) \end{array}

(2)

\begin{array}{l} {\overset{\cdot\cdot}{θ}}_{3} = (\frac{1}{I_{p}}) (k_{c} (θ_{4} - θ_{3}) + C (t) ({\dot{θ}}_{4} - {\dot{θ}}_{3}) \\ + R_{2} k (t) (R_{p} θ_{2} - R_{w} θ_{3} + x_{5} - x_{2}) \\ + R_{w} C (t) (R_{p} {\dot{θ}}_{2} - R_{w} {\dot{θ}}_{3} + {\dot{x}}_{5} - {\dot{x}}_{2})) \end{array}

(3)

{\overset{\cdot\cdot}{θ}}_{4} = (\frac{1}{I_{r}}) (k_{c} (θ_{3} - θ_{4}) - c_{c} ({\overset{\cdot}{θ}}_{3} - {\overset{\cdot}{θ}}_{4}) - T_{r})

(4)

Movement straight

\begin{array}{l} {\overset{\cdot\cdot}{x}}_{1} = (\frac{1}{m}) (K_{b} (x_{1} + A_{1} \cos (ω_{s} t + \emptyset_{1}) \\ - f_{s} ω_{s} \sum_{m = 1}^{N_{1}} A_{2}^{m} m \cos (ω_{s} m t + \emptyset_{1}^{m}) \\ + f_{s} \sum_{m = 1}^{N_{2}} A_{3}^{m} \cos (ω_{c} m t + \emptyset_{2}^{m})) \\ + C_{b} ({\dot{x}}_{1} - A_{1} ω_{s} \sin (ω_{s} t + \emptyset_{1}) \\ - f_{s} ω_{s} \sum_{m = 1}^{N_{1}} A_{2}^{m} m \sin (ω_{s} m t + \emptyset_{1}^{m}) \\ - f_{s} ω_{c} \sum_{m = 1}^{N_{2}} A_{3}^{m} m \sin (ω_{c} m t + \emptyset_{2}^{m})) \\ + K_{S} (x_{1} - x_{2})) \end{array}

(5)

\begin{matrix} {\overset{\cdot\cdot}{x}}_{2} = (\frac{1}{m_{p}}) (K_{S} (x_{1} - x_{2}) + K_{S} (x_{3} - x_{2}) + k (t) \end{matrix} (R_{p} θ_{2} - R_{w} θ_{3} + x - x_{5}) + C (t) (R_{p} {\overset{\cdot}{θ}}_{2} - R_{w} {\overset{\cdot}{θ}}_{3} + {\overset{\cdot}{x}}_{2} - {\overset{\cdot}{x}}_{5}))

(6)

{\overset{\cdot\cdot}{x}}_{3} = (\frac{1}{m}) (K_{b} x_{3} + C_{b} {\overset{\cdot}{x}}_{3} + K_{S} (x_{3} - x_{2}))

(7)

{\overset{\cdot\cdot}{x}}_{4} = (\frac{1}{m}) (K_{b} x_{4} + C_{b} {\overset{\cdot}{x}}_{4} + K_{S} (x_{4} - x_{5}))

(8)

\begin{matrix} {\overset{\cdot\cdot}{x}}_{5} = (\frac{1}{m_{w}}) (K_{S} (x_{4} - x_{5}) + K_{S} (x_{6} - x_{5}) + k (t) \end{matrix} (R_{w} θ_{w} - R_{p} θ_{p} + x_{2} - x_{5}) + C (t) (R_{w} {\overset{\cdot}{θ}}_{w} - R_{p} {\overset{\cdot}{θ}}_{p} + {\overset{\cdot}{x}}_{2} - {\overset{\cdot}{x}}_{5}))

(9)

{\overset{\cdot\cdot}{x}}_{6} = (\frac{1}{m}) (K_{b} (x_{6} + F) + C_{b} ({\overset{\cdot}{x}}_{6} + \overset{+}{F)} K_{S} (x_{6} - x_{5}))

(10)

Vibration in bearings can stem from different origins, each of which can be distinguished by specific force factors. The primary culprits are bearing flaws, imbalances, and misalignments. Table 2 outlines the distinctive force factors linked to the prevailing causes of vibrations.

Table 2.

Fault functions parameters.

Vibration source	Force function
Imbalanced masse	$F_{1} = A_{1} \cos (2 π f_{s} t + \emptyset_{1})$
Misalignment	$F_{2} = \sum_{k = 1}^{\infty} A_{2}^{k} δ (t - \frac{k}{f_{s}}) \approx f_{s} \sum_{m = 1}^{N_{1}} A_{2}^{m} \cos (2 π f_{s} m t + \emptyset_{1}^{m})$
Bearing defects	$F_{3} = \sum_{k = 1}^{\infty} A_{3}^{k} δ (t - \frac{k}{f_{c}}) \approx f_{s} \sum_{m = 1}^{N_{2}} A_{3}^{m} \cos (2 π f_{c} m t + \emptyset_{2}^{m})$

Table 3 provides the principal symbols employed to describe the functions related to bearing defects.

Table 3.

Notation of the fault functions.

Parameter	Description
$A_{1}$	Amplitude value created by the imbalance
$A_{i}^{m}, i = 1, 2, 3$	Amplitude value for the mth harmonic
$A_{2}^{k}, A_{3}^{k}$	Initial magnitude value for the kth harmonic
$f_{s}$	Rotation frequency
$f_{c}$	Fault frequency associated with the bearing
$\emptyset_{1}$	Phase angle appropriate
$\emptyset_{i}^{m}, i = 1, 2, 3$	Value of the phase for the mth harmonic
$N_{i}, i = 1, 2, 3$	Harmonics number in the impulse train

Methodology and fault scenario

Our objective is to diagnose the bearings in the gearbox and classify the defective bearings using vibration signal. The study focuses on two bearings positioned within the shaft that is linked to the motor. As a result, both bearings rotate at the same speed and have identical dimensions. In order to improve feature extraction and create meaningful attributes for training classifiers, the vibration signal is decomposed.

The developed approach is based on two methods: signal processing technique versus Hybrid signature method. In the first step, we utilize the Autogram to extract the AM-FM modes from the vibration signatures. This technique allows us to precisely decompose the signal into a fixed number of modes. This will show significant results for detecting defects in bearings. The second step, consists of our developed study where we combine between the wavelet packet transform, the statistical feature extraction, and the feature classification by using Random Forest algorithm. Firstly, we decompose the signal using wavelet packet transform into high-frequency band and low frequency components to select the effective modes capture the characteristics of the vibration signatures. Then we calculate the set of features listed in Table 4 for 15 modes, resulting in 180 attributes. Hence, we obtain a more detailed understanding of the underlying characteristics of the vibration signature, which enables us to identify subtle changes and patterns that may be indicative. Finally, using the Random Forest classifier for faults identification and classification.

Table 4.

Features extracted from the WPT.

Feature	Equation
Root mean square	$\sqrt{\frac{1}{n} \sum_{i = 1}^{n} {\| x_{i} \|}^{2}}$
Crest factor	$\frac{{‖ x ‖}_{\infty}}{\sqrt{\frac{1}{n} \sum_{i = 1}^{n} {\| x_{i} \|}^{2}}}$
Peak to peak	Max(x) − Min(x)
Skewness	$E [{(\frac{x - μ}{σ})}^{3}]$
Kurtosis	$E {(\frac{x - μ}{σ^{4}})}^{4}$
Entropy	$- \sum_{i} p_{i} lo g_{2} (p_{i})$
Mean	$μ = \frac{1}{N} \sum_{i = 1}^{N} A_{i}$
Std	$σ = \sqrt{\frac{1}{N - 1} \sum_{i = 1}^{N} {\| A_{i} - μ \|}^{2}}$
Var	$\frac{1}{N} \sum_{i = 1}^{N} {(x_{i} - \tilde{x})}^{3}$
Root sum square	$X_{rss} = \sqrt{\sum_{n = 1}^{N} {\| x_{n} \|}^{2}}$
Max	$Max \| x_{i} \|$
Min	$Min \| x_{i} \|$
Mean square value	$\frac{1}{n} \sum_{i = 1}^{n} {\| x_{i} \|}^{2}$

Signal processing technique using Autogram

However, the proposed signals processing method, namely Autogram method developed by Moshrefzadeh and Fasana³¹ has proven to be more efficient than wavelet analysis in many applications.^32–34 The Autogram is released fourth-order spectral analysis tool for the detection and characterization of transients in a signal. The paradigm is based on the affirmation that each type of transient is combined with an optimal frequency and frequency resolution (dyad {f, Bw}) that maximizes its kurtosis, and therefore its detection. This could describe the Autogram in the following steps:

Step 1: data signal is filtered and divided in frequency bands and center frequencies using decimated wavelet packet transform (MODWPT).

Step 2: using periodicity of the autocovariance function, we calculate the unbiased autocovariance AC of the squared envelope for each node from signal filtered in step 1.

Step 3: find the most suitable frequency band for demodulating signal to have the successful diagnosis of bearings faults.

Diagnostics using Autogram is one of the most relevant methods; it demonstrated significant results for the detection of defects in bearings.^31,35 We will apply Autogram on the signal to shows the squared envelope spectrum for different states of rolling element bearing (healthy/faulty) in the gearbox. The healthy envelope signal is show in Figure 2. There is no frequency that characterizes the rolling element bearing failure in this figure; which means that the system is in a healthy state. The meshing frequency f_mesh is the only frequency that appears.

Figure 2.

Healthy state of gearbox system.

Furthermore, in order to ensure the information provided by Autogram the squared envelope spectrum of the signal with fault for three type of defects of the rolling element bearing (ball, inner race, and outer race faults) are presented in Figures 3 to 5 respectively. This is done under three different cases: case 1: defect in bearing 1, case 2: defect in bearing 2, and case 3: defect in both bearings at faith.

Figure 3.

Bearings fault in different scenario at ball fault: (a) bearing 1, (b) bearing 2, and (c) both bearings simultaneously.

Figure 4.

Bearings fault in different scenario at inner race fault: (a) bearing 1, (b) bearing 2, and (c) both bearings simultaneously.

Figure 5.

Bearings fault in different scenario at outer race fault: (a) bearing 1, (b) bearing 2, and (c) both bearings simultaneously.

According to the obtained results based on the squared envelope spectrum, this approach allows us the detection of the fault in bearing whatever the type of defect either for ball, inner race or outer race. The gearbox system considered by this study contains four bearings with identical dimensions (same frequency characteristics of the defect) and the same rotation frequency of each pair of bearings (the rotation frequency of the motor shaft and the rotation frequency of the load).

As a consequence, the current approach is not be able to locate the fault. From these figures we can see that it is not possible to determine the exact location of the defect using envelope analysis, as this approach does not provide information about whether the defect is present in bearing 1, bearing 2, or both bearings simultaneously.

A hybrid signature methods

In this section, the decomposition process of the proposed method starts by applying the wavelet packet transform to the original signal and then dividing the resulting into high-frequency band and low frequency components until the desired level of decomposition is reached. The proposed study-based wavelet packet decomposition and machine learning are presented in the following flowchart given in Figure 6.

Figure 6.

Flowchart of the proposed approach.

Ensemble empirical mode decomposition

Empirical mode decomposition serves as a self-adaptive approach for analyzing nonlinear and non-stationary signals. It works by breaking down complex signals into a set of intrinsic mode functions (IMFs), which are determined based on the signal’s local characteristic time scale. An IMF is defined by two key conditions: firstly, within the entire dataset, the count of extrema and zero-crossings must be either equal or differ by at most one; secondly, at any given point, the mean value between the envelopes formed by local maxima and minima is zero. The EMD process of a signal x(t) can be described as follows:

x (t) = \sum_{j = 1}^{n} c_{j} - r_{n}

(11)

Where $r_{n}$ is a residual of signal $x (t)$

(1) Identify the local maxima and local minima to produce upper envelope and lower envelope.

(2) Obtain the first component $h$ by calculated the difference between the signal and local mean of the two envelopes.

(3) Consider $h$ as the dataset and iterate through steps 1 and 2 until the envelopes achieve symmetry around zero mean according to specific criteria. Repeat this process as many times.

The final result, denoted as $c_{j}$ , is the final output of the sifting process. The process is considered complete when the residue, $r_{n}$ , transforms into a monotonic function, indicating that no additional intrinsic mode functions (IMFs) can be extracted.

To address mode mixing in EMD, Wu and Huang¹⁵ introduced ensemble empirical mode decomposition (EEMD), an enhanced variant of EMD. EEMD employs a noise-assisted approach by introducing finite white noise to the signal under investigation. This addition effectively resolves the mode mixing issue across all scenarios automatically. Consequently, EEMD stands as a significant advancement over traditional EMD methods.

Considering the characteristics of EMD, the proposed EEMD is formulated as follows:

add a white noise series to the targeted data;

decompose the data with added white noise into IMFs;

repeat step 1 and step 2 again and again, but with different white noise series each time; and

obtain the (ensemble) means of corresponding IMFs of the decompositions as the final result.

Based on the principle and observations above, the EEMD algorithm can be given as follows:

(1) Set the initial values for the ensemble number $M$ , the amplitude of the white noise to be added, and initialize $m$ to 1.

(2) Execute the $m th$ iteration by introducing white noise to the signal.

(a) Add a white noise series with the given amplitude to the investigated signal

x_{m} (t) = x_{m} (t) + n_{m} (t)

(12)

where $n_{m} (t)$ indicates the $m th$ added white noise series, and $x_{m} (t)$ represents the noise-added signal of the $m th$ trial.

(b) Apply the EMD method, as described earlier, to decompose the signal $x_{m} (t)$ with added noise into $I$ (IMFs) intrinsic mode functions $c_{i . m}$ ( $i = 1; 2; \dots, I$ ), where $c_{i . m}$ denotes the $i th$ IMF of the $m th$ trial, and $I$ is the number of IMFs.

(c) If $m < M$ then go to step (a) with $m = m + 1$ . Repeat steps (a) and (b) again and again, but with different white noise series each time.

(3) Compute the ensemble mean $\bar{c_{i}}$ for each intrinsic mode function (IMF) across all $M$ trials

\bar{c_{i}} = IM \sum_{m = 1}^{M} c_{i, m}, i = 1, 2, \dots, I, m = 1, 2, \dots, M

(13)

(4) Report the mean $\bar{c_{i}} (i = 1, 2, \dots, I)$ of each of the I IMFs as the final IMFs.

Empirical wavelet transform

Gilles³⁶ recently introduced the Empirical Wavelet Transform (EWT) within the realm of non-stationary signal processing. The primary aim of the EWT is to construct adaptive wavelets capable of effectively extracting amplitude modulation-frequency modulation (AM-FM) components from a signal. To achieve adaptability concerning the analyzed signal, the selection of the wavelet filter bank is guided by Fourier supports. The information extracted from the processed signal spectrum is used to identify Fourier supports. This involves locating local maxima and defining the support boundaries $ω_{i}$ as the middle points between successive maxima.

We begin by partitioning the Fourier support $[0, π]$ into $N$ consecutive segments. Each segment is denoted $Λ_{n} = [ω_{n - 1}, ω_{n}]$ , thus $\cup_{n - 1}^{N} Λ_{n} = [0, π]$ . Centered around each $ω_{n}$ , there exists a transient phase $T_{n}$ extending over a range of $2 τ_{n}$ (Figure 7). EWT are defined as bandpass filters on each $Λ_{n}$ .

Figure 7.

Segmentation of the Fourier axis and construction of EWT wavelets.

Hence, establish the empirical scaling function and empirical wavelets using the expressions provided in equations (14) and (15), respectively.

ϕ_{n} (ω) = {\begin{matrix} 1 if | ω | \leq ω_{n} - τ_{n} \\ \cos \begin{matrix} [\frac{π}{2} β (\frac{1}{2 τ_{n}} (| ω | - ω_{n} + τ_{n}))] \\ if ω_{n -} τ_{n} \leq | ω | \leq ω_{n} + τ_{n} \end{matrix} \\ 0 Otherwise \end{matrix}

(14)

ψ_{n} (ω) = {\begin{matrix} 1 if | ω | \leq ω_n - τ_n \\ \cos \begin{matrix} [\frac{π}{2} β (\frac{1}{2 τ_{n + 1}} (| ω | - ω_{n + 1} + τ_{n} + 1))] \\ if ω_{n + 1 -} τ_{n + 1} \leq | ω | \leq ω_{n + 1} + τ_{n + 1} \end{matrix} \\ \sin \begin{matrix} [\frac{π}{2} β (\frac{1}{2 τ_{n}} (| ω | - ω_{n} + τ_{n}))] \\ if ω_{n -} τ_{n} \leq | ω | \leq ω_{n} + τ_{n} \end{matrix} \\ 0 Otherwise \end{matrix}

(15)

The function $β (x)$ is given as follows:

β (x) = {\begin{matrix} 0 \\ β (x) + β (1 - x) = 1 \forall x \in [0, 1] \\ 1 \end{matrix}

(16)

Note that the most used function that satisfies this property:

β (x) = x^{4} (35 - 84 x + 70 x^{2} - 20 x^{3})

(17)

The detail coefficients $W_{f}^{ε} (n, t)$ are obtained by the inner products of the input signal with the empirical wavelets:

W_{f}^{ε} (n, t) = 〈 f, ψ_{n} 〉 = \int f (τ) \bar{ψ_{n} (τ - t)} dt = {(\hat{f} (ω) \bar{{\hat{ψ}}_{n} (ω)})}^{\lor}

(18)

The approximation coefficients with scaling function is given:

W_{f}^{ε} (0, t) = 〈 f, ϕ_{1} 〉 = \int f (τ) \bar{ϕ_{1} (τ - t)} dt

(19)

Finally, the original signal is decomposed into various empirical modes $f_{k} (t)$ , which is given by:

{\begin{matrix} f_{0} (t) = W_{f}^{ε} (0, t) * ϕ_{1} (t) \\ f_{k} (t) = W_{f}^{ε} (k, t) * ψ_{k} (t) \end{matrix}

(20)

The signal can be reconstructed as follows:

f (t) = W_{f}^{ε} (0, t) * ϕ_{1} (t) + \sum_{n = 1}^{N} W_{f}^{ε} (n, t) * ψ_{n} (t)

(21)

Wavelet packet transforms

The wavelet transform has been widely used in signal processing and signal denoising.¹⁸ The wavelet method uses the time-frequency representation of a signal through a set of wavelets.

The wavelet packet transform is an extension of the discrete wavelet transform (DWT).³⁷ In the DWT the signal (S) is decomposed using two complementary filters, high-pass and low-pass filter (Figure 8), can be extracted in approximation signal A from low-pass filter (large-scale and low frequency) and extracted in detailed signal D from high-pass filter (small-scale and low frequency).

Figure 8.

(a) Decomposition signal using DWT and (b) tree decomposition of signal.

In practice, the Discrete Wavelet Transform (DWT) can be realized using a set of low-pass and high-pass wavelet filters, represented as $h (k)$ and $g (k)$ respectively, where $g (k) = (- 1)^{k} h (1 - k)$ .

These filters are derived from the chosen wavelet function $ψ (t)$ and its associated scaling function $ϕ (t)$ , expressed as follows³⁸:

{\begin{matrix} ϕ (t) = \sqrt{2} \sum_{k} h (k) ϕ (2 t - k) \\ ψ (t) = \sqrt{2} \sum_{k} g (k) ϕ (2 t - k) \end{matrix}

(22)

Using the wavelet filters, the signal is decomposed into a set of low and high frequency components as³⁸

{\begin{matrix} a_{j, k} = \sum_{k} h (2 k - m) a_{j - 1, m} \\ d_{j, k} = \sum_{k} g (2 k - m) a_{j - 1, m} \end{matrix}

(23)

Equation (23) defines $a_{j, k}$ as the approximation coefficient, representing the signal’s low-frequency components, and $d_{j, k}$ as the detail coefficient, corresponding to the signal’s high-frequency components. To compute the approximation and detail coefficients at wavelet scale 2j (where j indicates the level), we convolve the approximation coefficients from the previous level (j−1) with the low-pass and high-pass filter coefficients, respectively.

The WPT splits the approximations and details into finer components. Therefore, it can decompose the high frequency part.

To conduct Wavelet Packet Transform (WPT) of a signal at a specific level (e.g. level 3), the functions described in equation (22) are combined as follows:

{\begin{matrix} U_{2 n} (t) = \sqrt{2} \sum_{k} h (k) U_{n} (2 t - k) \\ U_{2 n + 1} (t) = \sqrt{2} \sum_{k} g (k) U_{n} (2 t - k) \end{matrix}

(24)

where $U_{0} (t) = ϕ (t)$ , and $U_{1} (t) = ψ (t)$ . Correspondingly, the signal is decomposed as³⁹

{\begin{matrix} d_{j + 1, 2 n} = \sum_{m} h (m - 2 k) d_{j, n} \\ d_{j + 1, 2 n + 1} = \sum_{m} g (m - 2 k) d_{j, n} \end{matrix}

(25)

where $d_{j, n}$ denotes the wavelet coefficients at the $j$ level, $n$ sub-band, $d_{j + 1, 2 n}$ and $d_{j + 1, 2 n + 1}$ denotes the wavelet coefficients at the $j + 1$ level, $2 n$ and $2 n + 1$ sub-bands, respectively, and $m$ is the number of the wavelet coefficients. As illustrated in Figure 9, a 3-level WPT generates a total of eight sub-bands, and each sub-band covers one eighth of the frequency information successively.

Figure 9.

Illustration of wavelet packet transform.

The vibration signals analyzed in this study covering both bearings for the normal (Healthy H) operating condition and three distinct bearing defects of each one: Inner Race Fault (IRF), Outer Race Fault (ORF), and Ball Fault (BF). In Figure 10, the vibration data corresponding to various fault types, along with the healthy state, is presented.

Figure 10.

Original vibration signals of different bearings health state.

In this situation, we choose to decompose the signal into three levels of decomposition, which means that the resulting wavelet packet tree will have three levels of sub-bands. Each sub-band in the tree represents a different frequency range and resolution of the original signal. By decomposing the signal into sub-bands using the wavelet packet transform, we can effectively separate the signal into different frequency components. Signal decomposition with the Daubechies 4 mother wavelet is given in Figures 11 and 12.

Figure 11.

Signal decomposition results using WPT for bearing 1.

Figure 12.

Signal decomposition results using WPT for bearing 2.

The vibration acceleration signals of rolling bearing in different conditions modes are shown in Figures 11 and 12, which represent the bearings in different modes of faults, outer race, inner race, and ball faults. These figures show that the vibration signals in the four conditions mode are highly complex, which lead to the incapability of making difference between the faults in each case.

Features extraction based on signal processing

Feature extraction is a critical step in data processing that involves identifying and transforming relevant variables from raw data. This technique is essential for preparing data for further analysis, modeling, or visualization. Raw data often contains numerous variables that can lead to increased computational complexity and memory usage. Feature extraction aims to reduce data dimensionality while retaining essential information.

At this level of development, we focus our study by determining 12 statistical feature parameters through the approximation and detail of decomposition until level 3. To calculate the statistical feature parameters, a range of statistical measures are employed for both the approximation and detail signals at each level of decomposition. These measures encompass a variety of statistical indicators, such as max, min, rms, mean, variance, skewness, kurtosis, energy, entropy …, etc.

Feature selection and classification

In the last decay, machine learning has produced considerable results in a number of application areas, including Rolling bearing problems. In this contribution, we have used a random forest algorithm, which is a supervised machine learning method adapted to both binary and multiclass failures.⁴⁰

This algorithm has been successfully used to solve a number of complex applications and has several desirable features. The main features of this algorithm are:

Ensemble method combined with several decision trees

Each forest tree selects best prediction

Random feature selection avoids over-fitting by de-correlating trees

Keeps track of incomplete data and non-linear variables between features and target variable.

Handle both binary-faults and multi-faults issue.

The standard equation employed in Random Forest is the Gini impurity, which is designed to identify the quality of the decision tree split. Gini impurity is a probability a function of randomly selected components in the data set being misclassified. It is defined as follows:

Gini Index = 1 - \sum_{i = 1}^{n} {P_{i}}^{2}

(26)

Where $P_{i}$ denote an element’s probability to be classified for a distinct class.

The random forest algorithm is an ensemble learning technique that has been developed to combine several decision trees to form a more robust model (Figure 13). Several trees are trained on different subsets of data and features, which reduces over-fitting and improves accuracy.⁴¹

Figure 13.

Random forest flowchart.

Results and discussions

This section presents the outcome achieved through three signal processing techniques, namely ensemble empirical mode decomposition (EEMD), wavelet packet transform (WPT), and empirical wavelet transform (EWT). For each technique, we generate a feature matrix based on the time domain parameter listed in Table 4, which is then prepared for classification defective bearing in gearbox.

The classification process is performed using six different classifiers, which include random forest (RF), decision tree (DT), k-nearest neighbors (KNN), support vector machine (SVM), ensemble tree (ET), and Gaussian mixture models (GMM). The classifiers are evaluated through a 10 iteration loop, while the stability of the machine learning techniques is assessed using the standard deviation formula (STD).

The used ML tools demonstrate the varying outcomes regarding to the accuracy and stability. Where the ML inputs are the time domain feature obtained from EEMD, WPT, and EWT. Knowing that the output are the bearing states which can be in healthy mode, either defectivity in 1st or 2sd bearing, or defectivity in both bearings. Then we locate the faults at the ball, in the inner race or the outer race. Eighty percent of data are used for training and 20% of data are used for testing the model.

Table 5 shows the classification accuracy results and the classifier stability of the model. We observe that signal processing with WPT gives the best classification results compare to EEMD and EWT. This table, we notice that it indicates only the global accuracy. There is no credible indicator which allows us to conclude that the model gave the best classification.

Table 5.

Classification results using WPT with different classifiers.

		RF	DT	KNN	SVM	ET	GMM
EEMD	Max	20.58	32.35	5.88	5.88	26.47	20.58
	Min	11.76	14.70	0	0	11.76	8.82
	Mean	16.17	20.29	2.94	2.94	18.82	14.70
	Std	2.49	5.08	1.96	2.40	4.42	4.38
WPT	Max	100	100	14.79	17.51	100	69.50
	Min	99.97	99.95	13.98	15.36	99.95	68.32
	Mean	99.99	99.98	14.41	16.63	99.97	68.98
	Std	0.01	0.02	0.26	0.59	0.01	0.34
EWT	Max	22.50	22.5	20	20	27.50	25
	Min	17.50	15	5	5	15	10
	Mean	20.25	18.75	13.25	13.75	20.50	18
	Std	1.84	2.94	4.57	5.17	3.68	4.83

We know that generally, sometimes we may have a high global accuracy, which may contain classes with a low precision. To enhance the performance accuracy and stability of the system, we have implemented four key metrics which are the mean, max, min, and standard deviation. By doing, this we can appreciate better the performance of the RF classifier’s. This RF classifier yields us to the best outcome in terms of stability (Std = 0.01) and mean accuracy (99.99%). Even though, the RF, DT, and ET, also show best classification in terms of accuracy and stability.

To assess the capability of fault prediction detection we have provided various types of fault to test the proposed approach. The obtained prediction confusion matrixes for different considered faults are given in Figures 14 to 16. Through these figures we can see the accuracy from classifiers RF, DT, and ET of each class for fault classification of bearing defect.

Figure 14.

Prediction confusion matrix for fault diagnosis based WPT-RF.

Figure 15.

Prediction confusion matrix for fault diagnosis based WPT-DT.

Figure 16.

Prediction confusion matrix for fault diagnosis based WPT-ET.

Figure 14 shows the WPT in tandem with the RF algorithms, that the vast existing numbers of sample for the considered bearing faults in the proposed approach are predicted successfully with highest percentage scales between 99.76% and 100%.

We can conclude that the proposed approach based on WPT in tandem with RF classifier, enables us to improve the system performance in terms of reliability, stability, and accuracy.

Conclusion

In this study, the dynamic model of a gear-box bearings defective system was developed to examine the vibration behaviors in presence of bearings faults. We modeled these two faults by an amplitude reduction and phase change in gear stiffness. Firstly, the fault diagnosis of bearings is performed using the squared envelope spectrum obtained from Autogram. The analysis was conducted for healthy bearings showing only the gear mesh frequencies (Figure 2). These frequencies happen to appear also in the first bearing or the second bearing or for both bearing defects (Figure 3). Nevertheless, we see the appearance of new frequencies providing information about the fault types (ball fault, inner race fault, and outer race fault). As a result, the obtained spectrums based on Autogram cannot differentiate between these bearing defects

Secondly, the proposed approach based on hybrid method for bearing fault diagnosis is based on wavelet packet decomposition and machine learning. Moreover, the vibration signal is decomposed into different frequencies bands with WPT. The decomposition at each level provides a set of sub-bands that allows us capturing specific frequencies components of the signal. From the time domain, relevant features are extracted from each sub-band to capture the fault characteristics at different frequencies ranges. Furthermore, the extracted features are then used as inputs to the machine learning algorithm to train the different classifiers to build the model for defective bearing classification in the gearbox.

The obtained results from the random forest (RF), decision tree (DT), and ensemble tree (ET) classifiers show that the approach is highly sensitive and efficient method for locating and differentiating defective bearings in gearbox systems. The classifiers enhance the accuracy in identifying the faulty bearings. This approach can help modern industries to diagnose the gearbox system and enable timely maintenance, and reduce the risk of catastrophic failure.

Footnotes

Handling Editor: Chenhui Liang

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Ali Damou

References

Zhang

Zhou

Huang

, et al. Fault diagnosis of rotating machinery based on recurrent neural networks. Measurement 2021; 171: 108774.

Ratni

Rahmoune

Benazzouz

. A new method to enhance of fault detection and diagnosis in gearbox systems. J Vibroeng 2017; 19: 176–188.

. New sparse regularization approach for extracting transient impulses from fault vibration signal of rotating machinery. Mech Syst Signal Process 2024; 209: 111101.

Saufi

Ahmad

Leong

, et al. An intelligent bearing fault diagnosis system: a review. MATEC Web Conf 2019; 255, 06005.

Ratni

Benazouz

. Mathematical modelling of rolling element bearings fault for the diagnosis in the gearbox-induction machine. Math Models Eng 2020; 6: 1–12.

Liu

Wang

Zhang

. Fault diagnosis of industrial wind turbine blade bearing using acoustic emission analysis. IEEE Trans Instrum Meas 2020; 69: 6630–6639.

Choudhary

Goyal

Letha

. Infrared thermography-based fault diagnosis of induction motor bearings using machine learning. IEEE Sensors J 2020; 21: 1727–1734.

Barbieri

GDSAV

Martins

, et al. Analysis of automotive gearbox faults using vibration signal. Mech Syst Signal Process 2019; 129: 148–163.

Glowacz

Kozik

, et al. Detection of deterioration of three-phase induction motor using vibration signals. Meas Sci Rev 2019; 19: 241–249.

10.

Malla

Panigrahi

. Review of condition monitoring of rolling element bearing using vibration analysis and other techniques. J Vib Eng Technol 2019; 7: 407–414.

11.

Zhang

Jiang

Feng

. Research on variational mode decomposition in rolling bearings fault diagnosis of the multistage centrifugal pump. Mech Syst Signal Process 2017; 93: 460–493.

12.

Devendiran

Mathew

. Bearing fault diagnosis using empirical mode decomposition, entropy-based features and data mining techniques. Mater Today Proc 2018; 5: 11460–11475.

13.

Zair

Rahmoune

Benazzouz

. Multi-fault diagnosis of rolling bearing using fuzzy entropy of empirical mode decomposition, principal component analysis, and SOM neural network. Proc IMechE, Part C: J Mechanical Engineering Science 2019; 233: 3317–3328.

14.

Lei

Lin

, et al. A review on empirical mode decomposition in fault diagnosis of rotating machinery. Mech Syst Signal Process 2013; 35: 108–126.

15.

Huang

. Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 2009; 1: 1–41.

16.

Chen

Lin

. Modified complementary ensemble empirical mode decomposition and intrinsic mode functions evaluation index for high-speed train gearbox fault diagnosis. J Sound Vib 2018; 424: 192–207.

17.

Lang

Rehman

Zhang

, et al. Median ensemble empirical mode decomposition. Signal Process 2020; 176: 107686.

18.

Yan

Gao

Chen

. Wavelets for fault diagnosis of rotary machines: a review with applications. Signal Process 2014; 96: 1–15.

19.

Jayakumar

Thangavel

. Industrial drive fault diagnosis through vibration analysis using wavelet transform. J Vib Control 2017; 23: 2003–2013.

20.

Dautov

ÇP

Özerdem

. Wavelet transform and signal denoising using Wavelet method. In: 2018 26th signal processing and communications applications conference (SIU), Izmir, Turkey, 2–5 May 2018, pp.1–4. New York: IEEE.

21.

Gougam

Rahmoune

Benazzouz

, et al. Bearing fault diagnosis based on feature extraction of empirical wavelet transform (EWT) and fuzzy logic system (FLS) under variable operating conditions. J Vibroeng 2019; 21: 1636–1650.

22.

Gao

Yan

. Wavelets: theory and applications for manufacturing. Heidelberg: Springer, 2010.

23.

Plaza

López

. Application of the wavelet packet transform to vibration signals for surface roughness monitoring in CNC turning operations. Mech Syst Signal Process 2018; 98: 902–919.

24.

Laala

Guedidi

Guettaf

. Bearing faults classification based on wavelet transform and artificial neural network. Int J Syst Assur Eng Manag 2023; 14: 37–44.

25.

Agrawal

Jayaswal

. Diagnosis and classifications of bearing faults using artificial neural network and support vector machine. J Inst Eng (India) Ser C 2020; 101: 61–72.

26.

Parmar

Pandya

. Experimental investigation of cylindrical bearing fault diagnosis with SVM. Mater Today Proc 2021; 44: 1286–1290.

27.

Roy

Dey

Chatterjee

. Autocorrelation aided random forest classifier-based bearing fault detection framework. IEEE Sensors J 2020; 20: 10792–10800.

28.

Liang

Zuo

Feng

Dynamic modeling of gearbox faults: a review. Mech Syst Signal Process 2018; 98: 852–876.

29.

Lei

Gang

, et al. Dynamic modeling method of transmission gear system for pure electric vehicle. In: 2017 2nd International Conference on Automation, Mechanical Control and Computational Engineering (AMCCE 2017), Beijing, China, 2017, pp.620–629. Amsterdam: Atlantis Press.

30.

Omar

Moustafa

Emam

. Mathematical modeling of gearbox including defects with experimental verification. J Vib Control 2012; 18: 1310–1321.

31.

Moshrefzadeh

Fasana

. The Autogram: an effective approach for selecting the optimal demodulation band in rolling element bearings diagnosis. Mech Syst Signal Process 2018; 105: 294–318.

32.

Afia

Rahmoune

Benazzouz

. Gear fault diagnosis using Autogram analysis. Adv Mech Eng 2018; 10: 1–11.

33.

Gougam

Rahmoune

Benazzouz

, et al. Health monitoring approach of bearing: application of adaptive neuro fuzzy inference system (ANFIS) for RUL-estimation and Autogram analysis for fault-localization. In: 2020 prognostics and health management conference (PHM-Besançon), Besancon, France, 4–7 May 2020, pp.200–206. New York: IEEE.

34.

Zheng

Zhu

. Feature extraction of the hydraulic pump fault based on improved Autogram. Measurement 2020; 163: 107908.

35.

Liu

Yang

, et al. An improved Autogram and its application in bearing fault diagnosis. In: 2020 11th International Conference on Prognostics and System Health Management (PHM-2020 Jinan), Jinan, China, 23–25 October 2020, pp.276–280. New York: IEEE.

36.

Gilles

. Empirical wavelet transform. IEEE Trans Signal Process 2013; 61: 3999–4010.

37.

Bouzida

Touhami

Ibtiouen

, et al. Fault diagnosis in industrial induction machines through discrete wavelet transform. IEEE Trans Ind Electron 2010; 58: 4385–4395.

38.

Mallat

. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 1989; 11: 674–693.

39.

Coifman

Wickerhauser

. Entropy-based algorithms for best basis selection. IEEE Trans Inf Theory 1992; 38: 713–718.

40.

Imane

Rahmoune

Zair

, et al. Bearing fault detection under time-varying speed based on empirical wavelet transform, cultural clan-based optimization algorithm, and random forest classifier. J Vib Control 2023; 29: 286–297.

41.

Sahraoui

Rahmoune

Meddour

, et al. New criteria for wrapper feature selection to enhance bearing fault classification. Adv Mech Eng 2023; 15: 1–15.