Abstract
Nowadays, when it comes to bearing faults monitoring, several solutions and implementations are in vogue due to the interest they arouse. It can be seen both in the types of hardware solutions used and, in the signal-processing algorithms and techniques employed. However, for a fault such as the bearing fault, which accounts for 41% of all faults in these systems and is also the source of the majority of other faults, it appears that the approaches used until now are insufficient for containing this fault and the losses it generates. The aim of this work is to present a new system dedicated exclusively to bearings, while also conducting a comparative study of the various algorithms currently used for mechanical fault detection, based on the mathematical model of the stator current signal used in the MCSA method, which is very close to the induced current signal. At the end, results of all simulations demonstrated that, in addition to the ESPRIT-TLS method, which is currently the best in terms of accuracy, the Varpro method could be a promising alternative.
Keywords
Introduction
Today, the use of electromechanical machines in many areas has made their monitoring a matter of great importance. This is due to the significant damage that can be caused by failures of their mechanical and electrical components. However, of all these components, the bearing is the one that causes the most losses.1,2 It accounts for 41% of all faults occurring in these systems.3–6 Several signal acquisition techniques have been used for decades. The most widely used for online monitoring are the technique based on vibration and electrical sensors.7–10 The big problem is that these techniques have considerable limitations in terms of developing an accurate, real-time system. In the case of the vibration analysis technique, the biggest problem is that the vibration sensors capture all vibrations around the component where they are placed. In other words, in addition to the vibration caused by the appearance and evolution of the fault, they also sense all the surrounding vibrations and noises. It means that, when analysing the signal, it is necessary to go through several processes to retain useful signal. Also, because of others vibrations, this can lead to errors in detection and decision-making. So, what’s the point in taking a lot of time to laminate the signal and end up with useless information, without mentioning the fact that during the time it takes to decide whether the fault is present or not, not only may the fault appear, but it may also have taken significant proportions. 11 On the other hand, a potential drawback of the approach based on stator current analysis is that it includes the participation of all other faults that may occur in the machine in addition to the fault being investigated. As a result, the signal captured is a combination of all other potential fault signals, including those originating from the bearing fault.12–14 So, in this situation, signal decomposition becomes the first thing to do. Then comes the identification of the bearing fault. It can be time-consuming and distort the real-time nature of detection. Coming to the signal processing techniques combined with them, methods such as fast Fourier transform FFT,15,16 STFT,17,18 WVD,19–22 wavelet analysis was the first to be used. But because of their limitations, such as poor signal comprehension, a frequency resolution problem, a limited number of analysis points for FFT; a fixed windowing size, poor simultaneous frequency and temporal resolution for the STFT and the WVD, and the failure to decompose the signal into its elementary signals for the WT, they have pushed researchers to explore new methods. This is where algorithms such as EMD and VMD23–25 become relevant for the proper decomposition of the composite signal, as well as algorithms capable of extracting essential signal characteristics such as frequencies and amplitudes. These techniques include ESPRIT,26,27 SAGE,28–30 CLEAN31,32 and Varpro.33–35 From this panoply of literature, it is clear that techniques and methods exist in large numbers. However, making the right choice requires evaluating them based on some metrics such as computation time or complexity, detection accuracy, and sometimes memory space. It is evident that the old acquisition techniques are inadequate for detecting only bearing faults. This means no improvement in detection performance side, but also a side of designing appropriate systems.
The main objective of this work is to present this new system dedicated exclusively to the monitoring of bearing defects but also to compare the latest signal processing algorithms used in the detection of defects in electromechanical systems such as variable projection (VarPro), SAGE, Clean, mode decomposition algorithm (EMD, VMD), and ESPRIT-TLS for the detection of mechanical bearings only.
This work is performed as follows,
First, we give a brief overview of the bearing defect. Then we present the proposed system for the detection of bearing fault only. Next, we provide an overview of these algorithms. Finally, after presenting the simulation and the results obtained using MATLAB software, we conclude with the conclusion and future perspectives.
Bearing fault modelling and detection
In the literature, the major causes of rotating machine failures are faults caused by fatigue spalling, lubricant contamination, excessive load, or other causes or electrical causes.36,37 Faults in bearings are based on several effects in rotating machines, namely, an increase in the noise level and the appearance of vibrations owing to the rotor’s movements around the longitudinal axis of the machine. 36 This type of fault also causes variations (oscillations) in the torque of the machine load, thereby blocking the rotor. It should also be mentioned that bearings with balls are composed of two rings, internal ring and external ring between which there are several balls or rotating rollers. When the bearing is operating under normal conditions, the fatigue defect starts with small cracks under the raceway and rolling element surfaces, and progressively spreads over the contact surface, increasing the number of harmonics over time. 38 In addition, bearing faults are most often related to eccentricity faults, which induce rotor asymmetry. These bearing defects occur most often when the inner or outer surfaces of the inner or outer ring are affected by damage or ball defects. 36 In addition, because the axis that links the rotor axis and load-bearing axis passes through the centre of the bearing, any defect within the bearing will directly affect this axis. Thus, the system placed on the axis underwent the same sudden changes, generating a variation in the received flux. 39 In Figure 1, we show what a bearing looks like and its various faults. The expression that gives the characteristic frequencies of the bearing fault is as follows:
where
k = 1, …, N
and
where

Bearing fault. (a) Ball fault, (b) inner ring fault, (c) outer ring fault, (d) cage fault.
Presentation of the system used
The system proposed in this work for detecting only bearing faults exploits the principle of the induced current created by the variation of the magnetic flux through a copper coil. Thus, a sensor connected to the end of these coils captures this induced current and a vibration sensor placed on bearing axis captures all other variations. This is the case when the bearing carrying the magnets is defective. Once acquired, this induced current is used by algorithms embedded on the DSP card to extract the parameters that can then determine whether the bearing is in good condition or not through artificial intelligence algorithms. This intelligent algorithms can be deployed locally or online in a cloud as show on Figure 2. The main parts of this system are as follows: the bearing with the two magnet poles placed on internal ring, the copper coil placed on external ring that allows to receive the variations of the magnetic flux according to the state of the bearing, the analog-digital converter for the acquisition and conversion of the signal, the DSP card or processing unit, and the RF module for the transmission of the data online.

Architecture of monitoring system.
Modelling of the perceived system signal
The signal model used in this study has the same nature as the stator signal used in the MCSA method and its variants. The principle of this method is the analysis of the current to detect the presence of a possible anomaly in the entire machine or one of its components. However, the difference with the system shown in Figure 3 is that the signal is not captured from the electrical line coming from the stator, but only from the interaction (magnet field) between the movement of the magnets and copper coil on the bearing, which not allows any other fault detection. And this avoid to have a detction system that may be too costly for the development of a real-time monitoring system. Thus, knowing the characteristics of a bearing fault that can manifest itself by the appearance of special harmonics in the normal signal (signal composed only of the frequency
Where s[n] denotes a sample of the signal at time n, w[n] the additive Gaussian noise at time n, L the exact sinusoidal number contained in the signal, fk are the frequencies of bearing fault, ak the amplitudes and Φ k the phase of fault’s harmonics contained in the n-th sample.

Fault detection procedure.
Determining the number of harmonics in the fault signal
The methods presented in this work require prior knowledge of the number of harmonics containing inside the signal to be analysed to extract parameters such as frequencies and amplitudes. Techniques for doing this are known as Model Order Selection (MOS). These methods, based on the retention of essential information from a signal, enable us to know the number of parameters to be analysed in advance. These include the AIC, MDL, FPE, KIC, and RAE. 41 This is the first step in all the parametric methods. In the simulation and results section, we present how MDL and AIC are applied to determine the number of harmonics in the bearing fault signal at low and high amplitudes for certain SNR values.
Overview of the fives algorithm used
EMD method
Similar to Fourier decomposition, the empirical mode decomposition (EMD) algorithm decomposes a given signal into an elementary sum of signals and a residual. The method was introduced by Huang et al.23,42–44 The mathematical expression is as follows:
where
For a sample at time n of the signal X,
Take the maxima and minima
draw the lower and upper envelopes
Calculate the average of these two envelopes
Subtract the average of the time series of the sample as follows
Verify conditions for stopping the extraction of modes:
Repeat the instructions from 1 to 5 as long as SC < 0.3.
After obtaining the modes, the last step is to determine the frequency representations to extract the frequencies.
VMD method
Similar to EMD, the Variational Mode Decomposition (VMD) algorithm finds all the series that make up a given signal.45,46 Unlike EMD, with VMD, the spectral bandwidth of each mode is chosen as the extraction unit. In other words, VMD assumes that each mode is fixed around a central frequency, which must be determined during the same decomposition period.47,48 The following steps must be taken to determine the bandwidth of a mode: (1) Exploit the Hilbert transform to compute the analytical signal of each mode; (2) Each mode was shifted to its ‘baseband’ using an exponential factor; (3) Estimate the bandwidth. 23 The algorithm used by VMD to decompose the signal is called the Alternating Direction Method of Multipliers. The detailed structure of this algorithm can be found in Reference 25 .
ESPRIT-TLS
The ESPRIT-TLS algorithm is the most robust and accurate variant of the ESPRIT high-resolution signal processing algorithm. Unlike others, the TLS algorithm is used to estimate the parameters of a given signal. ESPRIT is one of the signal processing algorithms that, starting from a given signal, subdivides it into two spaces called signal and noise in order to extract characteristics such as the amplitudes and frequencies of each harmonic present that compose it.26,27,49,50 It is used in the resolution of TLS problems. 51 It is based on the decomposition using SVD.52,53 The work in Reference 54 provides a more detailed understanding of the use cases of the TLS algorithm. Its combination with ESPRIT can be found in the following publication. 55
CLEAN method
This algorithm, exploiting the principle of deconvolution over several iterations of the spectral band of the signal domain, allows the estimation of the signal parameters of most so-called nonlinear square (NLS) problems. It is largely used for situations where several pieces of data are missing to clarify the signal, as well as when the signal is strongly affected by noise, as in geoscience and astrophysical science.31,32,56,57 This algorithm is very robust, easy to implement, and manages to filter the signal from the additive noise that composes it to allow a better estimation of these parameters. The structure of the algorithm was described in the following reference. 31
SAGE method
This algorithm, which spares signals in single harmonics by eliminating the interference that may exist between them based on interference cancellation to separate any signal into harmonics in the so-called waiting stage (expectation step), is an improvement of the EM algorithm. 58 Using the maximum likelihood principle, 59 the cost function was optimized to extract the required parameters. Instead of using the usual EM algorithm process, the optimization is performed using two unidimensional optimization procedures in series.28–30 To initialize the parameters before starting the optimization process, the algorithm uses a Fourier transform-based frequency parameter estimation and serial separation of the given signal. By updating the searched parameters sequentially, the algorithm alternates between several small data spaces defined by the designer. This algorithm is commonly used in the telecom domain to determine the direction of arrival of signals on a receiver. Over time, it has been adapted to composite signals of the same nature as the mathematical model presented in equation (3). The detailed structure of the SAGE algorithm can be found in this study. 58
Variables projection method
To analyse a signal from its parameters, parametric methods are most often used. These methods are then satisfied by predicting an almost perfect model that allows obtaining the desired signal to model the observed signal. This was achieved by determining these parameters according to a fixed error threshold.33–35,60,61 This is sometimes referred to as the minimization of a cost function. This is what Golub and Pereyra did in their 1973 work, where they showed how to solve most of the so-called separable nonlinear least squares problem (SNLLS) using an algorithm called variable projection (VarPro). Moreover, what is interesting with their work is that they showed that for any problem of estimation of more than two frequencies, that this problem could be considered as an SNLLS problem.62,63 The structure of this algorithm can be found in this work by Dianne P. O’Leary-Bert W. Rust. 35 The key steps are as follows.
Solving Constrained Nonlinear Least-Squares Problems
Solving Constrained Linear Least-Squares Problems using Matlab’s lsqlin function
Computing the Jacobian Matrix of the cost function and how to use it along the algorithm.
Results of simulation
Simulation’s parameters
In our present study, the parameters used for the simulation part and the results obtained are presented in Table 1 below. These parameters are used in the mathematical model of the signal presented in equation (3) to obtain the signal to be analysed. For the amplitudes, we used different values for the small and large ones. The small ones were obtained by dividing the large ones by 10, as shown in Table 2. The frequencies of the defect are obtained from the mathematical model and the value of k, which is the factor determining the number of harmonics of the defect in addition to the fundamental frequency. As mentioned above, MDL and AIC were applied to three, five, and seven harmonics to extract the number of harmonics that is not known a priori in the estimation of the actual parameters. The value on the n-axis corresponding to the minimum cost function is the number of harmonics constituting the observed signal. The following figures provide an overview of the results.
Simulation parameters.
Amplitudes and frequencies used for simulation.
From Figures 4 and 5, it is clear that of the two algorithms, the MDL algorithm is the one that best estimates the number of harmonics composing the signal. However, it can be seen that when noise dominates, whether at low or high amplitude, the estimated number of n frequencies of the signals is different from the exact NH value of the signal. In addition, this parameter is very well estimated when there is less noise. Therefore, we deduce that noise is an important factor to reduce for the correct estimation of the number of harmonics. Otherwise, even in the presence of a defect, the system designed would indicate the absence of any defect, and by the time it realizes that it has a defect, the severity will have already changed enormously.

Estimation of Harmonics number NH with is n for large amplitudes for SNR = [−5 10 50] dB.

Estimation of Harmonics number NH with is n for small amplitudes for SNR = [−5 10 50] dB.
Simulation results
This part gives an overview of the results of the simulation performed with these six algorithms in the estimation of the frequency harmonics of the rolling defect contained in the signal synthesized in equation (3). We use the NMSE (Normalized mean square error) to measure the accuracies of this estimate for Nh equal to two, four and six harmonics of the bearing fault. But before presenting these results, we give in Figure 6 the representations of the signal in the frequency domain according to each number of harmonics.

Frequency domain representation of the signal. (a) Nh = 2, (b) Nh = 4 and (c) Nh = 6.
Simulation results for two harmonics
Simulation results for four harmonics
Simulation results for 6 harmonics
Discussions
Based on the above results (from Figures 7 to 12), the VMD, SAGE, ESPRIT-TLS, and VarPro algorithms can be selected in estimating the frequency of the bearing defect for accuracy. However, given the ‘real-time’ aspect, SAGE and VMD are time-consuming even though their memory sizes are minimal compared to TLS and VarPro. However, since the memory space of the latter two is on the order of a few KB, they can be perfect candidates for a real-time estimation of small memory sizes.

Simulation for two harmonics with large amplitudes. (a) mean of NMSE on f1f2f3 estimation, (b) average execution time of each algorithm, (c) memory size of each algorithm.

Simulation for two harmonics with small amplitudes. (a) mean of NMSE on f1f2f3 estimation, (b) average execution time of each algorithm, (c) memory size of each algorithm.

Simulation for 4 harmonics with large amplitudes. (a) Mean of NMSE on f1f2f3f4f5 estimation, (b) Average execution time of each algorithm, (c) Memory size of each algorithm.

Simulation for four harmonics with small amplitudes. (a) mean of NMSE on f1f2f3f4f5 estimation, (b) average execution time of each algorithm, (c) memory size of each algorithm.

Simulation for six harmonics with large amplitudes. (a) mean of NMSE on f1f2f3f4f5f6f7 estimation, (b) average execution time of each algorithm, (c) memory size of each algorithm.

Simulation for four harmonics with small amplitudes. (a) Mean of NMSE on f1f2f3f4f5f6f7 estimation, (b) average execution time of each algorithm, (c) memory size of each algorithm.
The last thing to analyse is the behaviour of these two for very close frequencies with
In addition, it is clear from Figure 13 that the ESPRIT-TLS algorithm outperforms all others, including Varpro, in estimating the frequencies of the single bearing fault. As far as Varpro is concerned, he needs to have knowledge of the signal model for the estimation process. In addition, not only does it use a notion of weight that must be well chosen, but also the choice of the initial values of the parameters can influence the estimate. However, with ESPRIT-TLS, even without knowing the signal model, it is possible to estimate the frequencies and amplitudes of the elementary signals contained in the signal by knowing only the number of harmonics estimable by the MOS method. This is also the case for the signal of rotating machines, where it is impossible to know in advance the pattern of the signal. This makes it a much more perfect choice than Varpro, although for the latter, a good choice of weights and a good initialization led to good estimates.

Simulation for close harmonics with small amplitudes. (a) Mean of NMSE on f1f2f3 estimation, (b) zoom on Mean of NMSE on f1f2f3.
Conclusion et perspectives
In view of these results, taking into account the size of the memory, in particular the accuracy in the estimation of the signal parameters for the correct detection of the bearing fault and the real-time character of the system, we can finally come to the end of this work by saying that the Varpro algorithm could have been a good choice. But due to conditions such as my knowledge of the observed signal pattern and the right choice of parameters, ESPRIT-TLS is much better not only in terms of accuracy but also in terms of simplicity. So, in conclusion, ESPRIT-TLS would be a good choice among all the algorithms to explore for bearing defect detection alone. However, as these results are experimental, we wish to continue this work by moving to a test bench with the proposed system mounted with defective bearings, in order to confirm the conclusion reached in this study of the analysis and comparison of these six algorithms. Since this work is only the beginning of a series of works, we intend to subsequently perform a comparison of the techniques present in this work and other techniques from the emerging fields of signal processing for fault diagnosis in various industrial applications, in particular in terms of the fault information-guided variational mode decomposition (FIVMD) method for the diagnosis of rolling element bearings, the Vold-Kalman filter bandwidth selection scheme based on the order spectrum for the diagnosis of gearbox faults in offshore wind turbines, and the new vibration-based prognostic scheme for managing the condition of gears in surface wear the evolution of the intelligent manufacturing system. In addition, because of the errors that can be made in the estimation of the number of harmonics even for the ESPRIT-TLS algorithm, we plan to look for another method that does not exploit this parameter in one of the next works.
Footnotes
Handling Editor: Aarthy Esakkiappan
Authors’ contributions
Pascal Dore: student Ph.D, producer of this work (contact the corresponding author).
Pr Saad Chakkor, Supervisor of this work.
Pr Ahmed El Oualkadi, Director of thesis.
Mostafa Baghouri Co-supervisor.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Ethics approval
Not applicable.
Consent to participate
We, authors of this paper consent to participate to the publication of this paper.
Consent for publication
All authors consent for publication.
Availability of data and material
Regarding the data used, they can be obtained from the mathematical model of the stator signal used for this purpose (see equation 3, 1 and 2) and the parameters specified in Tables 1 and 2.
Code availability
Codes are available on demand. The Persons can contact the corresponding author.
