Abstract
Health monitoring of train bearing is crucial to railway transport safety. More and more attention has elicited by the wayside acoustic monitoring technique in recent years than other defect detection techniques. However, wayside acoustic signal contains serious Doppler distortion and heavy background noise because of the high speed of trains. Thus, extracting fault-relevant information is difficult. A novel method for Doppler effect correction is proposed in this study by incorporating the traditional time-domain interpolation resampling with a novel kinematic parameters estimation method. In this kinematic parameters estimation method, an iterative algorithm based on least squares theory is proposed to improve the parameters estimation accuracy. After the Doppler effect correction, the ensemble empirical mode decomposition is employed to further enhance the fault-relevant information. The proposed iteration algorithm can improve the accuracy of kinematic parameters estimation significantly; thus the Doppler distortion can be corrected more accurately. The proposed ensemble empirical mode decomposition can further enhance the fault-relevant information and so that the accuracy and reliability of the diagnosis decision can be improved. The performance of this method has been verified in experimental and simulated cases.
Keywords
Introduction
Railway transport is one of the main transportation modes for national economic activities and general public travel. The safety, stability, effectiveness, and economy of railway transport are current concerns. Bearings are critical components in train wheels. If a bearing fault cannot be determined in time, a serious accident could occur. Therefore, monitoring the status of the train wheel bearing is necessary to prevent railway traffic accidents and reduce economic losses and casualties.1,2
Train bearing fault diagnosis can be performed through several methods, such as oil monitoring, 3 hot-box detection, 4 vibration signal analysis, 5 and acoustic signal analysis. 6 Wayside acoustic signal (WAS) diagnosis has a clear advantage over other methods because the acoustic signal acquisition device does not come into contact with the wheel bearings. As a train passes over a track, the WAS of wheel bearings can be obtained with a wayside microphone mounted next to the track. When the wheel bearing is faulty, fault-induced impulses exist in the acquired signal. Many mechanical faults can be detected from wheel bearing signals. The signal acquisition system of the acoustic analysis method is also simple and economical. This technique can predict incipient failures and conduct on-line monitoring. 6
However, given the relatively high moving speed between the microphone and the train, the Doppler effect will make the WAS a serious distortion. 7 Consequently, the signals undergo amplitude modulation and frequency band expansion. The collected WAS are also submerged in heavy noises, such as rolling noise, 8 mechanical structural borne noise radiated by car body vibration, 9 and aerodynamic noise. 10 In addition, many frequency interference components also exist in the WAS because of adjacent wheel bearing. These problems make extracting fault information from WAS difficult. Thus, the accuracy, validity, and stability of the method are affected. 11
In order to correct the Doppler effect using time-domain interpolation resampling (TIR) method, the kinematic parameters of the WAS must be identified. There are two ways to do it at present: one is kinematic parameters estimation (KPE) based on the data-driven matching pursuit method, and the other is KPE based on the instantaneous frequency (IF) extraction. 12 However, the algorithm of data-driven matching pursuit method usually takes a long time, and with the increase in the number of kinematic parameters, the required time will grow very fast. 13 The accuracy of KPE based on the IF extraction is generally low and should be improved further. 14 In addition, a narrow fitting interval should be set for each parameter beforehand in traditional method. This condition limits the traditional method because the fitting intervals for each parameter cannot be set in each fitting, due to several parameters that are unknown in advance. Thus, in this study, an iterative fitting algorithm that involves the iterative extraction of IF through iterative kinematic parameter fitting is proposed to improve the accuracy of KPE. Through iterative fitting, the accuracy of the obtained kinematic parameters increases until the optimal value is reached. The optimal kinematic parameters will be used to correct the Doppler effect of the WAS according to TIR.15,16 In TIR, the higher the accuracy of the obtained kinematic parameters is, the better the results of acoustic signal correction are. Thus, this study focuses on improving the accuracy of the obtained kinematic parameters.
Even after correcting the Doppler effect, identifying the fault information of the wheel bearing remains difficult because of the heavy background noise. The next task is to extract weak fault-relevant information (FRI) from WAS with heavy background noise. Many methods have been developed in recent years to enhance the FRI of rotating machinery. These methods include digital filters, 17 wavelet transform,18,19 stochastic resonance,20–22 manifold learning, 23 match pursuit, 24 and morphological analysis. 25 Empirical mode decomposition (EMD) is a classical method that could be used to extract the time–frequency feature of non-stationary and non-linear signals.26–28 The mode-mixing problem of EMD can be eliminated by ensemble empirical mode decomposition (EEMD),29,30 which is a noise-assisted method. In this study, the Doppler-free signal is first decomposed into a set of embedded natural oscillatory modes by EEMD; those modes are defined as intrinsic mode functions (IMFs). 31 Second, spectral kurtosis is calculated to select the IMF that represents the failure characteristic. Finally, the local fault frequency of the WAS can be identified by envelope spectrum analysis on the chosen IMF.
The contents of this article are arranged as follows: the research background, which includes the kinematic model, and theory of TIR is introduced in section “Research background.” Section “Proposed fault diagnosis strategy” introduces the iterative fitting algorithm of KPE, which is the core of the WAS monitoring technique. Section “Simulation analysis” presents the results of the simulation process to verify the effectiveness of the iterative fitting algorithm. Section “Experiment analysis” discusses the results of the experimental test using bearings with outer and inner race defects, and the result of the experiment verified the performance of the proposed method in WAS diagnosis. Section “Conclusion” presents the conclusions of the study. 1
Research background
Kinematic model
The kinematic model is shown in Figure 1. When a train passes over a track in reality, the WAS generated by the train wheel bearings are collected by the microphone installed at the wayside. The distance between the train and microphone varies, and the train speed is high. Thus, the WAS obtained from the wayside contains serious frequency distortion caused by Doppler effect. In Figure 1, v is the train speed, s is half the track length, the distance between the microphone and wheel bearing is represented by R, and the vertical distance between the microphone and wheel bearing is represented by r. Given the train movement, R shortens from a point to the midpoint of the track (O) and then lengthens after the train passes over the midpoint (O). f0 is the center frequency of the acoustic signal. It should be time-varying and can be denoted by f0(t). The kinematic model is described as equation (1)

Kinematic model of a train.
The time-varying center frequency f0(t) can be expressed as equation (2) according to the theory of dynamics and the kinematic model, where M is the Mach number in aerodynamics 32 defined as M = v/c, and c is the speed of sound spread in the air (i.e. generally, c is 340 m/s). After simplification, the Doppler frequency time-varying formula can also be expressed as equation (3). The proposed iterative fitting algorithm is implemented according to the following two frequency formulas
Doppler correction using TIR
TIR can provide a good result of Doppler distortion correction both in single- and multiple-frequency Doppler distortion signals. The amplitude and frequency structures of the distortion signal can be both corrected by TIR.
The following is a brief introduction of the TIR algorithm principle. The TIR framework is depicted in Figure 2, and the steps of the TIR method are described as follows:
First, the optimal kinematic parameters are used to calculate the resampling time vector
Second, the Doppler distorted signal
Third,

TIR flowchart.
Proposed fault diagnosis strategy
After pre-processing of the input signal, the main task of this method is to estimate the kinematic parameters. This method obtains the iterative kinematic parameters by fitting the iterative IF. The optimal kinematic parameters can be obtained by fitting the most accurate IF based on least squares theory according to equation (2). The most accurate IF can be extracted from the iterating kinematic parameters by setting a threshold beforehand. The optimal kinematic parameters will be used to correct the distorted signal according to TIR; then, the fault information will be identified by EEMD. The flowchart of this method is shown in Figure 3, and its detailed description is as follows:
The original signal X(t) has serious Doppler distortion and is mixed with heavy noise and interference from different sound sources.
Signal pre-processing—the signals with the frequency of interest from the original signals can be obtained through a Butterworth band-pass filter with a specific frequency band. The time–frequency distribution (TFD) of X(t) is obtained by time–frequency analysis.33–35 It uses short-time Fourier transform (STFT) in this study. The time–frequency matrix TF(t,f) of X(t) is constructed from TFD. An energy threshold of time–frequency matrix TF(t,f) is set to reduce the background noise and interference components with a mean filter.36,37 The energy threshold is defined as
IF ridge line extraction—the IF ridge line can be extracted by connecting the energy extreme points. From TFD of x(t), IF should be located where the high energy is, where the IF could be defined as
First KPE—the kinematic parameters (i.e. f0, v, r, and s) are estimated by curve fitting the IF ridge line extracted in step 3 with equation (2).
Kinematic parameter computation—the kinematic parameters are calculated according to the Doppler frequency variation formula (3). The first point of the IF extracted by step 3 is selected for computation because it is the simplest case. The IF is shown in formula (2). At this point, t = 1, fe is the emitting frequency expressed by
where
where t is half of the length of the sample time series. Once the sound acquisition device is installed, r becomes constant. Thus, the train speed v can be solved through equation (5). Given that the train speed v and time t(i) is known, the horizontal distance can be obtained as equation (6)
Doppler window construction—a set of highly accurate kinematic parameters (i.e. v, s, r, and f0) are obtained after step 5. The time series t(i) with the length of the signal generates a time–frequency curve with Doppler distortion in TFD according to equation (2). The Doppler windows are constructed with upper and lower boundaries. The boundaries of the Doppler window can be expressed as: (1 ± k*rf)·f0(t). The distance between upper boundary and the time–frequency curve is defined as range factor rf. The initial value of rf is 0.01. The value of k is generally defined as
Iterative termination judgment—rf is set as the iteration termination condition. If the condition is satisfied, the iteration will be terminated, and the output would be the optimal value. Otherwise, the process will return to step 3, and the IF ridge line will be extracted again. A new Doppler window with a reduced width is then constructed, where the width of the each reduction is 0.002 and the window width threshold is set as 0.001. In each iteration, a set of new parameters can be obtained, and the Doppler window can be constructed again based on this set. When searching for the extreme value of energy of each column in the Doppler window, IF can be obtained by connecting each extreme value. The accuracy of the obtained IF and kinematic parameters is then improved.
The obtained kinematic parameters are employed by TIR to correct the Doppler effect.
EEMD is adopted to decompose the corrected signal into a set of IMFs. One of the IMFs is selected as the diagnosis information based on the spectral kurtosis maximum principle.
Fault frequency is identified through envelope spectrum analysis.

Flow diagram of the WAS diagnosis method.
Simulation analysis
A simulation signal was designed to verify the proposed method. The kinematic parameters were designed as follows: s = 1.5 m, r = 1 m, v = 45 m/s, and c = 340 m/s. The simulated Doppler-shifted signal can be expressed as equation (7)
where the intensities of four frequency components are as follows: P1 = 0.7, P2 = 1.7, P3 = 0.9, and P4 = 1.1. fi is defined as the center frequency of the simulated distortion signal, and the simulated Doppler signal comprises four frequency components (i.e. f1 = 1000 Hz, f2 = 1300 Hz, f3 = 1500 Hz, and f4 = 1700 Hz). The sampling frequency in this simulation was 50 kHz. Considering the actual scenario where the acoustic signal contains heavy noises, a random noise n(t) was added to the simulation signal, where

(a) Simulative Doppler distortion signal and (b) spectrum.
Given that all frequency components follow the same distortion rule, one of the frequency components, that is, f2 = 1300 Hz, was selected to iterative IF extraction and curve fitting, in order to get the optimal kinematic parameters of the simulation signal. The simulation signal was pre-processed by a filter, and energy threshold was set to eliminate the background noise and interference components. Where, the energy threshold of simulation signal is 200.88 (in TFD). The processing of the iterative fitting algorithm is shown in Figure 5. The width of the Doppler window becomes increasingly narrow from Figure 5(a)–(f). Each iteration is a new process of IF extraction and parameters fitting, and the new kinematic parameters become increasingly accurate until a set of optimal values is obtained. The values of the kinematic parameters obtained in the first fitting were compared with the optimal values obtained by iterative fitting in Table 1. The improvement in the accuracy of the kinematic parameters is evident. The receive time vector

Iterative process demonstration with the simulation signal: (a) the renderings of first iteration, (b) the renderings of second iteration, (c) the renderings of third iteration, (d) the renderings of fourth iteration, (e) the renderings of fifth iteration, and (f) the renderings of sixth iteration.
Different fitting results of simulation signal.

(a) Amplitude demodulation factor, (b) resampling time series, and (c) TFD of Doppler-free signal.
Experiment analysis
Experiment setup and data acquisition
The experiment was divided into two separate steps. The microphone collects the acoustic signal of the faulty bearing from the bearing test workbench in the first step. A kinematics experiment that allowed the fault acoustic signal to generate Doppler distortion was conducted in the second step. The outer and inner race faults of the bearing were tested in the experiment. Figure 7 shows cracks on the outer and inner races, which were cut by a wire-electrode cutting machine. The width of the cracks all set as 0.18 mm.

Bearing with artificial defects: (a) artificial crack on outer race and (b) artificial crack on inner race.
The acoustic signal with the fault-induced impulses was collected by a microphone. The bearing test bench setup is shown in Figure 8(a). The bearing test workbench was composed of a drive motor, a tested bearing, and a mechanical radial loader. The microphone was installed next to the workbench.

Experimental setup: (a) bearing signal acquisition system and (b) the scene of Doppler signal acquisition experiment.
The acoustic signal obtained in the first step was played by a loud speaker on the minibus to simulate the relative motion between the train and microphone. The kinematic parameters in the second step were as follows: s = 4 m, r = 2 m, and v = 30 m/s. The experiment setup is shown in Figure 8(b). The acoustic signal was collected by microphone again, and then, the obtained signal will be mixed with Doppler effect. The sampling frequencies were all set to 50 kHz in the two experimental steps. The type of bearing used in this experiment is NJ(P)3226X1. The type of microphone is 4944-A (B&K Company). The frequency range of this microphone is 16–70,000 Hz. The fault frequency from outer race and the inner race are 137 and 195 Hz.
Outer race fault signal analysis
The time domain, frequency domain, and STFT of the outer race fault signal are shown in Figure 9(a). Frequency band expansion clearly existed in the frequency domain, which made the identification of the fault frequency difficult. Thus, investigating the kinematic parameters is necessary to correct the Doppler effect from the signals. The outer race signal was pre-processed by a band-pass filter with the pass-band of 1580 and 1950 Hz, and the background noise and interference components were eliminated with the energy threshold, which is 0.005 (in TFD). The KPE processing by iterative Doppler construction and curve fitting is shown in Figure 10. The width of the Doppler window becomes increasingly narrow from Figure 10(a)–(f), where the window width threshold was set to 0.001. The iterative fitting algorithm was run until rf obtained the threshold set in advance. Figure 10(f) shows that the Doppler window region approximates one curve, and the region is focused on areas with the highest signal energy. The said figure also shows that the interferences caused by other frequency components were effectively eliminated by the proposed method. The values of the kinematic parameters obtained by first fitting were then compared with the optimal values obtained by iterative fitting in Table 2. The accuracy of the kinematic parameters clearly improved because of the iterative algorithm. The Doppler effect can be corrected in the acoustic signals according to TIR and the optimal kinematic parameters. Figure 9(b) shows the time domain, frequency domain, and STFT of the correction acoustic signal. Frequency band expansion in the frequency domain was eliminated, and the signal obtained suitable correction in TFD.

(a) Original signal and (b) Doppler-free signal.

Iterative process demonstration with the outer race signal: (a) the renderings of first iteration, (b) the renderings of second iteration, (c) the renderings of third iteration, (d) the renderings of fourth iteration, (e) the renderings of fifth iteration, and (f) the renderings of sixth iteration.
Different fitting results of the outer race fault signal.
After the Doppler effect was corrected, the fault information was still submerged in heavy noises. Thus, the next step was to extract weak information through EEMD. First, the Doppler-free signal was decomposed into a set of IMFs. Second, spectral kurtosis of each IMF was calculated as an index to extract fault information, as shown in Figure 11(a). The larger the spectral kurtosis of IMF is, the more fault information the IMF contains.38,39 And it can be seen from Figure 11(a) that the third IMF value is the largest, so the third IMF was selected for the envelope spectrum analysis, where the three IMFs whose spectral kurtosis were bigger were displayed in Figure 11(b), and Figure 12(a) shows the envelope spectrum of the Doppler-free signal. Figure 12(b) shows the envelope spectrum of the third IMF. It can be seen that the fault frequency is clearer in Figure 12(b), which means that the signal was effectively de-noised.

(a) Spectral kurtosis values of each IMF and (b) Second, third, and fourth IMFs.

(a) Envelope spectrum of the Doppler-free signal and (b) envelope spectrum of IMF3.
Inner race fault signal analysis
The proposed WAS diagnosis method also can identify the fault information from inner race signal. The time domain, frequency domain, and STFT of the inner race signal are shown in Figure 13(a). Given the Doppler effect, the inner race fault signal also exhibits obvious frequency band expansion in the spectrum. In addition, different from the outer race signal, the TFD shows that the time–frequency region of the inner race signal is discontinuous. So, its KPE is more difficult than that of the outer race signal. After signal pre-processing with a band-pass filter of 1680 and 1950 Hz, set an energy threshold to reduce the background noise, where the calculate result of energy threshold of inner race is also 0.005 (in TFD). The iterative algorithm method was employed to analyze this signal, as shown in Figure 14(a)–(f). The Doppler window also became increasingly narrow, where the window width threshold was also set to 0.001. The iterative fitting algorithm continued until rf achieve the threshold. Interference can be eliminated easily because the Doppler window width became increasingly narrow through the iterative processing.

(a) Original signal and (b) Doppler-free signal.

Iterative process demonstration of the inner race signal: (a) the renderings of first iteration, (b) the renderings of second iteration, (c) the renderings of third iteration, (d) the renderings of fourth iteration, (e) the renderings of fifth iteration, and (f) the renderings of sixth iteration.
The two previous iterative processes of the iterative fitting are shown in Figure 15. Comparison of Figure 15(a) and (d) shows that the Doppler window width became narrow. Figure 15(a) shows two singular parts of the IF ridge line in this Doppler window. This phenomenon is caused by the interference component with high energy existing in the acoustic signal. It cannot be eliminated easily by pre-processing because the local extremum leads to a large deviation of the obtained IF ridge line. The singular part in Figure 15(a) with a gray mark converged with that in Figure 15(b) through the iterative processing. The other singular part was eliminated by a new extremum, as shown in Figure 15(b). Figure 15(c) and (d) illustrate similar scenarios. The extreme value of the effective signal, which was overwhelmed by the high-energy interference, was determined during the iterative algorithm.

Demonstration of the two previous iterative processes: (a) instantaneous frequency extraction with singular part, (b) correction the singular part, (c) instantaneous frequency extraction with singular part again, and (d) correction the singular part again.
The kinematic parameter values obtained by the first fitting were then compared with the optimal values obtained by iterative fitting in Table 3. The accuracy of the kinematic parameters also improved. The Doppler effect was corrected in the acoustic signal according to TIR and based on the obtained kinematic parameters. Figure 13(b) shows the time domain, frequency domain, and STFT of the correction signal. The frequency band expansion in the frequency domain was eliminated, and the signal obtained suitable correction in the TFD.
Different fitting results of inner race fault signal.
The fault information of the inner race Doppler-free signal was also difficult to identify because of the background noises. So, EEMD was adopted to extract weak diagnosis information. The spectral kurtosis values of each IMF are shown in Figure 16(a). The third IMF value is higher than others. It means that the third IMF contains more fault information than others. Considering that other IMFs only contained very little information about the original signal, only the IMFs from second to fourth are shown in Figure 16(b). Figure 17(a) shows the envelope spectrum of the Doppler-free signal. The envelope spectrum of third IMF is shown in Figure 17(b). The fault frequency is mixed in the background noises in Figure 17(a). Comparing Figure 17(a) and (b), the fault frequency and its harmonics can be seen clearly in the latter. It means that the fault information can be identified effectively from the inner race signal.

(a) Spectral kurtosis values of each IMF and (b) Second, third, and fourth IMFs.

(a) Envelope spectrum of the Doppler-free signal and (b) envelope spectrum of IMF3.
Conclusion
The proposed WAS diagnosis method can correct the Doppler effect first and then to identify the fault frequency effectively. The main innovation of the proposed method is the iterative fitting algorithm of KPE. The experiment analysis showed that the accuracy of the obtained kinematic parameters significantly improved with the iterative fitting, and the iterative fitting algorithm did not have to set a narrow fitting interval for each parameter beforehand. Doppler distortion was corrected effectively by TIR after obtaining the kinematic parameters. EEMD could extract the information from background noise. The fault frequency could be identified by envelope analysis on IMF. The proposed method may be applied to practical train bearing healthy monitoring systems to enhance their monitoring performance. However, the proposed method still has shortcomings. For example, the distance between the microphone and the middle of the track (r) should be constant in the iterative fitting algorithm to decrease the number of the unknown kinematic parameters. Furthermore, train speed increases in reality, so the distortion of the acoustic signal becomes increasingly serious. More interference could also occur from the different wheel bearings, so IF extraction becomes difficult. These problems should be addressed in future studies.
Footnotes
Handling Editor: Mario L Ferrari
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (nos 51675001, 51505001, 51605002, and 51637001) and the Natural Science Foundation of Anhui Province (nos 1508085SQE212 and 1608085QE110). The experiment was completed in Dr Qingbo He’s laboratory in University of Science and Technology of china. The valuable and constructive comments from the reviewers to further improve this paper are sincerely appreciated.
