Application of psychoacoustics for gear fault diagnosis using artificial neural network

Abstract

Identification of correct working of gearbox is a very important function during end of line inspection in the assembly line while manufacturing the gearbox. Such inspection is performed by an operator by listening to the sound of gearbox while running it on a test bench. Based on the sound emitted by the gearbox combined with experience and judgment of the operator, the gearbox is passed or rejected for fitting inside the vehicle. This paper makes an attempt to use artificial intelligence techniques to identify gearbox condition in the above environment by using psychoacoustic features to replace human hearing. Experiments are carried out on a gearbox test rig and sound data are acquired for good and faulty gear conditions. Psychoacoustic features and statistical indices are extracted from the data and these are then used as input to an artificial neural network. The artificial neural network output is the condition of gearbox. Performances of psychoacoustic and statistical indices are then compared. It is found that psychoacoustic features are able to predict gearbox condition with an accuracy of 99% and 98% for good and faulty conditions, respectively, whereas the statistical features are able to do the same with 97% and 98% accuracy. Therefore, it is concluded that psychoacoustic features have the potential to be used for the end of line inspection of gearbox in manufacturing environment and the process of inspection can be made objective by eliminating operator’s ability and judgment.

Keywords

Gear fault diagnosis psychoacoustics artificial neural network assembly line inspection

Introduction

Gearbox is a vital element in any power transmission system. Therefore, condition monitoring of gearbox has been a topic of wide interest. Monitoring by using vibration and acoustic emission is very popular and well established. The monitoring is carried out for various purposes: (1) to characterize the emitted sound, (2) to diagnose faults in gearboxes of running machinery and (3) to check correctness of gearbox assembly after manufacturing for identifying problems if any, before clearing it for use in any vehicle or machine. Whatever is the purpose of monitoring, the process involves data acquisition and analysis. The analysis techniques can be classical or intelligent. The intelligent techniques are used for effective classification and fault diagnosis. The following paragraphs present a review of some relevant works in this area.

In applications like automobiles, manufacturers are very much concerned about the noise, vibration and harshness (NVH) and sound quality parameters of all the components and subassemblies which contribute to annoying noise of the vehicle. Sound quality has become an important criterion for the design and manufacturing of products to attract and retain customers. It is evaluated using psychoacoustics. Sound quality evaluation is of great importance not only to comply with pollution control regulations but also for sales as sale volumes are affected by it. Caryer Cook and Ali¹ have discussed the trends and perspectives of the end-of-line inspection for annoying noises in automobiles. There are some studies to characterize sound for various domestic products such as vacuum cleaner, refrigerator compressors, automobile seat adjuster and car door closer.^2–6 Some of the researchers^7–10 have studied the dynamic response of the gearbox, i.e. vibration and noise for the various errors like backlash, misalignment, profile error, etc. using techniques such as finite element method (FEM), boundary element method (BEM), Monte Carlo simulation, equation of motion and incremental harmonic balance method (IHBM). Zhou and Wenlei⁷ established a dynamic model using FEM and BEM to show dynamic characteristics of gearbox where the time history of node dynamic response and noise spectrum of the gearbox were obtained. Shen et al.⁸ studied nonlinear dynamics of a spur gear pair based on the IHBM, where the time-varying stiffness and backlash were included. The frequency–response of the system was investigated by IHBM, and the effects of damping ratio and amplitude of excitation on the response were analysed. Driot and Perret-Liaudet⁹ studied the variability of modal behaviour in gear pair due to manufacturing error and shaft misalignment. Bonori and Pellicano¹⁰ analysed non-linear vibrations of spur gear in the presence of manufacturing errors where backlash and profile error distributions were stochastically modelled to find their effect on gear vibration. Such studies help to analyse and control dynamics of gearbox to make changes at the design stage or to modify the existing product to enhance its performance. Significant work has been carried out to develop signal processing and artificial intelligence techniques for condition monitoring of rotating and reciprocating machines. Wang et al.¹¹ proposed an advanced technique for engine fault diagnosis based on Hilbert–Huang transform (HHT) on the noise based samples and support vector machine (SVM) method. Han et al.¹² used fast-ICA (independent component analysis) and wavelet packets along with the SVM for bearing fault diagnosis. Kankar et al.¹³ used the complex Morlet wavelet based on minimum Shannon entropy criterion to extract the fault feature and presented a methodology for detection of bearing faults by classifying them using three artificial intelligence techniques where SVM is found to be superior to least vector quantization and self-organising maps. Advances in the field of condition monitoring indicators, signal processing and artificial intelligence techniques have been reported in the review papers by Nie et al.,¹⁴ Singh and Al Kazzaz,¹⁵ Peng and Chu,¹⁶ and Jardine et al.¹⁷ In addition to the above, there are some studies^18,19 on the cabin noise for tractor and car and it is found that the structure born sound due to gearbox is the most annoying noise and hence it is recommended that efforts should be made to control quality of gearbox on assembly line.

If the focus is on gear fault diagnosis in running machinery, then there is no dearth of literature on use of vibration and acoustic emission techniques for this purpose. These techniques generally use time and frequency domain methods for diagnosing faults. When in operation, the major causes of failure of the gearboxes are cracked gear tooth, pitting, wear, etc. Wang et al.,²⁰ Saravanan et al.,²¹ El Badaoui et al.,²² Wang et al.²³ and Loutas et al.²⁴ have demonstrated use of intelligent techniques for effective classification and gear fault diagnosis.

During inspection of newly manufactured gearboxes, for deciding the acceptability of the gearbox, the operator listens to the sound of gearbox on a test bench by running it at a constant speed by an electric motor. Based on the judged sound quality, the operator accepts or rejects the gearbox. This becomes a challenging task as the decision is subjective and depends on the opinion of the individual. Hence, psychoacoustics, a new evolving technique would be one of the solutions to eliminate subjectivity by extracting psychoacoustic parameters which are based on science of human hearing and physics of sound waves. For newly manufactured gearboxes, faults like misalignment, centre distance variation, dents on teeth, bearing misfit, profile errors, etc. may occur. These faults would manifest in different ways such as increased vibration, annoying sounds, rattle and high pitch whistle type sound. However, very limited work can be been found for evaluating gearbox quality using psychoacoustics to identify correct working of gearbox during end of line inspection. Shang et al.²⁵ have also reported similar situation of dependence on human hearing ability and expertise in gearbox inspection after making the assembly. They have proposed vibration based technique which makes use of time and frequency domain methods. Time synchronous averaging was applied to acquire data as driving, driven and counter shafts are involved. During visit to few gearbox manufacturing industries by the authors and subsequent interaction with the people involved in manufacturing, it was realized that there is a need to develop a technique which would work as a tool to identify correctness of the assembly and classify the gearbox as ‘OK’ or ‘Not OK’, based on some objective indices and artificial intelligence technique rather than the subjective opinion and hearing ability of the operator which is currently used. Hence, this paper focuses on developing a method by which the existing end of the line inspection of gearbox assembly by human operator can be replaced by an algorithm where the subjectivity involved in the decision on acceptability of gearbox can be eliminated. As human hearing ability is to be replaced, use of psychoacoustics has been made for fault identification.

In view of the above, experiments are conducted on a standard experimental gear set up for fault simulation. The main objective is to find the relationship between possible defects and manifested outcomes using psychoacoustic indices to ascertain their ability to classify faults. The statistical features of acoustic signal and psychoacoustic parameters are extracted from the measured data and used as input to artificial neural network (ANN). Details are discussed in the next section.

Experimental set up and procedure

Experiments were carried out on a spur gearbox test rig with layout as shown in Figure 1. It consists of single stage spur pinion and gear. The shafts are supported by deep groove ball bearings. The gearing system is lubricated by splash lubrication provided at the bottom of gear housing. The pinion on input shaft has 32 teeth and driven gear on the output shaft has 80 teeth. A three phase, 3 HP, 0–5000 rpm variable speed motor operates the input shaft. The output shaft is connected to magnetic particle brake. More than one test gears can be mounted on the input shaft at a time so that good and faulty gears can be engaged by sliding. Output shaft is connected with magnetic particle brake for applying the required load/torque. Magnetic brake can vary the load as required with the help of programmable controllers. Data acquisition (DAQ) card by National Instruments (NI9234) was used to acquire the acoustic signal with array microphone of type 40 PH of make of G.R.A.S. with sensitivity of 52.14 mV/Pa. Figure 2 shows a photograph of the experimental setup and Figure 3 shows the setup with position of microphone. The microphone, was located in free field at 1 m distance from the gearbox. It was connected to DAQ card and data were acquired and stored on a computer. LabVIEW software was used to acquire data and extract various psychoacoustic and signal statistical features. The acoustic signals and their features were obtained by varying speed and load of the gearbox for good and faulty (crack at the root) gears in mesh. In the experiment, 50 samples of the signals of each condition of speed, load and gear were acquired and their features were extracted. Thus there were 600 sample signals. The sampling frequency selected for acquiring data was set at 44,000 Hz keeping in mind the requirement of Nyquist theorem. The various features extracted are described in section ‘Acoustic signal feature extraction’.

Figure 1.

Experimental setup.

Figure 2.

Photograph of experimental setup.

Figure 3.

Photograph of experimental setup with microphone.

Acoustic signal feature extraction

Various features characterizing acoustic signal are discussed below.

Statistical features of acoustic signal

From the acoustics time domain signals, various statistical features were computed^26–29 using LabVIEW programme. Brief description of these features is presented in Table 1 and sample values shown in Table 2.

Table 1.

Expressions to compute statistical features of acoustic signal.

S. no.	Statistical indicator	Formula	Remark
1	RMS	$\sqrt{\frac{\sum_{n = 1}^{N} (x (n) - μ) 2}{N}}$	It is the normalized second statistical moment of the signal.
2	Kurtosis	$\frac{\sum_{n = 1}^{N} (x (n) - μ) 4}{N σ 4}$	It is the normalized fourth statistical moment of the signal. Provides measure of the impulsive nature of the signal.
3	Skewness	$\frac{\sum_{n = 1}^{N} (x (n) - μ) 3}{N σ 3}$	It is a measure of symmetry, or more precisely, the lack of symmetry.
4	Maximum	$max [x]$	Finds the highest point in a set of values.
5	Minimum	$min [x]$	Finds the lowest point in a set of values.
6	Range	$max [x] - min [x]$	It is the difference in maximum and minimum point values.
7	Crest Factor	$\frac{peak vlue}{RMS Value}$	Its ratio of peak level to RMS level. It indicates the presence of high amplitude peaks caused by local damages.
8	Form Factor	$\frac{RMS Value}{Mean Value}$	Ratio of RMS value to mean value. Indicates the overall status of signal.
9	Mean	$\frac{\sum_{n = 1}^{N} (x (n))}{N}$	Average of all the amplitude of digitized points sampled.
11	Variance	$\frac{\sum_{n = 1}^{N} (x (n) - μ) 2}{N}$	Indicates the spread of the amplitude of the values from its mean.

RMS: root mean square; x(n): amplitude of the nth digitized point in the time domain; N: number of points in time domain; μ: mean of the N points; σ: standard deviation.

Table 2.

Statistical signal features of acoustic signal.

Condition	RPM	Load%	RMS	Variance	Kurtosis	Max	Skewness	Min	Range	Mean	Form Factor	Crest Factor
Good	120	0	0.043	0.001	3.100	0.200	−0.012	−0.201	0.401	0.002	14.78	4.66
Good	120	50	0.049	0.002	4.599	0.265	−0.049	−0.269	0.534	0.003	15.766	10.38
Good	180	0	0.125	0.015	2.892	0.486	0.085	−0.452	0.939	0.003	41.25	9.96
Good	180	50	0.056	0.003	3.917	0.352	−0.111	−0.326	0.678	0.003	17.26	11.31
Good	240	0	0.094	0.008	2.976	0.424	−0.016	−0.403	0.82	0.002	32.42	10.85
Good	240	50	0.074	0.005	3.903	0.456	−0.053	−0.429	0.885	0.002	26.05	11.02
Faulty	120	0	0.046	0.002	3.760	0.250	−0.036	−0.263	0.513	0.002	18.25	17.21
Faulty	120	50	0.081	0.006	3.141	0.376	0.071	−0.338	0.715	0.002	29.26	17.94
Faulty	180	0	0.071	0.005	4.71	0.457	0.001	−0.449	0.907	0.002	28.45	19.18
Faulty	180	50	0.12	0.016	3.18	0.54	0.108	−0.526	1.07	0.002	48.35	20.46
Faulty	240	0	0.1	0.011	4.56	0.66	−0.06	−0.644	1.31	0.0025	43.03	21.55

RMS: root mean square.

Psychoacoustic features of acoustic signal

Psychoacoustics is a science which describes how sound is perceived by human. Zwicker, Fastle and Aures have made significant contributions in this field. The contributions of these scientists have resulted in methods to objectively quantify sound that the human perceives. The way human being extracts the information contained in the acoustic signals using natural senses is mimicked in the algorithms that have been accepted by the International Organisation for Standardisation (ISO). As per ISO523B objective indices specified are: Stationary loudness, Time varying loudness, Roughness, Sharpness, Tonality and Fluctuation strength.^30–37 These parameters are calculated using expressions described in Table 3. Brief discussion of these parameters is given in the subsequent paragraphs and detailed description is available in literature.³⁰

Table 3.

Psychoacoustic features and related expressions.

S. no.	Psychoacoustic indicator	Formula	Description
1	Loudness (Sone)	$N = \int_{0}^{24 Bark} N' dz$ Refer Figure 4	N = loudness in Sone N′ = specific loudness z = Bark scale [31]
2	Loudness (Phone)	$N = 2 \frac{LL - 40}{10}$ $LL = 40 + 10 \frac{log (N)}{log (2)}$ [31]	LL = loudness level in phone Refer Figure 4 [32]
3	Sharpness (acum)	$S = 0.11 \frac{\int_{0}^{24 Bark} N' g (z) dz}{\int_{0}^{24 Bark} N' g (z) dz}$ $z < 14, \to g' (z) = 1 z > 14, \to g' (z) = 0.00012 . Z 4 - 0.0056 Z 3 + 0.1 Z 2 - 0.81 Z + 3.51$ Refer Figure 5 for z value	Refer Figure 5 [32]
4	Roughness (asper)	$R = f_{\mod} \int_{0}^{24 Bark} \nabla L g (z) dz$	Refer Figure 6 [32]
5	Fluctuation strength (vacil)	$F = \frac{Δ L}{\frac{f_{\mod}}{4 Hz} + \frac{4 Hz}{f_{\mod}}}$	ΔL = perceived modulation depth f_mod = modulation frequency
6	Tonality (Tu) Aures Model	$T = c . W_{T}^{0.29} * W_{Gr}^{0.79}$ $W_{T} = {\sum_{i = 1}^{M} (W_{1} (Δ z_{i}) * W_{2} (f_{i}) * W_{3} ((Δ L_{i})) 2} 0.5$ $W_{1} (Δ z_{i}) = [\frac{0.13}{Δ zi + 0.13}] 1 / 0.29$ $W_{2} (f_{i}) = {\frac{1}{1 + 0.2 * (\frac{fi}{700 Hz} + \frac{700 Hz}{fi}) 2}} 0.5$ $W_{3} (Δ L_{i}) = 1 - exp (- \frac{Δ L_{i}}{15 dB})$ $Δ L_{i} = L_{i} - 10 {log}_{10} [[\sum_{k \neq i}^{n} A_{E} (f_{i})] 2 + E_{Gr} (f_{i}) + E_{HS} (f_{i})] dB$	c = calibrating constant and is adjusted so that the value of 1 kHz tonal component at 60 dB will be 1 W_GR = weight function signifies the loudness to tonal ratio of tonal element W_T = coherence function on the incitement of annoyance by tonal elements W₁ = the width of an individual tonal element W₂ = centre frequency W₃ = related to the value of tonal size Δz = width of tonal element f = centre frequency in the unit Hz ΔL_i = value of calibration size of the tonal element in the unit of dB A_E(f_i) = the effect of tonal element close to tonal element E_Gr(f_i) = intensity of noise in the critical band including the ith tonal element [33]

Loudness

Loudness is the effect of energy content of sound on the ear. It is related to decibel (dB) which is logarithmic scale used to quantify the power of sound. Doubling the sound power does not mean that the sound perceived is twice. The calculation of sound addition is more complex and is dependent on the critical bandwidth which is a measure of frequency resolution of ear. Loudness perception is a function of sound pressure level, frequency and the spectral shape of the sound. Hence the loudness is computed using the equation shown in Table 3. The calculations are based on the complex graphical method using 1/3rd octave band which is considered as equivalent to critical bandwidth as per ISO 532/R. It provides a graphical method in which specific loudness is integrated over 21 critical bands and specific loudness N’ is computed as a function of critical bandwidth in Bark (z).The loudness level is defined as sound pressure level of 1 kHz pure tone in a plane wave and frontal incident that is as loud as Phone. Loudness is a term referring to human perception of sound volume expressed in the units of Sone which corresponds to 40 dB sound at 1 kHz tone.^30–32

Sharpness

Sharpness is a measure of the relative loudness at high frequencies. Sharpness is used to characterise steady state noise and corresponds to the sharp, painful, high energy sound and is the comparison of amount of high frequency energy to the total energy. It is measured in acum. An acum is referenced to a narrow band noise centred at 1 kHz with the level of 60 dB_spl. Expressions for sharpness used in LabVIEW program are based on Aures' model given in Table 3.^30–32

Figure 4.

Calculation of Loudness [32].

Figure 5.

Weighting, g′(z), as a function of critical band rate [32].

Figure 6.

The effect of subjective duration on rapid amplitude modulated noise [32].

Roughness

Roughness is the subjective perception of fast amplitude modulation present in sound pressure signal. Roughness is used to characterise the dynamic noise by measuring the temporal deviation of the loudness spectrum due to frequency modulation between 20 Hz and 300 Hz. Roughness is measured in asper. An asper is referenced to a 1 kHz tone at 60 dB_spl that is frequency modulated by 70 Hz sine wave with a modulation factor of 1. It is the algorithm developed to measure energy in 24 barks, computes and filters the envelope of signal in each band and measures the amplitude modulation of each envelope and then weights the level in each band with the frequency dependent weighting function. This algorithm returns the roughness spectrum versus critical band and then integrates the roughness spectrum to measure the roughness.^30–32

Fluctuation strength

Fluctuation strength is a hearing sensation related to loudness modulation at low frequencies that is perceptible individually. It uses similar method to roughness versus time analysis except that it focuses on the signal variation with very low modulation frequencies between 0.25 Hz and 20 Hz. It is measured in vacil. A vacil is referenced to a 1 kHz tone at 60 dB_spl that is frequency modulated by 4 kHz sine wave with a modulation factor of one.^30–32

Tonality

Tonality is used to determine whether a sound consists mainly of tonal component of broadband noise. The algorithm for tonality measures the relative strength of the signal compared to the overall signal. For each time block, this algorithm first varies the frequency resolution according to the human frequency selectivity, searches the frequencies of likely tones and then compares the loudness of the sound. The expressions based on Aures’ model³³ are mentioned in Table 3. LabVIEW programme computes tonality of the sound pressure signal according to Aures' model on successive 160 ms blocks.³¹ Tonality measures the relative strength of the tones in a signal compared to the overall signal.

The psychoacoustics features – Loudness, Sharpness, Roughness, Fluctuation strength and Tonality were extracted by using the modules in LabVIEW and the values are tabulated for good and faulty gears for different speeds and loads on a standard test setup and sample values are shown in Table 4.

Table 4.

Sample values of psychoacoustics parameters.

Gear	Load	Speed	Loudness (Phone)	Loudness (Sone)	Sharpness (acum)	Roughness (asper)	Fluctuation strength (vacil)	Tonality (Tu)
Good gear	0%	2 Hz	79.75	15.72	2.26	0.31	0.41	0.10
Good gear	50%	2 Hz	81.28	17.49	2.10	0.31	0.47	0.12
Good gear	0%	3 Hz	81.83	18.16	2.14	0.45	0.47	0.11
Good gear	50%	3 Hz	83.84	20.88	1.97	0.35	0.49	0.18
Good gear	0%	4 Hz	83.63	20.59	2.18	0.52	0.55	0.05
Good gear	50%	4 Hz	87.80	27.48	2.16	0.40	0.46	0.03
Faulty gear	0%	2 Hz	80.77	16.88	2.17	0.76	0.88	0.11
Faulty gear	50%	2 Hz	86.19	24.57	2.01	0.44	0.39	0.20
Faulty gear	0%	3 Hz	84.78	22.29	2.29	1.30	1.48	0.05
Faulty gear	50%	3 Hz	88.10	28.06	2.10	0.88	1.03	0.01
Faulty gear	0%	4 Hz	88.40	28.65	2.49	1.14	1.85	0.02

Fault classification

The methodology adopted for experimentation and fault classification using acoustic signal’s statistical and psychoacoustic features is shown in Figure 7. After extracting the psychoacoustic and statistical indices from the acquired data, ANN was used for classification. First, ANN was trained with the psychoacoustic features and its ability to classify good and faulty condition was ascertained. Then, the ANN was trained using the various statistical features of acoustics signals like RMS, Kurtosis, Form Factor, Crest Factor, etc. and the ability to classify the fault was again tested. A graphical user interface was developed in MATLAB for using the ANN module. Trained ANN models were tested to compare the classification efficiency for psychoacoustic and statistical features. The details of the ANN architecture selected for classifying the faults are discussed in section ‘Artificial neural network’.

Figure 7.

Methodology.

Artificial neural network

Neural networks are based on biological nervous system composed of neurons where information is processed to learn from the given data. It can be used in several areas of engineering applications and eliminates the limitations of the classical approaches by extracting the desired information using the input data. The advantage of the usage of neural networks for prediction is that it is able to learn from given set of data and after its learning is finished, it is able to catch the hidden and strong non-linear dependencies, even when there is significant noise in the training set.^38–40 Among the different types of neural network, feedforward backpropagation multilayer perceptron neural network is used for the present work. It consists of an input layer of source node, two hidden layers of computation neurons and the output layer. The input layer nodes represent the normalized feature extracted from the measured acoustic signal. The number of input nodes is six, for the six psychoacoustic features used. Similarly 10 input nodes are used for the 10 statistical features of the acoustic signal. Output node is one in both cases. Numbers of nodes in two hidden layers are 6 and 10 respectively for both the networks. The target value of the output node can have binary value 1 and 0 representing good and faulty condition, respectively. In the ANN, activation function of tan-sigmoid (tanh) and logistic (log-sigmoid) was used in hidden and output layer, respectively. The ANN was created, trained and implemented using code written in MATLAB with training algorithm of Levenberg–Marquardt. Out of the 600-sample signals acquired, 40% were used for training, 30% for testing and 30% for validation. The ANN was trained iteratively to minimize the performance function of mean square error (MSE) between the network output and corresponding target values.⁴⁰ At each iteration, the gradient of performance function MSE was used to adjust the network weights and biases. In this work, an MSE of 10⁻⁵, a minimum gradient of 10⁻¹⁰ and maximum iteration number of 5000 were used. The training process was terminated when the error converged to specified condition within the specified iteration. The initial weights and biases of the network were generated automatically by the program.

The sample values of input features for training network are given in the Tables 2 and 4. The network architecture selected is shown in Figures 8 and 9 for statistical and psychoacoustic features, respectively. Figures 10 and 11 show regression plots of the correlation between the network output and target values. The value of correlation coefficient R was 0.97 for statistical features and 0.99 for training with psychoacoustic features. The value of correlation coefficient greater than 0.9 indicates a good fit of the data and perfect training.

Figure 8.

Architecture of ANN for statistical features of acoustic signal.

Figure 9.

Architecture of ANN for psychoacoustic features.

Figure 10.

Regression plot for statistical features of acoustic signal.

Figure 11.

Regression plot for psychoacoustic features.

Figures 12 and 13 show the performance plot for the ANN. The validation and test curves are very similar. The curves do not indicate any problems with the training and there is no possibility of overfitting of data. Performance plots and values of correlation coefficients indicate satisfactory training performance for the architecture selected and give good generalization.⁴⁰ The trained ANN was tested for ascertaining the classification efficiency. It was found that sound pressure signal statistical and psychoacoustic features can classify data with high efficiency. The details of testing efficiency of the ANN used with good and faulty gears are shown in Table 5.

Figure 12.

Performance plot for statistical features of acoustic signal.

Figure 13.

Performance plot for psychoacoustic features of acoustic signal.

Table 5.

Classification accuracy of ANN.

	Classification efficiency
Input parameters based on	Good gear data	Faulty gear data
Psychoacoustics features	99%	98%
Signal statistic features	97%	98%

Comparison of results with the other condition monitoring techniques

While comparing the results of this paper with other published work and the conventional vibration based condition monitoring techniques, following observations are made. Shang et al.²⁵ had carried out similar work for fault identification during end of line inspection using vibration based technique. They have highlighted the limitations of the various vibration based condition monitoring techniques such as HHT, wavelet transform, and envelop spectrum for fault identification in newly manufactured gearbox. It was pointed out that the HHT is difficult to apply for real time application and would lead to inferior performance due to end effects propagated in signal, whereas wavelet basis function is not self-adaptive and envelope analysis is difficult to use at variable speed. To overcome these limitations they had proposed method of acquiring vibration signal using time synchronous averaging whose time domain and frequency domain features are input to ANN to diagnose fault for the case mentioned. Therefore, the method used by Shang et al.²⁵ may also be used in place of the psychoacoustic analysis method which is proposed in this paper. However, the maximum accuracy of fault identification reported by Shang et al. after using improved genetic algorithm was 94.61% and 91% with no algorithm for feature selection. Comparing these results with the work presented in this paper, it is seen that psychoacoustics analysis technique gives the fault classification with 99% accuracy and hence it could be a better option for identification of fault during end of line inspection of gearbox.

An attempt is also made to apply conventional spectrum analysis for the diagnosis of crack at gear tooth in the gearbox used in this work. Figure 14 compares the spectrum for healthy, i.e. good gear and faulty gear with crack at root. It is observed that the spectrum of faulty gear shows increase in amplitude at the gear mesh frequency and its harmonics indicating presence of fault. However, use of this type of conventional method needs expert to analyse the fault and dependence on skilled person still persists. The work presented in this paper was taken up to remove dependence on any expert and hence use of conventional methods of vibration monitoring for fault diagnosis was not focused upon in this work.

Figure 14.

Vibration spectrum of good and faulty gear.

Conclusion

This paper has focused on specific situation of identifying assembly error in the gearbox at the end of assembly line during manufacturing. An objective method using ANN with psychoacoustic features as input has been proposed to substitute operator based inspection which is purely subjective. Ability of sound quality features based on psychoacoustics to classify fault has been ascertained using ANN as a classifier with efficiency of 98% in fault detection and 99% for healthy condition. These results have been compared with the classification using signal statistical features of acoustic signal and efficiency of 98% has been observed for fault detection and 97% for healthy gear condition. As the classification efficiency by psychoacoustic features has been found to be better than statistical features, it can be concluded that the proposed method can be used to accurately classify gear faults during end of line inspection.

Various gear faults manifest in different sounds, and this is judged by an operator at the end of line inspection. Statistical features operate on the acquired digitized signal without any concern of the spectral components involved in this complex signal, whereas psychoacoustic features can capture the deviations in sound emitted during operation of the gearbox. In these features sharpness characterizes steady state features, while roughness, fluctuation strength and tonality characterize dynamic and temporal features. Loudness characterizes both amplitude and frequency content of the acoustic signal. Therefore, they perform better than statistical features of acoustic signal in the specific situation considered.

Thus, it can be concluded that psychoacoustic indices can be used along with ANN as a substitute for human hearing ability and expertise for identifying the correctness of gearbox assembly during end of line inspection in the manufacturing environment.

Footnotes

Acknowledgement

Authors acknowledge Dr. Anand Parey, Associate Professor, Department of Mechanical Engineering, IIT, Indore, for providing Gear fault simulator to perform experiments.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

Caryer Cook

Ali

. End-of-line inspection for annoying noises in automobiles: Trends and perspectives. Appl Acoust 2012; 73: 265–275.

Parizet

Guyader

Nosulenko

. Analysis of car door closing sound quality. Appl Acoust 2008; 69: 12–22.

Nykänenandand

Sirkka

. Specification of component sound quality applied to automobile power windows. Appl Acoust 2009; 70: 813–820.

Lim

. Correlations between deficiencies in power window systems influencing sound quality and some psychoacoustic metrics. Appl Acoust 2001; 62: 1025–1047.

Cerrato-Jay BG, Dong J, Pickering DJ, et al. The development of a sound quality-based end-of-line inspection system for powered seat adjusters. SAE Technical Paper 2001-01-0040, 2001.

Yıldırımand

Eski

. Sound quality analysis of cars using hybrid neural networks. Simulat Model Pract Theory 2008; 16: 410–418.

Zhou

Wenlei

. Vibration and noise radiation characteristics of gear transmission system. J Low Freq Noise Vibr Active Control 2014; 33: 485–502.

Shen

Yang

Pan

. Non-linear dynamics of spur gear pair with a time varying stiffness and backlash. J Low Freq Noise Vibr Active Control 2004; 33: 179–187.

Driot

Perret-Liaudet

. Variability of modal behavior in terms of critical speeds of a gear pair due to manufacturing errors and shaft misalignments. J Sound Vibr 2006; 292: 824–843.

10.

Bonori

Pellicano

. Non-smooth dynamics of spur gears with manufacturing errors. J Sound Vibr 2007; 306: 271–283.

11.

Wang

Zhu

. An intelligent approach for engine fault diagnosis based on Hilbert–Huang transform and support vector machine. Appl Acoust 2014; 75: 1–9.

12.

Han

Guo

. Feature extraction method of bearing AE signal based on improved FAST-ICA and wavelet packet energy. Mech Syst Signal Process 2015; 62–63: 91–99.

13.

Kankar

Sharma

Harsha

. Rolling element bearing fault diagnosis using wavelet transform. Neurocomputing 2011; 74: 1638–1645.

14.

Nie

Wang

. Review of condition monitoring and fault diagnosis technologies for wind turbine gearbox. Proc CIRP 2013; 11: 287–290.

15.

Singh

Al Kazzaz

SAS

. Induction machine drive condition monitoring and diagnostic research—A survey. Electric Power Syst Res 2003; 64: 145–158.

16.

Peng

Chu

. Application of the wavelet transform in machine condition monitoring and fault diagnostics: A review with bibliography. Mech Syst Signal Process 2004; 18: 199–221.

17.

Jardine

AKS

Lin

Banjevic

. A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mech Syst Signal Process 2006; 20: 1483–1510.

18.

Abd-El-Tawwab

El-Sayed

A-S

El-Hakim

. Characteristics of agriculture tractor interior noise. J Low Freq Noise Vibr Active Control 2000; 19: 73–81.

19.

Georgiev

Krylov

Winward

RETB

. Simplified modeling of vehicle interior noise: Comparison of analytical, numerical and experimental approaches. J Low Freq Noise Vibr Active Control 2006; 25: 69–92.

20.

Wang

Lee

C-M

Kim

D-G

. Sound-quality prediction for non-stationary vehicle interior noise based on wavelet pre-processing neural network model. J Sound Vibr 2007; 299: 933–947.

21.

Saravanan

Kumar Siddabattuni

VNS

Ramachandran

. Fault diagnosis of spur bevel gear box using artificial neural network (ANN), and proximal support vector machine (PSVM). Appl Soft Comput 2010; 10: 344–360.

22.

El Badaoui

Guillet

Daniere

. New applications of the real cepstrum to gear signals, including definition of a robust fault indicator. Mech Syst Signal Process 2004; 18: 1031–1046.

23.

Wang

Ismail

Golnaraghi

. Assessment of gear damage monitoring techniques using vibration measurements. Mech Syst Signal Process 2001; 15: 905–922.

24.

Loutas

Sotiriades

Kostopoulos

. Condition monitoring of gears and advanced signal processing techniques towards more effective diagnostic schemes. Noise Vibr Worldwide 2010; 41: 10–18.

25.

Shang

Zhou

Yuan

. An intelligent fault diagnosis system for newly assembled transmission. Expert Syst Appl 2014; 41: 4060–4072.

26.

Večeř

Kreidl

Šmíd

. Condition indicators for gearbox condition monitoring systems. Acta Polytechn – J Adv Eng 2005; 45: 35–43.

27.

Sharma V and Parey A. Review of gear fault diagnosis using various condition monitoring indicators. In: Proceedings of ICOVP 2015, 12th international conference on vibration problems, IIT, Guwahati, India, 14–17 December 2015.

28.

Zhu J, Nostrand T, Spigel C, et al. Survey of condition indicators for condition monitoring systems. In: Proceedings of annual conference of the prognostic and health management society, Texas, USA, 29 September–02 October 2014, pp.1–13.

29.

Klein R. Condition indicators for gear. In: Annual conference of prognostic and health management society, Minneapolis, USA, 23–27 September 2012, pp.1–8.

30.

Zwicker E, Fastl H. Psychoacoustics: Facts and models. 2nd ed. Berlin, Germany: Springer-Verlag, 2007.

31.

NI-Tutorial-1526. White paper on Measurement of Sound Quality, https://www.ni.com/tutorials and http://zone.ni.com/reference/en-XX/help/372416B-01/sndvibtk/aures_tonality/ (13 June 2013).

32.

Technical note. An introduction to sound quality testing. Manchester, UK: Acoustic Research Centre, School of Computing, Science and Engineering, University of Salford, http://www.salford.ac.uk (10 October 2015).

33.

Kim

E-Y

Lee

YG-J

Lee

SG-K

. Sound metric design for evaluation of tonal sound in laser printer. Int J Precis Eng Manuf 2012; 13: 1349–1358.

34.

Fastl H. Psychoacoustic basis of sound quality evaluation and sound engineering. In: Proceedings of the thirteenth international congress on sound and vibration, Vienna, Austria, 1–16, 2–6 July 2006.

35.

Fastl

. The psychoacoustics of sound quality evaluation. Acta Acust 1997; 83: 754–764.

36.

Wang

Shu

Wei

. Statistical evaluation and regression analysis of vehicle sound quality. Trans Tianjin Univ 2006; 12: 297–302.

37.

Bodden

. Instrumentation for sound quality evaluation. Acta Acust 1997; 83: 775–783.

38.

Bishop

. Neural networks for pattern recognition, Oxford: Clarendon Press, 1995.

39.

Duda

Hart

Storkv

. Pattern classification, 2nd ed. New York, USA: John Wiley & Sons, 2001.

40.

Demuth

Beale

. Neural network toolbox for use with MATLAB, Release 13, USA: Mathworks, 2002.