Abstract
The present study emphasizes an optimized deep learning algorithm for gearbox fault detection using vibration, sound, and acoustic emission signals. Statistical and acoustic features are extracted from these signals, and various neural network algorithms are explored. The supervised deep feed forward neural network (DFFNN) demonstrates excellent performance with vibration signals but limited accuracy with sound and acoustic emission signals. To address this, unsupervised algorithms are optimized and compared with vibration-based classification. The findings show that unsupervised neural networks, particularly the auto-encoder and stacked auto-encoder architectures, achieve improved classification accuracy by leveraging the unique characteristics of acoustic emission signals. The unsupervised models also effectively overcome the vanishing gradient problem via regularization, enhancing their training efficiency. The stacked auto-encoder, with multiple layers of encoders and decoders, reduces computation time by 40% and memory consumption. These optimized algorithms hold promise for automated fault detection systems. The auto-encoder and stacked auto-encoder, utilizing vibration, sound, and acoustic emission signals, offer enhanced classification accuracy and can facilitate real-time monitoring of rotating mechanical systems. However, further optimization is needed to maximize their performance. In a nutshell, the supervised DFFNN excels in utilizing vibration signals for fault detection, while the unsupervised models exploit the distinctive characteristics of acoustic emission signals. Future research will focus on refining these algorithms to enhance their effectiveness. Implementing these optimized deep learning approaches can lead to autonomous fault detection systems, eliminating the need for continuous human supervision.
Keywords
Introduction
Automobiles provide an easy and comfortable commute, which is necessary for daily life. The increased population and finite resources have demanded the implementation of Hybrid Electric Vehicles (HEV) and Electric Vehicles (EVs). The HEVs are the perfect balance between combustion and electric vehicles, where the HEVs use an electric motor for torque applications and an IC engine for high-speed applications. 1 Using multiple power sources demands complex rotational systems such as gearboxes, responsible for varying gear ratios to achieve higher speed without compromising initial torque. 2 The extreme working conditions of the gearbox lead to pitting, wear, misalignment, and specific other faults in their gears and bearings. This will lead to lesser efficiency and increased friction on the gears and bearings, even with proper lubrication. 3
The excessive use of the gearbox will induce stress extending over the Factor of Safety (FOS), causing deterioration of gears such as tooth break and tooth wear, pitting, scuffing, cracking, bearing wear, noise, and vibration. 4 Thus, the rotating machinery’s critical failures and health had to be detected to avoid significant failure and accidents. 5 Along with the gears, the gearbox bearings are also suspected of high bending and axial load and are commonly diagnosed using vibration analysis. 6 The vibration analysis can also be used for damage detection of multiple system components and condition-based monitoring of gears and bearings. 7
The condition-based monitoring of the rotating systems is developed to utilize the condition indicator in the vibration signal acquired from the gearbox. The primary purpose of using vibration signals is to determine the dynamic characteristics of the gearbox. 8 The vibration signals are produced by induced forces on the internal gears and bearing of the gearbox generated due to poor lubrication, gear pitting, bearing misalignment, etc. 9 Literature studies have proved that the vibration signals are most efficient for fault detection and even the raw vibration signals are efficient in fault diagnosis of gears.10–14 The cited literature data demonstrates that the higher dimension of vibration signals helps in the accurate and faster detection of faults compared to wear debris analysis. Besides vibration signals, the sound signals can also precisely detect faults in an early stage. The sound signals are also chosen for fault diagnosis as the sound analyzers are efficient with a wide bandwidth of microphones. 15
The sound signals are the propagation of sound waves, and they contain information about the behavior and working conditions of the system it is exhibited from. The faults of a system produce sound with the respective characteristic frequency that can be used to determine the condition of any system. For instance, Mohanraj and Kumar 16 used sound signals to predict the optimal processing condition of their tool in machining. Many non-intrusive methods using one or more microphones for condition monitoring of the internal combustion engine have been employed successfully and analyzed using machine learning techniques. 17 Apart from vibration analysis, the shafts and bearings can be diagnosed by sound signal analysis and are often compared with vibration and motor current.18,19 Thus, the degradation of a gearbox is followed by a gradual increase in sound upon aging; it can be used to determine the health of the gearbox by employing machine learning techniques. Even most medical diagnosis and analysis tools are subsonic and ultrasonic, 20 and the structural analysis of buildings is also done using sound signals. 21 Although the vibration and sound signals accurately detect faults, early detection is necessary for some instances, which can be done using acoustic emission signals.
Acoustic Emission (AE) is the rapid release of energy on a system surface generated as propagating waves. These transient waves are released during any deformation on the tooth’s surface. The AE signals are emitted at a range of phenomena at every working condition and significantly affect the AE signals acquired, which can help detect faults in any system early. 22 Ahn et al. 23 and Hou et al. 24 have proved that the signals are much more responsive to early detection of faults than vibration signals, even at bearing fault analysis. According to Van Hecke et al., 25 the AE signals behave with instant changes in their dynamic responses compared to vibration signals. Such signals have higher sensitivity towards the location of faults and are immune to mechanical background noises. As the AE signals are predefined with acoustic features, extracting the features is unnecessary, making it easy for fault detection even at lower speeds.
Condition monitoring and fault detection involve data acquisition, data processing, and data classification to predict the health of the gearbox. The critical part of the data-driven degradation investigation is to extract the representative features from the acquired raw signal data. Praveenkumar et al. 26 classified the static features such as mean root square, Kurtosis, and acoustic features are extracted from the vibration signal and gearbox faults using machine learning algorithms. Kumar et al. 27 determined that the effectiveness of the input features can be improved by using multi-feature fusion techniques, which reduces the declassification of faults. Similarly, several different algorithms can be used for fault detection of any component to avoid future damage to that component and its associated systems. Various algorithms are proven efficient and compatible with different signals and their features.
One of the most efficient algorithms is the Support Vector Machine (SVM), which classifies the data by creating hyperplanes between the classes. The SVM algorithm can be trained at the highest and lowest speed conditions to define a range of suitable gearboxes. 28 The internal analysis methods can be varied by using different kernels, and the RBF kernel is claimed to be best suited with SVM for time-domain signals when compared with other kernels like linear, proximal, polynomial, and sigmoid. 29 The decision tree and SVM algorithm can be used for multi-component analysis and prove to have increased fault classification accuracy using RBF classification. 30 Apart from SVM, Artificial Neural Networks (ANN) are being used these days, which have better classification accuracy as the algorithm uses multiple neurons to learn from the unprocessed data. The rotating components, such as the rolling element bearings, can be diagnosed by Artificial Neural Network and are best suited for fault detection of rotating machines and online monitoring. 31 The ANN algorithms are adopted as they can minimize the error in fault detection by updating the weight matrix on every iterative training, and a further improvement in classification can be made by employing multi-sensor fusion techniques. 32 Any rotating component, such as a centrifugal pump, can be diagnosed with faults using back-propagation, where the faults are categorized into seven types and are analyzed using Adaptive Resonance Theory (ART). 33 Grossberg reported that ART is more efficient than SVM, but unsupervised learning techniques are employed to work with anonymous data, which use unlabeled data. 34 With complex unlabeled high-dimensional data, it is nearly impossible to fully train a model using ANN because of the vanishing gradient issues during back-propagation, which deep learning networks can easily overcome. 35
These deep learning methods have their apparent differences from the traditional ANN, where deep learning can mine the hidden correlation among samples of the pre-identified input data. Deep learning algorithms are used to interpret the faults of unstructured input data and have recently been used in detecting COVID-19. 36 The deep learning algorithm uses the X-ray images of COVID-19-infected patients to classify them as positive or negative. 37 This perceptive learning of the algorithms is achievable because of the adaptive connectivity between the network’s neurons. A comparison of different deep learning approaches (ANN, Deep Neural Network, Convolutional Neural Network, and Deep Convolutional Neural Network techniques) and a residual deep learning technique for fault diagnosis of rotating machinery has proven better efficiency. It concludes that the efficiency can be improved by using large layers of neurons to increase the classification accuracy. 38 The deep learning algorithms are even more accurate using digital data such as motor current as the input. 39 Any highly unstructured data with high complexity are utilized by multi-stage learning methods such as stacked auto-encoder (SAE), which adaptively learns features that capture discriminative information from the input signal signals in an unsupervised way and achieve better classification of faults with given input. 40 With the reported instances, it is evident that the deep learning networks can be modeled as an extensive series of interconnected layers of neurons that exhibit excellent convergence, and accuracy is compelling.
Despite significant advancements in fault diagnosis of rotational mechanical systems using vibration signals, there remains a gap in exploring and utilizing less reliable signals, such as acoustic emission (AE) and sound signals, for fault detection. While vibration signals have been extensively studied and proven effective, the potential of AE and sound signals in detecting faults in such systems has not been adequately explored. This research addresses the gap in using less reliable signals, such as AE and sound signals, for fault detection in rotational mechanical systems. This research primarily emphasizes comprehensively evaluating the performance of deep learning approaches for fault diagnosis using vibration, sound, and AE signals. By analyzing and comparing various deep learning algorithms, including supervised deep feed forward neural networks, unsupervised deep feed forward neural networks, autoencoders, and stacked autoencoders, the most suitable algorithms for the accurate classification of gearbox conditions are identified.
Furthermore, the study seeks to optimize these neural networks to improve fault classification results. This research aims to advance fault diagnosis techniques by leveraging deep learning and alternative signal sources, thereby enhancing the reliability and efficiency of condition monitoring in rotational mechanical systems. The findings of this study are expected to contribute significantly to the development of robust fault diagnosis methodologies, leading to improved maintenance strategies, reduced downtime, and enhanced operational safety across various industrial sectors.
Experimental methodology
A comprehensive methodology (Figure 1) is proposed for evaluating the performance of deep learning approaches in the domain of fault diagnosis for rotational mechanical systems utilizing vibration, sound, and acoustic emission signals. The experimental setup involves controlled fault induction in the mechanical systems, followed by systematic data acquisition of vibration, sound, and acoustic emission signals. Feature extraction techniques are employed to derive relevant information from the acquired signals, and subsequent feature classification is performed. The study explores both supervised and unsupervised learning paradigms, incorporating autoencoder and stacked autoencoder architectures for enhanced representation learning. To assess the efficacy of the models, results validation is conducted, and the selection of the best fault prediction model is determined based on rigorous evaluation criteria. This methodology aims to provide a robust framework for evaluating the effectiveness of deep learning approaches in diagnosing faults in rotational mechanical systems through multi-modal signal analysis. Methodology.
Experimental setup
Noticeable changes in the system output, efficiency loss, temperature, aggressive vibrations, sounds, etc., always detect the fault of a mechanical system. One of the primary and quantifiable parameters must be chosen for adequate identification. The parameters that have been selected are vibration, sound, and AE signals. These signals can be acquired from the gearbox with the help of advanced data science technologies.
41
The experimental setup consists of the following types of sensors and data acquisition system. • 3 – Phase induction motor. • 4 – speed synchronous gearbox. • Eddy current dynamometer • Piezoelectric triaxial accelerometer. • Physical acoustics AE sensor. • Physical acoustics data acquisition system. • 40PH free-field array microphone. • Vib pilot 8-channel data acquisition system.
The experimental setup consists of three 3-phase induction motors, a 4-speed gearbox, and a dynamometer. The induction motor acts as the power source to replace a combustion vehicle’s engine, and the motor’s speed is controlled by delta drive. The output of the gearbox is coupled with an eddy current dynamometer. A vehicle’s dynamic working conditions are simulated with a dynamometer’s help to induce load onto the gearbox. The motor and dynamometer are coupled to the gearbox with a flexible coupling and are fixed rigidly to a stationary table, as shown in Figure 2. The operating speed of the gearbox was maintained between 0 and 1440 rpm, but for safety concerns, the maximum speed of the motor was held at 1000 rpm. The following faults were induced to gear
3
and bearing
7
with reference to existing literature. The gear teeth was induced with adhesion type wear by machine grinding operation, and a bearing crack was introduced on the outer race of a bearing using Electric Discharge Machining (EDM). Experimental setup.
A triaxial accelerometer and AE sensors were rigidly mounted on the top surface of the gearbox above the bearing housing, and a 40PH free-field array microphone was fixed near the bearing housing of the gearbox. The accelerometer and microphone sensors are connected to an 8-channel Data Acquisition System (DAQ) to acquire the vibration and sound signals. The AE sensors are connected to the acoustic data acquisition system to receive respective AE signals from the gearbox.
Experimental procedure
Experimental conditions.
Acquired statistical features.
Acquired Acoustics features.
Experimental conditions and data points.
Deep learning approach
The neural networks are built with several layers of artificial neurons, which are the data processing units of the algorithms. These artificial neurons perform similar tasks to biological neurons and are interconnected to the nearby layers of neurons. The neural network mimics the neural system of a human brain, where the networks analyze the input data. However, high-dimensional input data require advanced neural network techniques such as deep learning with multiple layers of neural network that can perform high-dimensional recognition of data samples with a complex neural structure. 42 Unlike traditional machine learning algorithms, which need the user to identify the features, deep learning algorithms identify the features without user intervention. 43 Deep learning models excel in gearbox fault detection by automatically learning complex patterns, handling non-linearities, and adapting to variable conditions, offering robust and efficient real-time monitoring compared to traditional methods. The solutions of the deep learning algorithms are end-to-end analysis, while the traditional algorithms depend on a series of conclusions. The deep learning algorithms utilize the maximum of any unstructured data by eliminating separate feature engineering. Even without data labeling, the algorithms produce accurate classification with high-dimension data. 44 The deep learning algorithms such as Supervised Deep Feed Forward Neural Network, Unsupervised Deep Feed Forward Neural Network, auto-encoder, and Stacked auto-encoder are proposed in this work, which is modeled to accommodate vibration, sound, and AE signals for improved fault classification.
Deep feed forward neural network
A Feed Forward Neural Network (FFNN) is a straightforward algorithm that utilizes the information in the input data.
45
It consists of input, hidden, and output layers of neurons where neurons of one layer are interconnected to neurons of the next layer, as shown in Figure 3.
46
The number of neurons in the input layer is equal to the number of input features, and the input layer processes the data, passes it to the next layer of the algorithm, and eventually reaches the output layer. The number of output layers is equal to the number of feature classes, and each neuron consists of an activation function and a set of hyper-parameters, such as weight and bias values that need to be defined and tuned for analyzing the data upon sequential training and testing as shown in equation (1). These hyper-parameters help in understanding the input data for fault diagnosis Feed forward neural network.
46

The results of the activation function can be continuous or discrete, so a bias variable is used to categorize the output into classes, either true or false. This method is utilized by iterative training and testing to define the hyper-parameters. The reliable activation functions that produce higher classification accuracy are as follows.
SIGMOID function
The sigmoid function is a logistic function that results in a continuously S-shaped curve. The sigmoid curve runs between 0 and 1. This function can be utilized by biasing the results either 1 or 0, making the output binary. Equation (2) represents the function of the sigmoid function
RELU function
The RELU function is the rectified linear unit function, which outputs 1 for positive and 0 for negative values. The function has no complex mathematics and so takes less time for computation. The function is sparsely activated, which has more predictability and less chance of over-fitting or noise, as shown in equation (3)
Tanh function
The tanh function, as shown in equation (4), is a non-linear function that results in a continuous curve falling between the range [−1 1]. The curve is saturated before −1 and after 1, making it suitable for classification. The gradient is much stronger when compared to sigmoid but also has a vanishing gradient problem
A deep Feed Forward Neural Network (DFFNN) algorithm has more than one hidden layer. The algorithm works on the same concept but involves complex relationships within and between the layers of neurons. This enables the use of low-dimensional data for improved daily detection. 47
These algorithms can be either supervised or unsupervised. The supervised DFFNN uses the known data to train and update the hyper-parameters, which involve user intervention in the learning process as the data has to be predefined for learning. The predefined data consists of data samples mapped to their respective condition classes. In contrast, the unsupervised DFFNN learns upon the unknown data where the data points are unmapped. This self-learning algorithm can be utilized in various automation processes by reducing its neural complexities. The base plot of this algorithm was improvised to many different algorithms by employing various functions, optimizer, stack flow, etc. 48
Architecture of DFFNN
The DFFNN is tested and compared in two forms: Supervised and Unsupervised DFFNN. The supervised DFFNN is programmed with more than one layer, where each layer consists of two parts: Activation and Dropout. A scheduled learning rate and a compatible optimizer can be employed in experimentation to increase classification accuracy. The supervised DFFNN experiments with various library functions to perfect the algorithm to maximize the classification accuracy as much as possible.
Dense layer
The neural networks can represent any complex data that has more non-linearity. Since the complex structure of the algorithm can reduce the efficiency of the training process with overlapping and vanishing gradients, the dense layer was introduced to gain a performance boost. The dense layer is a deeply connected layer with an element-wise activation function. The dense block connects every layer of the algorithm, reducing the vanishing gradient issue and the features being reused and reducing the number of parameters. 49 The number of neurons in the layer defines the shape of the output of the dense layer. The layers accept 2D input and give a 2D output. The dense layer supports a set of arguments used to enhance efficiency, including units, activation functions, initializers, and bias vectors. 50 The activation function selected for this project was “tanh” as it was experimented with to achieve higher accuracy for this data type. The tanh function is an asymptote that can be standardized using a bias vector. 51 The final output layer is activated with the SoftMax function. Each dense block gave an output of 8 units as input to the dropout block.
Dropout block
The complex multilayer algorithm results in over-fitting data in the training process. The over-fitting phenomenon will reduce the efficiency of the learning process. It will result in lower classification accuracy and a higher training period, which can be avoided using a dropout block. The dropout block is a regularization method that reduces over-fitting by avoiding complex co-adaptations on training data. It drops out one unit of hidden and visible layers of a neural network, reducing the complexity of the neural network. This dropout also simulates sparse activation, enabling the network to learn in a sparse method like auto-encoders. 52
Auto-encoder and stacked auto-encoders
The different combinations and process flow of data have led to the development of many algorithms, one of them being auto-encoders. The auto-encoder algorithm is an unsupervised algorithm that receives the input data, encodes them into a compressed version, and decodes them to reconstruct the input, as represented in Figure 4. Auto-encoder.
53

The auto-encoder is efficient when the regression of the output is nearly the same as the input. The auto-encoders are defined to recreate the information to understand the given data, hence better-updating hyper-parameters. The end-to-end structure of the algorithms with proper stacking is encoding and decoding layers, as shown in Figure 5,
54
which can produce better classification accuracy with high-dimensional data and is best suited for highly varying dynamic analysis of a mechanical system. Stacked auto-encoder.
54

Mathematically, the input vector is represented and mapped to hidden layers through a deterministic mapping
The auto-encoder can be represented in two parts: the encoder, defined as y = f(x), and the decoder,
The “a” is an activation function, W is the weight matrix, and b is a bias variable. The encoder compresses the input data to a lower-level data known as the code. In contrast, the decoder consists of the output layers that are inversely mapped to obtain z, as shown in equation (9)
L is the loss function, and the x and z are reconstructed as vectors or probabilities. The loss can further be decreased by stacking the auto-encoder. The stacked auto-encoder (SAE) is a deep learning algorithm that uses stacked layers of the auto-encoder. This system involves layer-wise training where the output of one auto-encoder is the input to the next autoencoder. The algorithm efficiently learns the data pattern and can further precisely tune the parameter, leading to better classification. 55
Among these highlighted activation functions, the tanh and sigmoid are similar, but the tanh is bounded between (−1, 1), whereas the sigmoid is (0, 1), preventing more significant gradient values. The tanh function is centered around the value 0, whereas the sigmoid is 0.5 and so making. Moreover, since the tanh derivative is 1, the W and b in equation (1) are more expensive and quickly updated by the next layer. Hence, tanh is chosen as the primary activation function in the algorithms. The DFFNN is enabled only with the tanh function in every neuron layer. The auto-encoder is also enabled with the tanh function for activation for both encoding and decoding operations, as mentioned in equations (8) and (9). As mentioned above, these algorithms are tested and analyzed to give better results for the performance evaluation of a mechanical gearbox using the acquired data.
Evaluation of algorithm
The algorithm performance assessment was essential to optimize their ability to handle input data efficiently for enhanced fault classification in gearbox systems. The proposed machine learning algorithms will analyze the extracted statistical features for fault detection of the gearbox. The algorithms had to be evaluated under three evaluation conditions: individual data, gear-wise data, and speed-wise data, referred to as condition 1, condition 2, and condition 3, respectively. These conditions help analyze the working efficiency and complexity of the algorithms by which the algorithms can be optimized to use the given gearbox dataset. The hidden relation between the number of layers, steps per iteration, learning rate, weight decay, etc., can be analyzed and set to an optimum value.
Classification accuracy of supervised DFFNN.
It was evident that the algorithm could precisely update with hyper-parameters with increased data samples under condition 3, providing more accuracy than SVM and Decision Tree algorithms. 56 The supervised DFFNN (S-DFFNN) algorithm also suffered high computation time under condition 3 and over-fitting of fault classes due to the increased complexity of the user-defined function in the algorithm. The automation of fault detection of any rotational mechanical system must employ a reduced complex neural network for faster classification involving available input data without human intervention. 57
Accuracy of unsupervised DFFNN.
Accuracy of auto-encoder.
Accuracy of stacked auto-encoder.
Results and discussions
Based on the evaluation, the vibration signals perform best, resulting in higher classification accuracy. Figure 6(a) and (b) represents the average accuracy and time of the proposed algorithms using vibration signals. It is clear from the table that the accuracy of the algorithms increases with the number of data points in the vibration signals. The SAE algorithm produces the highest average accuracy of 99.95 % speed-wise combined. The computation time taken was 253 s for 250 iterations. The SAE is capable of achieving 100 % accuracy with high-dimensional data. The increase in the number of data points achieved better training results in higher accuracy when compared with individual and gear-wise combined data. This could be attributed to the non-linear transformations, data representation, deep architecture, adaptability to data size, regularization, and effective optimization.
58
Accuracy using vibration signal. Here, (a) compares different classifier and their accuracy, (b) compares different classifier and their compilation time for vibration signal.
The sound signal data were not best explored as much as the vibration signal in previous research.27,59 This work was primarily focused on optimizing the algorithms to achieve the highest classification accuracy with sound and AE signals. The average accuracy and time of the algorithms using sound signals is shown in Figure 7(a) and (b). Despite the stacked auto-encoder being the most robust classification algorithm, it can be depicted that the unsupervised DFFNN achieves the highest classification accuracy of 98.68 % with combined sound signal data. This is due to the coding and decoding process of the SAE algorithm, where the algorithm loses the information in the sound signal during the decoding process, resulting in slightly lesser accuracy than the unsupervised DFFNN. However, the unsupervised DFFNN requires a computation time of 600 s, while the SAE takes 262 s under 250 iterations with combined speed-wise data. Accuracy using sound signal. Here, (a) compares different classifier and their accuracy, (b) compares different classifier and their compilation time for sound signal.
The difference in performance between the unsupervised DFFNN and the SAE when classifying sound signal data can be explained by the trade-off between feature representation and information loss during decoding. The DFFNN excels in feature learning and representation but requires more computation time. In contrast, while achieving high accuracy, the SAE may lose some information during the encoding and decoding process but does so with greater computational efficiency. The choice between these models depends on the application’s specific requirements, including the balance between accuracy and computational resources.
The AE features are directly acquired from the AE-DAQ system, so the loss of information in the data is lesser due to the avoidance of the feature extraction technique. Though the supervised algorithms suffered over-fitting with AE signal data, the unsupervised algorithms were primarily built to reduce over-fitting by regularizing the input. Figure 8(a) and (b) illustrates the average accuracy and time of algorithms with AE signal data. It can be inferred that the SAE algorithm can effectively analyze the AE signal data and achieve an average classification accuracy of 96.48 % with speed-wise combined AE signal data. With a sample size of 48,000 sample points, the speed-wise combined data guides the algorithm to achieve the highest classification accuracy with 248 s of computation time under 250 iterations. Accuracy using AE signal. Here, (a) compares different classifier and their accuracy, (b) compares different classifier and their compilation time for AE signal.
The success of the SAE algorithm when applied to AE signal data is consistent with the findings related to other signal types.60–62 The key factors contributing to this success include direct data acquisition, the unsupervised learning advantage, regularization techniques, and access to a substantial volume of data, enabling the SAE to capture relevant information and achieve high classification accuracy effectively. 63 These factors, as outlined by Manikandan and Duraisamy, 64 indicate that SEA is a viable option for fault classification jobs over a wide range of signal types.
Based on the evaluation, it can be proved that the SAE provides higher classification accuracy using vibration and AE signals. The unsupervised DFFNN achieves slightly better classification accuracy than the SAE with sound signals because DFFNNs, with their more profound architecture, are better at capturing the intricacies of complex sound data. However, SAEs excel in computational efficiency due to their design, resulting in significantly shorter processing times. This trade-off is significant as it highlights the choice between higher accuracy (DFFNN) and faster real-time decision-making (SAE) in fault detection systems, with the preference depending on specific application requirements. Hence, the SAE algorithm can be used to automate fault diagnosis.
Conclusion
In this paper, the performance of the proposed deep algorithms for fault detection of the gearbox was optimized and evaluated. The vibration, sound, and acoustic emission signals are acquired from the gears and bearings of the gearbox. According to the literature and previous research, the properties of each signal data were studied to construct different neural networking algorithms. The following conclusions were drawn from this research. • The supervised DFFNN can completely utilize the information in vibration signals. The sound and acoustic emission signals are not entirely utilized, resulting in less classification accuracy. • The unsupervised algorithms are optimized and compared with the classification results using vibration signals to use reliable information on the acoustic emission and sound signals. • The unsupervised neural networks provide an enhanced complex network that enables the network to train efficiently by regularizing the network and reducing the vanishing gradient phenomenon. • The auto-encoder and stacked auto-encoder improve classification by utilizing the information in the acoustic emission signals, producing improved classification accuracy compared to supervised and unsupervised DFFNN. • The stacked auto-encoder is a network with multiple layers of encoder and decoder with suitable optimizer and regularizer, reducing the computation time by 40 % and the memory consumption.
Even though the algorithms excel in their performance, they can be even more optimized to boost the performance, which is aimed to be continued with future research. With this excellent performance of the auto-encoder and SAE, which takes a lesser computation time while using vibration, sound, and acoustic emission signals, the algorithms can be utilized in an automated fault detection system. The real-time monitoring of any rotating mechanical system under dynamic conditions is possible and can be automated to preclude supervision.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
