Abstract
The high-speed train gearbox is one of the key components of the train system, and its working state is related to passengers’ safety. The gearbox shell is the protective shell for gears under harsh and complex service environment. Instantaneous impact of hard material during failure can lead to the rapid damage accumulation and fracture of the shell material. In this article, the accumulation of tensile damage of the high-speed train gearbox shell material has been studied. For the short tensile process leading to significant amount of damage, a real-time and non-destructive detection method has been used to monitor the tensile damage progression and predict the residual life of the material. An automatic optimization algorithm has been proposed to deal with data quality problems such as fluctuations, imbalances, and large intervals. In addition, an automatic life prediction optimization model for material tensile process has been established. The life prediction errors are controlled within 50 s, and the majority of errors are less than 20 s. For the accelerated test, in respect to real situation, a 6 h time slot is designed for disaster control such as passenger evacuation and train failure prevention.
Keywords
Introduction
It is of great importance to study structural health monitoring and prognosis for in-service gearbox of high-speed train. The key material for the high-speed train gearbox shell is high-strength aluminum alloy, such as A356. Although some models have been developed to simulate the tensile damage accumulation, few models have been proposed to investigate the tensile life of high-strength aluminum alloys. During service, the high-speed train gearbox is operated in a harsh environment, where static and dynamic loads promote both tensile damage and fatigue damage. Compared to the fatigue damage process, the tensile damage process is a relatively short term, but may lead to a fatal disaster.
In recent years, many studies have focused on analyzing tensile damage of in-service materials. Qin et al. 1 investigated the plastic damage induced by thermal stresses in a 2024 aluminum alloy plate and established an axisymmetric mathematical model. They showed that the plastic damage for a top hat beam is controlled by the yield time and that it occurs in the top layer of the plate. Darras et al. 2 presented a study on the damage evolution in 5083 marine-grade aluminum alloy deformed at different strain rates. The isotropic damage values were predicted using the hypotheses of strain equivalence and strain energy equivalence, as well as a recently developed energy-based model. Shui et al. 3 developed an experimental method for evaluating material damage due to plastic deformation in AZ31 magnesium–aluminum alloy, based on a nonlinear ultrasonic technique. They showed that the acoustic nonlinearity parameter (ANP) increases monotonically with the level of plasticity (or the tensile stress). He et al. 4 adopted a forming limit stress–based diagram (FLSD) to predict the fracture limit of a sheet aluminum alloy (AA) 5052-O1. The damage evolution was investigated, and the parameters of the Gurson–Tvergaard–Needleman (GTN) model were identified by quantifying the void fraction at three damage stages. Zhu et al. 5 conducted dynamic tensile tests on aluminum alloy (AA) 6061T6 using a high-speed servo-hydraulic machine at intermediate strain rates to validate the testing technique and to investigate the strain-rate effect on the material’s stress–strain behavior and failure mode. They found that the yield strength, ultimate strength, and failure strain were dependent on the strain rate. Presently, a large number of articles have focused on studying the tensile damage accumulation based on material properties. The disadvantages of this kind of research are the destructive and the non-real-time nature. Hence, there is a need to develop an acoustic emission (AE) method to analyze the tensile damage non-destructively in situ.
In order to effectively analyze the accumulation of the tensile damage during service, non-destructive testing (NDT) methods need to be utilized. AE is a typical NDT technique, which is capable of detecting material damage.6,7 Monitoring AE signals in situ can give significant information about the microscopic damage mechanisms involved. 8 It has been shown that the AE technique is sensitive and reliable enough for the detection of material damage of in-service structures,9,10 but building a relationship between the AE signal and the accumulated tensile damage is necessary. Lomov et al. 11 reported results of AE recording for three-dimensional (3D) non-crimp orthogonal woven (3DNCOW) carbon/epoxy composites and correlated the AE to the damage morphology observed in the specimen’s cross sections at characteristic stages of the damage development. Máthis et al. 12 developed an experimental set-up to perform slow strain rate tensile (SSRT) tests on tubular 304L stainless steel (SS) specimens in supercritical water (SCW) environment. The set-up enabled in situ monitoring of AE and electrochemical potential during the SSRT test. The AE response was correlated to changes in the electrochemical potential.
In this article, the short tensile process and resulting damage in high-strength aluminum alloy A356 were studied in real time by a non-destructive detection method, and the residual life of the material was predicted. As the data of the tensile process are imbalanced, that is, the number of data points between the safety stage and alert stage are few, an automatic optimization algorithm has been proposed to deal with this problem. Finally, an automatic life prediction optimization model for the in-service material has been established.
Materials and methods
Tensile test
In all, 16 tensile specimens, geometry given in Figure 1, were wire electro-discharge machined (EDM) from a high-speed train gearbox shell, manufactured from high-strength aluminum alloy A356.

Tensile specimen specifications.
An 810 Material Test System (MTS) was used to carry out the tensile tests up to failure. Once the tensile specimen was mounted in the MTS load frame, Vaseline was used to attach the sensor to the specimen surface to ensure a good connection for enhanced sensitivity. The AE signals were recorded and processed throughout the tensile test using PCI-2 supplied by American PCA company, as shown in Figure 2.

AE instrument.
The tensile test data comprised two parts: the stress–strain data and the AE monitoring data. The stress–strain data were used to obtain actual life of the material, the AE data were used to perform life predictions based on the performance degradation of the tensile damage progress. The two types of data were correlated by the experimental time.
AE technology
AE is defined as a phenomenon of rapid release of energy and generation of a transient elastic wave from a localized source of the material. 13 The AE signal is generated by a change in the defects of the material by the external conditions. It can detect the generation and expansion of micro-scale small cracks, and it can directly reflect characteristics of internal fracture of metallic materials. Parameter analysis and waveform analysis are the two main methods to study AE signals, and the typical parameters are the count, amplitude, duration, and rise time, 14 as shown in Figure 3.

Typical parameters of AE signal.
Tensile test loading condition was 0.4 mm/min. Table 1 shows the parameter settings of AE system.
Acoustic emission system parameter settings.
PDT: peak definition time; HDT: hit definition time; HLT: Hit lockout time.
AE signals of high-strength aluminum alloy tensile damage process
AE data were collected during tensile test. Energy, count, amplitude, rise time, peak frequency, and duration were selected as characteristic parameters to study the tensile damage. Figure 4 shows representative experimental curves of sample 1.

The AE signal features of thermocouple 1: (a) counts, (b) energy, and (c) amplitude.
The transition point from elastic stage to yield stage is the failure point of the tensile process. 13 Hence, the tensile process can be divided into two stages according to the tensile curve, that is, the elastic region is the safe stage and plastic deformation is the alert stage. Accordingly, the AE data in safe stage are short due to the short elastic nature of the material. It has been shown in Ai et al. 13 that the AE signal boundary between elastic stage and plastic stage is not clear, and the AE signals of different samples have the same tendency but significantly different values. Therefore, it is difficult to establish a consistent model with the raw data from the AE equipment. The statistically different dispersion of features such as voids in the material is the cause of the above phenomenon. Besides, Figure 4 shows that tensile damage is a relatively short-termed process and that the volatility of the AE data is relatively high. Therefore, a new method for dealing with the AE signal and predicting the residual life of the material’s tensile response is needed.
Results and discussion
Initial life prediction model
Analysis of the characterization parameter
The AE signal data collected during the tensile tests were built into characteristic parameter set series, namely, C(t) for time t
where
where
In theory, the correlation coefficient series for the same sample should be larger at beginning and fluctuate with each new data acquired. Finally, after a large amount of data has been collected, the correlation coefficients should gradually stabilize. The count, energy, and amplitude have the same tendency, while the rise time reflects the response speed. Furthermore, a given parameter of AE signal would initially vary significantly with rise time and subsequently dampen. Figure 5 shows the results of the correlation coefficients between the AE signals.

The results of the correlation coefficients between AE signals.
It can be seen that the correlation coefficients of rise time versus peak frequency are not consistent with the other curves, which is governed by the characteristics of the peak frequency signal, as shown in Figure 7. The count, energy, duration, and amplitude versus rise time correlation coefficients are very high and have a value of nearly ~1. The curves of rise time versus count, rise time versus energy, rise time versus duration, and rise time versus amplitude show early fluctuation, decreasing to a steady-state value. The demarcation points of the tensile process from elastic stage to yield stage are in oscillation decline phase of the curves. However, near each demarcation point, the correlation coefficient has a vibration, which is mainly due to the fast nature of the tensile test. Hence, the time from elastic stage to yield stage is at a relatively shorter time. In addition, with the dispersion of the casting material, the median values of the above correlation coefficients were chosen in order to reduce the fluctuation of the correlation coefficients in this study.
Compared with the average value of the time series, the median value is relatively more concentrated; hence, it was selected for the correlation coefficient of the median time series, Crisemedian
The Crisemedian of a selected tensile test is shown in Figure 6.

The Crisemedian of tensile test.
It can be seen that the Crisemedian time series is initially volatile tending to a more steady value with time. The transition point of elastic stage to yield stage was at the oscillatory decreased stage, and the values of the samples at the demarcation point are relatively closed. Therefore, if the curve of the fluctuation stage before the oscillatory decreased stage can be removed, it is possible to characterize the tensile process with the AE parameters. Figure 7 shows the peak frequency of an AE signal.

The peak frequency of AE signals.
It can be seen from Figure 7 that the peak frequency signal has a relatively high value just before the transition point and exhibits two relatively concentrated values: the high value and low value of different sizes. So, the peak frequency signal of the first time can be considered as the trigger signal for damage identification and life prediction during tension. According to the 16 tests, the values of the peak frequency are all higher than 155, which can be used to judge the high value of peak frequency.
To establish life prediction model
The values of Crisemedian at the transition point of the 16 tests are concentrated. CTpoint has been defined as the criterion for tensile damage, which is a statistical result of Crisemedian at the transition point. So, CTpoint is a value with a certain distribution and reflects the distribution of the Crisemedian value at the transition point 15 of the 16 tensile test data that have been used for calculating the CTpoint and its probability distribution,

The detailed algorithm flow of the initial life prediction model.
The exponential curve fitting uses the form of f(x) = a*exp (b*x), where a and b are the fitting function parameters. After the peak frequency has been triggered successfully, the data 10 s before the transition point were used for the curve fitting. Then, the fitting curve can be compared with the original Crisemedian time series.
Figure 9 shows the prediction result and the real value, and the dotted lines correspond to the transition points of the prediction result and the real value. More prediction result and fitting information are shown in Table 2.

The comparison of prediction result and real-time series Crisemedian.
The prediction result of initial tensile life prediction model.
It can be seen from Table 2 that the prediction results obtained by conventional exponential curve fitting is not ideal. Most errors of the samples are within 30 s, some prediction results are after the transition point, and some results of the samples have a large deviation. Careful analysis revealed that these problems are mainly caused by three reasons: first, the sample data near the transition point fluctuates and decreases, which creates errors in the fitting result; second, the data of some samples for fitting are so little to the inaccuracy of the result; finally, because of the dispersion of the material, the values of Crisemedian time series near the transition point are overall either higher or lower than the CTpoint value, which creates the large deviations. It is therefore necessary to come up with a targeted solution for improving the accuracy of life prediction model.
Automatic optimization for the life prediction model
Automatic optimization algorithm for few and imbalanced data
Based on the life prediction results of the tensile damage and the analysis of the reasons of the larger deviations, an adaptive optimization method of existing models has been used as follows.
First, for the oscillation problem in the decreased process of the sample data, the smoothing method has been used to avoid increasing adjacent elements and reducing the prediction accuracy of the life prediction model. The moving average method is used to smooth the sample data which are required to be fitted by exponential fitting, and the result will be used as the input for the subsequent interpolation and standard deviation compensation.
Second, for the problem of few samples and the imbalanced data, when the adjacent AE signal time interval is greater than 3 s, the interpolation method has been used for increasing the sample size. The time series trend of Crisemedian near the demarcation point is oscillatory downward. Therefore, in the relatively small interval of two adjacent elements of the Crisemedian time series, it can be considered as a linear relationship. So, the linear interpolation method can be used for interpolation, and the difference interval is 1 s:
if
to run with the interpolation method to increasing the sample size
end
Third, it can be seen from Table 3 that the normal distribution variance of the material failure criteria CTpoint is very low, where
If
% when certain sample value are overall higher or lower than sample mean value
If
else
end
end
The results of automatic optimization life prediction model.
Fourth, after a tensile process, the new failure data are used to modify the failure criteria of the characteristic parameters as well as to modify the normal probability distribution of the original failure criteria.
The detailed algorithm flow of automatic optimization life prediction model for the gearbox shell tensile damage is shown in Figure 10.

The detailed algorithm flow of automatic optimization life prediction model for gearbox shell tensile damage process.
Automatic optimization result and analysis
The results of automatic optimization life prediction model are shown in Table 3.
It can be seen from Table 3 and Figure 11 that the predictive accuracy of automatic optimization life prediction model significantly improved, which is due to the smoothing, interpolation, and overall compensation method for the data. Some results of certain samples have not improved, which is because the automatic optimization method proposed in this study is not in all cases to be adaptively optimized and can only optimize situations such as large oscillation of two adjacent elements, lager time intervals of two adjacent elements, and characterization parameters needing overall compensation.

The comparison of life prediction model before and after the automatic optimization.
It can be seen from the result of the automatic optimization life prediction model that the life prediction errors are controlled within 50 s, and most of the test data error is less than 20 s. According to the acceleration test design relationship, 20 s error roughly corresponds to 1400 km or 6 h in real operation situation. There is also the prediction result which is after the presence of the real time of the demarcation point, but compared with the whole process of tensile damage, the material is in the plastic stage of the tensile damage process in the lag time of prediction and have not yet entered the fracture stage. So, it will not bring the final burst failure and accident disaster.
Conclusion
In this article, the tensile damage accumulation of a high-speed train gearbox shell material has been studied. For the short tensile process leading to significant damage, a real-time and non-destructive detection method has been used to monitor the tensile damage accumulation and to predict the residual life of the material. For cases where the data of the tensile process are not balanced and the volume of the data at the demarcation point from safety stage to alert stage is low, an automatic algorithm has been proposed to deal with the problem. The algorithm deals with the data fluctuations, imbalances, and large intervals by smooth, interpolation, compensation, and other methods. Based on the above techniques, the automatic optimization life prediction model for material tensile process has been established. The life prediction errors are controlled within 50 s, and the majority of errors are less than 20 s. The test is accelerated, and when compared with the real operation situation, there are about 6 h left for evacuation of passengers and handling failures of the high-speed train.
Footnotes
Handling Editor: Wenbing Zhao
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The financial support for this study was provided by the National Natural Science Foundation of China (Grant No. 61273205), the Fundamental Research Funds for the Central Universities of China (Grant No. FRF-SD-12-028A), and the 111 Project (Grant No. B12012).
