Non-parametric load extrapolation based on load extension for semi-axle of wheel loader

Abstract

Load extrapolation is the paramount step in compiling load spectra of the mechanical components. To avoid the limitation of the stationary processes in parametric extrapolation methods, the non-parametric extrapolation method is widely investigated in recent years. However, the accuracy of kernel density estimation of the large load cycles in existing non-parametric methods should be still improved. Aiming at this issue, a non-parametric rain-flow extrapolation method based on load extension is presented. In this method, combining with non-parametric extrapolation model and Markov chain Monte Carlo method, the measured load–time histories for semi-axle in different operating sections are extended and extrapolated. The extrapolated load spectra curves and the relative damage ratio are compared with the results obtained by the existing rain-flow extrapolation methods. The results show that the proposed method can realize the reasonable extrapolation of load–time histories.

Keywords

Non-parametric load extrapolation load extension load spectra kernel density estimation wheel loader

Introduction

Engineering vehicles such as mining truck, wheel loader, excavator, and pump truck are widely used for material exploitation and industrial construction. These vehicles always undertake heavy tasks and work in harsh environments. Particularly, the wheel loader will experience load change frequently during the operation of spading and loading. Hence, fatigue failure of the components of wheel loader often occurs.¹ Fatigue life test and prediction based on the load spectra are topics that have received continuous attention. Since the load measurement for wheel loader is very time consuming and of high cost, only short-term load–time histories (L-THs) can be obtained. However, the short-term L-THs do not contain the full-load characteristics in the whole service life. In order to use the measured L-THs to the fatigue test, load extrapolation is indispensable in the process of load spectra compilation.

The rain-flow counting method² is used to convert the measured L-THs into load cycles in rain-flow domain before load extrapolation. Recently, there are a variety of extrapolation methods for load cycles, such as parametric method and non-parametric method.³ For the parametric rain-flow extrapolation (PRFE) method, probability distributions of load range and mean should be known.^4,5 Considering the limitation of unimodal distributions in describing load statistical characteristics accurately, multi-modal distributions were applied to load extrapolation.^6,7 Along with the wide use of non-parametric density estimation methods, a non-parametric rain-flow extrapolation (NPRFE) method based on kernel density estimation (KDE) was put forward.⁸ To extrapolate large load cycles, extreme value theory was applied and a limiting rain-flow extrapolation (LRFE) method was proposed.^9,10 Furthermore, an NPRFE method which involves the modification of extrapolated load values was studied.¹¹ In the modification process, Weibull distribution was used to fit load spectra curves. Besides, an evaluation criterion based on multi-criteria decision-making technology was provided to select the most appropriate model from the existing KDE models for NPRFE.¹²

Compared with parametric models in the PRFE and LRFE, KDE model may be more suitable for the extrapolation of complex and random loads. It is data-driven as it does not need to make a hypothesis distribution for load samples and then the probability density of estimation point is obtained. However, the accuracy of KDE will be decreased if the load samples are not enough.¹³

In order to extrapolate the measured L-THs with different amplitudes reasonably, NPRFE method based on load extension is proposed. First, considering the choice of kernel function, adaptive bandwidth matrix, and bandwidth calculation method, NPRFE model based on full bandwidth matrix (FBM) is built. Then, Markov chain Monte Carlo (MCMC) method is applied to extend the measured L-THs to obtain the optimal probability density by KDE. Based on this optimal probability density, the measured L-THs can be extrapolated with the anticipative fold. Furthermore, the measured L-THs for the semi-axle of a wheel loader are extrapolated. The validity of proposed method is evaluated through the extrapolated load spectra curves and relative damage ratio (RDR).

NPRFE modeling

The density estimation of the measured L-THs based on KDE is a critical step in NPRFE. Suppose that f is the probability density of d-dimensional random vector $x$ and $x_{1}, x_{2}, \dots, x_{n}$ denote load samples from $x$ , $x_{i} = (x_{i 1}, x_{i 2}, \dots, x_{id})^{T}$ . Then the multivariate KDE of $f (x)$ can be calculated by¹⁴

f (x) = \frac{1}{n} \sum_{i = 1}^{n} K_{H} (x - x_{i})

(1)

where $K_{H} (x) = {| H |}^{- 1 / 2} K (H^{- 1 / 2} x)$ and $H$ is a $d \times d$ bandwidth matrix. When the kernel function $K (x)$ and bandwidth matrix $H$ are determined, the density of random vector $x$ can be estimated by equation (1).

Selection of kernel function

The standard probability distribution functions are often selected as kernel functions. The shapes and expressions of univariate kernel functions, which are commonly used, are shown in Figure 1 and Table 1. They express the influence extent of data near the density estimation point. To select an appropriate kernel function, a mixed normal density function is estimated by different kernel functions with adaptive bandwidth calculated by L-stage direct plug-in (LDPI) method.¹⁴ Obviously, the KDE results (see Figure 2) are almost the same. Here, Gaussian kernel is selected in KDE model for load extrapolation. The expression of bivariate Gaussian kernel is

K (x) = \frac{1}{2 π} e^{- \frac{1}{2} x^{T} x}

(2)

With equations (1) and (2), the KDE of $f (x)$ based on Gaussian kernel can be calculated by

\begin{array}{l} \hat{f} (x; H) = \frac{1}{n} \sum_{i = 1}^{n} {| H |}^{- 1 / 2} K [H^{- 1 / 2} (x - x_{i})] \\ = \frac{{| H |}^{- 1 / 2}}{2 π n} \sum_{i = 1}^{n} \exp [- \frac{1}{2} {(x - x_{i})}^{T} H^{- 1} (x - x_{i})] \end{array}

(3)

Figure 1.

Shapes of different univariate kernel functions.

Table 1.

Expressions of different univariate kernel functions.

Kernel function	$K (x)$
Rectangular	1/2
Triangular	$1 - \| x \|$
Epanechnikov	$\frac{3}{4} (1 - x^{2})$
Gaussian	$\frac{1}{\sqrt{2 π}} e^{- \frac{1}{2} x^{2}}$
Biweight	$\frac{15}{16} (1 - x^{2})^{2}$
Triweight	$\frac{35}{32} (1 - x^{2})^{3}$
Logistic	$\frac{1}{e^{x} + 2 + e^{- x}}$

Figure 2.

KDE by different kernel functions with adaptive bandwidth values.

In order to study the influence of different bandwidth values on KDE accuracy, the above mixed normal density function is estimated based on Gaussian kernel and different bandwidth values. As shown in Figure 3, the estimated densities are close to the real values only when bandwidth is adaptive and reasonable; otherwise, it would result in a great error. In fact, the selection of bandwidth value cannot guarantee that the deviation and variance of KDE decrease at the same time. If the bandwidth value is too small, the data near estimation point will get a large probability, which will lead to small deviation but large variance. While the bandwidth value is too large, a wide range of data will have an effect on estimation point, which will lead to small variance but large deviation.⁹ Since a bandwidth that is too large or too small will result in the decrease of KDE precision, the bandwidth is a key parameter in KDE model.

Figure 3.

KDE by Gaussian kernel with different bandwidth values.

Determination of adaptive bandwidth matrix

The bandwidth matrix in bivariate KDE model can be selected from the following three forms:¹⁵

Single bandwidth matrix (SBM): $S_{2} = (\begin{matrix} h^{2} & 0 \\ 0 & h^{2} \end{matrix}), h > 0$

Double bandwidth matrix (DBM): $D_{2} = (\begin{matrix} h_{1}^{2} & 0 \\ 0 & h_{2}^{2} \end{matrix}), h_{1}, h_{2} > 0$

FBM: $F_{2} = (\begin{matrix} h_{1}^{2} & h_{12} \\ h_{12} & h_{2}^{2} \end{matrix}), h_{1}, h_{2} > 0, | h_{12} | < h_{1} \cdot h_{2}$

Comparing with $S_{2}$ and $D_{2}$ , $F_{2}$ is more flexible and can guarantee the smoothness in any direction. Therefore, $F_{2}$ is selected as the form of bandwidth matrix in KDE model. The unknown parameters $h_{1}$ and $h_{2}$ in $F_{2}$ can be calculated by the calculating method of bandwidth, and $h_{12}$ is determined by¹⁵

h_{12} = w_{12} h_{1} h_{2}

(4)

Λ = (\begin{matrix} ρ_{11} & ρ_{12} \\ ρ_{12} & ρ_{22} \end{matrix})

(5)

w_{12} = \frac{ρ_{12}}{{(ρ_{11} ρ_{12})}^{1 / 2}}

(6)

where $Λ$ and $w_{12}$ are the covariance and the correlation coefficient of load samples, respectively.

To guarantee the KDE precision of the measured L-THs, bandwidth values should be different in the sparse and dense areas. Thus, an adaptive factor $η_{i}$ ¹⁶ is introduced

η_{i} = {\frac{f (x_{i})}{\frac{1}{n} \sum_{i = 1}^{n} \ln [f (x_{i})]}}^{ε}

(7)

where $ε$ is the sensitivity coefficient ranging from 0 to 1. Here, it is set to 0.5 in KDE model according to the recommendations in WAFO.¹⁶ Besides, $x_{i}$ denotes the load which has an effect on the KDE of estimation point. It varies at different points, which will lead to the change of $η_{i}$ . The adaptive bandwidth matrix is¹⁶

H_{2 i} = η_{i}^{2} (\begin{matrix} h_{1}^{2} & w_{12} h_{1} h_{2} \\ w_{12} h_{1} h_{2} & h_{2}^{2} \end{matrix})

(8)

With equations (3) and (8), the adaptive KDE model is given by

\hat{f} (x; H) = \frac{{| H_{2 i} |}^{- 1 / 2}}{2 π n} \sum_{i = 1}^{n} \exp [- \frac{1}{2} {(x - x_{i})}^{T} H_{2 i}^{- 1} (x - x_{i})]

(9)

Comparisons of the calculating methods of bandwidth

To select the optimal calculating method of the bandwidth values $h_{1}$ and $h_{2}$ , least squares cross-validation (LSCV) method,^17,18 biased cross-validation (BCV) method,¹⁹ rule-of-thumb (ROT) method,²⁰ and LDPI method¹⁴ are compared. Four bivariate density functions $F_{1}, F_{2}, F_{3}, and F_{4}$ ^21–24 are provided to analyze the performance of the above four calculating methods.

First, samples with size N (N = 20, 50, 100, 200, and 500) are generated randomly from the above four bivariate density functions. Then the densities are estimated by KDE with 100 repetitions. The results are shown in Table 2. It is observed that the mean values of root-mean-square error (RMSE) in ROT are larger than those in other calculating methods. To compare the calculation results of the above methods visually, the box plots of $\log_{e} (RMSE)$ for the above four density functions with sample size $N = 500$ are drawn. For each density function in Figure 4, the median and interquartile range values of $\log_{e} (RMSE)$ in LDPI are almost the same as those in LSCV and BCV. However, there are many outliers caused by LSCV and BCV. Thus, ROT will result in large deviation. LSCV and BCV will result in large variance. LDPI may be a good choice for calculating bandwidth.

Table 2.

Mean values and standard deviation values of RMSE in parentheses based on 100 repetitions for four bivariate density functions.

F	N	LSCV	BCV	ROT	LDPI
F ₁	20	0.0728 (0.0986)	0.0560 (0.0378)	0.0709 (0.0078)	0.0490 (0.0207)
	50	0.0476 (0.0385)	0.0462 (0.0338)	0.0640 (0.0050)	0.0354 (0.0102)
	100	0.0424 (0.0424)	0.0318 (0.0147)	0.0595 (0.0031)	0.0287 (0.0067)
	200	0.0291 (0.0225)	0.0250 (0.0084)	0.0552 (0.0026)	0.0245 (0.0053)
	500	0.0243 (0.0159)	0.0187 (0.0069)	0.0482 (0.0017)	0.0183 (0.0027)
F ₂	20	0.0417 (0.0633)	0.0326 (0.0224)	0.0373 (0.0042)	0.0233 (0.0082)
	50	0.0251 (0.0232)	0.0213 (0.0125)	0.0362 (0.0025)	0.0175 (0.0038)
	100	0.0203 (0.0176)	0.0163 (0.0064)	0.0335 (0.0019)	0.0146 (0.0020)
	200	0.0160 (0.0107)	0.0144 (0.0061)	0.0311 (0.0014)	0.0123 (0.0018)
	500	0.0111 (0.0067)	0.0096 (0.0024)	0.0278 (0.0008)	0.0094 (0.0014)
F ₃	20	0.0995 (0.1108)	0.0602 (0.0484)	0.0805 (0.0090)	0.0474 (0.0095)
	50	0.0533(0.0458)	0.0479 (0.0281)	0.0756 (0.0057)	0.0380 (0.0065)
	100	0.0412 (0.0240)	0.0394 (0.0218)	0.0714 (0.0043)	0.0314 (0.0048)
	200	0.0317 (0.0170)	0.0295 (0.0128)	0.0680 (0.0026)	0.0256 (0.0035)
	500	0.0214 (0.0086)	0.0207 (0.0060)	0.0629 (0.0017)	0.0193 (0.0026)
F ₄	20	0.0346 (0.0148)	0.0312 (0.0090)	0.0455 (0.0057)	0.0311 (0.0056)
	50	0.0294 (0.0111)	0.0262 (0.0050)	0.0434 (0.0035)	0.0280 (0.0037)
	100	0.0254 (0.0066)	0.0239 (0.0035)	0.0422 (0.0023)	0.0264 (0.0022)
	200	0.0228 (0.0047)	0.0205 (0.0025)	0.0405 (0.0018)	0.0238 (0.0017)
	500	0.0180 (0.0028)	0.0173 (0.0027)	0.0384 (0.0011)	0.0206 (0.0010)

RMSE: root-mean-square error; LSCV: least squares cross-validation; BCV: biased cross-validation; ROT: rule-of-thumb; LDPI: L-stage direct plug-in.

Figure 4.

Box plots of $\log_{e} (RMSE)$ for four density functions with sample size N = 500.

In addition, the comparison of computation time of four calculating methods is shown in Table 3. It shows that LDPI and ROT are less time consuming than LSCV and BCV. Combining the KDE accuracy and the computational efficiency, LDPI method is selected to calculate the parameters $h_{1}$ and $h_{2}$ .

Table 3.

Comparison of computation time of four calculating methods.

	LSCV	BCV	ROT	LDPI
Time (s)	1251.337	2385.452	19.693	22.420

LSCV: least squares cross-validation; BCV: biased cross-validation; ROT: rule-of-thumb; LDPI: L-stage direct plug-in.

NPRFE method based on load extension

The KDE precision and load extrapolation accuracy are influenced by load samples as well as the NPRFE model. Only when the load cycles are enough, the KDE precision can be guaranteed. Compared with medium and small load cycles, large load cycles will lead to a large amount of fatigue damage although it does not occur frequently (see Figure 5). Thus, MCMC simulation is used to extend the measured L-THs to improve the KDE precision and extrapolation accuracy of load cycles.

Figure 5.

Damage contributions of different load cycles.

Brief overview of MCMC simulation

For arbitrary $n \in T$ , the discrete-time series ${X_{n}, n = 0, 1, 2, \dots}$ is called a Markov chain; if for all $i_{0}, i_{1}, i_{2}, \dots, i_{n} \in I$ , the following is true²⁵

\begin{array}{l} P {X_{n} = i_{n} | X_{0} = i_{0}, X_{1} = i_{1}, \dots, X_{n - 1} = i_{n - 1}} \\ = P {X_{n} = i_{n} | X_{n - 1} = i_{n - 1}} = p_{i j n} \end{array}

(10)

where $p_{ijn}$ is called one-step state transition probability, which is the condition probability from state $i_{n - 1}$ at time $n - 1$ to state $i_{n}$ at time n. The Markov chain can be called as homogenous Markov chain if it does not depend on time unit, which implies that

\begin{matrix} P {X_{n} = i_{n} | X_{0} = i_{0}, X_{1} = i_{1}, \dots, X_{n - 1} = i_{n - 1}} \\ = P {X_{n} = i_{n} | X_{n - 1} = i_{n - 1}} = p_{ij} \end{matrix}

(11)

Thus, the homogeneous one-step state transition probability $p_{ij} = p_{ijn}$ , and $p_{ij}$ satisfies the following conditions: $0 \leq p_{ij} \leq 1$ and $\sum_{j = 1} p_{ij} = 1, i = 1, 2, \dots, n$ . Moreover, the one-step transition probability matrix P is

P = (\begin{matrix} p_{11} & p_{12} & \dots & p_{1 n} \\ p_{21} & p_{22} & \dots & p_{2 n} \\ \dots & \dots & \dots & \dots \\ p_{n 1} & p_{n 2} & \dots & p_{nn} \end{matrix})

(12)

Regarding the measured L-THs for semi-axle, the Markov chain model can be created by extracting turning points²⁶

{X_{t}} = {X_{t_{1}}, X_{t_{2}}, X_{t_{3}}, X_{t_{4}}, \dots} = {l_{1}, L_{1}, l_{2}, L_{2}, \dots}

(13)

where $l_{1}, l_{2}, \dots$ represent the local minimum points, while $L_{1}, L_{2}, \dots$ represent the local maximum points.

According to the load cycles counting, Markov matrix F is obtained, and it can be converted into the one-step transition probability matrix by²⁵

P (i, j) = \frac{F (i, j)}{\sum_{l = 1}^{n} F (i, l)}

(14)

Then the stationary distribution of Markov chain model is created by calculating the one-step transition probability, and the extension of random load samples based on Monte Carlo is obtained.

Extrapolation of the extended load

The steps of NPRFE method based on load extension are shown as follows.

Load extension. The turning points of the measured L-THs are extracted. Then, new turning points are generated through the MCMC simulation process until the new samples meet the demands of extrapolation.

Load preprocessing. The new turning points are converted into load cycles in rain-flow domain by rain-flow counting method. Considering that the small load cycles have little effect on fatigue damage, the load cycles below a certain threshold should be filtered. For example, the load cycles with a range less than 10% or 50% of the maximum load cycle can be eliminated,^27,28 or the 50% of the material endurance limit can be set as the filtering threshold.²⁹ The 10% of the maximum load cycle is selected as the threshold to remove small load cycles.

Density estimation. The density of load cycles in rain-flow domain is estimated according to NPRFE model created at section “NPRFE modeling”, and the optimal probability density of load samples can be obtained.

Rain-flow matrix (RFM) extrapolation. According to the optimal probability density, Monte Carlo method is used to generate new load cycles randomly, and the RFM is extrapolated with certain folds.

The flowchart of NPRFE method based on load extension is shown in Figure 6.

Figure 6.

Flowchart of NPRFE method based on load extension.

Case study

This article takes wheel loader as an example to verify the proposed NPRFE method. The measured L-THs for semi-axle in spading section and back section with full load are selected (see Figures 7 and 8). Through rain-flow counting and small load cycles filtering methods, rain-flow matrices (RFMs) in two operation sections are obtained (see Figure 9).

Figure 7.

L-THs for semi-axle in spading section.

Figure 8.

L-THs for semi-axle in back section with full load.

Figure 9.

RFMs of the measured L-THs: (a) spading section and (b) back section with full load.

Direct NPRFE for L-THs

To demonstrate the performance of the FBM-based NPRFE model, the densities of the measured RFMs are estimated by direct NPRFE method which only uses the FBM-based NPRFE model without load extension. Then the measured load cycles are extrapolated with 10 folds and the results are shown in Figure 10. Compared with RFMs shown in Figure 9, the extrapolated RFMs include a lot of new load cycles. However, it is difficult to evaluate whether they are reasonable or not.

Figure 10.

RFMs extrapolated by direct NPRFE with 10 folds: (a) spading section and (b) back section with full load.

Here, load spectra curve is applied to describe the effect of extrapolation. At the same time, the measured load cycles are also extrapolated by the existing SBM-based NPRFE model and “mileage extrapolation” method which means that the measured load cycles are only duplicated for several folds. The comparisons of different extrapolation curves in two sections are shown in Figure 11. Compared with the SBM-based NPRFE curve, the FBM-based NPRFE curve is more close to mileage extrapolation curve in medium and small load cycles. Thus, it can be deduced that the FBM-based NPRFE model is more reasonable and accurate than the SBM-based NPRFE model.

Figure 11.

Load cycles extrapolated by different models with 10 folds: (a) spading section and (b) back section with full load.

Considering that the LRFE method can obtain a good extrapolation effect in large load cycles, the direct NPRFE method should be compared with it, especially in large load cycles. The load spectra curves extrapolated by LRFE method and direct NPRFE method with 10 folds are shown in Figures 12. For both the sections, the extrapolation of the measured load cycles by direct NPRFE method is repeated 10 times, and 10 different load spectra curves are obtained. According to the part of the large amplitude in load spectra curves, almost all of the amplitude values extrapolated by direct NPRFE are larger than those extrapolated by LRFE. Therefore, the direct NPRFE method is not very perfect for extrapolating large load cycles.

Figure 12.

Load cycles extrapolated by LRFE and direct NPRFE with 10 folds: (a) spading section and (b) back section with full load.

NPRFE method based on load extension for L-THs

Aiming at improving the extrapolation effect of large load cycles, the NPRFE method based on load extension is applied and verified. When more load samples are extended, then higher accuracy of KDE can be guaranteed. However, the extrapolated results are perfect when the measured load samples are extended with 100 folds in this case. Extension more than 100 folds is time consuming and meaningless. Considering computation time, the extended fold is set to 100 here.

After extension, load preprocessing, density estimation, and extrapolation with 10 folds for the measured L-THs for the semi-axle in spading section and back section with full load, the RFMs are obtained and shown in Figure 13.

Figure 13.

RFMs extrapolated by NPRFE based on load extension with 10 folds: (a) spading section and (b) back section with full load.

Similarly, the LRFE method is used as reference. The load spectra curves extrapolated by LRFE method and NPRFE method based on load extension with 10 folds are shown in Figure 14 (the NPRFE method based on load extension is repeated 10 times). According to the part of the large amplitude in load spectra curves, all of the amplitude values obtained by proposed NPRFE method are similar to LRFE results. Compared with the results shown in Figure 12, the extrapolation performance for large load cycles is obviously improved.

Figure 14.

Load cycles extrapolated by LRFE and NPRFE based on load extension with 10 folds: (a) spading section and (b) back section with full load.

To verify the validity of extrapolation effect of this new method further, extrapolation with 100 folds is also simulated (see Figure 15). Compared with direct NPRFE, NPRFE method based on load extension is more reasonable because its extrapolation results are more close to LRFE results in large load cycles.

Figure 15.

Load cycles extrapolated by different methods with 100 folds: (a) spading section and (b) back section with full load.

Similar to the load spectra curve, the damage is another evaluation indicator for the validity of proposed NPRFE method. Nominal stress method is commonly applied for fatigue life calculation of mechanical components. For the measured L-THs, the nominal damage is calculated. Basquin’s equation of Waller curve is

N_{i} = τ \cdot S_{i}^{- γ}

(15)

where $N_{i}$ is the number of load cycles of fatigue failure under a certain load amplitude. $S_{i}$ is a certain load amplitude, $τ$ is related to the materials of components, and $γ$ is damage index which reflects the types of components. The damage D caused by load cycles with different amplitudes is obtained according to Palmgren–Miner rule

D = \sum_{i} \frac{n_{i}}{N_{i}} = \frac{1}{τ} \cdot \sum_{i} n_{i} \cdot S_{i}^{γ}

(16)

where $n_{i}$ is the frequency of $S_{i}$ . Here, RDR C (equation (17)) is used to compare the performance of different rain-flow extrapolation methods. It can be calculated by

C = D_{extra} / D_{measure} = \sum_{j} n_{j} \cdot S_{j}^{γ} / \sum_{i} n_{i} \cdot S_{i}^{γ}

(17)

where $γ$ is generally determined by experience. For the components with smooth surface, $γ = 7$ .³⁰ The damage index of the semi-axle of wheel loader is also set to seven here.

L-THs for semi-axle are extrapolated by LRFE, direct NPRFE, and NPRFE method based on load extension with fold x (x = 10, 20, …, 100), and then the RDR values caused by these extrapolation load cycles are acquired. Figure 16 shows that the RDR values calculated by NPRFE method based on load extension are more close to LRFE results than those calculated by direct NPRFE. Thus, NPRFE method based on load extension can realize a reasonable extrapolation.

Figure 16.

RDR values calculated by different extrapolation methods: (a) spading section and (b) back section with full load.

Discussion

As previously stated, both the extrapolation model and the size of the measured L-THs are critical in the process of NPRFE. To achieve the reasonable extrapolation of the measured L-THs, these two factors are taken into account in this article.

LRFE method has the advantage in extrapolating large load cycles. In this method, large load cycles’ extrapolation is based on peak over threshold model. Generalized Pareto distribution is used to fit the large load cycles above a threshold, and then the probability distribution is obtained. Through probability multiplied by total frequency, the extrapolation load cycles are obtained. When the threshold and parameter estimation method in that model are the same, the load spectra curve after extrapolation will not change.

NPRFE method based on load extension does not need to assume that the load samples obey a certain distribution. In this method, MCMC method is used in load extension, and Monte Carlo method is used in RFM extrapolation to generate new load data randomly. When the NPRFE method based on load extension is repeated 10 times, the load spectra curves are diverse (see Figure 14). It reflects the randomness of load samples. Compared with LRFE method, NPRFE method based on load extension can obtain a reasonable extrapolation, and the superiority of data-driven property of KDE is retained. Thus, NPRFE method based on load extension may be better than LRFE method for the extrapolation of random and complex L-THs.

The L-THs analyzed here are obtained from the load measurement for the semi-axle of a wheel loader. In the process of measurement, 100 data points are gathered per second, and the total time of each section is 750 s. Thus, 75001 data points were gathered and used for analysis. They satisfy the demands of MCMC simulation, and the final extrapolation effect is perfect. However, if the load samples are too small and cannot reflect their distribution characteristics, the extrapolated results are unreasonable even though MCMC simulation is done in the process of extrapolation. If the load samples are large enough and meet the demands of KDE, MCMC simulation seems to be unnecessary for load extrapolation. Therefore, further research may be emphasized on the load extrapolation considering the influence of the size of original samples.

Conclusion

Load extrapolation is the crucial step in compiling the long-term load spectra of mechanical components for fatigue life prediction and bench test loading. NPRFE method based on KDE was gradually investigated in recent years as its extrapolation process is data-driven. However, the estimation accuracy of this method in large load cycles is limited and still needs to be improved. To overcome this problem, this study proposed an NPRFE method based on load extension.

The measured L-THs for the semi-axle of a wheel loader in spading section and back section with full load were used as an example to verify the reasonability and effectiveness of the proposed method. The comparison results between the FBM-based NPRFE curve and the SBM-based NPRFE curve show that NPRFE model greatly influences KDE precision and extrapolation accuracy. The comparison results between direct NPRFE and NPRFE method based on load extension show that the size of load samples has a great effect on the extrapolation of the measured L-THs. Based on load extension, the proposed method, which takes NPRFE model and the size of load samples into account, can realize a reasonable extrapolation for large load cycles as well as other cycles. Because any parametric distribution is not applied for fitting load samples in the proposed NPRFE method, it can decrease the subjectivity and prejudice. Besides, as NPRFE method based on load extension is not restricted by sample distribution, it may be flexibly used in the extrapolation of random and complex loads.

Footnotes

Academic Editor: Filippo Berto

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (grant nos 51375202 and 51265020) and the Graduate Innovation Fund of Jilin University (project 2016074).

References

Wang

. Multi-criteria decision-making method-based approach to determine a proper level for extrapolation of Rainflow matrix. Proc IMechE, Part C: J Mechanical Engineering Science 2012; 26: 1547–1554.

Grubisic

Determination of load spectra for design and testing. Int J Vehicle Des 1994; 15: 8–26.

Wang

Chen

. A review of the extrapolation method in load spectrum compiling. Stroj Vestn: J Mech E 2016; 62: 60–75.

Fajdiga

Jurejevčič

Kernc

Reliability prediction in early phases of product design. J Eng Des 1996; 7: 107–128.

Rodzewicz

Determination and extrapolation of the glider load spectra. Aircr Eng Aerosp Tec 2008; 80: 487–496.

Nagode

Fajdiga

A general multi-modal probability density function suitable for the rainflow ranges of stationary random processes. Int J Fatigue 1998; 20: 211–223.

Nagode

Klemenc

Fajdiga

Parametric modelling and scatter prediction of rainflow matrices. Int J Fatigue 2001; 23: 525–532.

Dressler

Gründer

Hack

. Extrapolation of rainflow matrices. SAE technical paper 1996-960569, 1996.

Johannesson

Thomas

JJ.

Extrapolation of rainflow matrices. Extremes 2001; 4: 241–262.

10.

Johannesson

Extrapolation of load histories and spectra. Fatigue Fract Eng M 2006; 29: 209–217.

11.

Socie

Pompetzki

MA.

Modeling variability in service loading spectra. J ASTM Int 2004; 1: 1–12.

12.

Wang

Liu

Zeng

. Selection method for kernel function in nonparametric extrapolation based on multicriteria decision-making technology. Math Probl Eng 2013; 2013: 391273 (11 pp.).

13.

Research on non-parametric extrapolation method in compiling load spectra of wheel loader. Master’s Thesis, Jilin University, Changchun, China, 2016.

14.

Wand

Jones

MC.

Kernel smoothing. New York: CRC Press, 1994, pp.71, 91.

15.

Wand

Jones

MC.

Comparison of smoothing parameterizations in bivariate kernel density estimation. J Am Stat Assoc 1993; 88: 520–528.

16.

WAFO Group. WAFO—a MATLAB toolbox for analysis of random waves and loads. Lund: Lund Institute of Technology, 2000.

17.

Rudemo

Empirical choice of histograms and kernel density estimators. Scand J Stat 1982; 9: 65–78.

18.

Bowman

AW.

An alternative method of cross-validation for the smoothing of density estimates. Biometrika 1984; 71: 353–360.

19.

Scott

Terrell

GR.

Biased and unbiased cross-validation in density estimation. J Am Stat Assoc 1987; 82: 1131–1146.

20.

Cao

Cuevas

Manteiga

WG.

A comparative study of several smoothing methods in density estimation. Comput Stat Data An 1994; 17: 153–176.

21.

Zougab

Adjabi

Kokonendji

CC.

Bayesian estimation of adaptive bandwidth matrices in multivariate kernel density estimation. Comput Stat Data An 2014; 75: 28–38.

22.

Duong

Hazelton

ML.

Plug-in bandwidth matrices for bivariate kernel density estimation. J Nonparametr Stat 2003; 15: 17–30.

23.

Duong

Hazelton

ML.

Cross-validation bandwidth matrices for multivariate kernel density estimation. Scand J Stat 2005; 32: 485–506.

24.

Chacón

Duong

Multivariate plug-in bandwidth selection with unconstrained pilot bandwidth matrices. Test 2010; 19: 375–398.

25.

Kijima

Markov processes for stochastic modeling. New York: CRC Press, 1997, pp.55, 56.

26.

Johannesson

Rainflow cycles for switching processes with Markov structure. Probab Eng Inform Sc 1998; 12: 143–175.

27.

Wang

. Determination of the minimum sample size for the transmission load of a wheel loader based on multi-criteria decision-making technology. J Terramechanics 2012; 49: 147–160.

28.

Schön

Spectrum fatigue loading of composite bolted joints—small cycle elimination. Int J Fatigue 2006; 28: 73–78.

29.

Heuler

Seeger

A criterion for omission of variable amplitude loading histories. Int J Fatigue 1986; 8: 225–230.

30.

Johannesson

Michael

(eds). Guide to load analysis for durability in vehicle engineering. Chichester: John Wiley & Sons, 2013, p.63.