Abstract
In the design and condition assessment of bridges, the extreme vehicle load effects are necessary to be taken into consideration, which may occur during the service period of bridges. In order to obtain an accurate extrapolation of the extreme value based on limited duration, threshold selection is a critical step in the peak-over-threshold method. Overly high threshold results in little information to be used and excessively low threshold leads to large bias in parameters estimation of generalized Pareto distribution. To investigate this issue, 417 days of strain data acquired from the long-term structural health monitoring system of Taiping Lake Bridge in China are employed in this article. According to the tail distribution of the strain data induced by vehicle loads, four homothetic distributions are chosen as its parent distribution, from which lots of random samples are generated by the Monte Carlo method. For each parent distribution, the 100-yearly extreme values at different thresholds are estimated and compared with the theoretical value based on those samples. Then a simple and empirical threshold selection method is proposed and applied to estimate the weekly extreme strain due to vehicle loads on the Taiping Lake Bridge. Results show that the estimate on the basis of the threshold obtained by the proposed method is closer to the measured result than the commonly used methods. The proposed method can be an effective threshold selection tool for the extreme value estimation of vehicle load effect in future engineering practice.
Keywords
Introduction
Safety assessment of bridges plays an important role in bridge management, for which the load-carrying capacity of bridges and the associated uncertainties are needed to be evaluated accurately. A large volume of research work has been conducted on the assessment of load-carrying capacity of bridges and it is possible to evaluate the load-carrying capacity reasonably.1,2 Vehicle load effect of bridges, one of the great sources of uncertainty, is mainly studied in this article. Vehicle load effect mainly refers to stress/strain, internal force (such as bending moments, shear force), and deformation induced by vehicles passing through bridges. Obviously, vehicle load effect due to light vehicle loads is not in our concern. On the contrary, the extreme events caused by heavy vehicles, several vehicles meeting and overtaking, which endanger the serviceability and safety of bridges, are of greatest interest.
Extreme value (EV) theory indicates that EV estimation is only related to the tail of the probabilistic distribution.3,4 Techniques and models have been developed to describe these tails, by which the probabilities of extreme events can be estimated on the basis of historical data. Among them, the peak-over-threshold (POT) method is one of the most widely used methods. It avoids the problem of wasting data, which is a common problem of the block maxima method, using a generalized Pareto distribution (GPD). 5 However, how to select an appropriate threshold is also an inevitable problem, on which the EV estimation depends. If the threshold is too low, the tail satisfies the convergence criterion less, which will result in large bias and incorrect result. On the other hand, if the threshold is too high, little data above the threshold will lead to high variance and unreliable result.6,7 Thus, choosing a proper threshold implies a balance between the bias and the variance.
The current methods for threshold selection mainly consist of graphic methods and computational methods. 8 Graphic methods mainly include mean residual life (MRL) plot, Hill plot, threshold stability plot, and so on. The MRL plot, as one of the most popular used methods, has been broadly used in EV estimation of many fields such as vehicle load effect, precipitation, climate, and finance.9–14 In this method, the threshold is selected using the plot of the mean of exceedances versus threshold. Hill plot is suitable for long-tailed distributions, which mean the shape parameter of GPD is positive. In this method, the threshold is determined by plotting the Hill estimator for a range of value of the number of upper order statistics. Computational methods are suggested to choose the optimal threshold by minimizing bias-variance of GPD model or its shape parameter based on a bootstrap procedure. For example, Caers et al. 15 presented a guide to threshold selection using the finite sample mean square error (MSE) of an estimated tail parameter as a criterion. Moreover, there are some other approaches developed to address the issue. DuMouchel 16 suggested using the data above tenth percentile to estimate the EV. However, theoretical examples given in this article show that these conventional methods are not suited for the EV estimation of vehicle load effect by the POT method.
In order to investigate the threshold selection method for vehicle load effect on bridge, 417 days of strain data acquired from the structural health monitoring system installed on the Taiping Lake Bridge in China are employed in this article. The strain due to vehicle loads can be obtained by the analytical modal decomposition method.17,18 To simulate the tail distribution of vehicle load effect, four mixed distributions are chosen as the parent distributions, from which a large number of samples are produced by the Monte Carlo method. For each parent distribution, the 100-yearly EVs are estimated at different thresholds. By Comparing the estimates at different thresholds and corresponding theoretical values, an empirical threshold selection method is proposed to study the vehicle load effect.
Threshold estimation methods
Three commonly used methods are introduced in this section, including the MRL plot, Hill plot, and minimum MSE method, which will be compared with the proposed threshold selection method in this article.
The MRL plot
Davison and Smith 19 introduced MRL plot to determine the threshold using the expectation of the GPD excesses
where u denotes the threshold, ξ denotes the shape parameter, and σu denotes the scale parameter corresponding to threshold u. The condition of ξ < 1 is defined to guarantee the existence of the expectation. Equation (1) shows that the expectation of excesses is linear in u with gradient ξ/(1 – ξ) and intercept σu/(1 – ξ). For a set of samples (X1, X2,…, Xn), the empirical estimate of the mean of excesses can be obtained as follows
where Nu denotes the number of exceedances over u. The set data of {u, en(u)} represent the MRL plot. Generally, the value of u above which the plot is an approximately straight line can be selected as the optimal threshold.
Hill estimator
Let
Clearly, the Hill estimator is function of these extreme random variables {X(1,n), X(2,n),…, X(k,n)} which depends on the chosen threshold. A Hill plot is constructed by the Hill estimator of a range of k values versus the k value or the threshold. The value of Xk,n above which the Hill estimator tends to be stable can be chosen as the optimal threshold.
MSE
The MSE can be evaluated on any tail characteristics θ such as the shape parameter of GPD, an extreme quantile, or a return period. The optimal threshold can be obtained by evaluating the MSE for a range of thresholds. The bootstrap is chosen as the technic to analyze estimation bias and variance of the tail characteristics θ at each threshold:
Select a range of high thresholds (u1, u2,…, uj).
For each threshold Select the exceedances over the threshold ui, that is, For each bootstrap loop: draw a resample Calculate the MSE of
The optimal threshold is selected when MSE of
Since threshold selection greatly influences the parameter estimation of a GPD, the shape parameter ξ is chosen as the tail characteristic θ in this article.
Threshold estimation based on the EV extrapolation of strain data
Simulation of strain peaks distribution induced by vehicle loads
It is demonstrated that the EV estimation corresponds to tail data. 3 To ascertain the probability distribution of the tail data of vehicle load effect, the strain data of 417 days of a long cable-stayed bridge in China is taken as the real example. The strain due to vehicle loads can be obtained by decomposing the measured strain through analytical model decomposition method. Figure 1 shows the strain time history due to vehicle loads on 27 December 2015. Each “jump” in the figure represents the strain time history induced by a vehicle passing through the bridge. The maximum value of each “jump” is defined as strain peak, which is assumed to be independent.21,22 According to the suggestion in DuMouchel, 16 the upper 10% of these strain peaks, which is larger than 18.1 με, are collected. It is indicated that mixed distributions of normal distribution, lognormal distribution, and Weibull distribution can fit vehicle load and its effect well.23–24 Since the histogram of collected data shows three peaks, three mixed distributions (a mixed distribution of three normal distributions, a mixed distribution of one lognormal and two normal distributions, a mixed distribution of one Weibull and two normal distributions) are chosen to fit these collected data, as shown in Figure 2. The Kolmogorov–Smirnov test for goodness-of-fit is used. Results show that all the three models coincide with the tail data well, and the third model, that is, a mixed distribution of one Weibull and two normal distributions, is the best. The same conclusion can be drawn from the strain peaks collected from other four measurement points on the bridge. In order to investigate the influence of threshold on EV estimation in POT method, four homothetic parent distributions are given to simulate the tail distribution of vehicle load effect as follows:
A mixed distribution of three normal distributions;
A mixed distribution of one lognormal and two normal distributions;
A mixed distribution of one Weibull and two normal distributions;
A mixed distribution of one Weibull and two normal distributions with different parameters.

Strain peaks induced by vehicle loads of Taiping Lake Bridge (27 December 2015).

Strain peaks over 18.1 με fitted by the three mixed distributions. (a) The mixed distribution of three normal distributions—p1 = 0.3665, µN1 = 18.56, σN1 = 0.94; p2 = 0.4708, µN2 = 22.62, σN2 = 2.09; p3 = 0.1627, µ N 3 = 32.21, σN3 = 5.45. (b) The mixed distribution of one lognormal distribution and two normal distributions—p1 = 0.4055, µL1 = 2.98, σL1 = 0.052; p2 = 0.4388, µN2 = 22.82, σN2 = 2.12; p3 = 0.1558, µN3 = 32.56, σN3 = 5.38. (c) The mixed distribution of one Weibull distribution and two normal distributions—p1 = 0.7593, µN1 = 18.1, σW1 = 5.62, ζW1 = 0.9796; p2 = 0.0953, µN2 = 19.93, σN2 = 0.71; p3 = 0.1454, µN3 = 21.98, σN3 = 1.34.
Probability models of the third and fourth parent distributions are the same, while the parameters are different. Parameters of the four parent distributions are listed in Table 1.
Parameters of four parent distributions.
p1, p2, p3 denote the weighting coefficients; μN, σN denote the mean value and standard deviation of Normal distribution, respectively; μL, σL denote the mean value and standard deviation of Lognormal distribution, respectively; σw, ξw, μw denote the scale parameter, shape parameter, and position parameter of Weibull distribution, respectively.
With reference to the measured data of Taiping Lake Bridge, 100 strain peaks, which are in the top 10%, are supposed to be produced every day. From each parent distribution, 100 times for different data length (such as a day, a month, 6 months, a year, 10 years and 50 years) are randomly sampled by Monte Carlo method, which means 100 groups, respectively, with the sample size of 100, 3000, 18,000, 36,000, 360,000, and 1,800,000 are sampled from each parent distribution.
Threshold estimation of samples generated from the fourth parent, which is apparently the closest to the optimal model shown in Figure 2(c), is analyzed in detail, while the other three examples are presented in Appendix 1.
The effect of threshold chosen on EV estimation
A series of values chosen as thresholds are listed in Table 2. The mean number of exceedances over each chosen threshold is calculated and also shown in the table. The exceedances are fitted by GPD models. Parameters of GPD can be estimated by the maximum likelihood. 15 The fitting residual can be obtained as follows
The number of exceedances over different thresholds (the fourth parent distribution).
which is used as a goodness-of-fit function for GPD. F(xi,N) denotes the empirical cumulative probability at value of xi,N, and
Fitting residual of GPD model (the fourth parent distribution).
GPD: generalized Pareto distribution.
Where “–” denotes that data are too little to be used, same denotation is adopted in the following tables.
where Tp, λ, and G(x) denote the forecast period (Tp = 100), the number of exceedances per year, and fitted GPD, respectively. Characteristic value of the expectation of each 100-yealy EV distributions is calculated. The mean value, the standard deviation, and 95% confidence interval (CI) of the characteristic values are listed in Table 4. Since the fourth parent distribution is known, the theoretical characteristic value can be obtained, which is 56.82.
Estimates of 100-yearly EV (theoretical value E = 56.82).
EV: extreme value; CI: confidence interval.
Where μ, σ denote the mean value and standard deviation of characteristic values, respectively; same denotations are adopted in following tables.
From Table 4, it can be found that estimates of EV depend on the threshold and sample size. Obviously, larger sample size will bring better estimates, while choosing an appropriate threshold involves a trade-off between the bias and the variance. In order to select the optimal threshold, the MSE criterion is applied as
where μ, E, and σ 2 denote the mean value, the theoretical value, and the variance of characteristic value, respectively. The threshold achieves optimal when MSE reaches the minimum. Moreover, it is assumed that relative error less than 10% is within acceptance range. For the fourth parent distribution, the 95% CI of characteristic value in the interval [51.14, 62.50] is allowed.
Analyzing Tables 2–4, some conclusions can be drawn as follows: the estimate of EV based on a GPD is larger than the theoretical value generally. It is not advisable to estimate the EV based on 1 day’s data. Satisfied results cannot be obtained based on data of 1 month or 6 months or 1 year either, since 95% CI of the characteristic value exceeds the acceptable range. In all of the above cases, better estimate can be achieved with a larger threshold. Based on 10 years’ data, satisfied results can be obtained when the threshold is in the interval [34, 38], and the best estimate is achieved with the threshold of 38. When the sampling time is as long as 50 years and the threshold is within the interval [34, 40], the estimates are good which achieve the best with the threshold of 40. Moreover, better results of estimate cannot be guaranteed by better fitting results of GPD model. The variation of estimates with threshold of samples produced from the other three parent distributions is similar to that of the fourth.
Threshold estimation
For each group of samples, the thresholds are estimated by the MRL plot, Hill plot, and MSE method. The mean values of thresholds of the samples with the same sampling length and the optimal threshold are listed in Table 5.
Estimated thresholds of the samples produced from the fourth parent distribution.
MRL: mean residual life; MSE: mean square error.
As shown in Table 5, thresholds estimated by the three methods increase with the increasing of sample size. It can also be found that the thresholds estimated by the three methods are smaller than the optimal thresholds generally. Similarly, the threshold estimation results of the other three parent distributions also are not very good, which mean that the commonly used methods are not suitable for investigating the vehicle load effect. In addition, a significant drawback of the graphic methods is that the MRL plots and Hill plots of samples generated from the same parent distribution show great instability. Especially, for the third and fourth parent distribution, more than 80% of the MRL plots and Hill plots are difficult to determine the “turning point,” that is, the threshold.
Based on the estimated EV at different thresholds of four parent distributions, an empirical threshold estimation technic based on the relationship between the characteristic value and the threshold is proposed in this article. The steps are as follows: draw the plot of characteristic value versus the threshold and observe the relationship between them. (1) If the sample size is large enough, the characteristic value tends to be stable with the increasing of threshold. In this case, the starting point of stable region can be selected as the optimal threshold. (2) If the sample size is small, the variation magnitude of characteristic value remains constant or even increases with the increasing of threshold, and it does not tend to be stable. In this case, the threshold should be as large as possible under the condition that shape parameter of GPD can be correctly estimated without large fluctuations or obvious errors.
Case study—Taiping Lake Bridge
The Taiping Lake Bridge is a pre-stressed concrete cable-stayed bridge with single cable plane and single tower, locating in Huang Mountain scenic spot of Anhui Province in China. 25 The bridge has a length of 380 m (span combination: 190 m + 190 m), as shown in Figure 3. The bridge carries four lanes with a width of 14 m. The health monitoring system was installed on the bridge in 2014 to collect information for bridge safety assessment. Measurement points are selected at several locations of the box girder. Strain gauges are installed at bottom plate of the main girder with a sampling frequency of 50 Hz. These strain gauges are arranged along the longitudinal direction of the bridge. By comparing the strain of different measuring points, the strain data of the most dangerous section under vehicle loads is studied which locate at about 121 m from the centerline of tower.

The Taiping Lake Bridge (Huang Mountain, China).
Taking 417 days’ strain data (2015.4–2016.7) of the bridge as the real example, all strain peaks induced by vehicle loads are extracted as samples. Several thresholds are selected, and then the corresponding 100-yearly EV distributions are estimated based on GPD. The plot of expectation of the EV versus threshold is shown in Figure 4. From this figure, it can be seen that when the threshold is greater than 41, the expectation varies slowly. Thus, it is suggested to choose 41 as the optimal threshold. To evaluate the optimal threshold, the measured data of 417 days is divided into two parts, while data of the first 67 days are used to estimate weekly EV, and the remaining data of 350 days, that is, 50 weeks, are used to validate the estimate. The estimated and measured weekly EV distributions are shown in Figure 5. It can be found that the estimate coincides with the measured result well. However, the EV estimated by GPD is a little bit larger than measured value. Moreover, the MRL plot and Hill plot of these strain peaks are shown in Figure 6, and the thresholds estimated by the three common methods are listed in Table 6. The weekly extreme strain distribution at each threshold is estimated. Corresponding expectations are also shown in table, which indicates that the estimate by the proposed technic is better than the common methods.

Expectation of 100-yearly EV versus the threshold.

Estimated and measured weekly extreme strain distribution.

Threshold estimation of graphic methods: (a) MRL plot and (b) Hill plot.
Threshold estimated by the common methods (mean of measured weekly extreme strain: 40.89 με).
MRL: mean residual life; MSE: mean square error.
Conclusion
The present paper focuses on the threshold selection in POT method to estimate the EV of vehicle load effect on bridge. 417 days of strain data of a long cable-stayed bridge located in China are employed to study this issue. According to the tail distribution of strain data due to vehicle loads, four homothetic distributions are chosen as the parent distributions. On the basis of samples produced by Monte Carlo method from each parent distribution, characteristic values at different thresholds are estimated and compared with the corresponding theoretical value. Then an empirical method for the threshold estimation based on the relationship between the characteristic value and threshold is proposed. Finally, the proposed method is compared with the commonly used methods and verified based on the strain data due to vehicle loads on Taiping Lake Bridge. The conclusions are drawn as follows:
The distribution of strain peaks induced by vehicle loads is a multi-peak distribution, and the tail distribution can be described by a mixed distribution of one Weibull and two normal distributions.
The characteristic value mainly depends on threshold. With the increase of the threshold, the bias of characteristic value decreases first and then tends to be stable, while the variance decreases at first and then increases gradually. It means that an optimal threshold involves a trade-off between the bias and the variance.
The characteristic value is closely related to the sample size. If the bias does not tend to be stabilized with the increasing of threshold for the shortage of samples, the EV cannot be estimated accurately. In that case, the threshold should be as large as possible.
The estimated EV based on GPD is greater than the theoretical value generally.
On the basis of the estimated EV of Taiping Lake Bridge, it is demonstrated that compared with the commonly used methods, the proposed method is more suitable to estimate the threshold of vehicle load effect.
Footnotes
Appendix 1
Acknowledgements
We acknowledge the support received from the Anhui Provincial Natural Science Foundation (1708085ME101) and the China Postdoctoral Science Foundation (2015M581982).
Handling Editor: Kenneth Loh
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
