Abstract
The power law model produces both temperature varying and unreliable estimates for its parameters. Threshold stresses have been suggested as a solution. The power law and Wilshire models are modified to include this stress and estimation and error decomposition methods applied to assess its importance in representing failure times. A statistically significant and temperature-dependent threshold stress was identified in two low-alloy steels. This threshold stress was closer to the operating stress in the Wilshire model. The inclusion of this stress reduced interpolation errors, but this improvement was greater in the Wilshire model. The Wilshire model increased the random component of these errors at all temperatures in one material, but only at some temperatures for the other.
Introduction
Low-alloy ferritic steels such as 1Cr–0.5Mo, 1Cr–1Mo–0.25V, 2.25Cr–1Mo and 0.5Cr–1Mo–1V are extensively used for high temperature applications in the power generation and petro-chemical industries. This is because they all have good creep strength together with good resistance to oxidation and hydrogen embrittlement at the elevated temperatures required within these sectors. The main applications for these steels are in components such as turbine rotors and steam pipes/headers. Typically, the components used within the power generation sector operate at a temperature and stress of around 823 K and 50 MPa, respectively. The minimum creep rate,
However, these power law-based approaches have several known weaknesses that are now well documented and include the derivation of unrealistic values for the activation energy and stress exponent (which also appears to vary with both stress and temperature) when these models are applied to creep data on materials used for high temperature applications. The development of the Wilshire equations [7] was driven by a desire to overcome this problem of an unrealistic and varying stress exponent. A fundamental feature of this new approach was the normalisation of the applied stress through use of the ultimate tensile strength. What was new about this stress normalisation was that the approach constrained failure times to zero when the test stress equalled the tensile strength and infinity when stress equalled zero. This was done using a ‘Weibull’ type expression of the form:
It has been suggested that observing unrealistic n values of more than 5 is the result of not accounting for threshold stress. Williams and Wilshire [12], for example, suggested that creep occurs not under the full applied stress, but under a reduced effective stress (σ − σ0), so that the power law model is more correctly written as
If it is the effective stress that is important, then the Wilshire equation should also be rewritten as
The presence of a threshold stress is not as well established for creep failure as it is for fatigue, but authors have tried to explain its existence in different ways depending on the material under investigation. For low-alloy ferritic steels, microstructural changes tend to develop after prolonged service at high temperature and this mainly consists of the coarsening of carbides, compositional and morphological changes in the carbides, and increases in the inter-particle spacing formation of new carbides. Research by Senior [16], Singh and Banerjee [17,18] and Cheruvu [19] have revealed that such microstructural changes only lead to a degradation in creep strength after long exposure at lower stress levels (and so by implication also at lower temperatures) – given the long exposure required for such coarsening. As a result of this, at low stresses (long-term rupture strengths) virgin and service exposed materials were hardly distinguishable, suggesting the presence of a threshold stress induced by these microstructural changes. A threshold stress has also been observed by researchers working on Nickel-based super alloys. For example, Benz et al. [20] observed, in their study of Alloy 617, the presence of a threshold stress at temperatures 1023 K (and less). Such temperatures directly correlate to the γ/precipitates, with these precipitates providing an obstacle to continued dislocation motion which result in the presence of a threshold stress. In essence the γ/precipitates provide a barrier to the movement of dislocations which at lower temperatures and thermal energies require a minimum stress for dislocations to overcome such barriers to further movement.
In addition to using a threshold stress and or the Wilshire equation to stabilise the stress exponent, another approach is to take the more pragmatic view that all these models are imperfect descriptors of creep and so should only apply over a very limited range of test conditions. An estimation procedure based on localised data points will then provide good fits to the data and it will then be possible to trace out the variation in a models parameters with test conditions to identify changing creep mechanisms. If this parameter variation is then well defined with respect to test conditions it can also be used for predictive purposes. The LOESS technique proposed by Cleveland [21] is one such procedure as it performs a regression on data points in a moving range around the transformed stress suggested by the creep model that is required for linearisation, where the values in the moving range are weighted according to their distance from this value.
This paper therefore has several aims. First, the paper moves away from the existing approach (e.g. [14]) of measuring σo experimentally, towards new methods for estimating threshold stresses directly from the observed failure times by making use of the power law and Wilshire creep models – suitably re-expressed to include a threshold stress. Second, the paper tests the statistical significance of a threshold stress within the power law model because the use of such stresses is not well proven or established in the creep literature (unlike for fatigue). Third, the LOESS estimation technique is assessed as a means of dealing with imperfections remaining within the creep model. LOESS is applied to both the power law and Wilshire threshold models to see if the model structure affects the values for the estimated threshold stress. All assessments are made using the measures of creep model misspecification outlined in Appendix 1 of the paper. With regards to these misspecification tests, the main advancement contained in this paper is the decomposition of Holdsworth's Z parameter into random and systematic components because a high Z value does not necessarily mean a poorly performing creep model if most of Z is made up of random errors. The paper then uses data on 1Cr–1Mo–0.25V rotor steel and 2.25Cr–1Mo steel to illustrate these modifications and estimation procedures.
Public domain data sets
This paper makes uses of some extensive creep data sets that are currently in the public domain. NIMS Creep Data Sheet 3B &50A, published by the Japanese National Institute for Materials Science (NIMS) [22,23] provides extensive rupture data for 12 batches of 2.25Cr–1Mo steel where each batch has a different chemical composition that underwent one of four different heat treatments, details of which are given in [22]. Specimens for the tensile and creep rupture tests were taken radially from the ring-shaped samples which were removed from the virgin material. Each test specimen had a diameter of 6 mm with a gauge length of 30 mm. These specimens were tested at constant load over a wide range of conditions: 333–22 MPa and 723–923 K. In addition to minimum creep rate and time to failure measurements, values are also given of the times to attain various strains – 0.005, 0.01, 0.02 and 0.05 over a selected range of the above-mentioned test conditions. Also reported are the values of the 0.2% proof stress and the ultimate tensile strength determined from high strain tensile tests carried out at the creep temperatures for each batch of steel investigated. Failure times are averaged over all batches at each temperature, and these values are displayed in Figure 1(a).

NIMS creep data sheets 9B & 50A, published by the Japanese National Institute for Materials Science (NIMS) [24,23], includes information on nine batches of tempered bainitic 1Cr–1Mo–0.25V steel, where each batch corresponded to a different heat treatment and a different chemical composition, details of which are given in [24]. Specimens for the tensile and creep rupture tests were taken radially from the ring-shaped samples which were removed from the virgin turbine rotors. Each test specimen had a diameter of 10 mm with a gauge length of 50 mm. These specimens were tested at constant load over a wide range of conditions: 412–47 MPa and 723–948 K. In addition to minimum creep rate and time to failure measurements, values are also given of the times to attain various strains – 0.005, 0.01, 0.02 and 0.05 over a selected range of the above-mentioned test conditions. Also reported are the values of the 0.2% proof stress and the ultimate tensile strength determined from high strain tensile tests carried out at the creep temperatures for each batch of steel investigated. Again, the failure times are averaged over all batches at each temperature, and these values are displayed in Figure 1(b).
In addition to these data sheets, NIMS Creep Data Sheet No. M-6 [25] is a detailed metallographic atlas, containing additional information on hardness and microstructural changes observed over time in a selection of test specimens.
Methodology
Based on Equation (3), the threshold-power law and Monkman-Grant description of time to failure at any stress and temperature combination is given by
If this threshold power law model is correctly specified, the parameters B and m should be constants with respect to stress and temperature. The constancy of B and m in Equation (5a) can be tested statistically by carrying out the following regression over all temperatures
The threshold Wilshire model can be semi-linearised with respect to its unknown parameters using logs
In either of these models, the joint significance of the threshold stresses can be tested using the F test given by Equation [A13] in Appendix 2. The ability of Equations (5) and (7) to adequately describe failure times can be assessed using the RESET test given in Appendix 1. It can also be assessed using the performance statistics discussed in more detail in Appendix 1.
Another approach to dealing with any observed variation in m and u is to accept that these models are an imperfect explanation of minimum creep rates and failure times. That is, the model is only a realistic explanation of creep over a reduced range of test conditions. One way to implement this is to have an estimation procedure that only considers test conditions local to the test condition of interest. Once repeated for all test conditions of interest, this estimation procedure will then reveal the nature of the variation in u and m with the test conditions and so could reveal more information on changing creep mechanisms. One such estimation procedure is the locally weighted scatterplot smoothing (LOESS) curve first proposed by Cleveland [21]. Here data points around a chosen test condition are weighted with points further away from this chosen test condition being given less weight in a standard least squares estimation (i.e. weighted least squares are used). A zero weight is then used for conditions beyond a selected band width. This process is then repeated for all stresses and temperatures revealing different values for the model parameters B, m, kF and u at each test condition. These can then be used to make interpolations and/or extrapolations. To achieve this, the power threshold and Wilshire models are rewritten as
Results and discussion
Non-linear least squares estimation of the threshold power law model
2.25Cr–1Mo steel
Equation (9) shows the results of estimating Equation (5b) using the non-linear search procedure described in the methodology section above
where the square brackets contain the t values associated with the null hypothesis that the parameter value above it are zero, and the numbers in curly brackets the p-value associated with this t test (and so give the probability of this null hypothesis being true). There are a number of observations around Equation (9) that indicate this representation of creep is reasonable. First, a positive threshold stress is observed at all temperatures except at 923 K. The joint statistical significance of these threshold stresses was tested using the F test given by Equation (A13) in Appendix 2. This F value comes out at 7.4, which is statistically significant even at the 1% significance level. Thus, while the threshold stress may not be present at all temperatures, there are threshold stresses that are strongly dependent on temperature – being highest at the lowest temperatures. Second, the R2 value for this model is 93.24%, which is quite reasonable given the stochastic nature of creep.
However, there are also some features of this model that suggest it is mis-specified or at least an imperfect descriptor of the observed failure times, First, the estimated activation energy of around 178 kJ mol−1 is a little below the activation energy for self-diffusion quoted by Whittaker and Wilshire [27] (of around 230 kJ mol−1 for this material). Second, the estimation of Equation (6) yielded
and the p-values in parenthesis clearly show that the stress exponent m and intercept term in Equation (9) are not really constant with respect to all temperatures. The value for m at 823 K is −4.99, while at 773 K it is −4.99 + 1.17 and this difference is statistically significant at the 5% significance level. The value for the intercept (α) at 823 K is 11.95, while at 773 K it is 11.95-4.78 and this difference is again statistically significant at the 5% significance level. As well as this, m at 873 and 923 K are statistically different from the value at 823 K. Equation (10) suggests then that m varies between −3 and −5 –values that are consistent with dislocation climb. The lack of constancy in m is visualised in Figure 2(a).
Dependency of the temperature adjusted average failure times (calculated using Qc = 178 kJ mol−1 in the threshold power law model) for (a) 2.25Cr–1Mo and (b) 1Cr–1Mo–0.25V steels.
In addition, when the squared predictions obtained from Equation (9) are added as another variable to Equation (9), the resulting RESET test gave a p-value of 0.003 for the null hypothesis of no misspecification, thus suggesting this threshold power law model is statistically mis-specified at the 1% significance level. This misspecification can be seen in Figure 2(a), which plots tF exp[178(1000/RT)] against (σ−σ0,v) revealing data points that define a curve rather than a line.
Interpolative performance of the threshold power law model when applied to two different low-alloy steels.
Z is defined by Equation (A5), MPSE through Equation (A3). RMPSE is the square root of MPSE. Both MPSE and RMPSE are in %. UM, UR and UD are defined by Equation (A8) of Appendix 1 and are in % (decomposition of MPSE).
1Cr–1Mo–0.25V steel
Equation (11) shows the results of estimating the threshold power law model for 1Cr–1Mo–0.25V steel
Again, there are several observations around Equation (11) that indicate this representation of creep is reasonable for this material. First, a positive threshold stress is observed at all temperatures except above 898 K. The F test for the joint significance of these threshold stresses comes out at 6.3, which again is statistically significant even at the 1% significance level. Thus, once again there are threshold stresses that are strongly dependent on temperature. Compared to 2.25Cr–1Mo steel, these threshold stresses are much larger, however. Second, the R2 value for this model is reasonably high at 86.2%. Thirdly, the estimation of Equation (6) yielded
and the p-values in parenthesis clearly show that the stress exponent m and intercept term in Equation (11) are constant with respect to all temperatures at the 10% significance level (but at the 5% significance level, m may be different at 873 K compared to 823 K). The value for m at 823 K is −3.44, while at 873 K it is −3.44 + 0.88 and this difference is not statistically significant at the 10% significance level. The value for the intercept (α) at 823 K is 14.54 and at the 10% significance level the intercept is no different at any other temperature. Equation (12) suggests then that m is around −3.44 – a value that is consistent with dislocation climb. The near constancy in m is visualised in Figure 2(b). Fourthly, when the squared predictions obtained from this equation are added as another variable to Equation (11) the resulting RESET test gave a p-value of 0.053 for the null hypothesis of no misspecification, Thus suggesting this threshold power law model is not mis-specified using a 5% significance level (but it is at the 1% significance level).
However, there are also some features of this model that suggest it is mis-specified or at least an imperfect descriptor of the observed failure times, First, the estimated activation energy of around 122 kJ mol−1 is well below the activation energy for self-diffusion quoted by Scharning and Wilshire [8] (of around 300 kJ mol−1 for this material). Second, the bottom half of Table 1 gives further evidence for this material that the threshold power law model is far from an ideal description of the creep failure times. At all temperatures, the Z statistic is unacceptable high (>4) at 773–873 K. At these temperatures the MPSE is high with values between 8% and 60%. The random component of this MPSE remains very low at 948, 898 and 723 K.
Possible explanations for varying stress exponent
The reason for a smaller slope at high temperatures could be down to the fact that some specimens tested at these temperatures all remained exposed for the longest periods of time because they were conducted at the lowest stresses within the test matrix. An inspection of some of the images of the failed test specimens in creep data sheet 3B [22] at these temperatures and long exposures revealed the presence of an oxidised surface that could weaken its load bearing strength. Further, the NIMS Metallographic Atlas [25] reveals that the received bainitic regions degrades to Ferrite and Molybdenum carbide particles, with very coarse carbides along grain boundaries. Both of these phenomena would result in faster creep rates and creep lives that are substantially shorter in tests of long duration at 873 K and above, than would be expected by direct extrapolation of the best fit lines seen in Figure 2(a,b) at the lower temperatures That is, it would result in the bending of the curve seen in the figures at the long exposure present at these high temperatures and lower stresses. The second of these two explanations would then be consistent with the presence of a threshold stress below 873 K – with carbide coarsening at the high temperatures removing the need for a minimum stress required for dislocations to move.
Explanation for the presence of threshold stresses
The presence of a threshold stress and its dependence on temperature could reflect the fact that for this data set, the longest exposed test specimens are at the highest two temperatures (due to the low stresses), where as noted above, noticeable carbide coarsening takes place. This will weaken the barriers to dislocation movement and so remove the presence of a minimum or threshold stress required for dislocation movement. Consequently, the estimated threshold stresses are at 873 and 923 K either zero or very close to it. Alternatively, the presence of large threshold stresses at the lowest temperatures may be the result of lower thermal energies making creep more stress-dependent – i.e. requiring a minimum stress before barriers to dislocation movement can be overcome. This minimum is larger the lower the thermal energy present – the lower is the temperature.
LOESS estimation of the threshold power law model
2.25Cr–1Mo steel
Figure 3(a,b) shows some results from using this LOESS procedure to estimate the parameters of Equation (8a) for 2.25Cr–1Mo data. Both B and m follow very similar patterns showing a very strong dependency on test conditions. At temperatures of 823 K and above and with stresses less than around 150 MPa, ln(B) increases with stress and m decreases with stress. Between a stress of around 150 and 250 MPa both these parameters are broadly constant at temperatures below 823 K and for higher stresses ln(B) starts to increase and m decrease with stress.
Dependency of (a) ln(B) and (b) m in Equation (8a) on test conditions for 2.25Cr–1Mo steel.
The value for Qc was estimated at 306 kJ mol−1, again rather on the large side for this material, and the threshold stresses were estimated at
Temperature-adjusted failure times as a function of stress together with LOESS interpolations obtained using Equation (8a) and parameter values in Figures 3 and 6(a,b), for (a) 2.25Cr–1Mo and (b) 1Cr–1Mo–0.25V–steels. Interpolative performance of the threshold power law model estimated using the LOESS technique when applied to two different low-alloy steels. Z is defined by Equation (A5), MPSE through Equation (A3). RMPSE is the square root of MPSE. Both MPSE and RMPSE are in %. UM, UR and UD are defined by Equation (A8) of Appendix 1 and are in % (decomposition of MPSE).

This is all visualised in Figure 5(a). The solid curves show the interpolations of time to failure at each stress and consistent with the statistics in Table 2, the interpolations look poorest at 948 K.
Failure times as a function of stress together with this LOESS interpolations obtained using Equation (8a) for (a) 2.25Cr–1Mo and (b) 1Cr–1Mo–0.25V steels.
1Cr–1Mo–0.25V steel
Figure 6(a,b) shows some results from using this LOESS procedure on 1Cr–1Mo–0.25V steel. Both B and m are strongly dependent on test conditions but in a more straightforward way than for 2.25Cr–1Mo steel. Up to around 325 MPa, ln(B) increases with stress and m decreases with stress. At temperatures of 823 K and above with stress less than around 150 MPa ln(B) increases with stress and m decreases with stress. Temperature then shifts the dependency in an almost parallel fashion. Above 325 MPa and for temperatures below 798 K these parameters remain constant with respect to stress.
Dependency of (a) ln(B) and (b) m in Equation (8a) on test conditions for 1Cr–1Mo–0.25V steel.
The value for Qc was estimated at 133 kJ mol−1, again unrealistically small for this material, and the threshold stresses were estimated at
LOESS estimation of the threshold Wilshire model
2.25Cr–1Mo steel
Figure 7(a,b) contain the values for ln(kF) and 1/u of Equation (8b) found at each test condition using LOESS. Both ln(kF) and 1/u show a strong dependency on test conditions. At temperatures of 823 K and above with stress levels less than around 125 MPa, ln(kF) decreases with stress and 1/u increases with stress. Then at higher stresses and lower temperatures, ln(kF) is broadly constant with respect to stress, while 1/u shows a slight tendency to decline with stress.
Dependency of (a) ln(kF) and (b) 1/u in Equations (8b) on test conditions for 2.25Cr–1Mo steel.
The value for Qc was estimated at 190 kJ mol−1, which is closer to the value of 230 kJ mol−1 reported by Wilshire and Whittaker [27] than the value obtained above using LOESS or non-linear least squares on the threshold power law model. The threshold stresses were estimated as
Figure 8(a) then plots the interpolated time-adjusted failure times obtained by substituting into Equation (8b) the values for ln(kF) and 1/u shown in Figure 7(a,b) together with the Qc and threshold stress values shown above. Compared to Figure 4(a), there appears to be more scatter around the model's interpolated values, especially at higher stresses, and the fit is not so good at 823 K in the LOESS estimated threshold Wilshire model. In the threshold power law model deviations from a straightish line occurs only once, while in the threshold Wilshire model this happens twice. This is further seen in the statistics shown in the top half of Table 3. Compared to the threshold power law model estimated via LOESS (Table 2), the Z values are about the same or a little higher depending on the temperature – especially at 823 K where the LOESS estimated threshold Wilshire model is far inferior in performance. All the Z values are also the same or much lower than the critical value of 4. The same picture holds for the MSPE. The LOESS estimated threshold Wilshire model has more random interpolative error at all of the temperatures except at 823 K compared to the LOESS estimated threshold power law model.
Temperature-adjusted failure times as a function of stress together with the LOESS interpolations obtained using Equations (8b) for (a) 2.25Cr–1Mo and (b) 1Cr–1Mo–0.25V steels. Interpolative performance of the threshold Wilshire model estimated using the LOESS technique when applied to two different low-alloy steels. Z is defined by Equation (A5), MPSE through Equation (A3). RMPSE is the square root of MPSE. Both MPSE and RMPSE are in %. UM, UR and UD are defined by Equation (A8) of Appendix 1 and are in % (decomposition of MPSE).
This is all visualised in Figure 5(a). The dashed curves show the interpolated time to failures at each stress for the threshold Wilshire model. Consistent with the statistics in Tables 2 and 3, the threshold Wilshire interpolations look better than the threshold power law model at 873 K but worse at 823 K.
1Cr–1Mo–0.25V steel
Figure 9(a,b) contains the values for ln(kF) and 1/u found at each test condition. Both ln(kF) and 1/u show a very strong dependency on test conditions. At temperatures of 848 K and above and with stresses less than around 250 MPa, ln(kF) decreases with stress and 1/u increases with stress. Then at higher stresses and lower temperatures ln(kF) and 1/u are broadly constant with respect to stress.
Dependency of (a) ln(kF) and (b) 1/u in Equations (8b) on test conditions for 1Cr–1Mo–0.25V steel.
The value for Qc was estimated at 245 kJ mol−1, again rather on the small side for this material, and the threshold stresses were estimated at
This is all visualised in Figure 5(b). The dashed curves show the interpolated time to failure at each stress for the threshold Wilshire model. Consistent with the statistics in Tables 2 and 3 the threshold Wilshire interpolations look better than the threshold power law model at all temperatures except 923 and 898 K.
Both the threshold power law and Wilshire models gave unreasonable values for the activation energy irrespective of whether it was estimated via LOESS or non-linear least squares. Despite this shortcoming, estimation via LOESS resulted in improved interpolative performance. The improved interpolations come about because interpolations are based on restricted observations around the test condition of interest when using LOESS. But the values for Qc still suggest the models are miss specified. These unrealistic Qc values likely come from forcing Qc to be a constant over all test conditions in the models used in the paper. The possibility of an activation energy being dependent on test conditions would fit well with the LOESS methodology and warrants future research.
Conclusion
This paper presented an investigation into the significance of a threshold stress within a power law and the Wilshire models. For both materials that were studied in this paper, a statistically significant threshold stress was found and that this stress varied with temperature. The choice of creep model had a much smaller impact on the estimated threshold stress compared to the estimation technique – LOESS vs non-linear least squares. The values for the threshold stress at each temperature in the LOESS estimated Wilshire model were much closer to the typical operating stress for these materials.
The use of Z together with its decomposition overcomes the problem of using Z to assess interpolative performance (large Z values are not necessarily bad if all the interpolation errors are random).
When the LOESS estimation procedure was applied to the threshold power law model an improved interpolative capability was observed when compared to that model estimated via non-linear least squares. The scatter in the temperature-adjusted failure times ware dramatically reduced yielding a smooth well defined but non-linear relationship with stress.
When the LOESS estimation procedure was applied to a threshold-modified Wilshire model an improved interpolative capability (as measured by Z and MPSE) compared to the threshold power law model was observed at nearly all temperatures for both for 2.25Cr–1Mo and 1Cr–1Mo–0.25V steel. In terms of how much of this MPSE was random in nature, a more mixed picture emerged. For 2.25Cr–1Mo steel, the LOESS estimated threshold Wilshire model produced more random interpolation errors at all but one temperature, while for 1Cr–1Mo–0.25V steel the LOESS estimated threshold power law model performed better in around two thirds of the temperatures studied.
The estimated activation energy was still unreasonable in – especially for 1Cr–1Mo–0.25V steel – coming out at just 190 kJ mol−1. This may be an indication that the creep models used are mis-specified.
The implications of including a threshold stress for extrapolative capability remain to be assessed. In particular, the stability of the estimated threshold stress when using failure times of up to 10,000 h or less remains to be quantified and should form part of any future research efforts.
Footnotes
Disclosure statement
No potential conflict of interest was reported by the author(s).
Data availability statement
All data used in this publication are in the public domain [22–
].
Appendices
Appendix 1
The following statistics will be used to assess how well each creep model explains the experimental creep failure times. No extrapolation will be undertaken, and all predictions are in the form of an interpolations over all data pints used for model estimation.
The S A-RLT test for model adequacy
The starting point for more recent measures of creep model specification is the interpolation or prediction error. Holdsworth et al. [28] termed this error the residual log time
The decomposition of MPSE as a measure of model adequacy
However, the above interpretation of what is efficacious, assumes that the residual variation picked up by se (and thus Z) is all systematic in nature and so the result of a poorly fitting creep model. This will not always be the case. Granger and Newbold [30] have shown that
This suggests that a high value for Z would not be an indication of a creep model making large systematic prediction errors, provided β = 1. Rather, it would be due to a large value for sv. In this extreme situation, all the variation being picked up by Z is purely random in nature and reflects the stochastic nature of the creep testing of the material under investigation – which for some materials can result in substantial scatter. The size of this random variation is pre-determined, and no creep model can reduce it. Instead, it is the result of things like microstructural variation in test samples, accuracy of test equipment, etc. At the other extreme, a large value for Z would be an indicator of a poorly performing creep model if
Another issue with Z is that it does not pick up a mis-specified creep model that fails to predict the log failure times even on the average. This can be seen by noting that the MPSE can also be worked out as
The RESET test as a measure of model adequacy
Another test for model misspecification is that proposed by Ramsey [31]. This so-called RESET test is based on looking for non-linearities in the transformed creep data that should, if correctly specified, only contain linear variation. While the nature of any non-linearity in the event of misspecification will not be known (unless the correct creep model is known), it can be empirically approximated using polynomials (as fits to data can always be made better by adding high and higher ordered polynomials – indeed if there are 5 data points, a curve can be put through all 5 data points using a fourth-order polynomial). To avoid such over fitting of the data a low-order polynomial is best used – such as a quadratic. So, the RESET test then takes the form of a standard t test that γ = 0 in
Appendix 2
LOESS estimation
Many non-parametric estimation procedures exist in the literature including cubic splines together with varying Kernel estimation procedures. An interesting approach that enables the physical structure of the creep model to be maintained, is a LOESS curve first proposed by Cleveland [21]. In this model, it is assumed that the relationship between the log time to failure and temperature and stress is well represented by the chosen creep model. Then instead of approximating any non-linear relationship between, say,
This is achieved through the following sequential steps:
Create starting values for Choose a value for the ‘smoothing parameter’ k that must be between 0 and 1 (the larger the number, the smoother will be the parametric curve). Select a data point on the constructed plot and label these x–y values x* and y*. From this, identify the Nk nearest data points on this plot to this selected point using the distance measure
Select the Nk smallest distance values (after rounding up to the nearest whole number) and scale them to take on a value between 0 and 1 – by diving each value for distance by the largest distance value in this subset of data. Call these values d*. Select all those data points having the smallest Nk values for d* and with these data points derive regression weights defined by the tri cube weight function
Carry out a weighted least squares by regressing the product yw on w and the product xw (with no constant or intercept term) for this subset of data points. Use the resulting best fit line to predict the value for y corresponding to x*, together with the squared residual (difference between the actual and predicted value for y squared). This prediction is the point on the parametric curve corresponding to x*. Repeat steps 1–6 for all other values for x. Calculate the sum of all the squared residuals – RSS. Then use a non-linear search algorithm to find values for
There are several suggestions in the literature as to the best choice for the smoothing parameter k used in the above steps. One class of method chooses the smoothing parameter value to minimise a criterion that incorporates both the tightness of the fit and the model complexity. Such a criterion can usually be written as a function of the RSS and a penalty function designed to decrease with increasing smoothness of the fit. An example of such a criteria is the generalised cross-validation (GSV) developed by Craven and Wahba [32]
An F test for the presence of a threshold stress
The following F statistic can be used to statistically test for the presence of a threshold stress
