Different percentages of false-positive results obtained using five methods for the calculation of reference change values based on simulated normal and ln-normal distributions of data

Abstract

Background

Reference change values provide objective tools to assess the significance of a change in two consecutive results for a biomarker from an individual. The reference change value calculation is based on the assumption that within-subject biological variation has random fluctuation around a homeostatic set point that follows a normal (Gaussian) distribution. This set point (or baseline in steady-state) should be estimated from a set of previous samples, but, in practice, decisions based on reference change value are often based on only two consecutive results. The original reference change value was based on standard deviations according to the assumption of normality, but was soon changed to coefficients of variation (CV) in the formula (reference change value = ± Z ċ 2^½ ċ CV). Z is being dependent on the desired probability of significance, which also defines the percentages of false-positive results. The aim of this study was to investigate false-positive results using five different published methods for calculation of reference change value.

Methods

The five reference change value methods were examined using normally and ln-normally distributed simulated data.

Results

One method performed best in approaching the theoretical false-positive percentages on normally distributed data and another method performed best on ln-normally distributed data. The commonly used reference change value method based on two results (without use of estimated set point) performed worst both on normally distributed and ln-normally distributed data.

Conclusions

The optimal choice of method to calculate reference change value limits requires knowledge of the distribution of data (normal or ln-normal) and, if possible, knowledge of the homeostatic set point.

Keywords

Critical difference false-positive results ln-normal distribution normal (Gaussian) distribution reference change value

Introduction

Reference change values (RCVs), also known as critical differences, are tools to assist with objective analysis of differences in the results of two consecutive measurements of a biomarker in an individual. The basis for this aid to interpretation is that, for a difference in serial results to be significant, this must be greater than the inherent variation. The most commonly cited and applied formula for calculating RCV limits is RCV = ± [Z ċ 2^½ ċ (CV_A²+ CV_I²)^½], where CV_A is the analytical coefficient of variation (CV) and CV_I is the within-subject biological variation. Often, the formula is given as RCV = ± [Z ċ 2^½ ċ CV_T], where CV_T is the total CV, CV_T = (CV_A²+ CV_I²)^½. Z is the number of standard deviations appropriate to the probability desired for detecting differences. Usually, 95% probability (P < 0.05 when 5.0% are false-positives) is regarded as significant, and 99% probability (P < 0.01 when 1.0% are false-positives) is highly significant. Thus, in general, 1.65 and 2.33 are the appropriate Z-scores to use for a unidirectional change, i.e. when increases or decreases in concentration are being considered (one-tailed). However, before applying the RCV calculation for interpretation of the difference in two results, the assumptions inherent in the calculation of RCV should be considered. The RCV calculation is based on the assumption that CV_I represents random fluctuation around a homeostatic set point and follows a normal (Gaussian) distribution. This set point (or baseline in steady-state) ideally should be estimated from more than one sample, but in practice, RCV calculations often are based on only two consecutive results. In consequence, the percentage of false-positive results will differ from the theoretical. This fact was observed in 2015, when we studied methods to calculated limits for significant uni- and bidirectional differences in two or more serial results of a biomarker based on a computer simulation model.^1,2 During this work, we observed that the percentages of false-positive results from simulated data-sets were greater than those calculated theoretically for an increase in concentration and less for a decrease in concentration.

Harris and Yasaka,³ described the first application of RCV where for two consecutive results, X1 and X2, the significant limits for a change in concentration are: X2 − X1 = ± Z ċ 2^½ ċ SD_T, where SD_T is the total standard deviation (SD) of analytical (SD_A) plus within-subject biological variation (SD_I), [SD_T = (SD_A²+ SD_I²)^½]. Shortly after, Costongs et al.⁴ expanded the concept to alternate between SD and CV in calculation of RCV. In making this change, an important factor has turned out to be the value used to change the difference in results (in the assay units) into a per cent difference, as differences in the choice of this value give different percentage changes for the same difference in assay units. Fraser and Harris⁵ redefined RCV through using CV instead of SD as: RCV = ± [Z ċ 2^½ ċ (CV_A²+ CV_I²)^½] and included a table and a formula where the number of specimens required to assess the homeostatic set point at different levels of analytical imprecision were specified. This approach uses the set point (mean of baseline results) to convert the difference in results to a percentage change. However, the common use of RCV is based on only two results and comparison of the difference to the first result (i.e. the first result is an estimate of the set point, albeit rather uncertain). In this process, one important assumption for RCV is altered in that CV is considered to be the constant rather than SD and, in consequence, the original RCV calculation method³ should be modified. Jones⁶ pointed out the problem with the assumption of constant SD and revised the RCV calculations with the assumption of constant CV. Consequently, Jones postulated new RCV limits based on a formula detailed without rigorous mathematical explanation. We recently discovered, using the same assumptions, that our calculated limits for significant changes in two results based on a computer simulation model were in perfect agreement with the mathematically calculated RCV limits from Jones.^1,2 Thus, it seemed that the assumptions applied (e.g. constant CV or constant SD) for calculation and application of the different RCV methods are very important.

The RCV calculations also assume that CV_I shows a normal (Gaussian) distribution. However, it has become more and more apparent that, for many measurands, within-subject variations are more appropriately described as ln-normal distributions. A data-set is ln-normally distributed if the natural logarithms of that data-set are normally distributed. For example, Fokkema et al.⁷ found that the CV_I distribution of brain natriuretic peptides showed a right-skewed distribution and developed an RCV calculation method by addressing the skewness of the distribution. Therefore, Fokkema et al.⁷ stated more correct formulae for RCV calculations when a data-set is ln-normally distributed.

The aim of this study was to investigate the performance of five published calculation methods for RCV applied to normally and ln-normally distributed simulated data appropriate to apparently healthy individuals. The performance of the RCV methods was assessed by simulations and the percentages of false-positive results for each method calculated and then compared to the theoretical percentages. The RCV performance was investigated by varying both CV_T and percentages of theoretical results (i.e. use of different Z-scores). Based on the simulation results, an optimal choice of RCV method is possible, depending on whether the distributions of the data are normal or ln-normal or unknown and whether the homeostatic set point is known or unknown.

Materials and methods

All data for the simulations were generated using Microsoft Excel version 2010.

RCV calculation methods

For two consecutive results, X1 and X2, five methods for calculating RCV were studied. Four methods use the formula [(X2 − X1)/Y] = RCV for a change in concentration, where the value of Y is dependent on the method.

RCV method ‘Common’⁸

([X2 − X1]/Y) = ([X2 − X1]/X1) = RCV = ± Z ċ 2^½ ċ CV_T are the limits for a difference in concentration. In this method, Y = X1 and is the first result and the only estimate of the mean of the individual’s homeostatic set point.

RCV method ‘Sölétormos’⁹

([X2 − X1]/Y) = (X2 − X1)/(0.5 ċ [X1 + X2]) = RCV = ± Z ċ 2^½ ċ CV_T are the limits for a difference in concentration. In this method, Y = 0.5 ċ (X1 + X2), and the mean is an estimate of the individual’s homeostatic set point.

RCV method ‘Fraser’⁵

([X2 − X1]/Y) = RCV = ± Z ċ 2^½ ċ CV_T are the limits for a difference in concentration. Y is the individual’s homeostatic set point and is estimated as a mean of further serial results. The number of specimens required for a confident estimate of the homeostatic set point is n = (Z ċ CV_T/D)², where n is the number of specimens required and D is the desired percentage closeness to the homeostatic set point.

RCV method ‘Jones’⁶

[(X2 − X1)/Y] = [(X2 − X1)/X1] = RCV = {#x02212; (CV_T)²± [(CV_T)⁴ – 2 ċ (CV_T)² ċ ((CV_T)² – 1/Z²)]^½}/((CV_T)²–1/Z²) are the limits for a difference in concentration. Y = X1 is the first result and is the only estimate of the mean of the individual’s homeostatic set point.

RCV method ‘Fokkema’⁷

X2/X1 = exp(±Z ċ 2^½ ċ σ) are the limits for a difference in concentration where σ = [ln((CV_T)²+ 1)]^½. X1 is the first result and is the only estimate of the mean of the individual’s homeostatic set point.

These five described RCV calculations were used on normally and ln-normally distributed data generated from simulations. The RCV limits were calculated using Z = 1.65 and 2.33 corresponding to theoretical unidirectional percentage of false-positive results of 5.00% and 1.00%, respectively. The RCV were calculated for CV_T = 5.0%, 10.0%, 15.0% and 20.0% (representative of most often encountered values in everyday practice calculations of RCV). It should be noted that CV_T for ln- normally and CV_T of the underlying normally distributed data are not the same. The CV_T for ln-normal distributions is calculated using CV_T = [exp(σ²) − 1]^½, where σ² is the variance of the underlying normal distribution.⁷ A total of 10,000 simulated normally and ln-normally distributed results, as would be found in apparently healthy individuals, were generated for each CV_T. The method to estimate percentages of false-positive results has been described in detail in a previous publication.¹

Results

The percentages of false-positive results using the five RCV calculation methods on simulated normally distributed data are listed in Table 1. For each RCV method, the percentage of false-positive results for an increase and decrease in concentration and for varying CV_T related to the theoretical false-positive values is documented. Similarly, the percentages of false-positive results for the same five RCV calculation methods on simulated ln-normally distributed data are listed in Table 2.

Table 1.

Estimation of false-positive results when unidirectional RCV^a were calculated using five different methods on simulated concentration data as from apparently healthy individuals with varying coefficients of variation and theoretical false-positive results.

	CV_T^b= 5.0%		CV_T^b= 10.0%		CV_T^b= 15.0%		CV_T^b= 20.0%
Method	Up^c (%)	Down^d (%)	Up^c (%)	Down^d (%)	Up^c (%)	Down^d (%)	Up^c (%)	Down^d (%)
	Theoretical percentage of false-positive results: 5.00
‘Common’	6.00	4.14	7.24	3.23	8.29	2.55	9.32	1.98
‘Sölétormos’	4.84	5.03	5.04	5.26	5.17	5.42	5.27	5.51
‘Fraser’	4.85	4.98	4.94	5.08	4.92	5.04	4.90	5.02
‘Jones’	4.90	5.03	4.91	5.17	4.95	5.15	4.74	5.09
‘Fokkema’	4.97	5.09	5.17	5.37	5.41	5.70	5.80	6.01
	Theoretical percentage of false-positive results: 1.00
‘Common’	1.56	0.64	2.26	0.35	3.12	0.20	4.23	0.12
‘Sölétormos’	0.96	1.02	0.99	1.12	1.13	1.22	1.28	1.42
‘Fraser’	0.95	1.02	0.95	1.02	0.95	1.02	0.95	1.02
‘Jones’	0.96	1.02	0.94	1.01	0.92	1.07	1.00	0.91
‘Fokkema’	0.97	1.04	1.07	1.20	1.31	1.37	1.62	1.73

Note: All data show a normal (Gaussian) distribution.

Reference change values.

CV_T, total coefficient of variation.

Increased concentration.

Decreased concentration.

Table 2.

Estimation of false-positive results when unidirectional RCV^a were calculated using five different methods on ln-normally distributed simulated concentration data as from apparently healthy individuals with varying coefficients of variation and theoretical false-positive results.

	CV_T^b= 5.0%		CV_T^b= 10.0%		CV_T^b= 15.0%		CV_T^b= 20.0%
Method	Up^c (%)	Down^d (%)	Up^c (%)	Down^d (%)	Up^c (%)	Down^d (%)	Up^c (%)	Down^d (%)
	Theoretical percentage of false-positive results: 5.00
‘Common’	5.96	4.12	6.96	3.14	7.79	2.01	8.69	1.26
‘Sölétormos’	4.82	4.97	4.81	4.96	4.56	4.77	4.32	4.55
‘Fraser’	4.92	4.95	5.01	5.03	4.95	5.04	4.97	4.95
‘Jones’	4.85	4.98	4.61	4.82	4.30	4.53	3.90	4.17
‘Fokkema’	4.93	5.02	4.93	5.02	4.93	5.02	4.93	5.04
	Theoretical percentage of false-positive results: 1.00
‘Common’	1.47	0.65	2.15	0.28	2.72	0.08	3.50	0.01
‘Sölétormos’	0.93	0.99	0.88	0.97	0.79	0.89	0.68	0.80
‘Fraser’	0.93	1.02	0.94	1.06	1.06	1.18	1.20	1.26
‘Jones’	0.92	0.99	0.79	0.89	0.62	0.76	0.43	0.58
‘Fokkema’	0.96	1.02	0.96	1.02	0.96	1.02	0.96	1.02

Note: All data show a ln-normal (ln-Gaussian) distribution.

Reference change values.

CV_T, total coefficient of variation.

Increased concentration.

Decreased concentration.

Discussion

From Tables 1 and 2, it is clear that simulations are excellent approaches to test whether different RCV methods generate comparable false-positive results. One (‘Fraser’) of the five methods performed best in accordance with the theoretical false-positive percentages (maximum differences: 0.15%) on normally distributed data, and similarly, one (‘Fokkema’) method performed best (maximum differences: 0.07%) on ln-normally distributed data (Table 3). This is in accordance with the assumptions of the distributions of data and the concepts of the RCV designs. In contrast, one of the used RCV methods, (‘Common’) which is based on two results, performed worst on both normally (maximum differences: 4.32%, Table 3) and ln-normally distributed data (maximum differences: 3.69%, Table 3). Please note that the theoretical differences presented in the tables may be less observable for measurands reported with insufficient number of significant figures.¹⁰

Table 3.

Estimated maximal difference from the expected theoretical percentages of false-positive results (5.00 and 1.00) using five reference change value (RCV) calculation methods (unidirectional), when the data display a normal (Gaussian) or ln-normal (ln-Gaussian) distribution, and the homeostatic set-point is known or unknown, and, the total coefficient of variation, CV_T ≤ 20%.

Normal distribution	ln-normal distribution	Set-point known	RCV method	Max. (%) difference from theoretical 5.00% false-positive results	Max. (%) difference from theoretical 1.00% false-positive results
Yes	No	Yes	‘Fraser’	0.15	0.05
Yes	No	No	‘Jones’ ‘Sölétormos’ ‘Fokkema’ ‘Common’	0.26 0.51 1.01 4.32	0.07 0.42 0.73 3.23
No	Yes	Yes	‘Fraser’	0.05	0.26
No	Yes	No	‘Fokkema’ ‘Sölétormos’ ‘Jones’ ‘Common’	0.07 0.68 1.10 3.69	0.04 0.32 0.57 2.50

Normal (Gaussian) distribution, Table 1

RCV method ‘Common’

The percentage of false-positive results was always greater than theoretical for increased concentration: i.e. using this RCV ‘Common’ method will always generate more false-positive results than expected. For example, the percentages of false-positive results were increased from 6.00 to 9.32% for an expected theoretical false-positive result of 5.00%, when CV_T was increased from 5.00 to 20.00%. In contrast, the ‘Common’ method generated fewer false-positive results than theoretical when the concentration decreased. For example, the percentages of false-positive results were decreased from 4.14 to 1.98% for an expected theoretical 5.00%, when CV_T was increased from 5.00 to 20.00%, respectively. This distortion of false-positive results is a consequence of changing the calculation formula from constant SD to constant CV and using the first result as a rough estimate of the homeostatic set point. In other words, a difference of two results (X2 − X1) is transformed from a linear scale to a new scale when the difference is related to the first result (i.e. [X2 − X1]/X1 = X2/X1 − 1). For increased concentrations, this new scale (X2/X1 − 1) is greater than a linear scale (and is more like a logarithmic function because X2/X1 has a logarithmic scale). Consequently, for increased concentrations, the ‘Common’ RCV method produces, relatively, more false-positive results when CV_T is increased, and less for decreased concentrations. In our comparison of the tested five RCV methods, the ‘Common’ method performed worst, and in consequence, is not optimal as a method to calculate significant limits for differences in concentration of a biomarker in serial results from an individual.

RCV method ‘Sölétormos’

Compared with ‘Common’, the ‘Sölétormos’ method had better performance (max differences: 0.51%, Table 3). The ‘Sölétormos’ RCV method uses the mean of two results and thereby compensates for extreme results and leads to a better estimate of the homeostatic set point.

RCV method ‘Fraser’

Of the five RCV methods ‘Fraser’ method performed best, i.e. the percentages of false-positive results were very close to the theoretical values (max differences: 0.15%, Table 3). On the other hand, this good performance is dependent on a reliable estimate of the homeostatic set point. Therefore, associated with the description of the RCV method, a formula was stated with the number of specimens required to ensure that the mean result is, e.g. ± 5.00% of the individual’s homeostatic set point.⁵ The ‘Fraser’ method is based on the original RCV concept from Harris and Yasaka,³ i.e. a calculation of a significant difference of two consecutive results (X1 and X2). When RCV is calculated using CV_T instead of original SD_T, this difference should be compared with the homeostatic set point (Y), i.e. RCV = (X2 − X1)/Y. In this process, the SD_T value is still a constant, mathematically, and the good performance results of the simulations from ‘Fraser’ method (Table 3), thus indicate that the original RCV concept from Harris and Yasaka³ was correct.

RCV method ‘Jones’

The ‘Jones’ RCV method performed nearly as well as the ‘Fraser’ method (max differences: 0.26%, Table 3). The strength of the ‘Jones’ method is that knowledge of homeostatic set point is not required. ‘Jones’ method applies the same procedure as the ‘Common’ method by using the first result (X1) as an estimate of the homeostatic set point, [(X2–X1/X1]), but compensate the distortion of false-positive results by extending the significant limits for an increase in concentration, and, lowering for a decrease in concentration (see section RCV method ‘Common’ above). Thus, the ‘Jones’ method has greater significant limits for increased concentrations compared with both ‘Common’ and ‘Fraser’ RCV methods. Consequently, the use of ‘Jones’ and ‘Fraser’ RCV method (both with comparable good performances, Table 3) on two serial results from an individual could lead to different interpretations. Therefore, if the homeostatic set point from an individual is available, it should be applied in the ‘Fraser’ RCV method, and if the set point is unknown the ‘Jones’ method is recommended. Jones⁶ did not fully document the mathematical explanations and calculations for the RCV limits but these are now stated here. A mathematical formula for RCV can be generated by changing the original RCV calculation from constant SD to constant CV using the following: from X2 − X1 = Z ċ 2^½ ċ SD_T to (X2 − X1)/X1 = RCV = Z ċ ([SD1/X1]²+ [SD2/X1]²)^½, where SD1 is the SD for the first result and SD2 is the SD for the second result. From this, equation can be derived X2 = (1 + RCV) ċ X1 and SD1/X1 = CV_T and SD2 = CV_T ċ X2 (CV_T is considered as a constant) and these are substituted into the equation which is derived as: RCV = Z ċ ([CV_T]² + [1 + RCV]² ċ [CV_T]²)^½. The latter equation can be considered as a quadratic with two solutions, one for an increase and one for a decrease in concentration. Jones realized the RCV calculation becomes ‘a little more complex’ and documented calculated RCV limits for limited CV_T values in a Table. For practical use, we have created Figure 1 for further CV_T values, where all the associated RCV limits can be read directly.

Figure 1.

Illustration of bi-directional RCV limits for two consecutive measurements in accordance with ‘Jones’ for normal (Gaussian) distributions.⁶ (a) is a larger version of a component of (b). RCV% up is the limit for a significant increase and RCV% down is the significant limit for a decrease in concentration as a function of CV_T%_. The RCV limits can be read for 90%, 95% or 99% probability.

RCV method ‘Fokkema’

The ‘Fokkema’ RCV method is designed for ln-normal distributed data and produced more false-positive results than theoretical expected for both increased and decreased differences in concentration. Notably, the Fokkema method worked better with smaller CV_T than with higher. This is expected as ln transformation of data sets with a narrow variation has less effect on the shape of the distribution.

ln-normal (ln-Gaussian) distribution, Table 2

RCV method ‘Common’

As with normally distributed data, the ‘Common’ method also performed worst on ln-normal distributed data (max differences: 3.69%, Table 3). Again, the ‘Common’ method produced too many false-positive results for an increase in concentration and too few false-positive results for a decrease in concentration. Compared with the other RCV methods, the ‘Common’ method is not optimal.

RCV method ‘Sölétormos’

‘Sölétormos’ method performed comparably on both normally distributed data (max differences: 0.51%, Table 3) and on ln-normally distributed data (max differences: 0.68%, Table 3).

RCV method ‘Fraser’

Also, the ‘Fraser’ method performed comparably well on both normally (max differences: 0.15%, Table 3) and ln-normally distributed data (max differences: 0.26%, Table 3).

RCV method ‘Jones’

The ‘Jones’ method is specifically designed for normally distributed data and produced fewer false-positive results than the theoretically expected (max differences: 1.10%, Table 3).

RCV method ‘Fokkema’

Uniquely, the ‘Fokkema’ RCV method is specifically designed for ln-normally distributed data and performed best giving false-positive results comparable with the theoretical percentages (max differences: 0.07%, Table 3). Another advantage of the ‘Fokkema’ methods is that knowledge of homeostatic set point is not required.

In conclusion, the optimal choice of method to calculate RCV limits requires knowledge of the distribution of data (normal or ln-normal), and if possible, knowledge of the homeostatic set point. In Table 3, a guide is given to use the optimal RCV method for available data on an individual. It is important to note that the likely most often used RCV method, i.e. the ‘Common’ method, performed worst and is not optimal in any situation.

Footnotes

Acknowledgements

We would like to thank Merete Frejstrup Pedersen for her assistance in generating the figure.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Ethical approval

Not applicable.

Guarantor

FL.

Contributorship

FL and PHP designed and generated the computer simulations. FL wrote the first draft of the manuscript. All authors contributed to the discussions and reviewed and edited the manuscript.

References

Lund

Hyltoft Petersen

Fraser

. Calculation of limits for significant unidirectional changes in two or more serial results of a biomarker based on a computer simulation model. Ann Clin Biochem 2015; 52: 237–244.

Lund

Hyltoft Petersen

Fraser

. Calculation of limits for significant bidirectional changes in two or more serial results of a biomarker based on a computer simulation model. Ann Clin Biochem 2015; 52: 434–440.

Harris

Yasaka

. On the calculation of a “reference change” for comparing two consecutive measurements. Clin Chem 1983; 29: 25–30.

Costongs

Janson

Bas

. Short-term and long-term intra-individual variations and critical differences of clinical chemical laboratory parameters. J Clin Chem Clin Biochem 1985; 23: 7–16.

Fraser

Harris

. Generation and application of data on biological variation in clinical chemistry. Crit Rev Clin Lab Sci 1989; 27: 409–437.

Jones

GRD

. Critical difference calculation revised: inclusion of variation in standard deviation with analyte concentration. Ann Clin Biochem 2009; 46: 517–519.

Fokkema

Hermann

Muskiet

FAJ

. Reference change values for brain natriuretic peptides revisited. Clin Chem 2006; 52: 1602–1603.

Fraser

. Biological variation: from principles to practice, Washington, DC: AACC Press, 2001.

Sölétormos

Hyltoft Petersen

Dombernowsky

. Progression criteria for cancer antigen 15.3 and carcinoembryonic antigen in metastatic breast cancer compared by computer simulation of data. Clin Chem 2000; 46: 939–949.

10.

Jones

GRD

. Effect of the reporting-interval size on critical difference estimation: beyond “2.77”. Clin Chem 2006; 52: 880–885.

Different percentages of false-positive results obtained using five methods for the calculation of reference change values based on simulated normal and ln-normal distributions of data

Abstract

Background

Methods

Results

Conclusions

Keywords

Introduction

Materials and methods

RCV calculation methods

RCV method ‘Common’ 8

RCV method ‘Sölétormos’ 9

RCV method ‘Fraser’ 5

RCV method ‘Jones’ 6

RCV method ‘Fokkema’ 7

Results

Discussion

Normal (Gaussian) distribution, Table 1

RCV method ‘Common’

RCV method ‘Sölétormos’

RCV method ‘Fraser’

RCV method ‘Jones’

RCV method ‘Fokkema’

ln-normal (ln-Gaussian) distribution, Table 2

RCV method ‘Common’

RCV method ‘Sölétormos’

RCV method ‘Fraser’

RCV method ‘Jones’

RCV method ‘Fokkema’

Footnotes

Acknowledgements

Declaration of conflicting interests

Funding

Ethical approval

Guarantor

Contributorship

References

RCV method ‘Common’⁸

RCV method ‘Sölétormos’⁹

RCV method ‘Fraser’⁵

RCV method ‘Jones’⁶

RCV method ‘Fokkema’⁷