Abstract
There is a growing debate with regards to the appropriate methods of analysis of growth trajectories and their association with prospective dependent outcomes. Using the example of childhood growth and adult BP, we conducted an extensive simulation study to explore four two-stage and two joint modelling methods, and compared their bias and coverage in estimation of the (unconditional) association between birth length and later BP, and the association between growth rate and later BP (conditional on birth length). We show that the two-stage method of using multilevel models to estimate growth parameters and relating these to outcome gives unbiased estimates of the conditional associations between growth and outcome. Using simulations, we demonstrate that the simple methods resulted in bias in the presence of measurement error, as did the two-stage multilevel method when looking at the total (unconditional) association of birth length with outcome. The two joint modelling methods gave unbiased results, but using the re-inflated residuals led to undercoverage of the confidence intervals. We conclude that either joint modelling or the simpler two-stage multilevel approach can be used to estimate conditional associations between growth and later outcomes, but that only joint modelling is unbiased with nominal coverage for unconditional associations.
1 Introduction
Increasingly in epidemiology and medical research, there is interest in the relationship between both baseline level of, and change in, an exposure and a future outcome. Examples include changes in biomarkers in relation to disease incidence or progression (such as prostate specific antigen (PSA) in relation to progression of prostate cancer); 1 changes in physiological variables in relation to disease outcomes (e.g. changes in renal function and subsequent arterial stiffness); 2 and associations between physical and cognitive changes in later life. 3 One change hypothesis that has been widely explored is the association between birth size, childhood growth and adult outcomes such as blood pressure (BP),4–8 and it is this example we focus on here – however, the methods are applicable to associations between linear change in any continuous exposure and a distal outcome.
We consider the example of a study aiming to investigate the association between birth length and linear growth rate during childhood on adult blood pressure (BP). We are interested in the total effect of birth length (B) on BP, and in the effect of linear growth rate (G) on BP, conditioning on birth length (Figure 1). Additional complexities that may occur in real data, such as non-linear growth, and an interaction between birth length and growth rate in their effect on the outcome are not considered. Studies relating birth length and growth to BP typically collect repeated data on height (length measured at birth and on at least one further occasion during childhood), and a later (adult) measure of BP. In our example, we assume that length is measured within 2 weeks of birth, and then height is measured at approximately 2.5, 5, 7.5 and 10 years of age.
A common approach is first to summarise the repeated measures of height, and then relate these summaries to the subsequent outcome. One simple method for carrying out the summary stage is to use the observed birth length and the latest childhood measure of height (or birth length and change between birth length and final height) as exposures; 9 an alternative is to regress height on age within each individual to estimate their birth length and growth rate. 10 More complex methods include using multilevel or other repeated measures methods to model the trajectories of height, and extracting summaries of these trajectories such as birth length and growth.11,12 Irrespective of the method used to summarise changes in exposure, linear regression models are then used to relate the summaries of change in height to the adult systolic BP, meaning that this stage can be carried out using standard statistical software. All the two-stage approaches share the potential problem that uncertainty in the estimates of birth length and growth are not taken into account in the confidence intervals for their associations with BP, meaning that standard errors may be underestimated.
This problem can be viewed in a measurement error framework, where the regression of blood pressure on birth height and growth is biased by the measurement error/intra-individual variation in height. Measurement error/intra-individual variation in the exposure (height) would tend to attenuate the coefficients between BP and height towards the null in the simple methods. It has been noted in the measurement-error literature that the two-stage multilevel method (also termed the regression calibration, or ‘RC’ method) provides consistent conditional effect estimates when the model relating exposures to outcome is linear, 13 and that the individual regression method results in biased estimates of the conditional effects. However, more research is needed into the performance of all two-stage methods in estimating unconditional effects, and into their relative bias and coverage under different conditions.
In contrast to these two-stage approaches, the joint modelling approach aims to model birth length, growth and BP simultaneously, often using a bivariate growth model, 14 or in a structural equation modelling framework. 15 Joint modelling approaches are becoming more widely implemented in mainstream statistical software. However, joint models are more complex than the two-stage approaches outlined earlier, and it is not known whether any bias or under-coverage in the two-stage methods is large enough to warrant this extra modelling complexity.
In this simulation study, we compare the bias and coverage under different study conditions of six methods to estimate the association between birth length, linear growth and later BP. The two-stage methods we examine are: (1) a simple approach, which uses the observed birth length, and the difference between birth length and latest height measure (here, at approximately 10 years), divided by the difference between the ages at the two measurements, as an estimate of growth rate; (2) an individual regression (OLS) approach which estimates an individual’s birth length and growth rate from the parameters of the model regressing that individual’s height measures on age; (3) a multilevel growth model (MLM) where estimates of the individual random effects (shrunken residuals (or (4) inflated residuals) are used as estimates of birth length and growth rate. These are compared to two versions of a joint modelling approach: (5) a bivariate MLM, in which child growth and the adult outcome are modelled simultaneously and re-inflated residuals used to estimate the association between growth and outcome and (6) a structural equation model (SEM) which provides an alternative parameterisation of the bivariate model. In this study, we compare the bias and efficiency of using joint modelling and two-stage methods under different assumptions about the between- and within-individual variances of birth length and growth rate, and the relationship between them.
2 Methods
2.1 Simulation study design
We simulated a study where length is measured close to birth and at approximate ages of 2.5, 5, 7.5 and 10 years (I = 5 measurement occasions per individual). The exact ages (Age ij ) at which height was measured for individual j at occasion i were drawn from a multivariate normal distribution with mean ages of 0, 2.5, 5, 7.5 and 10 years, and no covariance between occasions. The standard deviations of age at measurement at occasion i (σage i ) were 0.5, with the exception of age 0 (σage 0) which is 0.025 (approximately 95% of individuals had measurements within ± 2 weeks of zero). A single draw from the age distribution was used for all simulations, and age of measurement was checked to ensure that it was monotonically increasing across measurement occasions for each individual. The number of individuals in the study (J) was set to 1000, to illustrate a medium/large cohort study, and all individuals had all five measurements of height.
We simulated height for individual j at occasion i (H
ij
) from age for that individual at that occasion (Age
ij
) using a random intercept and random linear slope growth model
We then simulated a linear association between BP (
2.2 Two-stage methods
Two-stage methods of analysis attempt to (1) summarise the growth process, i.e. birth length and growth rate, and the covariance between these (where the method considers this explicitly) and (2) use these summaries to investigate the associations of birth length and growth rate with the outcome of interest (BP). We begin by describing the second stage as this is common to all two-stage methods. We then outline the first stage, which differs for each approach.
The second stage starts by assuming that we have estimates of the birth length (
2.3 The first stage
The first stage attempts to summarise the growth trajectory for individual j by estimating the true birth length (Bj) and linear growth rate (Gj) for each individual. The three two-stage methods considered here (the simple approach, the individual regression approach, and a multilevel model (MLM) for growth) differ in how they estimate birth length Bj and growth rate Gj.
2.3.1 The simple approach
The simplest approach summarises the growth trajectory using the first observed measurement as an estimate of birth length, and the difference between the latest height measure (here, height at age of approximately 10 years) and birth length, divided by the elapsed time between measurements, as an estimate of growth rate. This idea is equivalent to that suggested by Lucas et al.,
8
with the exception that they suggest adjusting for final height, which they recognise as a simple reparameterisation. This method assumes that the first observed height measure (H1
j
) is the best estimate of birth length. Justifiability of this assumption would depend on the timing of this first measurement. This simple method also assumes that the later measures of height provide no further information about birth length. Similarly, the estimate of growth rate is characterised by the difference between height at the first (H1
j
) and last measurement occasion (HI
j
), and assumes that intermediate measurements are uninformative.
2.3.2 The individual regression (OLS) approach
An alternative approach is to fit J separate linear regression models (one for each individual) of height for individual j at time i (H
ij
) on age of that individual at each time-point (Age
ij
), via ordinary least squares (OLS).
2.3.3 MLM approach
A further, more parsimonious, method would be to specify a MLM with random intercepts and slopes. This method simultaneously models the growth trajectories of all individuals, assuming that birth length and growth rate are normally distributed around the population mean birth length (
2.3.4 MLM re-inflation approach
In order to mitigate the potential problem of under-estimation of the variances, shrunken residuals,
2.4 Joint modelling methods
2.4.1 Bivariate MLM approach
In the bivariate model, the growth trajectory and adult BP models are estimated simultaneously.14,20 This model can be thought of as a combination of a two-level model for height, and a single level model for BP. The specification of the growth model is identical to the MLM presented previously. Additionally, a single level variance component model is estimated for BP and replaces the need to separately estimate equations (3) and (4).
In a similar way to the two-stage approaches, but here using estimated instead of measured BP, these three estimates are then substituted into equations (3) and (4) and used to estimate the parameters of interest
There are alternative methods which could be used to directly estimate
In order to take into account the uncertainty in the residuals without relying on normality assumptions, a non-parametric bootstrap with replacement, with 1000 replicates, was used. The standard deviation of the 1000 bootstrap replicates was used as an estimate of the standard error. Normal theory confidence intervals were constructed using the observed point estimates and bootstrap standard errors, and percentile based confidence intervals were also calculated. This bootstrap estimation was only carried out for the baseline experimental scenario (see further in the text).
2.4.2 Structural equation model
Using a structural equation modelling (SEM) framework, the bivariate outcome model can be reformulated, and framed in terms of measurement and structural models.
The measurement model (13) describes a model of linear dependence of the height H
ij
of an individual j at a given Age
ij
on two latent factors for birth length (B
j
SEM) and growth rate (G
j
SEM). The relationship is described by loadings λ0i and λ1i (where λ0i = 1 and λ1i = Age
ij
).
2.5 Experimental scenarios
Five different experimental scenarios were chosen to explore the performance of different methods (in terms of bias and coverage) under different assumptions about the magnitude of variation in birth length, growth rate, correlation between birth length and growth rate, measurement error in the growth model and measurement error in the BP model. The number of replications for each of the scenarios outlined below was 1000.
Assuming a non-factorial design, standard deviations were set at σu0 = 2.5, σu1 = 0.5, σe
h
= 2.0 and σeBP = 10, and correlation ρu01 = 0.1. All other residual correlations were set equal to 0.
σu0 (standard deviation of birth size, cm): 1.5, 2.0, (2.5), 3.0, 3.5. σu1 (standard deviation of growth rate, cm.y−1): 0.2, 0.3, 0.4, (0.5), 0.6, 0.7, 0.8, 0.9, 1.0. σ
e
hi
(residual standard deviation in growth model, cm): 0.1, 1, (2), 3, 4, 5. ρu01 (correlation of birth size and growth rate): −0.6, −0.4, −0.2, −0.1, 0, (0.1), 0.2, 0.4, 0.6. σeb (residual standard deviation in BP, mmHg): 8, 9, (10), 11, 12.
Values in parentheses are simulation defaults held constant in the other experimental scenarios, and the baseline experimental scenario held all parameters at these values.
Model parameters were fixed for all simulations at:
2.6 Summary statistics of interest
The statistics of primary interest are the estimated association between birth length and BP (
The expected unconditional association between birth length and BP is given by:
2.7 Simulation implementation
The data were generated using Stata 12.1. The first stage of the simple and individualised regression approaches were also conducted in Stata 12.1, 23 whereas the multilevel and bivariate MLMs were fitted in MLwIN 2.25 24 via the runmlwin 25 Stata command. The second stage of all two-stage methods was conducted in Stata 12.1. The structural equation model was fitted in Mplus 26 via R using the MplusAutomation package. 27 Full details of the syntax used to fit the models are listed in Appendix 1.
3 Results
Results for scenarios 1 (varying A schematic representation of causal associations between birth length, growth rate and BP. Performance of the simple and OLS methods displayed as relative bias, and nominal 95% coverage plotted as functions of σu0 (standard deviation of birth length) and σu1 (standard deviation of growth rate) for the effect of birth length on BP, and the effect of growth rate on BP conditional on birth length. Performance of the multilevel model (MLM(IGLS)) and MLM with re-inflated residuals (MLM(IGLS) Inflated) methods displayed as relative bias, and nominal 95% coverage plotted as functions of σu0 (standard deviation of birth length) and σu1 (standard deviation of growth rate) for the effect of birth length on BP, and the effect of growth rate on BP conditional on birth length. Performance of the structural equation model (SEM) and bivariate growth model with re-inflated residuals (BVM (IGLS) Inf) methods displayed as relative bias, and nominal 95% coverage plotted as functions of σu0 (standard deviation of birth length) and σu1 (standard deviation of growth rate) for the effect of birth length on BP, and the effect of growth rate on BP conditional on birth length.



If a model were to perfectly recover the estimates of interest, we would expect 0% relative bias which did not fluctuate with changes in the experimental values. Similarly, we would also expect nominal coverage to be approximately 95% across the range of experimental values used.
3.1 Two-stage simple and OLS methods
For the association between birth length and BP, both the simple and OLS methods are biased towards the null, and the simple method demonstrates more bias than the OLS method. The simple method has 18% nominal coverage rising to 60% as
The conditional growth rate/BP association shows less relative bias than the birth length/BP association for both methods. The simple method is modestly biased towards the null, with the bias reducing as
A similar pattern of results (OLS more favourable than simple methods) can be seen for scenarios 3 (varying
3.2 Two-stage MLMs
Whilst the same MLM is estimated for both two-stage analyses, the difference in results between using shrunken or re-inflated residuals is substantial. For the association between birth length and BP, the use of shrunken residuals results in bias away from the null, with increasing bias as
The converse is demonstrated for the effect of growth rate. As expected, shrunken residuals are unbiased for changes in
A similar pattern of results (inflation unbiased for birth length, shrunken residuals unbiased for growth rate) is seen for scenarios 3 (
3.3 Joint models of growth and disease
Estimates from the bivariate MLM are unbiased (less than 1% relative bias) with respect to estimates of the birth length–BP association and the growth rate–BP association, as expected. The SEM method demonstrates unbiased results for the BP birth length association, and a small but persistent bias of 1% towards the null for the growth rate BP association in scenario 1, and a larger bias (3.5%) towards the null when
A similar pattern of results is seen for scenarios 3 (
4 Discussion
We have shown algebraically that the two-stage process using a MLM to estimate growth parameters and then relating these to the distal outcome in a second stage will give unbiased estimates of the conditional associations between both growth parameters and outcome. Our simulations confirmed this, and also showed that using the same process to estimate the unconditional association between birth length and outcome leads to bias. We have shown that the two-stage bivariate MLM (with re-inflation) is unbiased in all the scenarios investigated, although with under-coverage of confidence intervals. We have also demonstrated that SEM produces a small bias towards the null in the estimation of the association between growth rate and BP (due to the different distribution of individual ages at birth compared to the other time-points). The simple and OLS two-stage methods result in biased estimates of the association between BP and birth length, and less biased estimates of the association between BP and growth rate, conditional on birth length.
The simple two-stage method illustrated substantial bias in the presence of measurement error, and underperformed in comparison to the OLS method with regards to estimates of the effect of birth length on BP. The OLS method demonstrated considerable bias with regards to the estimates of birth length on BP and nominal coverage was also poor. The MLM two-stage approach with inflated residuals demonstrated the least bias with regards to the association of birth length with BP, and considerably outperformed the use of shrunken residuals, which was nearly as biased (although in the opposite direction) as the OLS method. Therefore, the two-stage method of choice when investigating the effects of birth length on BP would be the MLM with inflated residuals. However, the process of re-inflating residuals is not unique, and the transformation does not necessarily preserve the relationship between the empirical residual and the outcome of interest. Using the lower triangular matrices of the Cholesky decomposition during re-inflation results in inflated residuals for birth length which are a simple linear transformation of the uninflated residuals. However, using the upper triangular Cholesky decomposition results in inflated residuals for birth length which are a linear combination of the uninflated residuals for growth rate and those for birth length and so does not result in unbiased associations.
In terms of the association of BP with growth rate (conditional on birth length), the most biased two-stage method was the MLM with inflated residuals, which biases results towards the null. The bias occurs because the association between growth rate and BP is distorted during the inflation process, which is a complex transformation that depends on the shrunken residuals of both birth length and growth rate. The simple method also illustrates biases towards the null, but to a lesser extent than those with inflated residuals. The OLS method illustrates biases worse than the simple method under some circumstances, despite the intuitive incorporation of all relevant data. However, the two-stage method using the MLM with shrunken residuals led to unbiased results, since the consistent shrinkage of both birth length and growth rate residuals preserve the association between BP and growth rate conditional on birth length. Nominal coverage is preserved at expected levels by the inflated standard errors due to the reduced residual variance.
The bivariate growth model, which simultaneously generates growth and BP residuals, which are in turn re-inflated, leads to unbiased results in all scenarios. However, this method results in under-coverage of 5% in scenarios 1 and 2. The unbiased result and less than optimal coverage needs to be balanced against the minor bias yielded by SEM method (in this example, due to unbalanced data), and the full propagation of uncertainty and correct 95% nominal coverage, or the computationally intensive nature of the non-parametric bootstrap, which fully incorporates the uncertainty from the growth model.
Approaches for tackling this problem have been suggested, in the context of the measurement error. It has been noted that using parameters from a linear MLM (the ‘RC’ method) results in unbiased conditional effect estimates when the model relating exposures to outcome is linear. 13 The same paper noted that the individual regression method resulted in biased estimates of the conditional effects. However, for non-linear models for the outcome (e.g. a logistic model for a binary distal outcome), this method remains biased (although with reduced bias compared to the individual regression method), and alternatives have been proposed. 28 Given the difficulties in estimating joint models with a binary outcome, more research is needed into the size of the bias when using the two-stage multilevel approach in the non-linear case, and the ease of implementation of alternatives.
5 Future work
Whilst this simulation study highlights how variation in specific parts of the data-generating process affect the estimation of the effect of either birth length or growth rate on BP, we have not explored a full factorial experimental design, therefore combinations of unfavourable scenarios may result in unacceptable bias and poor coverage. Additionally, we have only explored these effects when the number of observations is the same in each individual, and not explored the consequences of imbalance and missing data for the simple and OLS methods. We briefly explored the consequences of equalising the size of the effects between growth parameters and BP, and found similar associations, i.e. biases in birth length and BP association were greater than biases in the growth rate and BP associations (results available from authors upon request). Furthermore, we did not vary the direction and magnitude of the association between growth parameters and BP and therefore we are unable to explore the potential reversal paradox described by Tu and colleagues. 16 Similarly, we did not investigate the effect of population size or frequency of measurement in relation to the observed biases, and the stability of the six methods with small numbers of individuals and/or infrequent measurements may be different to that seen here.
We did not examine the effect of violating model assumptions. In particular, whilst growth (and change in other anthropometric variables) is often linear over short periods of time, non-linear models will generally be required for examining change over longer periods. More complex data-generating processes could also be considered, for example by creating an interaction between birth length and growth rate in their association with the outcome. SEM or path analyses could be used to examine the mediation of the association between birth length and BP by growth rate.
6 Conclusions
The joint modelling approach which takes into account and incorporates the variation of growth process into the estimation of effects on subsequent outcomes is clearly the preferred method, giving unbiased estimates of both the conditional and unconditional associations of birth length and growth with BP. Given the requirement for specialist software and the greater technical difficulty in fitting the joint models, this option may not be viable for all researchers without specialist training. An alternative would be to use the two-stage MLM approach to estimate conditional associations, where an experienced analyst can derive the residuals, and less-experienced researchers can use them as exposures in standard regression models. Where measurement error in the repeated outcome is low, this approach may result in little bias even for unconditional associations.
This simulation study could change the interpretation of previously reported null findings, as many of the methods commonly used result in biases towards the null and poor nominal coverage. Thus, reanalysis with more suitable methods may reduce both the type I error rate and the heterogeneity in the current literature.
Footnotes
Acknowledgement
We would like to thank Professor Harvey Goldstein for helpful discussions of this topic.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: AS, ADACS, JH, MSG, FS and KT were supported by the Medical Research Council [grant number G1000726]. FS was also supported by the ESRC [grant number RES-576-25-0032]. CM-W is supported by a Fellowship from the Medical Research Council (MR/J011932). ADACS, CM-W and KT work in a Unit that receives funding from the UK Medical Research Council and the University of Bristol (MC_UU_12013/5).
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
