Sage Journals: Discover world-class research

Abstract

In psychological research, longitudinal study designs are often used to examine the effects of a naturally observed predictor (i.e., treatment) on an outcome over time. But causal inference of longitudinal data in the presence of time-varying confounding is notoriously challenging. In this tutorial, we introduce g-estimation, a well-established estimation strategy from the causal inference literature. G-estimation is a powerful analytic tool designed to handle time-varying confounding variables affected by treatment. We offer step-by-step guidance on implementing the g-estimation method using standard parametric regression functions familiar to psychological researchers and commonly available in statistical software. To facilitate hands-on usage, we provide software code at each step using the open-source statistical software R. All the R code presented in this tutorial are publicly available online.

Keywords

causal inference doubly robust estimation posttreatment confounding propensity scores treatment-dependent confounding treatment-induced confounding

Longitudinal studies are often used to investigate causal questions. In psychological research, many naturally observed predictors of interest (referred to hereafter as “treatments”) are time-varying. For example, researchers might be interested in examining how consuming coffee at various times of the day influences sleep quality at night, how repeatedly being ostracized at work affects job performance, or how social media use over time can contribute to depression. Questions such as these share the commonality that the treatments change over time. As discussed in depth in the existing public health and medical sciences literature, a central challenge of causal inferences about time-varying treatments is “time-varying confounding” (Daniel et al., 2013; Hernán & Robins, 2008; Keogh et al., 2017; Mansournia et al., 2017). It is well recognized that routine methods for handling confounding are inadequate under such settings and lead to incorrect causal conclusions (Bray et al., 2006; Moodie & Stephens, 2010; Rosenbaum, 1984b).¹

How should researchers handle time-varying confounding? G-estimation offers a solution. G-estimation is a powerful statistical tool designed to deal with time-varying confounding (Robins, 1997; Vansteelandt & Joffe, 2014; Vansteelandt & Sjölander, 2016). Despite being a well-established and widely implemented analytic technique in public health and medical sciences, it has only recently been introduced to the psychological literature (Loh & Ren, 2023). In this tutorial, we provide concrete guidance on g-estimation to make this method accessible to psychological researchers. We provide software code at each step using the open-source statistical software R (R Core Team, 2021) to facilitate hands-on usage. All the steps implemented in this tutorial are publicly available online.² Additional references for interested readers are provided throughout the tutorial.

A Brief Introduction of G-Estimation

G-estimation is a well-established causal inferential-based technique for evaluating the effects of a time-varying treatment in the presence of time-varying or treatment-dependent confounding. It is among a broad class of so-called g-methods (where the “g” stands for “generalized”) developed by James Robins, with deep roots in causal inference research, and is widely used in biostatistics, epidemiology, and medical sciences to assess time-varying treatment effects in longitudinal data (for introductions to g-methods, see, e.g., Hernán and Robins, 2020, Part III; Naimi et al., 2016).

Three desirable features motivate the use of g-estimation. First, no models for the (distribution of the) covariates are required, so covariates can be either continuous or noncontinuous and be either time-invariant or time-varying (and thus possibly affected by treatment). Second, unbiased estimation requires correctly specifying a model for the treatment or the outcome but not necessarily both. Finally, it is relatively easy to implement using standard parametric regression functions; these are familiar to psychological researchers and commonly available in statistical software. We exploit these advantageous features to present a tutorial on g-estimation.

Establishing the Causal Effects of Interest

Minimal illustrating example

To illustrate the steps of the g-estimation method, we use a hypothetical example investigating coffee consumption on mood over the course of a day. Researchers may posit fatigue as a confounder of the causal effect of coffee consumption because fatigue affects the decision to consume coffee, and fatigue is likely to be correlated with mood because of hidden common causes. Crucially, fatigue fluctuates over the course of a day, making it a time-varying confounder that should be recorded at each time point.

Now, suppose that researchers prompted participants to take a short survey at three time points during the day: 10 a.m. (t = 1), 1 p.m. (t = 2), and 4 p.m. (t = 3). At each time point t, participants reported whether they were drinking coffee (binary treatment X) and their mood (continuous outcome Y) and fatigue (time-varying confounder L) during the past 3 hr since the previous survey at t−1. Therefore, coffee drinking (treatment) recorded at each survey t was causally and temporally preceded by mood (outcome) and fatigue (confounder) during the previous survey at t−1.

The underlying data-generating process for this example is readily visualized using a “causal diagram” (C. Glymour, 2001; Greenland et al., 1999; Pearl, 2009), such as the one depicted in Figure 1.³ Subscripts index the time t that a variable is recorded.⁴ The treatment (coffee), outcome (mood), and time-varying confounder (fatigue) at each time t is denoted by $X_{t}$ , $Y_{t}$ , and $L_{t}$ , respectively.⁵ Note that within each survey, $L_{t}$ and $Y_{t}$ are assumed to causally influence $X_{t}$ (e.g., fatigue and mood before 10 a.m. can influence coffee consumption at 10 a.m.). Critically, $L_{t}$ and $Y_{t}$ are permitted to be affected by the previous treatment $X_{t - 1}$ (e.g., fatigue and mood after 10 a.m. can be affected by coffee consumption at 10 a.m.). The node U represents a set of unmeasured time-invariant common causes of the confounder and outcome that induces a correlation between $L_{t}, t = 1, 2$ and $Y_{s}, s = 1, 2, 3$ . For example, the autocorrelation between repeated measurements of mood can be induced by an underlying factor U. For simplicity, we denote measured baseline time-invariant covariates collectively by C.

Fig. 1.

Causal diagram for example with three time points. A circular node denotes the (set of) unmeasured variable(s) U. For visual clarity, arrows emanating from U are drawn in gray, and the causal paths corresponding to the treatment effects are drawn as dashed arrows. Descriptions of the treatment effects are provided in the main text. Note that the treatment at Time 3 ( $X_{3}$ ) is excluded from the causal diagram and analysis because—as is common in most longitudinal designs—only the outcome at the last time point ( $Y_{3}$ ) is used to allow time for the causal effect of treatment on the outcome to transpire. For visual simplicity, edges emanating from the baseline time-invariant covariate(s) C to the time-varying variables are reduced and truncated in the causal diagram. This causal diagram using the DAGitty environment (Textor et al., 2017) is available on http://dagitty.net/mGp5szK.

Defining the causal effects of interest

In this example, we focus on three causal effects of interest: the Lag 1 effect of $X_{1}$ on $Y_{2}$ , the Lag 1 effect of $X_{2}$ on $Y_{3}$ , and the Lag 2 effect of $X_{1}$ on $Y_{3}$ . The definitions of these causal effects are codified by a linear structural nested mean model (SNMM; Robins, 1994, 1997; Robins et al., 2000; Vansteelandt & Joffe, 2014). A linear SNMM describes the conditional average difference in potential outcomes from comparing treatment sequences in which the treatment at each time is hypothetically manipulated rather than being allowed to progress naturally over time, as in the observed data.⁶ In this tutorial, we assume a simple additive SNMM in which the parameters encode causal effects that do not depend on (i.e., are not moderated or modified by) any other variable.⁷

Continuing our example, our focus is on the causal parameters: (a) $ψ_{21}$ that encodes the Lag 1 effect of $X_{1}$ on $Y_{2}$ , (b) $ψ_{32}$ that encodes the lag 1 effect of $X_{2}$ on $Y_{3}$ , and (c) $ψ_{31}$ that encodes the Lag 2 (total) effect of $X_{1}$ on $Y_{3}$ that does not intersect $X_{2}$ . To be precise, $ψ_{31}$ encodes the conditional average effect on $Y_{3}$ from manipulating $X_{1}$ from 1 (treated) to 0 (control) while holding the later treatment $X_{2}$ fixed at a reference level 0 (control) for all participants. Conceptually, $ψ_{31}$ can be interpreted as the effect of treatment at Time 1 on the outcome at Time 3, assuming that treatment did not occur at Time 2. Continuing our hypothetical example, does drinking coffee at 10 a.m. influence mood at 4 p.m., assuming no coffee is consumed at 1 p.m.? Concretely, using standard path-tracing rules (Pearl, 2009; Wright, 1934), it is the combined effect of $X_{1}$ on $Y_{3}$ along the causal paths $X_{1} \to L_{2} \to Y_{3}$ , $X_{1} \to Y_{2} \to Y_{3}$ , and $X_{1} \to Y_{3}$ in the causal diagram in Figure 1. Broken arrows indicate the causal paths corresponding to these treatment effects in the causal diagram above.

For illustration, the true parameter values are

We generate a data set and display the first few rows of the data set below⁸:

Selecting pretreatment covariates for confounding adjustment

Before attempting to estimate each causal effect of interest, researchers must first consider putative confounding variables. This warrants selecting a set of pretreatment covariates that suffice to eliminate all noncausal associations when adjusted or statistically controlled for (Imbens & Rubin, 2015; Rosenbaum, 1984a). Extensive discussions of pretreatment confounding adjustment for causal inference have been presented elsewhere (Hernán & Robins, 2020; Morgan & Winship, 2015; Pearl, 2009; Steiner et al., 2010).

In our motivating example, an adjustment set that is sufficient to block all noncausal paths with $X_{1}$ and either $Y_{2}$ or $Y_{3}$ as their end points is $C_{1} = {C, L_{1}, Y_{1}}$ . Likewise, the effect of $X_{2}$ on $Y_{3}$ is unconfounded given the adjustment set $C_{2} = {C, L_{1}, Y_{1}, X_{1}, L_{2}, Y_{2}}$ . It is crucial to note that the covariates in each adjustment set temporally precede a focal treatment. For example, $C_{2}$ contains pretreatment covariates relative to $X_{2}$ , notwithstanding $L_{2}$ and $Y_{2}$ occurring after $X_{1}$ . This assumption of “no unmeasured confounding” implies that when there is no causal effect of treatment $X_{t}$ on outcome $Y_{s}, s > t$ , it can then be assumed that $X_{t}$ and $Y_{s}$ would be conditionally independent given the selected covariates in $C_{t}$ . Within the causal diagram in Figure 1, this assumption is represented by the absence of unmeasured common causes of treatment ( $X_{1}$ or $X_{2}$ ) and outcome ( $Y_{1}$ , $Y_{2}$ , or $Y_{3}$ ) and the absence of arrows between U and treatment ( $X_{1}$ or $X_{2}$ ). In this article, we assume that all confounders of the effect of each treatment $X_{t}$ are known and precisely measured, allowing us to estimate the causal effects of interest consistently.⁹ Because this unconfoundedness assumption can be difficult to defend in practice, we describe a sensitivity analysis to unmeasured confounding in a later section.

A simplified g-estimation procedure

For ease of understanding, we first present a simplified version of the g-estimation procedure. It uses only regression models for the outcome given treatment and the pretreatment confounders. In a later section, we describe an enhanced version of g-estimation that is robust to biases from certain forms of incorrectly specified models and likely to be better applied in practice. Estimation proceeds by focusing on each outcome measurement in turn and sequentially estimating the treatment effects on that outcome. In particular, estimation follows a sequence starting with the latest treatment and working backward in time to the earliest treatment.

We briefly explain the motivation for such a reverse temporal ordering. The outcome $Y_{3}$ can be considered conceptually as the result of accumulating the effects of treatments $X_{1}$ and $X_{2}$ under the posited causal effects $ψ_{31}$ and $ψ_{32}$ , respectively. However, it is impossible to unbiasedly estimate $ψ_{31}$ in the presence of the time-varying confounders $L_{2}$ and $Y_{2}$ . This precludes estimating both causal effects of $X_{1}$ and $X_{2}$ on $Y_{3}$ using a single regression model. As we demonstrate next, in a multiple linear regression of $Y_{3}$ on both treatments $(X_{1}, X_{2})$ , while controlling for the covariates $(C, L_{1}, Y_{1}, L_{2}, Y_{2})$ , the coefficient estimator of $X_{1}$ will be biased for $ψ_{31}$ —even when no unmeasured confounding holds—because of time-varying confounding.¹⁰ Now consider the following thought experiment. Suppose instead that $X_{2}$ can be assumed to have no causal effect on $Y_{3}$ . Then the effect of $X_{1}$ on $Y_{3}$ (that does not intersect $X_{2}$ ) can be estimated by appropriately adjusting for only pretreatment instances of all variables before $X_{1}$ and simply not adjusting for any variable causally affected by $X_{1}$ .

To achieve this, rather than consider the observed outcome $Y_{3}$ , we seek to unveil the counterfactual outcome absent the later treatment $X_{2}$ . To simulate this counterfactual outcome, we will “peel off” or “blip down” (Robins, 1997; Vansteelandt & Joffe, 2014) the treatment effect of $X_{2}$ by subtracting $ψ_{32} X_{2}$ from $Y_{3}$ for each individual. Removing the structural variation in $Y_{3}$ caused by the “blip” of treatment $X_{2}$ renders a counterfactual quantity devoid of the causal effect exerted by $X_{2}$ (on average). Conceptually, this step can be viewed as removing the arrow from $X_{2}$ on $Y_{3}$ in the causal diagram so that now no causal paths from $X_{1}$ on $Y_{3}$ intersect $X_{2}$ . The resulting counterfactual quantity, denoted by $R_{3, - x 2}$ , represents a transformed outcome whose remaining structural covariation with $X_{1}$ (conditional on the pretreatment variables and assuming no unmeasured confounding) can be attributed to the causal effect of $X_{1}$ .¹¹ We can then use $R_{3, - x 2}$ in place of $Y_{3}$ to estimate the effect of $X_{1}$ on $Y_{3}$ (not via $X_{2}$ ). Crucially, only pretreatment confounders (e.g., $C, L_{1}, Y_{1}$ ) are adjusted for while excluding all posttreatment variables (e.g., $L_{2}, Y_{2}, X_{2}$ ). Hence, this motivates first estimating the effect of $X_{2}$ on $Y_{3}$ and then estimating the effect of $X_{1}$ on $Y_{3}$ .

Now we describe each step of the estimation procedure below:

1. First, we estimate the Lag 1 effect of $X_{2}$ on $Y_{3}$ , parametrized by $ψ_{32}$ . Fit a linear regression model for $Y_{3}$ on $X_{2}$ and include the pretreatment covariates in $C_{2}$ . The g-estimator of $ψ_{32}$ is the ordinary least squares (OLS) estimator of the coefficient of $X_{2}$ .

2. We can thus use the estimate of $ψ_{32}$ , denoted by ${\hat{ψ}}_{32}$ , from the previous step to remove the causal effect of $X_{2}$ on the outcome $Y_{3}$ . Subtract the Lag 1 effect of $X_{2}$ from $Y_{3}$ as

R_{3, - x 2} = Y_{3} - {\hat{ψ}}_{32} X_{2} .

Now let’s inspect the first few rows of the data set to ensure the new counterfactual quantity has been successfully created.

We can now use $R_{3, - x 2}$ to estimate the Lag 2 effect of $X_{1}$ on $Y_{3}$ , which is defined as excluding any causal effect via $X_{2}$ .

3. Fit a linear regression model for $R_{3, - x 2}$ on $X_{1}$ and include the pretreatment covariates in $C_{1}$ . The g-estimator of $ψ_{31}$ is the OLS estimator of the coefficient of X₁.

Finally, we estimate the Lag 1 effect of $X_{1}$ on $Y_{2}$ , parametrized by $ψ_{21}$ . Fit a linear regression model for $Y_{2}$ on $X_{1}$ and include the covariates in $C_{1}$ . Note that this is the same outcome model for $R_{3, - x 2}$ as in the previous step. The g-estimator of $ψ_{21}$ is the OLS estimator of the coefficient of $X_{1}$ .

Having carried out all the steps above, we can now inspect the estimates of the three causal effects below. The estimates were almost identical to the true values, with differences only at the second decimal place.

Finally, note that if we merely used the coefficient of $X_{1}$ in the regression model for $Y_{3}$ (in Step 1) as an estimator of $ψ_{31}$ , it would be biased, as shown below:

G-Estimation Using Lavaan

We illustrated the steps of a simplified g-estimation procedure using the linear regression functionality (lm) in R. We now describe how to carry out the above procedure using the R package lavaan (Rosseel, 2012). There are two critical advantages conferred by using lavaan for g-estimation. First, regression models for outcomes at multiple times can be evaluated simultaneously within the same lavaan model. This renders a more compact and coherent presentation of the estimation procedure. Second, equality constraints on the (regression) model parameters can be easily implemented during estimation. We provide examples of why such constraints are advantageous later.

1. First, we estimate both Lag 1 effects $ψ_{21}$ and $ψ_{32}$ simultaneously. We do this by fitting both linear regression models for $Y_{3}$ and $Y_{2}$ (from Steps 1 and 4 above) in the same joint model.

The g-estimators of $ψ_{21}$ and $ψ_{32}$ are then the estimators of the coefficients of $X_{1}$ on $Y_{2}$ and of $X_{2}$ on $Y_{3}$ , respectively.

2. Similar to Step 2 above, remove the Lag 1 effect of $X_{2}$ from $Y_{3}$ as

R_{3, - x 2} = Y_{3} - {\hat{ψ}}_{32} X_{2} .

3. Similar to Step 3 above, fit a model for $R_{3, - x 2}$ on $X_{1}$ and include the covariates in $C_{1}$ . The g-estimator of $ψ_{31}$ is the estimator of the coefficient of $X_{1}$ .

Standard errors and confidence intervals can be estimated using a nonparametric percentile bootstrap procedure (Efron & Tibshirani, 1994) that randomly resamples individuals with replacement and then repeats all the above steps for each bootstrap sample. We advise against merely using the standard errors for $ψ_{31}$ from the fitted model in Step 3 alone because they do not adequately capture the variability in the estimates ${\hat{ψ}}_{32}$ used to determine the transformed outcome in Step 2 and may lead to biased statistical inferences.

Benefits of using constraints in lavaan

A major advantage of using lavaan for g-estimation is the ease with which parameter constraints can be introduced in the lavaan model syntax. We describe two examples relevant to g-estimation. Suppose researchers postulate a constant Lag 1 (i.e., stationary or time-invariant) effect that is equal across all treatment time points (i.e., $ψ_{32} = ψ_{21}$ ). Then they merely need to add a simple line of code psi21 == psi32 to the model syntax in model_lag1 to ensure that the estimates of $ψ_{32}$ and $ψ_{21}$ are constrained to be exactly equal to each other. Alternatively, they may specify a model with the same Lag 1 effect by using the same label for the coefficients of $X_{1}$ on $Y_{2}$ and $X_{2}$ on $Y_{3}$ .

A fundamental yet empirically untestable assumption for valid causal inference is the absence of unmeasured treatment-outcome confounding. This assumption can be challenging to justify defensibly in practice, thus warranting a sensitivity analysis. Because unmeasured confounding can manifest in correlated treatment and outcome residual errors, we propose a sensitivity analysis for unmeasured confounding. We briefly summarize the steps. First, we fix the residual correlations at a given value. Next, we estimate the treatment effects under this fixed constraint. Both steps are then repeated using different fixed values of the residual correlations. This lets us systematically investigate how the effect estimates change depending on the given strength of unmeasured confounding.¹² Stronger correlations indicate more severe violations of the unconfoundedness assumptions, whereas a zero correlation corresponds to the assumption of no unmeasured confounding.

The correlations can be readily parametrized in lavaan as follows:

A sensitivity analysis then entails fixing the residual correlations to different (nonzero) values in turn (e.g., by replacing rho in the model syntax above by a value between –1 and 1) and then carrying out the g-estimation procedure for each fixed value to gauge how different the effect estimates can be.¹³

Doubly Robust G-Estimation

In the simplified version presented in the preceding sections, estimation was carried out using only outcome models given treatment and the pretreatment confounders. A crucial shortcoming of such an approach is that unbiased estimation is predicated on correctly specifying the statistical relations between the outcome and the covariates.¹⁴ Hence, the estimators can be (severely) biased because of incorrectly specified outcome models. To illustrate the biases, we generated a new data set with the same values of the treatment effects but with outcomes that depended on nonlinear functions of the covariates. Using the simplified estimation procedure (i.e., with outcome models that depended linearly on main effects of the covariates) thus yielded biased estimates, as shown below:

To offer protection from this shortcoming, the estimation procedure can be supplemented with a model for the distribution of treatment to endow the g-estimators with a doubly robust property (Hernán & Robins, 2020; Kang & Schafer, 2007; Robins, 2000; Vansteelandt & Daniel, 2014). The g-estimators are unbiased when both the treatment model and the outcome model are correctly specified and consistent when either model is correctly specified (in addition to a correctly specified SNMM and unconfoundedness being satisfied). No knowledge of which model is correctly specified is required. For example (as we have done here), suppose that the outcome was generated, unbeknownst to the researcher, based on nonlinear or complex functions of the covariates. But an incorrectly specified outcome model with only main effects of the covariates is assumed for estimation. A correctly specified treatment model can ensure that the probability of the empirical estimate getting closer to the true value converges to 1 as sample size increases, thus reducing the reliance on correctly specifying the outcome model for valid statistical inference (Vansteelandt & Joffe, 2014).¹⁵ We now describe how to carry out the doubly robust g-estimation procedure:

1. Fit a model for $X_{1}$ given the confounders in $C_{1}$ and a model for $X_{2}$ given the confounders in $C_{2}$ . Because treatment is binary in our simulated data set, we use a logistic regression model for each treatment.¹⁶ This model for the treatment given the covariates is commonly termed a “propensity score” (PS; Rosenbaum & Rubin, 1983) model.

2. Calculate the predicted treatment or PS given the fitted models above.

Now let’s inspect the data set’s first few rows to ensure the predicted treatments have been successfully created.

3. Carry out the steps in the previous section but now include the predicted treatment or PS as an additional predictor in each outcome model. For example, the model for estimating the Lag 1 effects is:

Likewise, the model for the Lag 2 effect is

In this article, we presented a simple example of g-estimation as described in Hernán and Robins (2020, Section 21.3) and Vansteelandt and Sjölander (2016). We focused on leveraging lavaan (Rosseel, 2012) to implement g-estimation. In doing so, we hope to make g-estimation more accessible to psychology researchers who are already familiar with the features and functionalities of the widely used lavaan platform. Readers interested in alternative g-estimation procedures designed to deal with a broad variety of research settings and using other software packages or platforms as further points of investigation are referred to Picciotto and Neophytou (2016), Sterne and Tilling (2002), Tompsett et al. (2022), and Wodtke (2018); for closely related approaches, although framed slightly differently, see Ertefaie et al. (2021) and Simoneau et al. (2018). For example, g-estimation has been extended to noncontinuous outcomes, such as binary outcomes (Dukes & Vansteelandt, 2018) and time-to-event data with censored outcomes (Seaman et al., 2021; Vansteelandt and Sjölander, 2016).

Summary

Time-varying (or treatment-dependent) confounding poses unique challenges to valid causal inferences using longitudinal data. G-estimation of linear SNMMs provides a well-established solution to this problem. In this tutorial, using a linear and simple SNMM, we demonstrated how to implement g-estimation using standard parametric regression functions familiar to psychological researchers. The resulting g-estimators have a double robustness property that ensures valid inferences even under certain model misspecifications. In summary, g-estimation is flexible, practical, robust, and relatively straightforward. We encourage psychology researchers to employ g-estimation when testing the causal effects of a time-varying treatment in longitudinal studies.

Footnotes

Transparency

Action Editor: Yasemin Kisbu-Sakarya

Editor: David A. Sbarra

Author Contribution(s)

Wen Wei Loh: Conceptualization; Formal analysis; Investigation; Methodology; Software; Writing – original draft; Writing – review & editing.

Dongning Ren: Conceptualization; Formal analysis; Investigation; Methodology; Writing – original draft; Writing – review & editing.

ORCID iD

Wen Wei Loh

Notes

References

Acharya

Blackwell

Sen

(2016). Explaining causal findings without bias: Detecting and assessing direct effects. American Political Science Review, 110(3), 512–529. https://doi.org/10.1017/S0003055416000216

Barendse

Rosseel

(2020). Multilevel modeling in the ‘wide format’ approach with discrete data: A solution for small cluster sizes. Structural Equation Modeling: A Multidisciplinary Journal, 27(5), 696–721. https://doi.org/10.1080/10705511.2019.1689366

Bray

B. C.

Almirall

Zimmerman

R. S.

Lynam

Murphy

S. A.

(2006). Assessing the total effect of time-varying predictors in prevention research. Prevention Science, 7(1), 1–17. https://doi.org/10.1007/s11121-005-0023-0

Daniel

R. M.

Cousens

De Stavola

Kenward

M. G.

Sterne

J. A. C.

(2013). Methods for dealing with time-dependent confounding. Statistics in Medicine, 32(9), 1584–1618. https://doi.org/10.1002/sim.5686

Dowle

Srinivasan

(2021). Data.table: Extension of ‘data.frame’ [R package version 1.14.2]. https://CRAN.R-project.org/package=data.table

Dukes

Vansteelandt

(2018). A note on g-estimation of causal risk ratios. American Journal of Epidemiology, 187(5), 1079–1084. https://doi.org/10.1093/aje/kwx347

Efron

Tibshirani

R. J.

(1994). An introduction to the bootstrap. Chapman & Hall / CRC. https://doi.org/10.1007/978-1-4899-4541-9

Elwert

(2013). Graphical causal models. In Morgan

S. L.

(Ed.), Handbook of causal analysis for social research (pp. 245–273). Springer Netherlands. https://doi.org/10.1007/978-94-007-6094-3_13

Ertefaie

McKay

J. R.

Oslin

Strawderman

R. L.

(2021). Robust q-learning. Journal of the American Statistical Association, 116(533), 368–381. https://doi.org/10.1080/01621459.2020.1753522

10.

Funk

M. J.

Westreich

Wiesen

Stürmer

Brookhart

M. A.

Davidian

(2011). Doubly robust estimation of causal effects. American Journal of Epidemiology, 173(7), 761–767. https://doi.org/10.1093/aje/kwq439

11.

Ghosh

Tan

(2022). Doubly robust semiparametric inference using regularized calibrated estimation with high-dimensional data. Bernoulli, 28(3), 1675–1703. https://doi.org/10.3150/21-BEJ1378

12.

Glymour

(2001). The mind’s arrows: Bayes nets and graphical causal models in psychology. The MIT Press. https://doi.org/10.7551/mitpress/4638.001.0001

13.

Glymour

M. M.

(2006). Using causal diagrams to understand common problems in social epidemiology. In Oakes

J. M.

Kaufman

J. S.

(Eds.), Methods in social epidemiology (pp. 393–428). Jossey-Bass/Wiley.

14.

Glynn

A. N.

Quinn

K. M.

(2010). An introduction to the augmented inverse propensity weighted estimator. Political Analysis, 18(1), 36–56. https://doi.org/10.1093/pan/mpp036

15.

Greenland

Pearl

Robins

J. M.

(1999). Causal diagrams for epidemiologic research. Epidemiology, 10(1), 37–48.

16.

Grosz

M. P.

Rohrer

J. M.

Thoemmes

(2020). The taboo against explicit causal inference in nonexperimental psychology. Perspectives on Psychological Science, 15(5), 1243–1255. https://doi.org/10.1177/1745691620921521.

17.

Harring

J. R.

McNeish

D. M.

Hancock

G. R.

(2017). Using phantom variables in structural equation modeling to assess model sensitivity to external misspecification. Psychological Methods, 22(4), 616–631. https://doi.org/10.1037/met0000103

18.

Hernán

M. A.

Robins

J. M.

(2008). Estimation of the causal effects of time-varying exposures. In Fitzmaurice

Davidian

Verbeke

Molenberghs

(Eds.), Longitudinal data analysis (pp. 567–614). Chapman & Hall / CRC.

19.

Hernán

M. A.

Robins

J. M.

(2020). Causal inference: What if. Chapman & Hall / CRC.

20.

Imai

Keele

Tingley

(2010). A general approach to causal mediation analysis. Psychological Methods, 15(4), 309–334. https://doi.org/10.1037/a0020761

21.

Imbens

G. W.

Rubin

D. B.

(2015). Causal inference in statistics, social, and biomedical sciences. Cambridge University Press.

22.

Kang

J. D.

Schafer

J. L.

(2007). Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science, 22(4), 523–539.

23.

Keogh

R. H.

Daniel

R. M.

VanderWeele

T. J.

Vansteelandt

(2017). Analysis of longitudinal studies with repeated outcome measures: Adjusting for time-dependent confounding using conventional methods. American Journal of Epidemiology, 187(5), 1085–1092. https://doi.org/10.1093/aje/kwx311

24.

Kisbu-Sakarya

MacKinnon

D. P.

Valente

M. J.

Çetinkaya

(2020). Causal mediation analysis in the presence of post-treatment confounding variables: A Monte Carlo simulation study. Frontiers in Psychology, 11, Article 2067. https://doi.org/10.3389/fpsyg.2020.02067

25.

Kurz

C. F.

(2022). Augmented inverse probability weighting and the double robustness property. Medical Decision Making, 42(2), 156–167. https://doi.org/10.1177/0272989X211027181

26.

Liu

Kuramoto

S. J.

Stuart

E. A.

(2013). An introduction to sensitivity analysis for unobserved confounding in nonexperimental prevention research. Prevention Science, 14(6), 570–580.

27.

Loh

W. W.

Moerkerke

Loeys

Poppe

Crombez

Vansteelandt

(2020). Estimation of controlled direct effects in longitudinal mediation analyses with latent variables in randomised studies. Multivariate Behavioral Research, 55(5), 763–785. https://doi.org/10.1080/00273171.2019.1681251

28.

Loh

W. W.

Ren

(2023). Estimating time-varying treatment effects in longitudinal studies. Psychological Methods. Advance online publication. https://doi.org/10.1037/met0000574

29.

Mansournia

M. A.

Etminan

Danaei

Kaufman

J. S.

Collins

(2017). Handling time varying confounding in observational research. The BMJ, 359, Article j4587. https://doi.org/10.1136/bmj.j4587

30.

Moodie

E. E. M.

Stephens

D. A.

(2010). Using directed acyclic graphs to detect limitations of traditional regression in longitudinal studies. International Journal of Public Health, 55(6), 701–703. https://doi.org/10.1007/s00038-010-0184-x

31.

Morgan

S. L.

Winship

(2015). Counterfactuals and causal inference. Cambridge University Press.

32.

Naimi

A. I.

Cole

S. R.

Kennedy

E. H.

(2016). An introduction to g methods. International Journal of Epidemiology, 46(2), 756–762. https://doi.org/10.1093/ije/dyw323

33.

Naimi

A. I.

Mishler

A. E.

Kennedy

E. H.

(2021). Challenges in obtaining valid causal effect estimates with machine learning algorithms. American Journal of Epidemiology. Advance online publication. https://doi.org/10.1093/aje/kwab201

34.

Pearl

(2009). Causality: Models, reasoning and inference (2nd ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511803161

35.

Pearl

Glymour

Jewell

N. P.

(2016). Causal inference in statistics: A primer. John Wiley & Sons.

36.

Picciotto

Neophytou

A. M.

(2016). G-estimation of structural nested models: Recent applications in two subfields of epidemiology. Current Epidemiology Reports, 3(3), 242–251. https://doi.org/10.1007/s40471-016-0081-9

37.

R Core Team. (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/

38.

Robins

J. M.

(1994). Correcting for non-compliance in randomized trials using structural nested mean models. Communications in Statistics: Theory and Methods, 23(8), 2379–2412. https://doi.org/10.1080/03610929408831393

39.

Robins

J. M.

(1997). Causal inference from complex longitudinal data. In Berkane

(Ed.), Latent variable modeling and applications to causality (pp. 69–117). Springer.

40.

Robins

J. M.

(2000). Robust estimation in sequentially ignorable missing data and causal inference models. Proceedings of the American Statistical Association, 1999, 6–10.

41.

Robins

J. M.

Rotnitzky

Scharfstein

D. O.

(2000). Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In Halloran

M. E.

Berry

(Eds.), Statistical models in epidemiology, the environment, and clinical trials. The IMA volumes in mathematics and its applications (Vol. 116, pp. 1–94). Springer.

42.

Robins

J. M.

Sued

Lei-Gomez

Rotnitzky

(2007). Comment: Performance of double-robust estimators when “inverse probability” weights are highly variable. Statistical Science, 22(4), 544–559. https://doi.org/10.1214/07-STS227D

43.

Rohrer

J. M.

(2018). Thinking clearly about correlations and causation: Graphical causal models for observational data. Advances in Methods and Practices in Psychological Science, 1(1), 27–42. https://doi.org/10.1177/2515245917745629

44.

Rosenbaum

P. R.

(1984a). From association to causation in observational studies: The role of tests of strongly ignorable treatment assignment. Journal of the American Statistical Association, 79(385), 41–48. https://doi.org/10.1080/01621459.1984.10477060

45.

Rosenbaum

P. R.

(1984b). The consequences of adjustment for a concomitant variable that has been affected by the treatment. Journal of the Royal Statistical Society A: General, 147(5), 656–666. https://doi.org/10.2307/2981697

46.

Rosenbaum

P. R.

Rubin

D. B.

(1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1), 41–55.

47.

Rosseel

(2012). Lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), 1–36. https://doi.org/10.18637/jss.v048.i02

48.

Rubin

D. B.

(1986). Comment: Which ifs have causal answers. Journal of the American Statistical Association, 81(396), 961–962.

49.

Seaman

S. R.

Keogh

R. H.

Dukes

Vansteelandt

(2021). Using generalized linear models to implement g-estimation for survival data with time-varying confounding. Statistics in Medicine, 40(16), 3779–3790. https://doi.org/10.1002/sim.8997

50.

Simoneau

Moodie

E. E. M.

Platt

R. W.

Chakraborty

(2018). Non-regular inference for dynamic weighted ordinary least squares: Understanding the impact of solid food intake in infancy on childhood weight. Biostatistics, 19(2), 233–246. https://doi.org/10.1093/biostatistics/kxx035

51.

Steiner

P. M.

Cook

T. D.

Shadish

W. R.

Clark

M. H.

(2010). The importance of covariate selection in controlling for selection bias in observational studies. Psychological Methods, 15(3), 250–267. https://doi.org/10.1037/a0018719

52.

Sterne

J. A. C.

Tilling

(2002). G-estimation of causal effects, allowing for time-varying confounding. The Stata Journal, 2(2), 164–182. https://doi.org/10.1177/1536867X0200200205

53.

Textor

van der Zander

Gilthorpe

M. S.

Liśkiewicz

Ellison

G. T.

(2017). Robust causal inference using directed acyclic graphs: The R package ‘dagitty.’ International Journal of Epidemiology, 45(6), 1887–1894. https://doi.org/10.1093/ije/dyw341

54.

Tompsett

Vansteelandt

Dukes

De Stavola

(2022). Gesttools: General purpose g-estimation for end of study or time-varying outcomes [R package version 1.3.0]. https://CRAN.R-project.org/package=gesttools

55.

VanderWeele

T. J.

(2019). Principles of confounder selection. European Journal of Epidemiology, 34(3), 211–219. https://doi.org/10.1007/s10654-019-00494-6

56.

Vansteelandt

Daniel

R. M.

(2014). On regression adjustment for the propensity score. Statistics in Medicine, 33(23), 4053–4072. https://doi.org/10.1002/sim.6207

57.

Vansteelandt

Joffe

(2014). Structural nested models and G-estimation: The partially realized promise. Statistical Science, 29(4), 707–731. https://doi.org/10.1214/14-STS493

58.

Vansteelandt

Sjölander

(2016). Revisiting G-estimation of the effect of a time-varying exposure subject to time-varying confounding. Epidemiologic Methods, 5(1), 37–56. https://doi.org/10.1515/em-2015-0005

59.

Wickham

Vaughan

Girlich

(2023). Tidyr: Tidy messy data. https://tidyr.tidyverse.org; https://github.com/tidyverse/tidyr

60.

Wodtke

G. T.

(2018). Regression-based adjustment for time-varying confounders. Sociological Methods & Research, 49(4), 906–946. https://doi.org/10.1177/0049124118769087

61.

Wright

(1934). The method of path coefficients. The Annals of Mathematical Statistics, 5(3), 161–215. https://doi.org/10.1214/aoms/1177732676

A Tutorial on Causal Inference in Longitudinal Data With Time-Varying Confounding Using G-Estimation

Abstract

Keywords

A Brief Introduction of G-Estimation

Establishing the Causal Effects of Interest

Minimal illustrating example

Defining the causal effects of interest

Selecting pretreatment covariates for confounding adjustment

A simplified g-estimation procedure

G-Estimation Using Lavaan

Benefits of using constraints in lavaan

Doubly Robust G-Estimation

Summary

Footnotes

Transparency

ORCID iD

Notes

References