Abstract
Many studies estimate the impact of exposure to some quasiexperimental policy or event using a panel event study design. These models, as a generalized extension of “difference-in-differences” designs or two-way fixed-effects models, allow for dynamic leads and lags to the event of interest to be estimated, while also controlling for fixed factors (often) by area and time. In this article, we discuss the setup of the panel event study design in a range of situations and lay out several practical considerations for its estimation. We describe a command,
1 Introduction
Recent developments in quasiexperimental methods have brought increasing attention to panel event study models. When one uses data covering a panel of observations (such as states) over time, the design seeks to estimate the impact of some event that occurs, or “switches on” in certain units and certain time periods. 1 These models seek to use as counterfactuals the areas in which the policy or event does not occur or has not yet occurred. By considering the variation in outcomes around the adoption of the event compared with a baseline reference period, one can estimate both event leads and lags, which allows for a clear visual representation of the event’s causal impact provided that key identifying assumptions are met.
These methods have been borne out of older difference-in-differences (DD) designs, or two-way fixed-effects models. These models often seek to examine the impact of natural experiments, where events are assigned to certain units due to some process beyond the control of the analyst but owing to environmental or political factors (among others), and thus, generally do not assume that assignment is random. Indeed, as we lay out at more length in the following section, the key assumption underlying consistent estimation in event study models is that the occurrence of the event in a particular area is not systematically related to the changes in levels that would have occurred in the future in the absence of the event.
These models are widely used in empirical analyses in a range of contexts, having been applied to (among many others themes) automotive plant closures and opioid overdoses (Venkataramani et al. 2020), family planning access and childhood economic circumstance (Bailey, Malkova, and McLaren Forthcoming), healthcare reform and ambulatory care usage (Dimitrovová, Perelman, and Serrano-Alarcón 2020), and university reform and intergenerational mobility (Suhonen and Karhunen 2019). These cases suggest use across a range of fields, including social sciences, medicine and public health, and additional reviews of their frequency of use in several economic journals are provided in Abraham and Sun (2018); Roth (2019). A burgeoning literature has laid out several identification requirements in this setting (Freyaldenhoven, Hansen, and Shapiro 2019; Borusyak and Jaravel 2018; Abraham and Sun 2018; Athey and Imbens Forthcoming; Schmidheiny and Siegloch 2019). These methods can be used, with some restrictions, both in cases where events occur at the same time period in each unit and in cases where the adoption of events is staggered. Indeed, Athey and Imbens (Forthcoming) refer to these as “staggered adoption designs”, although here we follow the more common nomenclature of panel event studies. 2 Additionally, these methods are related to a much broader literature on staggered adoption of policies and the estimation of a single-coefficient model (de Chaisemartin and D’Haultfoeuille 2019; Callaway and Sant’Anna 2018; Goodman-Bacon 2018). While we briefly discuss these models in the methods section, our principal interest is on full panel-event study specifications that come with their own considerations.
In this article, we discuss these panel-event study models and practical issues related to their estimation and to inference in these settings. We also present the
2 Methods
2.1 Estimation
Consider a panel covering a group, indexed as g and time periods t. We are interested in estimating the impact of the passage of an event that may occur at different times in different groups. We will denote as Event g a variable recording the time period t in which the event is adopted in group g. Denoting the outcome of interest as ygt , we can write the panel event study specification as 4
Here µg and λt are group and time fixed effects, X gt are (optionally) time-varying controls, and εgt is an unobserved error term. In (1), leads and lags to the event of interest are defined as follows:
Leads and lags are thus binary variables indicating that the given group was a given number of periods away from the event of interest in the respective time period. J and K leads and lags are included, respectively, and, as indicated in (2) and (5), final leads and lags “accumulate” leads or lags beyond J and K periods. A single lead or lag variable is omitted to capture the baseline difference between groups where the event does and does not occur. In (1), as standard, this baseline omitted case is the first lead (one period prior to the reform), where j = 1.
A stylized example of such a setting is provided in table 1. We consider four groups forming a balanced panel of years from 2000–2009. The Event g variable occurs at different times in different groups and, in the case of one group, does not occur. Here both four leads and four lags are included, such that J = K = 4. Lead and Lag 4 (exclusively) are switched on for periods in which the “Time to event” exceeds 4 leads or lags, respectively.
A stylized example
Groups in which the event never occurs (such as Group C in table 1) act as pure controls. These units have 0s in all lead and lag terms and act as the counterfactual on which the estimation of impacts is based. Differences between these pure control groups and groups which adopt the event of interest are anchored at 0 in the omitted base period, that is, the first lead in (1). Hence, leads and lags capture the difference between treated and control groups, compared with the prevailing difference in the omitted base period. Unbiased estimation of postevent treatment effects thus relies fundamentally on the so called parallel trends assumption. In the absence of treatment, it is assumed that treated and control groups would have maintained similar differences as in the baseline period. Thus, these models have been demonstrated to be underidentified, or identified only up to a linear trend, when all units adopt treatment at some point in time (Schmidheiny and Siegloch 2019; Borusyak and Jaravel 2018). Schmidheiny and Siegloch (2019) show that in this case, it is necessary to bin leads and lags beyond certain maximum lead (J) and lag (K) periods.
The panel event study is an extension of the standard two-way fixed-effects (sometimes called DD) model, where a single “Post event” indicator is included for all periods posterior to the occurrence of the event in treated groups. This is simply
where following the notation from (2)–(5), Post event
gt
=
A developing literature, including articles by de Chaisemartin and D’Haultfoeuille (2019), Callaway and Sant’Anna (2018), and Goodman-Bacon (2018), point to challenges in interpreting the estimated
2.2 Inference
A standard inference concern where policies are assigned by some group such as a state, and outcomes are followed over time within these groups, is related to potential serial-correlation in the outcome variable over time (Bertrand, Duflo, and Mullainathan 2004). While the derivations from Bertrand, Duflo, and Mullainathan (2004) are based on single-coefficient models of the form of (6), the crux of the concern relates to high serial correlation in the outcome variable of interest, and relatively little change in the independent variables of interest. This setting is replicated in event study models described in (1)–(5). It is thus fundamental to account for this within-cluster correlation when conducting inference in such models.
The standard solution is to allow for within-cluster auto-correlation by using a cluster–robust variance–covariance estimator (CRVE) to estimate standard errors and CIs on regression parameters. Such an estimator is provided as standard in Stata by specifying the
In practice, knowing how many clusters is “too few” depends on several factors. While rules of thumb such as the rule of 42 are laid out in Angrist and Pischke (2009), who suggest that standard clustering provides a good approximation if G ≥ 42 clusters, the performance of these methods under simulation has been shown to depend also on the relative size of clusters (MacKinnon and Webb 2017). A range of results surveyed in Cameron and Miller (2015) leads to their suggestion that if one is analyzing data with fewer than 50 clusters in a group-year panel (such as the case with panel event studies), alternative inference methods should be considered.
In this case where the quasiexperimental setup is based on fewer than about 50 clusters, the wild cluster bootstrap has been documented to be a successful resamplingbased method to account for autocorrelation in variables underlying panel event studies, even in cases with fewer clusters (see, for example, Cameron, Gelbach, and Miller [2008]; Cameron and Miller [2015]; Roodman et al. [2019]). This has been efficiently implemented in Stata as described in Roodman et al. (2019) and programmed for Stata as
3 The eventdd command
3.1 Syntax
Panel event studies can be implemented in Stata using the following command syntax:
The required depvar should specify the dependent variable of interest, and then indepvars should specify (where relevant) the optional controls, including fixed effects to be included in the panel event study model (1) but not including leads and lags, that should be entered in the regression.
3.2 Options
This also allows for the use of alternative labels for graph axes. By default, standard graphical output will be provided.
3.3 Stored results
Note that methods related to event study models such as that described by Rambachan and Roth (2020) rely on access to point estimates and standard errors of lead and lag terms, which are available through the matrices returned here.
3.4 Postestimation commands
Several postestimation commands are available after using the
Unless otherwise requested, these postestimation commands conduct F tests of the joint significance of parameters. However, wild clustered bootstrap versions of the joint tests can be conducted with the following options:
4 Examples based on an empirical application
We now provide several illustrations of the performance of
The specification of the baseline two-way fixed-effect DD style model of female suicide on no-fault divorce reforms used is
This is the analogue of (6) applied to this case in particular. Here
4.1 Estimation of the panel event study
To estimate a panel event study specification corresponding to the no-fault divorce reform, one first creates the standardized version of the time-to-reform variable, presuming such a variable is not already available in the data. In this case in particular, the creation of the variable in Stata simply requires subtracting the reform period, Event
s
, called Event
g
in section 2 (and
Note that as expected, missing values are generated for states in which the reform is not adopted at any point in this period and that act as pure controls in the panel event study. Below, you can see how the data are set for the first 10 observations, documenting the relationship between the absolute time period (
The second step is to estimate the event study, as per (1)–(5). In this example, the general form of the event study model, including all leads and lags available, is
where as above
The
The command stores all event leads, their lower bound, the point estimate, and their upper bound. For example, if we wish to visualize the estimates on the full set of leads, as well as their upper and lower CIs, we can simply examine the returned
Because we do not specify the estimation method in the
In the same way, we can estimate the results efficiently absorbing multiple levels of fixed effects via the
The standard command output consists of the regression output (all the above output, including the warning, comes directly from the regression estimated by
This can be simply assessed postestimation using one of the postestimation commands designed for use with
Similar such postestimation commands exist to test the joint significance of the postimplementation coefficients (

Event study example based on no-fault divorce reforms. N
This “fully saturated” model where all possible leads and lags are plotted is the default output in the
The output in this case is displayed in figure 2a. A special case of plotting limited leads/lags consists of the case in which one wishes to show only coefficients and CIs for which all states have a lead and lag term. We refer to this as a balanced plot, which can be produced quite simply using the
Given that we now restrict to only certain states based on their period of adoption (as well as nonadopting states), the lead and lag estimates will differ from those from the fully saturated model discussed previously. In the output of the above command, we observe that the estimation sample consists only of 507 observations for adopting states with balance in the indicated leads/lags, as well as states that do not adopt (versus 1,617 observations in the full-sample specification). The corresponding event study plot is presented in figure 2c, where we note that the considerable change in estimation sample (chosen simply for expositional reasons) produces quite different results.
An alternative way to work with the imbalance in standardized time periods is to stipulate that all periods beyond some specified values should be accumulated into final lead and lag points, as indicated in (2) and (5). This is implemented with the
Because these endpoints have a different interpretation to additional leads and lags, acting as an estimate of long-term impacts of the event for all periods beyond intermediate leads/lags, by default the endpoint estimates will be plotted in an alternative color. This behavior can be controlled fully using the
Finally, as discussed in section 2, the reference period for any estimated panel event study will be assumed to be the period immediately prior to the occurrence of the event in each state, unless otherwise indicated. This can be simply changed via the

Event study plots for no-fault divorce reforms: Output with alternative estimation options.N
4.2 Inference options
The previous subsection describes several alternative estimation procedures that are potentially of relevance in the estimation of a panel event study design. However, as discussed in section 2 of this article, there are several inference considerations that must be weighed when implementing a panel event study model. Until now, the command has always been implemented with
However, the

Visualizing alternative inference procedures for event study models
Finally, note that as standard,

Default event study plots with alternative CIs
4.3 Altering standard appearance

Alternative visualization options for event study CIs
These graph types can be fully controlled using suboptions within the

Event study plots no-fault divorce reforms: Appearance options
5 Conclusions
The panel event study is an increasingly frequently used tool in the applied analysts’ toolbox. It allows for the clear presentation of estimated impacts in quasiexperimental (observational) contexts, when one wishes to consider the impact of some event that occurs at (potentially) different times in different geographical areas. What’s more, while the discussion and examples provided in this article are structured around geographical clustering of events (such as the application of divorce reforms studied in Stevenson and Wolfers [2006]) and applied to demonstrate other two-way fixed-effects methods (Goodman-Bacon 2018), this setting can similarly be applied where there is the temporal arrival of some event of interest in other dimensions, such as by age or other demographic groups.
In this article, we discussed a growing literature laying out panel event study designs and introduced a flexible command,
While
7 Programs and supplemental materials
Supplemental Material, sj-zip-1-stj-10.1177_1536867X211063144 - Implementing the panel event study
Supplemental Material, sj-zip-1-stj-10.1177_1536867X211063144 for Implementing the panel event study by Damian Clarke and Kathya Tapia-Schythe in The Stata Journal
Footnotes
6 Acknowledgments
We are grateful to an anonymous referee for very useful suggestions related to command syntax and structure. The authors acknowledge the financial support of the Universidad de Santiago de Chile and the Millennium Nucleus for the Study of the Life Course and Vulnerability, funded by the Ministry of Economics of the Government of Chile.
7 Programs and supplemental materials
To install a snapshot of the corresponding software files as they existed at the time of publication of this article, type
