Abstract
In recent years, research on the incumbency effect using a regression discontinuity design has flourished. Although the regression discontinuity design has allowed scholars to examine the incumbency effect in various electoral settings, previous studies have not measured what has traditionally been defined as the incumbency (dis)advantage. In this paper, I bring together methods from previous research, provide a consistent exposition thereof, and highlight some of the challenges of estimation and interpretation by applying these methods to election data from 10 different electoral settings.
Introduction
In previous research, scholars have sought to overcome the methodological problems of measuring the effect of incumbency status on election outcomes (e.g., Ansolabehere, Snyder and Stewart, 2000; Erikson, 1971; Gelman and King, 1990; Lee, 2008; Levitt, 1994). However, the study of incumbency effects is complex because factors that affect incumbency status are also correlated with election outcomes. Regression discontinuity (RD) design offers a solution to some of the methodological problems associated with measuring the incumbency effect (Lee, 2008). If close elections are at least partly determined by chance, an RD design can be utilized to estimate the effect of winning an election on the outcome of the subsequent election.
Many recent studies have used RD design to estimate the incumbency effect in various electoral settings. The results of these studies are consistent with those of previous literature on the incumbency advantage in the United States (e.g., Ansolabehere and Snyder, 2002; Cox and Katz, 1996; Erikson, 1971; Gelman and King, 1990; Levitt and Wolfram, 1997), and report a similar size of incumbency advantage as that in the US (e.g., Lee, 2008; Erikson and Titiunik, 2015; Fowler, 2016). Outside of the US, researchers have also found an incumbency advantage in developed democracies (Ade, Freier and Odendahl, 2014; Eggers and Spirling, 2017; Hainmueller and Kern, 2008; Horiuchi and Leigh, 2009; Kendall and Rekkas, 2012; Redmond and Regan, 2015), while a growing body of literature has found no advantage or even a disadvantage of incumbency, particularly in developing democracies (Ariga, 2015; Ariga et al., 2016; Klašnja, 2015; Klašnja and Titiunik, 2017; Lee, 2016; Macdonald, 2014; Uppal, 2009).
Previous studies have adopted two approaches to measuring the incumbency effect using an RD design, the first of which is to estimate the incumbency effect at the party level (Ade, Freier and Odendahl, 2014; Ariga et al., 2016; Eggers and Spirling, 2017; Klašnja, 2015; Klašnja and Titiunik, 2017; Macdonald, 2014). Party-level RD analysis measures the effect of winning an election on a party’s vote share or probability of winning in the next election but to do so, it is necessary to define a “reference party.” In the US, for example, the Democratic or Republican parties could be used as such. The outcome variable is then defined as the Democratic (or Republican) share of the two-party vote or a dummy variable for the party’s victory. The running variable is defined as the Democratic (or Republican) share of the two-party vote in the previous election minus 0.5 (Lee, 2008). The treatment variable, winning an election, is defined as a dummy variable that takes the value 1 if the running variable exceeds 0.
In a multi-party system, the reference party is typically the most competitive party, which fields a large number of candidates across many elections: for example, the Liberal Democratic party in Japan (Ariga et al., 2016). Alternatively, a reference party can be defined as a “generic” incumbent party (Klašnja, 2015; Klašnja and Titiunik, 2017). For instance, Klašnja and Titiunik (2017) define an incumbent party for each municipality in Brazil as the party that wins an election at time t − 1. They measure the effect of winning an election at time t on the election outcome at time t + 1 for these incumbent parties.
However, the limitation of party-level analysis is that the RD design does not consider the possibility of a reference party being hurt by the incumbency status of other parties as much as it is helped by having an incumbent. In previous studies, the treatment variable has typically been coded as 1 if the reference party wins an election, and 0 otherwise. Therefore, the control group contains units in which a reference party faces an incumbent from other parties. As previous studies demonstrate (Erikson and Titiunik, 2015; Fowler and Hall, 2014), if almost all incumbents seek reelection and the incumbency status has, on average, the same effect for all parties, the party-level RD estimate would count the personal incumbency advantage twice. In contrast, in countries where only a small fraction of incumbents seek reelection, the RD estimate would underestimate the size of the personal incumbency (dis)advantage, because a large number of units in the treatment group contain open races.
The second approach is to analyze the incumbency effect at the candidate level (Ariga, 2015; De Magalhaes, 2015; Linden, 2004; Redmond and Regan, 2015; Uppal, 2009; Lee, 2016), one advantage of which is the applicability of this approach to countries with unstable party systems, where party switching is pervasive and/or candidates often run as independents. However, this approach can be problematic as we cannot observe the outcome for candidates who do not run in the next election. While some studies simply exclude such candidates from the sample (e.g., Uppal, 2009; Lee, 2016), doing so would result in a biased estimate if candidates’ retirement decisions are affected by their incumbency status and the expected outcome of the future election.
One potential solution to this issue is to measure the effect of winning an election on the probability of winning the subsequent election unconditional on running (Ariga et al., 2016; De Magalhaes, 2015; Redmond and Regan, 2015). In other words, the outcome variable is defined as a dummy variable that takes the value “1” if a candidate runs in and wins the next election, and “0” otherwise. However, this analysis is problematic in treating all candidates who do not rerun as losers. Therefore, the unconditional RD estimand does not inform us whether the estimated effect is, in fact, caused by a candidate’s incumbency status or simply because incumbents are more (or less) likely to rerun than losers.
In this paper, I bring together methods from previous applied papers in economics and political science, provide a consistent exposition thereof, and highlight some of the challenges of estimation and interpretation by applying these methods to election data from 10 different electoral settings. 1 First, for the party-level analysis, I discuss how to measure the personal incumbency effect using winning a close election as an instrument for incumbency status. Second, to address the sample selection issue of the candidate-level RD design, following Lee (2009) and Anagol and Fujiwara (2016), I estimate bounds on the average treatment effect of winning, conditional on running in an election. The results suggest that the estimated effects of incumbency critically depend on how the dependent variable is defined. Since the RD design estimates the effect of winning an election on the outcome of the subsequent election, it is important to treat candidates’ rerunning decisions carefully.
Party-level analysis
I first discuss how to measure the incumbency effect at the party level by formulating the problem in the potential outcome framework, also called the Rubin Causal Model (e.g., Imbens and Rubin 2015). Let Wd be a dummy variable indicating whether a reference party won in district d at time t. Let VMd be the running variable, the vote margin of the reference party in district d at time t. Since the RD design estimates the effect of winning a close election, all results are conditional on VMd = 0. For simplicity, I suppress this condition.
The treatment of interest, denoted by Id, is defined as the reference party’s incumbency status in district d at time t + 1, which is “1” if an incumbent from the reference party runs at t − 1, if the incumbent running at t + 1 is from another party, and “0” otherwise. The reference party (or non-reference party) will be assigned to treatment if it wins an election at time t. However, it will receive treatment—that is, have an incumbent running in the election at time t + 1—only if the incumbent runs again. Following the Rubin Causal Model’s convention, I will call incumbents who run at t + 1 compliers (C) and those who do not noncompliers (N). Note that the incumbency status Id depends on the election outcome at time t, Wd. If all winners at time t run at t + 1, we would have Id = 2Wd – 1. In other words, if the reference party wins at time t + 1, its incumbent will run at time t + 1, and if it loses it will face an incumbent from another party. Otherwise, in some districts where a reference party has won (lost) at t, Id can take a value of 1 (–1) or 0, depending on whether or not the incumbent decides to rerun.
Previous studies (e.g., Lee 2008) have focused on the effect of winning an election on the election outcome at t + 1, rather than the effect of having an incumbent in the election. If the usual assumptions of RD design are valid, the intention-to-treat (ITT) analysis can estimate the causal relationship between winning an election at t and the election outcome at t + 1. 2 However, as Fowler and Hall (2014) notes, the ITT analysis can estimate neither the personal nor the partisan incumbency effect.
To see this, let Id(0) and Id(1) denote the potential treatment a district would receive if assigned to Wd = 0 and Wd = 1, respectively. For each district, we can only observe one of the two potential treatments, Id = WdId(1) + (1 − Wd)Id(0). Since the incumbent party may not field a candidate in an election, there are four possible values of (Id(1), Id(0)): (1,−1), (1,0), (0,−1), and (0,0). 3 I assume that noncompliance is one-sided: A party that lost at time t cannot have an incumbent at t + 1. I relax this assumption in Online Appendix C. We can classify each unit into one of the four groups based on the potential treatment:
Let Yd be the outcome of interest. The realized outcome, Yd(Wd,Id(Wd)), which depends on Id as well as Wd, can take one of the following four values:
We can write the ITT effect as
Under certain assumptions, we can use the assignment, Wd, as our instrument for the treatment, Id, and estimate the causal effect of personal incumbency on the election outcome using a fuzzy RD design (see also Fowler, 2016). To do so, we must first assume that the effect of incumbency is the same for the reference party and other parties.
In other words, the reference party is hurt by the incumbency status of other parties as much as it is helped by having its own incumbent running. This assumption is routinely made in the literature on the incumbency advantage in the USA (e.g., Ansolabehere and Snyder, 2002; Erikson and Titiunik, 2015; Levitt and Wolfram, 1997). It is plausible in countries like the USA, which has a stable two-party system, but the incumbency status may have heterogeneous effects for different parties in other countries. For instance, if incumbents have advantages over other candidates because of what they can do for their constituents in office, incumbents from minor parties may have a smaller advantage because their ability to do what their constituents want is limited. To estimate the personal incumbency effect using a fuzzy RD design, the following additional assumption must be made regarding Gd = NN.
Assumption 1b states that either (a) in districts where no incumbent would run, winning itself has no effect on the election outcome or (b) the proportion of districts where no incumbent from any party would run is negligible. Note that the second part of this assumption does not imply that there will be no open race, because Gd denotes latent groups. The first part of the assumption, which has been characterized in previous literature (e.g., Erikson and Titiunik, 2015; Fowler, 2016) as a no “partisan” incumbency effect assumption, comprises an exclusion restriction because it rules out a direct effect of assignment on the election outcome. This is a key identifying assumption for a fuzzy RD design. It should be noted that assumption 1a implicitly presupposes that the partisan incumbency effect is also negligible for other latent groups, because party incumbency changes whenever the reference party wins or loses.
The exclusion restriction is a strong assumption and its applicability should be carefully examined in different electoral settings. Previous studies on US elections demonstrate that the advantage of having a retiring incumbent is negligible (Broockman, 2009; Butler and Butler, 2006; Fowler, 2016), thus suggesting no partisan incumbency advantage. However, the partisan incumbency effect may be non-zero in other electoral contexts. Kendall and Rekkas (2012) suggest that the partisan incumbency has a zero or negative effect in the Canadian parliamentary elections. In Brazilian mayoral elections, parties are hurt by having a term-limited incumbent (Klašnja and Titiunik, 2017), suggesting a “partisan incumbency disadvantage.” Similarly, Fowler and Hall (2014) suggest that the partisan incumbency effect can be negative in term-limited contexts, while Eggers (2017) provides a theoretical reason for the partisan incumbency disadvantage in the presence of term limits. If we allow the existence of a partisan incumbency effect, an RD design alone, fuzzy or sharp, cannot provide an unbiased estimate for either a partisan or a personal incumbency effect. See Online Appendix D for details.
If assumptions 1a and 1b hold, the incumbency effect consists entirely of a personal incumbency effect. However, the ITT estimate would almost always either exaggerate or depreciate the personal incumbency effect. To observe this, we can rewrite the ITT effect as
where
In the remaining section, I demonstrate how the fuzzy RD estimates for incumbency effects can diverge from the ITT estimate. Table 1 presents the percentage of districts where the winners of time t run again at time t + 1. The first two columns show the percentages for the full sample, while the remaining two columns show the percentages for the RD sample. The return rate of incumbents is the highest in the US House elections, where more than 80 percent of incumbents seek reelection. In contrast, in Indian Lok Sabha elections (national parliamentary elections), fewer than 40 percent of winners at time t reran at time t + 1. Figure 1 shows the RD estimate of µ in equation (3). 5
Percent of constituents where incumbents run for reelection.

Effect of winning an election on the incumbency status of the reference party.
Figure 2 presents the estimated effects of incumbency on election outcomes. The dependent variable for Figure 2(a) is a dummy variable indicating whether the reference party wins at time t + 1. In Figure 2(b), the dependent variable is the reference party’s vote margin divided by 2. The ITT estimates demonstrate that winning an election has a positive effect on the outcome of the next election in the US, Canada, and the United Kingdom, consistent with previous studies (e.g., Ansolabehere and Snyder, 2002; Eggers and Spirling, 2017; Kendall and Rekkas, 2012). The fuzzy RD estimate diverges substantially from the ITT estimate in the US House elections: the fuzzy RD estimate is approximately 10 percent smaller than the ITT estimate. The difference, however, is much smaller in Canadian and UK elections, because the first stage RD estimates are close to 1 (1.125 and 1.132 for Canada and the UK, respectively). As equation (3) demonstrates, the two estimates will be similar if µ is close to 1.

RD estimates of the incumbency effect (party-level analysis).
In contrast, where the ITT estimate is negative, the first stage RD estimate tends to be smaller than 1, leading to the underestimation of the size of the personal incumbency effect. As shown in Figure 2, the estimated effects of incumbency on the probability of winning and the vote margin are negative in South Korea and India. The estimates are statistically significant, except in the Indian Lok Sabha when a dummy for winning is used as a dependent variable, in which case the estimate is only marginally significant. In all of these elections, the size of the ITT estimate is smaller than the size of the fuzzy RD estimate, although the difference is not as great as in the US House elections.
Finally, the ITT estimate is not statistically different from 0 in Japanese House elections and Romanian mayoral elections. 6 The results suggest that the incumbency effects in these countries are likely to be zero unless the partisan and personal incumbency effects have different signs. If the incumbency effects, personal and partisan, are 0, the fuzzy RD estimate would not differ from the ITT estimate, because both are zero.
To summarize, the analyses in this section demonstrate that the fuzzy RD estimate diverges from the ITT estimate if (a) the incumbency effect is not 0 and (b) the estimated effect of winning on µ is close to 0 or 2. Among the cases analyzed in this section, the US House elections and, to a lesser extent, the South Korean and Indian legislative elections, fit these two criteria. If the exclusion restriction assumption holds, a sharp RD design would lead to over- or underestimation of the personal incumbency effect. Moreover, the ITT analysis tends to underestimate the size of the incumbency disadvantage because fewer than half of the incumbents seek reelection; however, more data are required to ascertain whether this pattern can be generalized to countries where incumbency has a negative effect on election outcomes.
Candidate-level analysis
I now turn to the candidate-level analysis. As previously mentioned, the unconditional RD analysis measures the effect of winning an election on electoral success in the subsequent election, treating all candidates who do not rerun as losers. Therefore, it does not tell us whether the effect is driven by the incumbency or by the fact that losing candidates find it more difficult to rerun. Unless all winners and runners-up of close elections decide to rerun, or the candidacy is unrelated to the future election outcome, the unconditional RD estimate would differ from the conditional RD estimate, which measures the effect of candidates’ incumbency status on election outcomes.
Table 2 presents the percentage of candidates rerunning in the next election. 7 As shown in the table, winners are more likely than losers to run in the subsequent election. 8 For instance, in Canada, a developed Western democracy, incumbents are 2.5–3.7 times more likely than runners-up to rerun in the next election. The difference in rerunning rate between incumbents and losers tends to be smaller in developing democracies, but winners are approximately 15–30 percent more likely to rerun.
Percent of rerunning candidates.
Although it is straightforward to estimate the unconditional RD estimate, estimating the conditional effect of winning is more complicated, because candidates’ decisions to rerun are affected by incumbency status, as well as by the prospect of future election. Furthermore, the candidate’s performance in the next election (i.e., how the candidate would do if he or she ran) is not observed. Following Lee (2009) and Anagol and Fujiwara (2016), I estimate bounds on the effect of winning an election on the probability of winning the next election conditional on running.
9
Let Wi be a dummy variable for our treatment, indicating whether candidate i won an election at time t. For each unit, let Ri(0) and Ri(1) be the potential outcome variables, indicating whether a candidate runs at time t + 1 when Wi = 0 and Wi = 1, respectively. Similarly, let Yi(0) and Yi(1) be our potential outcome variables. We can only observe
Let VMi be the vote margin of candidate i at time t, the running variable. The unconditional RD analysis estimates
To estimate the bounds, I first classify each candidate into one of the following four latent groups (Gi): “always-takers (AT),” those who always run; “never-takers (NT),” those who never run; “compliers (CO),” those who run only if they have won the previous election; and “defiers (DF),” those who run only if they have lost the previous election. I assume that there are no defiers: if a candidate who lost an election decides to rerun, she would also rerun had she won the election.
Under this assumption, we can write the conditional effect of winning, the incumbency (dis)advantage, as follows (for simplicity, I suppress the conditioning on VMi = 0)
where (i) is
We can obtain the upper bound by assuming
Figure 3 presents the results. 11 The RD estimate of the unconditional effect of winning is positive and statistically significant for Canada but negative in all other countries, although the negative effect is statistically significant only for Indian State Assembly elections. Interestingly, in countries where the unconditional RD estimate is not positive, it is outside, or very close to, the upper bound. As Figure 3 shows, the estimates are located on the right side of the bounds except for Canada. Note that the upper bound is calculated under the very restrictive assumption that a complier who lost at time t would never win the next election had she chosen to run. Therefore, the conditional RD effect of winning is likely to be strictly smaller than the upper bound. Note also that the estimates for the lower bound are negative and statistically significant in Brazil, India, and Korea, suggesting that the unconditional estimate is likely to underestimate the size of the incumbency effect in these countries.

Conditional/unconditional effect of winning (candidate-level analysis).
In Figure 4, I repeat the analysis of (b) in Figure 3, assuming that the probability that a complier who lost at time t would win at time t + 1 is only one-half the probability that an incumbent who reran would win at time t + 1, i.e.,

RD estimates of the incumbency effect (candidate-level analysis).
As expected, the figure shows that the unconditional RD estimate exceeds the conditional RD estimate. To ascertain why the unconditional RD will underestimate the size of the incumbency disadvantage, let θ denote
To summarize, the results suggest that the unconditional RD analysis may lead to overestimation of the incumbency advantage and underestimation of the size of the incumbency disadvantage. However, it should be noted that while we can estimate the bounds on the conditional incumbency effect, they may fail to provide conclusive evidence of an incumbency effect. Among the cases analyzed in this section, in three of four elections where the estimates for the lower bound were negative and statistically significant, the estimates for the upper bound failed to reach statistical significance.
Conclusion
In this paper, I have discussed how one can (or cannot) estimate the incumbency effect using an RD design. At the party level, I demonstrated that, based on some assumptions, we can recover the personal incumbency effect using a fuzzy RD design. At the candidate level, following Anagol and Fujiwara (2016) and Lee (2009), I provided bounds on the conditional effect of winning, the personal incumbency effect. The results suggest that it is important to recognize what RD analysis can and cannot estimate. As some recent studies suggest (Erikson and Titiunik, 2015; Fowler and Hall, 2014; Sekhon and Titiunik, 2012), a quasi- or natural experimental design like an RD design does not always recover the quantity that is of theoretical interest. Furthermore, the validity of an RD design to estimate the incumbency effect relies on some key assumptions that should be carefully examined in each electoral setting.
It should be emphasized, however, that I do not claim that the previous RD estimates are of little theoretical importance. They tell us that winning parties or candidates tend to win or lose in the next election, which may be due to what incumbents do in office (e.g., constituency service or rent-seeking in the case of incumbency disadvantage) or parties’ candidate nomination processes that affect whether incumbents can easily seek reelection. For instance, suppose that major parties in a country are very successful at weeding out unpopular incumbents. It would then be plausible that the party-level RD analysis may find a zero or positive effect of winning even when there is, on average, a personal incumbency disadvantage. In contrast, it is also possible that parties in a country tend to nominate incumbents who are loyal to the parties’ bosses but not to the voters. In this case, we may find that winning has an adverse effect on parties when, in fact, these incumbents would enjoy a personal electoral advantage if allowed to run. Therefore, studying whether/how the personal incumbency effect diverges from the previous RD estimates would allow us to better understand the sources of incumbency (dis)advantage and shed light on the electoral politics of developed and developing democracies.
Supplemental Material
online_appendix_RAP – Supplemental material for Estimating Incumbency Effects Using Regression Discontinuity Design
Supplemental material, online_appendix_RAP for Estimating Incumbency Effects Using Regression Discontinuity Design by B. K. Song in Research & Politics
Footnotes
Acknowledgements
I thank the anonymous reviewers and the associate editor Andy Eggers for their helpful comments and constructive suggestions.
Declaration of conflicting interest
The authors declare that there is no conflict of interest.
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Supplemental materials
The replication files are available at https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/JSOWUR&version=DRAFT.
Notes
Carnegie Corporation of New York Grant
This publication was made possible (in part) by a grant from the Carnegie Corporation of New York. The statements made and views expressed are solely the responsibility of the author.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
