Abstract
I review a recent article published in this journal (Ademi and Kimya, 2023) which uses a continuous measure of democracy as the basis for a ‘fuzzy regression discontinuity design’ investigating the effect of democratization on party system polarization. I argue that their design does not qualify as a regression discontinuity design because continuous measures of democracy produced by social scientists are not used to assign treatments to country-years, and because the authors misuse the auxiliary variable recording regime transitions used in their analysis.
Introduction
In a recent article published in this journal, Ademi and Kimya (2023) argue that transitioning to democracy reduces party system polarization. They claim to identify a causal relationship on the basis of the research design they use. Specifically, they use a fuzzy regression discontinuity design, which under certain assumptions can identify causal effects. In this note I argue that Ademi and Kimya’s regression discontinuity design is invalid. Their conclusions regarding the effects of transitioning to democracy are therefore unsafe. Worse, because of errors in their treatment of a key variable, the quantities they report cannot even be interpreted as partial associations between transitioning to democracy and party system polarization.
I begin by restating Ademi and Kimya’s argument, and repeating some basic features of regression discontinuity designs. I then go on to argue that particular values of the V-Dem project’s continuous measure of electoral democracy cannot serve as a rule assigning treatments to units, and that for that reason any regression discontinuity designs based on the V-Dem project’s continuous measures of democracy are invalid. I consider additional complications created by the authors’ use of a variable measuring regime transitions and argue that this cannot serve as a measure of the treatment actually received, rather than the treatment assigned. These arguments relate to the regression discontinuity design. In a final substantive section, I suggest that the authors have misunderstood the V-Dem project’s regime transition variable and have not considered how long-term the effects of democratic transition can be.
What Ademi and Kimya argue
Ademi and Kimya argue that “having transitioned to democracy” reduces party system polarization. They give three reasons for this (4). First, parties in societies undergoing democratic transition are organizationally weak and organizationally weak parties find it difficult to sustain distinct ideological positions. Second, some democratic transitions involve intra-elite pacts which can reduce polarization. Third, transitions to democracy can involve institutional changes targeted at extreme parties (for example, bans on anti-democratic parties) and these institutional changes reduce polarization.
The key independent variable in this analysis is “having transitioned to democracy”. This is a dummy variable which has a value of one if the country is currently democratic and has transitioned to democracy in that year or at some earlier point in the period covered by the data. This variable has a value of zero if the country is not currently democratic, or if the country is currently democratic and has always been democratic during the period covered by their data. I describe the key independent variable as “having transitioned to democracy” rather than “democratic transition” because countries can continue to have a value of one for this variable for multiple elections after their transition.
This variable depends on two further variables: a measure of current levels of democracy in a country, and a measure of whether there was a transition. Current levels of democracy are measured using the V-Dem project’s
The key outcome variable, party system polarization, is measured as the dispersion of policy positions along the left-right dimension of political competition. Parties’ positions on the left-right dimension are taken from the Manifesto Project. Their main measure of dispersion is the unweighted standard deviation of party positions; in an appendix they present similar results using a vote-share weighted standard deviation. My argument applies to both versions of this measure.
The unit of analysis in their study is the election year. 1 In total, Ademi and Kimya study levels of polarization in 745 election years in 58 countries. The earliest election they study is the Swedish election of 1944; the most recent is the German election of 2021. The geographic and temporal scope of their research is limited by the countries included in the Manifesto Project data. The Manifesto Project data includes data for parties contesting both democratic and non-democratic elections. Ademi and Kimya give as examples of non-democratic elections the Turkish elections of 1983 and 1987, which followed a military coup.
Ademi and Kimya claim to be able to identify the causal effect of having transitioned to democracy on polarization. They describe their research design as a fuzzy regression discontinuity design. This involves comparing values of polarization in election years either side of a cut-off. Because they have data from multiple elections in each country, their study involves both comparisons across time within the same country and comparisons between countries.
What a regression discontinuity design involves
A regression discontinuity design is a research design used to identify the causal effect of some treatment from observational data with certain features. Per Cattaneo et al. (2019): “In the RD design, all units have a score, and a treatment is assigned to those units whose value of the score exceeds a known cutoff or threshold, and not assigned to those units whose value of the score is below the cutoff. The key feature of the design is that the probability of receiving the treatment changes abruptly at the known threshold” (3)
RD designs are of two kinds: sharp RD designs, where “all units with score equal to or greater than the cutoff actually receive the treatment, and all units with score below the cutoff fail to receive the treatment and instead receive the control condition” (ibid.), and fuzzy RD designs, where units with score equal to or greater than the cut-off may fail to receive the treatment because of imperfect compliance. In fuzzy RD designs, we distinguish between the treatment assigned and the treatment received. Units which were assigned the treatment and received it can be described as compliers. Units which never receive the treatment, whether or not they were assigned it, can be described as never takers. Units which were assigned the control condition but who take the treatment are defiers.
In Ademi and Kimya’s analysis, the V-Dem electoral democracy index,
Particular values of V-Dem measures do not assign units to treatment
A key feature of an RD design – whether sharp or fuzzy – is that values of the score, or running variable, are used to assign units to treatment. For example: a municipality gets an AKP mayor based on whether the AKP candidate’s vote total exceeds the largest other vote total (Meyersson, 2014); or government departments enrol individuals in income support schemes based on whether their income exceeds a threshold. Ademi and Kimya’s design does not have anything like this. There are no real world consequences of exceeding a particular value on the
Even if we did think that values of V-Dem scores assigned election years to treatments, we would still have to specify a cut-off where treatment assignation was discontinuous. In the example of government departments making decisions on enrollment in a benefits scheme, we can speak to bureaucrats in the department or examine written guidance to ascertain the cut-off. We cannot do that here. It is true that some members of the V-Dem team use a cut-off of 0.5 to classify regimes (Lührmann et al., 2018). However, not all members of the V-Dem team agree with this use of the data (Mechkova et al., 2017:4). Indeed, when other dichotomous measures of democracy are modelled as a function of the V-Dem project’s
Even if there were good grounds for using a cut-off of 0.5, the design would still be invalid because it fails to deal with measurement error in the data. The treatment assignment function cannot be discontinuous because the
Ultimately, a regression discontinuity design requires that “the assignment of the treatment follows a rule that is known (at least to the researcher) and hence empirically verifiable” (Cattaneo et al., 2019:2). There is no empirically verifiable rule here, only assertion. The mere desire on the part of social scientists to see a discontinuity at a particular value does not create an assignment rule. If Ademi and Kimya’s design were valid, there would be nothing to stop researchers from “discovering” new discontinuities at particular values of indices created by social scientists but which are not used as part of a rule to assign units to treatments.
“Regime transition” cannot serve as a treatment indicator
So far I have argued that Ademi and Kimya’s design cannot, on grounds of principle, serve as a regression discontinuity design. I have not, however, dealt with an additional part of their analysis, which concerns the V-Dem project’s “Instead of assuming that a… score of >0.5 determines the treatment receipt, we use a separate variable for regime transition from … V-Dem (
Unfortunately the quoted text is ambiguous in certain respects, creating the impression that particular values of the
V-Dem regime end event types and their codes.
The references to the V-Dem variable
This way of creating the treatment variable means that it is not possible to validate the assumptions of the RD design. In a fuzzy RD design it is necessary to determine the degree of compliance. This involves not just checking whether there are individuals who were assigned to the treatment but who did not receive it (“never takers”), but whether there were individuals who were assigned to the control group but took the treatment (“defiers”). Because Ademi and Kimya create their treatment variable on the basis of values of the score variable, there are no “defiers” by construction: if the value of the running variable is less than 0.5, the value of
Thus although the authors stress the advantages of using a separate variable “to identify the cases that experience democratic transition”, it turns out that particular values of the
Concerning sufficiency, there are multiple examples of democratizing regime transitions which do not lead to a positive value of demtrans. As shown in Table 1, the V-Dem records information on democratizing transitions under two different codes (9 and 11, but possibly also 8). Many of these democratic regime transitions occur at values of
I have drawn attention to specific codes used in the
These are cases where the value of the
Misuses of the regime transition variable
I have proceeded on the basis that the authors have used the
These comments all concern the technical correctness of the analysis, but there are also questions about the theoretical basis for considering a country to “have transitioned to democracy” many years after the original transition, particularly when information on transitions is partly determined by the scope of the data in the Manifesto Project. This means that countries with similar levels of democratic experience are treated differently depending on whether their transition to democracy was within the period covered by the Manifesto Project. Take as an example the comparison between Canada and Denmark. The value of the demtrans variable for Denmark in 2019 is equal to one. In some sense, this is correct: Denmark did (re-)transition to democracy following the end of German occupation. There is also some sense in carrying forward the values of this transition indicator: the effects of transition may not be felt just in the first post-transition election but in a number of elections following. However, it does not seem likely that the effects of transition would still be felt more than 70 years later. Even if parties were weakened by German occupation and felt a need to present a united front in the post-war period, these effects would not still be present by 2019. Conversely, Canada in 1945 is treated as never having undergone a democratic transition. This is because the earliest election for which there is Manifesto Project data is 1945, and because the value of
These problems show how the inclusion of an additional regime transition variable does not, in fact, “loosen the assumptions regarding the treatment” (5). The problems noted in the previous sector show why authors’ proposed design cannot be interpreted as a fuzzy regression discontinuity; the problems described in this section show that the coefficients cannot even be treated as partial associations.
Conclusion
A common structure in response letters is to identify a list of problems present in an analysis, suggest ways of correcting these problems, and show that the original finding collapses when these problems are corrected in the preferred way. Because I believe that the problems listed above are so fundamental, I am not able to follow this same structure. The authors’ conclusions regarding the impact of democratization on (unweighted) party system polarization may be correct: I do not know. I do believe strongly that the proposed use of a regression discontinuity design using V-Dem’s continuous measures of democracy misuses those measures, and that any similar proposed research designs should be strongly deprecated.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
