Sage Journals: Discover world-class research

Abstract

Protest is a low-intensity form of political conflict that can precipitate intrastate armed conflict. Data on protests should therefore be informative in systems that provide early warnings of armed conflict. However, since most protests do not escalate to armed conflict, we first need theory to inform our prediction models. We identify three theoretical explanations relating to protest-repression dynamics, political institutions and economic development as the basis for our models. Based on theory, we operationalize nine models and leverage the political Violence Early Warning System (ViEWS) to generate subnational forecasts for intrastate armed conflict in Africa. Results show that protest data substantially improves conflict incidence and onset predictions compared to baseline models that account for conflict history. Moreover, the results underline the centrality of theory for conflict forecasting: our theoretically informed protest models outperform naive models that treat all protests equally.

Keywords

armed conflict prediction protest

Introduction

Protest is a low-intensity form of political conflict that can precipitate intrastate armed conflict. Data on protests should therefore be informative in systems that provide early warnings of armed conflict. Conflict early-warning systems have become much more advanced over the past years, responding to a rapidly increasing interest from decisionmakers. Performant forecasting systems are important: they can facilitate early action to prevent violence, mitigate the consequences of armed conflict and increase public awareness.

High-quality protest data with live updating schedules are readily available (ACLED; Raleigh et al., 2010), but no studies have persuasively shown they can be useful to predict armed conflict. The main reason for this, we argue, is that most protests do not lead to armed conflict, so naively adding protest data to machine-learning models does not necessarily yield good results (see Hegre et al., 2019: for prediction of conflict incidence with protest data). To succeed in our forecasting task, we need theory to identify which protests have the potential to lead to armed conflict violence.¹ We show that simply entering protest data in a forecasting model without careful modeling of dynamics and context does not yield good predictive performance. We, therefore, build on pre-existing theory on how protests are related to armed conflict and identify three broad theoretical explanations as the basis for our forecasting models. The first underlines how protest-repression dynamics can pave the way for armed conflict. The second and third explanations focus on the political institutions and socio-economic conditions that make armed conflict more likely in the wake of protest.

We operationalize the theoretical explanations and leverage the tools in ViEWS (the political Violence Early-Warning System, Hegre et al., 2019) to evaluate whether protest models improve armed conflict onset and incidence predictions. Our empirical analysis consists of nine model specifications that capture various aspects of the theoretical arguments. We use random forest algorithms to generate predictions of state-based armed conflict at the subnational level for African countries. Models are trained on data from 1997–2016 and predict for 2017–2019.

The evidence shows that forecasting models with information on protest activity do not unequivocally improve armed conflict predictions compared to baseline models accounting for conflict history. In fact, naive protest models, which treat all protests equally, do worse than the baseline, especially for conflict onset. However, theoretically informed models that unpack protest-repression dynamics are better than both the baseline models and naive protest models. There is also ample evidence that the institutional and economic context matters for the relationship between protest and armed conflict. In contrast to previous prediction efforts with protest data, our approach improves both armed conflict onset and incidence forecasts at the subnational level. The results illustrate the importance of building armed conflict forecasting models on solid theoretical foundations.

Overall, the article contributes to peace and conflict research by marrying innovations in theory, data collection and methods for forecasting purposes. We also make several additional contributions. First, efforts to predict and forecast political violence have entered the peace research mainstream (Hegre et al., 2013; Beger, Dorff & Ward, 2016; Hegre et al., 2017; Witmer et al., 2017). Most forecasting models, however, rely on slow-moving ‘structural’ factors, such as income or political institutions, to generate forecasts. Consequently, the models can distinguish countries at risk of violence from those that are not, but they do not help identify where and when violence will break out. Identifying the location and timing of violence in high-risk countries is crucial for conflict mitigation and prevention. Our approach shows that predictive performance of both conflict onset and incidence at the local level can be improved by combining structural factors with event data (Chadefaux, 2014; Chiba & Gleditsch, 2017; Mueller & Rauh, 2018).

Second, although our primary objective is to maximize predictive performance, our approach provides insights into the usefulness of the theoretical arguments we adapt for forecasting purposes. Evaluating a model’s ability to improve prediction for unseen data can complement the ‘p-value’ framework of hypothesis testing (Ward, Greenhill & Bakke, 2010; Schrodt, 2014). Without claiming that we test theoretical arguments in any rigorous meaning of the word, our results show that theoretical explanations help improve the performance of a conflict early-warning system.

Literature review

Forecasting of armed conflict was high on the agenda in peace research in the 1960s and 1970s (e.g. Choucri, 1974). This agenda has seen a renaissance over the past ten years, along with a general surge of forecasting and machine-learning techniques in most scientific fields (see Hegre et al., 2017: for a review). The most well-known armed conflict forecasting models (Goldstone et al., 2010; Ward, Greenhill & Bakke, 2010; Hegre et al., 2013; Bowlsby et al., 2020) are set at the country-year level. They are mainly based on static variables such as income and population. A few models provide forecasts at finer geographic resolutions (Witmer et al., 2017; Hegre et al., 2019, 2021) and some at a more precise temporal scale (Ward et al., 2013; Ward & Beger, 2017; Blair & Sambanis, 2020). Although armed conflict is the prediction target that has received the most attention, relevant studies also seek to forecast coups or irregular leader changes (Bell, 2016a; Ward & Beger, 2017), unrest (Chenoweth & Ulfelder, 2017), or regime change (Morgan, Beger & Glynn, 2019).

Many studies forecast the onset of armed conflict, typically defined as the first year/month of violence in a country above a given threshold after a given number of years/months below the threshold. Other projects forecast the incidence of armed conflict, whether violence is above the threshold irrespective of violence levels in the periods just before. Modeling the onset of new conflict reveals new information but is also a more challenging task. Incidence models, moreover, allow forecasting termination of ongoing conflicts.

Models forecasting protests have been forwarded (Gurr & Lichbach, 1986; Cadena et al., 2015; Chenoweth & Ulfelder, 2017), but not many models use protests to forecast armed conflict. Some make use of event data to forecast conflict (Ward et al., 2013; Chiba & Gleditsch, 2017; Blair & Sambanis, 2020), supplementing the most used static variables. The event data going into these models include information on protests. However, the analyses do not permit distinguishing the importance of protests from other events, including historical events that we define as the dependent variable in our study. Wu & Gerber (e.g. 2018) and Ward et al. (e.g. 2013) use protest data but only in a within-country setting.

Predicting changes in conflict levels was the topic of a recent prediction competition (Hegre, Vesco & Colaresi, 2022). Several contributions suggested new predictors useful for predicting rapid changes (e.g. Hultman, Leis & Nilsson, 2022; Mueller & Rauh, 2022; Oswald & Ohrenhofer, 2022), but none of them tapped into protest data, despite the arguments reviewed below indicating strong links between protest and conflict escalation. As far as we know, only the ViEWS model (Hegre et al., 2019, 2021) makes use of the extensive protest data available in ACLED (Raleigh et al., 2010). That protest model, however, is overly simple, and we show below that it can be considerably improved. The work presented here adds significantly to these efforts by building theoretically founded prediction models.

From protest to armed conflict: Theoretical explanations

Protest can lead to armed conflict through a gradual process in which the tactics and mind-set of radical protesters turn violent or by affecting the opportunity and motivation for large-scale political violence by other armed groups (Tilly, 1978; Della Porta & Diani, 2020; Rød & Weidmann, 2022). Nevertheless, only a minority of protests precede armed conflict. Consequently, theoretical explanations identifying which protests are likely to lead to armed conflict violence are crucial for our prediction task. The theoretical explanations we discuss below help us distinguish protests that increase the risk of armed conflict from protests that do not. Before discussing them, we first define armed conflict and protest.

Our prediction target is state-based armed conflict, defined as the organized use of violence between at least two actors, one of which is the government. Our key predictor, protests, are public gatherings of a group of people opposing the government. Protests can be peaceful or involve a degree of violence, for example, property damage or clashes with government security forces. The main difference between protest and state-based armed conflict is that the latter involves systematic and organized use of armed force by the political opposition. In contrast, violence during protests is spontaneous and military organizations only play a peripheral, if any, role on the opposition side (see also Bartusevičius & Gleditsch, 2019).

Protest-repression dynamics

Protest-repression dynamics influence whether armed conflict emerges in the aftermath of protests. State repression scholars have investigated how dissent affects repression and how repression affects dissent (for a recent overview, see Demeritt, 2016). One key finding is that states respond to dissent with repression. Repressive measures take the form of civil liberties infringements (e.g. censorship, restrictions on freedom of speech or organization) or physical integrity violations (e.g. torture, imprisonment without a fair trial). However, the form and intensity of repression depend on the dissent: violent rioters and peaceful protesters elicit different government responses.

While research has uncovered a fairly robust relationship between dissent and repression (but see Ritter & Conrad, 2016), the impact of repression on dissent is more uncertain. There is empirical support for ‘almost every possible relationship between protest and repression’ (Carey, 2006: 1). Some scholars show that repression discourages dissent (White, 1993; Pierskalla, 2010), while others report that repression encourages it (Sutton, Butcher & Svensson, 2014; Hultquist, 2017). Others show that moderate-intensity repression fuels dissent while high-intensity repression deters dissent (Hibbs, 1973; Mueller, 1990). Moreover, others find that repression leads to the substitution of dissenters’ strategies (Moore, 1998) or has heterogeneous effects over time (Rasler, 1996; Finkel, 2015).

Despite the diverging findings, there are strong reasons to expect that violent, indiscriminate state repression of protests can lead to armed conflict. The theoretical mechanisms leading from protest to armed conflict violence focus on both rational and emotive processes. From a rationalist perspective, repression increases the costs of the protests (Lichbach, 1987). Even when benefits beyond those of participating in protests are high, a violent crackdown increases the uncertainty of success. Therefore, repression may convince protesters that the costs of protests are too high and that violent rebellion is the only way forward. Escalation from peaceful protests to armed conflict is likely to be gradual. Protesters expand their tactical repertoire (e.g. strikes, street demonstrations, property damage, attacks on security forces) by observing failures and successes in interaction with government actors. Similarly, the government adapts its approach depending on protesters’ tactics and the social actors engaged in protests. Della Porta & Diani (2020) show how such an adaptation process can lead to a spiral of violence and counter-violence that culminates in armed conflict.

Emotions also play an essential role. While fear of repression can demobilize citizens (Young et al., 2019), protesters are often aware of potential costs. For example, a Libyan during the Arab Spring remarked that ‘we knew that our attempts to try would be hard and bloody’ (Pearlman, 2013: 389). Emotions such as anger, revenge, contempt and distrust towards the government can lead people to join or support armed rebellion (McCullough, Kurzban & Tabak, 2013; Bartusevičius, van Leeuwen & Petersen, 2020). These emotions can be triggered by violent repression of peaceful protesters, which tends to be perceived as unfair and consequently undermines the moral legitimacy of the government (Wood, 2003; Kalyvas, 2006). They are powerful motivators of action and affect how people prioritize their interests and process information. For example, anger and revenge propel people to take larger risks and attack their opponents, increasing the risk of further violence.

Rational and affective frameworks also explain why the risk of armed conflict involving existing non-state armed groups increases during protest mobilization. When protests are large and sustained over time, they tie up resources and reduce the legitimacy of the government, which opens up opportunities for violence by existing armed challengers. When protests are violently repressed, armed groups can argue that peaceful tactics are doomed to fail. Protests can also increase the motivation for violence by armed non-state actors. Government challengers, such as protest movements and rebel groups, are in competition with each other over local resources and power. When protesters openly confront the government, they simultaneously sharpen the competition with other non-state groups, increasing incentives for these to also challenge the government.

In sum, rational and affective theoretical mechanisms link protest-repression dynamics to political violence. Our ability to predict armed conflict should improve by distinguishing between peaceful and violent protests and unpacking repressive government responses.

H1: Compared to empirical models with information on protests that ignore the level of violence, models with information on protest-repression dynamics improve predictions of subnational armed conflict onset and incidence.

Political institutions

Political institutions have important implications for protest and armed conflict. Domestic peace is more likely in democracies than in autocracies and anocracies (Hegre, 2014). Democracy facilitates peaceful resolution of conflicts by allowing the expression of discontent and regular free and fair elections, empowering citizens to affect their future without violence. These institutional guarantees reduce motivations for violent conflict by reducing commitment problems (Boix, 2003; Fearon, 2004; Acemoglu, 2006). Institutional avenues also make escalation from protest to armed conflict unlikely. In democracies, political leaders and citizens see protests as a regular political activity with a low risk of violence. Further, democratically elected bodies are more likely than autocratic leaders to respond to protesters, partly because protesters in democracies often are backed by civil society organizations that influence politics through institutional channels.

Autocratic leaders aim to pre-emptively quell emerging popular threats through structural repression, especially restrictions on civil rights and liberties. However, structural repression often fails, and autocracies are highly prone to armed conflict (Gleditsch & Ruggeri, 2010; Bartusevičius & Skaaning, 2018). When protest occurs in autocracies, it is likely a strong signal of potential violence. The fact that protest occurs despite the high risk of government repression indicates radical discontent with the autocratic regime. Moreover, the chances of an escalating spiral of violence are high when a lack of formal constraints on violent repression is combined with committed protesters, especially when organizational support for protesters is absent (Sutton, Butcher & Svensson, 2014; Ives & Lewis, 2020).

Some non-democratic regimes are more prone to violent protest trajectories than others. Anocracies, a regime type that allows the political opposition to organize and compete for office but lacks effective institutional mechanisms to incorporate the opposition, are vulnerable to both protest and armed conflict (Muller & Weede, 1990; Hegre et al., 2001; Cederman, Hug & Krebs, 2010; Fjelde, 2010; Robertson, 2010). Elections often trigger protest and violence in these regimes (Birch, Daxecker & Höglund, 2020). Regimes often engage in fraud and repression, such as ballot-box stuffing, exclusion of opposition candidates and voter intimidation to avoid losing elections (Hafner-Burton, Hyde & Jablonski, 2014; Rød, 2019). Fraudulent elections serve as focal points for protest by exposing large parts of the population to the same injustice at the same time, creating an ‘imagined community of millions of robbed voters’ (Kuntz & Thompson, 2009: 258). These dynamics often take place in countries where violent groups already exist. Protest during elections, especially in non-democracies, can therefore be a powerful signal of future armed violence.

In sum, compared with democracies, protest in autocracies is a stronger early-warning signal for armed conflict. Protests around elections should also improve predictive performance by signaling an increased risk of armed conflict, especially in non-democratic settings.

H2: Protest models that account for political institutions (democracy, civil liberties, recent and anticipated elections) improve predictions of subnational armed conflict onset and incidence compared to protest models that do not.

Economic development

We expect economic development to reduce the risk of escalation from protests to armed conflict. Economic development refers to how services, manufacturing, and human and financial capital are more critical economic input factors than agricultural land and natural resources. Four key findings underpin our expectation. First, armed conflict is much less likely in high-income countries, independently of protests (Fearon & Laitin, 2003; Boix, 2008; Collier, Hoeffler & Rohner, 2009). Second, to maintain a developed economy, education and pre-existing civil society organizations are typically necessary. These factors strengthen opposition groups, helping them to coordinate mobilization. In turn, the impact of protests increases, and, given a commitment to nonviolence among organizers, minimizes violence and rioting on the protester side, all of this reducing the risk of escalation into violence (Chenoweth & Stephan, 2011).

Third, aspects of economic development, such as dense economic networks that would be curtailed by widespread organized violence, or substantial importance of civil society organizations such as labor unions, help constrain the government from using heavy-handed repression (Hegre, Bernhard & Teorell, 2020). In developed economies, political conflict is unlikely to turn into armed conflict since states are more capable and economic incentives to use violence are weak (Fearon & Laitin, 2003; Boix, 2008; Collier, Hoeffler & Rohner, 2009; Hegre, 2015). Although development may not hinder governments in violent repression, protest dynamics are less likely to escalate into armed conflict in highly developed contexts.

Finally, economic development may insulate nonviolent opposition from opportunistic violence by third-party actors. Governments in developed countries are typically stronger militarily and have better organization and intelligence services (Kocher, 2004; Fearon & Laitin, 2003). As governments and protesters often have a joint interest in marginalizing violent groups, state capacity helps prevent violent groups from exploiting the legitimacy challenges caused by peaceful protests to forward their violent agendas in the wake of protest events. All in all, we expect socio-economic development to dampen the tendency for protests to escalate to armed conflict.

H3: Protest models that account for economic development improve predictions of subnational armed conflict onset and incidence compared to protest models that do not.

Research design

To optimize an armed conflict prediction model and examine our theoretical expectations, we develop models and evaluate their predicted performance for countries in Africa using the ViEWS early-warning system (Hegre et al., 2019, 2021). Following ViEWS, we specify the empirical models at the PRIO-GRID-month (pgm) level. The spatial units (grid cells) are quadratic cells with a spatial resolution of 0.5 x 0.5 decimal degrees corresponding to approximately 55 x 55 kilometers at the Equator (Tollefsen, Strand & Buhaug, 2012).

Our dependent variables are the monthly incidence and onset of state-based armed conflict. We use data from the UCDP Georeferenced Event Dataset (UCDP GED) v.20.1 (Croicu & Sundberg, 2015; Pettersson & Öberg, 2020) and aggregate the events coded as ‘state-based’ to the pgm level. We use a single-death threshold of violence. The incidence variable (ged_dummy_sb) is coded as 1 if state-based violence occurred, 0 if not. The onset variable (ged_dummy_sb_onset) is coded as 1 if state-based violence occurred and more than six months have passed since violence last occurred in the same cell, 0 if not.²

We rely on the Armed Conflict Location & Event Data Project (ACLED) to operationalize independent variables measuring protest (Raleigh et al., 2010). We aggregate counts of protests to the pgm level and normalize them by grid cell population. Normalization ensures that our protest indicators do not simply differentiate densely populated grid cells from sparsely populated ones. We distinguish between four categories of protest events: peaceful protests, protests with low-level intervention, protests with excessive force against protesters,

Figure 1.

Decay maps illustrating the recent history (as of December 2016) of the different types of armed conflict and protest events. Cells coloured in red observed events in late 2016, with green color events about a year earlier, and with purple color no events the past three years

and protests with violent behavior by demonstrators. These categories separate protest with and without violence, the intensity of violence and the actors involved. Peaceful protests are events with no violence by protesters, state security forces, or other armed actors. Protests with low-level intervention are when security forces or other armed groups intervene but do not use lethal force. Events with excessive use of force equal lethal interventions. The final category, riots, identifies events with violent protesters.³ Following our theoretical discussion, we expect models that distinguish violent from peaceful protests to improve predictions relative to models that do not. In addition to capturing local protest, we calculate the number of protests at the country level and normalize them by country population.⁴

We include variables that capture the temporal and spatial dynamics of protest. For the temporal dimension, we construct decay variables for the time since the last protest events with a halflife of six months and include a one-month lag of protest. Further, we capture the cumulative effect of protest by computing the total number of events in the past three months. For the spatial dimension, we calculate the sum of all protest events for first- and second-order neighbor grid cells. Finally, we construct variables that measure the distance to the nearest protest event during the last three months.

Does this setup allow us to build protest models for armed conflict prediction, or are we simply predicting conflict with different data on conflict? In our view, the protest models we build using ACLED data are distinct from models that capture conflict history based on data from UCDP. First, as discussed in the theoretical section, protest and state-based armed conflict are distinguishable at a conceptual level. Note also that protest repression is conceptually different from armed conflict. Repression occurs when state armed actors use violence against unarmed protesters, whereas armed conflict requires armed actors on both sides. Second, patterns in the data support our conceptual notion of armed conflict and protest as qualitatively different. Figure 1 shows the geographical distribution of armed conflict and protest events focusing on Ethiopia and its neighboring countries. There is some spatial overlap between conflict and protest. However, a closer look reveals that protests are concentrated in urban areas, such as Addis Ababa, while armed conflict is more prevalent in eastern Ethiopia. Moreover, while there is a considerable amount of peaceful protest and even riots in southwest Kenya, armed conflict is absent. Finally, we compare all protest models to a baseline armed conflict history model in the empirical section. If our protest models merely capture conflict history, we would expect a low predictive performance compared to our baseline. These points should eliminate concerns that we are merely predicting conflict using more extensive conflict history data.

To evaluate how protest interacts with contextual factors to affect armed conflict (E2 and E3), we include political institutions and economic development in our models. We measure political institutions at the country level using variables from the Varieties of Democracy (V-Dem) and Rulers, Elections and Irregular Governance (REIGN) datasets (Bell, 2016b; Coppedge et al., 2020).⁵ The variables measure level of electoral democracy, civil liberties, the number of months since the last elections took place, and whether an election is scheduled for the coming six months. Finally, we use data on economic development from WorldBank (2019) and Tollefsen, Strand & Buhaug (2012). The indicators measure GDP per capita, GDP per capita growth and unemployment at the country level. We measure the logged gross cell product per capita, infant mortality rate, urban area and agricultural area at the grid cell level.

Model specifications and comparisons

We define nine empirical models (sets of predictors) based on the theoretical explanations (E1, E2, E3) summarized in Table I. Our interest lies in predicting incidence and onset, and we estimate all the models for both outcomes. We compare our models to a hierarchy of benchmark models. For our purpose – to evaluate the value of our theoretically informed models – baseline models should satisfy the following criteria: they should have good predictive performance but also avoid the predictive-modeling analogies to omitted variable and post-treatment bias.⁶ In our context, the baseline model should include features that are correlated with both protest and armed conflict, but only if they are not the results of protest in itself. Our initial baseline model only includes information from the prediction outcome (Cranmer & Desmarais, 2017) – i.e. variables measuring conflict history. Research has shown that conflict history models are difficult to outperform (Ward, Greenhill & Bakke, 2010; Hegre et al., 2019), making this a tough baseline model to beat. However, most model comparisons we make are either between models in Table I that all include this baseline and different formulations of the protest models or between richer models compared to the same models without the protest features. We contend that this relatively parsimonious setup allows a robust but transparent demonstration of the value of the protest features for predictive modeling.⁷

The first set of models focuses on protest-repression dynamics (M1–M3). Here, we evaluate whether information on recent protest enhances predictive performance compared to the baseline model and whether distinguishing between protest with and without violence further improves armed conflict predictions (E1). We first specify a naive protest model (M1) that includes protest counts and the temporal and spatial transformations of protest information without considering the role of violence by security forces or protesters.⁸ We compare the predictive performance of this naive protest model with two models that differentiate between peaceful protests, protest with low-level intervention, and excessive violence by the government or violence by demonstrators. A local protest-repression dynamics model contains this information on the pgm level (M2). We also specify a national protest-repression dynamics model that includes protest variables aggregated to the country level (M3).

Evaluating how political institutions and protest may improve conflict predictions (E2), the second set of models includes measures of electoral democracy (M4), civil liberties (M5) and elections (M6), in addition to the protest variables. Models that include protest and political institution variables should outperform models with only protest or political institution variables. Moreover, information on recent and anticipated elections should

Table I.

Overview of model specifications

Long name	Description	Features	N Features	Empirical expectation
Baseline model (M0)	Conflict data at the subnational level	Lagged armed conflict fatalities in grid cell, dichotomous indicator of armed conflict events in neighbor grid cells, time since armed conflict events in grid cell (decay, 12 months halflife), dichotomous indicator of armed conflict event last month	4
Naive protest model (M1)	Protest data at the subnational level	M0 + Count of protest last month in grid cell, cumulative count of protests last three months in grid cell, time since protest in grid cell (decay, six months halflife), count of protest last month in neighbor and second-order neighbor grid cells, cumulative count of protests last three months in neighbor and second-order neighbor grid cells, time since protest in neighbor and second-order neighbor grid cells (decay, six months halflife) distance to the nearest protest event in the last three months	11
Local protest-repression dynamics model (M2)	Differentiation between violent and nonviolent protests at the subnational level	M0 + Same indicators as M1, but disaggregated to the following protest categories: 1) peaceful protests; 2) protests with low-level intervention; 3) protests with excessive force against protesters and; 4) protests with violent demonstrator behaviour	32	E1
National protest-repression dynamics model (M3)	Differentiation between violent and nonviolent protests at the subnational and national level	M0 + M2 + Count of protest last month in country, cumulative count of protests last three months in country, time since protest in country (decay, six months halflife) for each of the protest categories	44	E1
Political institutions I: Electoral democracy (M4)	Includes information on level of democracy	M0 + M3 + electoral democracy index (lagged by 12 m)	45	E2
Political institutions II: Civil liberties (M5)	Includes information on civil liberties	M0 + M4 + civil liberties (lagged by 12 m) index and subnational differences in civil liberties index (lagged by 12 m)	47	E2
Political institutions III: Elections (M6)	Includes information on recent and planned elections	M0 + M5 + months since last election and dichotomous indicator of anticipated election in next six months	49	E2
Political institutions IV: Unexpected protest specification (M7)	Includes the residuals and predicted values of protest	M0 + M3 + residuals and predicted values from negative binomial regression of protest	46	E2
Economic development I: Country level (M8)	Includes information on economic development at the national level	M0 + M3 + GDP per capita, GDP per capita growth, and total unemployment	47	E3
Economic development II: Subnational level (M9)	Includes information on economic development at the subnational and national level	M0 + M8 + gross cell product per capita, infant mortality rate in cell, urban area in cell, agricultural area in cell	51	E3

boost the predictive accuracy of models with protest variables compared to models with only protest or election variables. We also specify a model that uses ‘unexpected protests’ as the main protest predictor (M7). As discussed in the theoretical section, protests are more common and widely accepted as political behavior in democracies than in non-democracies. To capture this, we include two features that capture the extent to which governments are expected to respond with excessive violence as well as whether the observed amount of protests exceed what is expected given institutions and history of the country.⁹

The final set of models incorporates features related to economic development on the national and subnational levels (E3). The first economic development model includes variables measuring GDP per capita, growth and unemployment (M8), whereas the second also incorporates information on gross cell product per capita, local infant mortality rate and urban/agricultural area in grid cells. E3 stipulates that models combining information on protest and economic development outperform models with only protest or economic development variables.

Estimation

We use the random forest classifier (Breiman, 2001) to link the predictors to the outcome. Random forest models have been very successful for forecasting purposes, as they allow for very flexibly modeled relationships between features and the likelihood of the outcomes, including interactive relationships, work well with a large number of features, and have strong built-in procedures to avoid overfitting. Potential interactions between a set of independent variables do not need to be specified but are instead picked up by algorithm due to the recursive structure of building the trees. Interpreting random-forest model results can be challenging. We overcome this challenge by discussing ‘individual conditional expectation’ (ICE) and partial dependence plots. Since the algorithm is computationally intensive with a large number of trees, we ‘asymmetrically’ downsample, i.e. keep all conflict outcomes but sample 30% of all non-conflict observations. For more details, see Online appendix A-2.

We predict for all months from January 2017 to December 2019. We train separate models for different forecasting horizons and show results for predictions $s \in [3, 6, 12, 36]$ months into the future. Our training data start in January 1997 (ACLED start year) and up to s months before the prediction applies.

Evaluation

We train a set of models that incrementally add features as specified in Table I. This enables us to evaluate changes in predictive performance when adding variables as suggested by the theoretical arguments. To compare the out-of-sample predictive performance of each model, we report three evaluation metrics based on data of observed and predicted conflict events for the period between January 2017 and December 2019. The average precision score (AP) is particularly well suited when evaluating predictions for imbalanced classes, as is the case for armed conflict (Cranmer & Desmarais, 2017). By not considering the true negatives, the score rewards models that predict

Table II.

AP, AUROC and Brier scores for the baseline model (M0) for conflict incidence and onset in the period 2017–2019

Outcome	Step	AP	AUROC	Brier
Incidence	3	0.275	0.928	0.0050
Incidence	6	0.240	0.913	0.0051
Incidence	12	0.209	0.885	0.0053
Incidence	36	0.130	0.808	0.0053
Onset	3	0.207	0.843	0.0047
Onset	6	0.179	0.904	0.0046
Onset	12	0.172	0.871	0.0046
Onset	36	0.078	0.758	0.0049

conflicts rather than the absence of conflict. We also report the area under the receiver operator curve (AUROC) and Brier scores. Compared to AP and AUROC, which measure the ranking of observations, Brier scores measure the sharpness of predictions (how close they are to the actual relative frequency). The evaluation metrics are averaged over the conflict probabilities and over the months for each step.

To measure the effect of including additional features, we compute and compare the percentile bootstrapped confidence intervals by drawing 1,000 samples of prediction-actual pairs, calculating the AP scores for each sample, computing the difference between a baseline and more extensive model specification, and defining the 95% confidence intervals. We also extensively use visualization tools such as maps and PDP/ICE plots to analyze the predictions.

Results

Table II presents the performance of the baseline model (M0) for conflict incidence and onset in AP, AUROC and Brier scores. At time step 3 for incidence/onset, AP is 0.275/0.207, AUROC 0.928/0.843 and Brier 0.0050/0.0047. Although the benchmark model is simple, it is not an easy model to beat. The full ensemble model in Hegre et al. (2019: 166), for instance, has AP of 0.277 averaged across steps, for the same prediction problem – only about 30% better than the incidence baseline model we use here.

Evaluation: Incidence

Our hypotheses specify that including data on protest-repression dynamics on its own (E1) and together with data on political institutions (E2) and economic development (E3) systematically improves armed conflict prediction. Figure 2 reports the AP, AUROC and Brier

Figure 2.

Performance metrics for armed conflict incidence: AP, AUROC and Brier scores for M0, M1 and M2 at steps 3, 6, 12, 36

scores for the baseline model (M0), naive model (M1) and the protest-repression dynamics model (M2) for incidence of conflict.¹⁰ We show that results are at least as strong for the challenge of forecasting conflict onsets. The y-axis shows the performance on these metrics. Along the horizontal axis, we show results for the four steps forward (

s \in [3, 6, 12, 36]

) and the three metrics for each step.¹¹ The figure provides three insights. First, model performance drops as we attempt to predict further into the future. Second, we can see differences in the performance of models M0, M1 and M2. The figure shows that the baseline model performs marginally better than the naive protest model for all metrics.

Further, we see a clear and consistent difference in predictive performance between the protest-repression models and the baseline and the naive protest model across metrics and time. AP scores are 0.303 vs 0.275/0.269 (3 months) and 0.144 vs 0.129/0.123 (36 months). These results provide evidence in favor of E1: Models with information on recent protests incorporating protest-repression dynamics improve predictions compared to a naive protest model. The improvement is substantial: On average, across all possible thresholds, the proportion of positive predictions that are correct increases by 12%, from 26.9% to 30.3%.

Figures 3, 4 and 5 displays a more rigorous test of expectations E1–E3. Each plot shows the difference in bootstrapped AP scores on the y-axis and time points on the x-axis for the relevant model comparisons. The 95% confidence intervals are computed by drawing 1,000 bootstrapped samples of prediction-actual pairs, taking the difference in AP scores between models for each of these draws, and defining the percentile confidence intervals across 1,000 bootstrapped AP metrics.

Figure 3a compares the naive protest model M1 to the baseline M0. A positive difference in AP indicates that M1 outperforms M0. The plot shows no difference between M0 and M1, indicating that naively adding information on recent protest does not improve armed conflict predictions. Figures 3b and 3c display the same comparison for the protest-repression dynamics models M2 and M3 against M1 as a test of E1. The results strongly support E1: we see consistent increases in AP for the protest-repression dynamics models compared to the naive protest model.¹²

We specify two comparison models to test the impact of adding information on political institutions (E2) and economic development (E3). E2 and E3 suggest that models that include protest and political institutions/economic development variables should outperform models with only protest or political institutions/economic development. We, therefore, first compare the protest-repression dynamics model (M2) to each of the more extensive models (M4–M9). Second, we compare M4–M9 to models similar to each of these, except for omitting information on protest-repression dynamics. This second comparison helps isolate differences in predictive performance due to the interplay of protest and political institutions/economic development from

Figure 3.

Testing E1: Difference of bootstrapped AP scores for armed conflict incidence. See Table I for model details

Figure 4.

Testing E2: Difference of bootstrapped AP scores for armed conflict incidence. See Table I for model details

differences merely due to adding institutional and developmental features to the models. The results are displayed in Figure 4 (E2) and Figure 5 (E3).

In Figures 4a–4d, we see that adding features for electoral democracy (M4), civil liberties (M5) and elections (M6) to the protest-repression dynamics model (M2) systematically improve predictions. The unexpected protest model (M7), however, does not clearly improve predictive performance relative to M2. Nonetheless, the results overall indicate that political-institutional features can improve protest models. Further, the bottom row (Figures 4e–4h) shows that the performance of all these four models is much better than similar models where all protest features are removed. It

Figure 5.

Testing E3: Difference of bootstrapped AP scores for armed conflict incidence. See Table I for model details

is clear that data on protests improve performance relative to models that only contain the static institutional features, in support of E2.

In Figure 5, we plot similar results for empirical expectation E3. Overall, combining protest with economic development features improves predictive performance. The results are more consistent for the local economic development model (M9) than for the corresponding national-level development model (M8).

How do our model specifications change subnational armed conflict predictions?¹³ The map in Figure 6a plots the baseline predicted probabilities of conflict incidence for March 2017 for Ethiopia and its neighboring countries for the baseline model (M0) predicting three months into the future. Figure 6a also marks with black dots the actual locations of conflict events in that month. The baseline model shows probabilities higher than 80% in conflict-ridden areas, such as Somalia, but does not always get the exact locations right. Also, note that the model misses conflict events in the eastern part and at the southern borders of South Sudan and conflict events near Lamu in Kenya.

Figures 6b–6d map the difference in predicted probabilities for three of our protest models compared to the baseline.¹⁴ As Figure 6b shows, the local protest-repression model adjusts the predicted probability upwards for most of the locations – three grid cells with actual conflict receive clearly lower predicted probabilities than in the baseline model. Particularly in Somalia, the protest model yields higher predicted probabilities for three out of five false negatives. These improvements are even more clear in Figures 6c and 6d. The development model in particular strengthens predictions in Somalia and Kenya.

The reason for these improvements can be seen in the maps of the distribution of protest events in the region up to December 2016 (Figures 6e and 6f). The area around Mogadishu saw peaceful but also several repressed protest events. The most substantial contribution to false positives is observed in Somaliland, an area that saw a combination of peaceful protests and riots.

Our theoretical expectations suggest that adding information on protest-repression dynamics and economic development improves predictive performance. Our results have provided evidence in favor of these. The theory also suggests clear directionality of effects: protest features are expected to increase conflict probability. In Figure 7, we take a closer look at the direction of effects in the centered ICE/partial dependence (PD) plots based on M9 for predictions three months ahead.

PD plots provide us with information regarding the global relationship between an individual feature and the predicted outcome of interest when all other features are held constant; the ICE lines display instances separately.¹⁵ The y-axis shows the change in the predicted probability of armed conflict. The x-axis displays the values of a feature, here the decay of time since different protest categories (a value of 0 indicates no recent protest, a 1 that protest occurred last month). All plots in Figure 7 indicate that the predicted probability of state-based conflict increases with temporal proximity to all protest categories. The patterns in the figure are consistent with the theoretical arguments linking protest-repression dynamics to armed conflict.

Figure 6.

Plots a–d: Differences in predicted probability between baseline and protest models for March 2017 (step = 3). The black triangles indicate where an actual conflict event was observed (based on a window of six months). Plots e–f: Decay maps illustrating the recent history (as of December 2016) of protest. Cells colored in red observed events in late 2016, with green color events about a year earlier, and with purple color no events the past three years

Evaluation: Onset

How does information on protest affect armed conflict onset prediction? In Figure 8, we display key model comparisons for protest onset. Consistent with E1 and the incidence results, we see that the two protest-repression dynamics models (M2 and M3) outperform the naive protest model. In terms of improvement in average precision (AP), the theoretically informed protest models are even more important for onset than for incidence. In particular, when comparing the national

Figure 7.

Combination of centered ICE and partial dependence plots for predicted armed conflict incidence by local protest-repression dynamics features (Model M9, Step 3)

Figure 8.

Key results for E1, E2 and E3 for armed conflict onset

protest-repression model (M3) to M1 (Figure 8b), AP increases by about 0.08 for steps 3 and 6, in contrast to an increase of 0.05 for the corresponding incidence comparison (Figure 3c). They also clearly beat the baseline model in three out of four steps, see Figure A-10a in Online appendix A-6. Further, Figure 8c shows that the electoral democracy model (M4) improves AP scores compared to the local protest-dynamics model (M2). These results also hold for M5, M6 and M7, indicating strong empirical support for the expectation that the interaction between protest and political institutions shapes conflict onset risk. The evidence is weaker for E3, namely that economic development level accentuates the risk of conflict during protest. Figure 8d shows improvement for steps 6 and 36, but not for 3 and 12 when comparing M9 (economic development) to M2. The average change is negative at step 3. Results are more in favor of E3 when comparing M8 to M2, but we consider the evidence here tentative.¹⁶

In sum, the combined incidence and onset results suggest strong evidence in favor of E1. Theoretically informed protest models improve our ability to predict armed conflict. We also find considerable evidence for E2 (political institutions) and E3 (economic development), although results vary somewhat depending on model specification and outcome.

Conclusion

Forecasting has entered the mainstream in peace and conflict studies, and policymakers are increasingly interested in conflict early-warning systems. In this article, we have leveraged forecasting tools in the ViEWS system to assess whether data on protest improve early warning of armed conflict.

Our starting point was that protest – a low-intensity form of political conflict – can lead to civil war violence. Protest should therefore be informative for conflict early-warning systems, but no studies have so far convincingly leveraged protest data to improve conflict predictions. Since most protests do not lead to civil war, we turned to theoretical accounts of the dynamics and contexts that increase the potential for armed violence in the wake of protests. The three theoretical explanations we discussed, namely protest-repression dynamics, political institutions and economic development, then guided how we specified our forecasting models.

Our empirical analysis showed that the theoretically informed models of protest outperformed a baseline model that captured conflict history and a naive protest model that treated all protests equally. The evidence also indicates that models that combine information on protest dynamics and the political/economic context in which protests occur outperform models that incorporate these aspects separately. The results are at least as strong when forecasting the onset of conflict as incidence.

Our article shows that event data can be combined with contextual data to improve predictions of where and when armed conflict will occur at the subnational level. This evidence is significant: the development of tools and systems that can identify the location and timing of violence is needed to prevent and mitigate violence. Importantly, our approach underlines that theory has a vital role to play in armed conflict forecasting – atheoretical machine-learning models do not necessarily pick up such patterns without a guide for how to specify them. Theory-driven predictive modeling can be taken further than we have done here by specifying the directionality and functional form of effects and exploring when and where expectations hold. Research teams and decisionmakers need to jointly leverage theory, data and predictive methods to improve existing systems and develop potent new ones.

Footnotes

Replication data

All replication files for the empirical analysis in this article can be found at .

Acknowledgements

The authors would like to thank Mihai Croicu, Jim Dale, David and the entire ViEWS team for help with data, code and contributing to the analytical tools we use. We would like to thank participants at a PRIO Brownbag, the Conflict Research Society annual conference 2021, the digital AFK Methods workshop in September 2021, and three anonymous reviewers.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The research was funded by the European Research Council, project H2020-ERC-2015-AdG 694640 (ViEWS), Riksbankens Jubileumsfond (project M21-0002 Societies at Risk), MISTRA Geopolitics, and the Swedish Research Council grant nr. 2018-01222. The evaluations were performed on resources provided by the Swedish National Infrastructure for Computing (SNIC) at Uppsala Multidisciplinary Center for Advanced Computational Science (UPPMAX). See for more information on the project.

ORCID iDs

Espen Geelmuyden Rød

Håvard Hegre

Maxine Leis

Notes

References

Acemoglu

Daron

(2006) Economic Origins of Dictatorship and Democracy. Cambridge: Cambridge University Press.

Bartusevičius

Henrikas

Gleditsch

Kristian Skrede

(2019) A two-stage approach to civil conflict: Contested incompatibilities and armed violence. International Organization 73(1): 225–248.

Bartusevičius

Henrikas

Skaaning

Svend-Erik

(2018) Revisiting democratic civil peace: Electoral regimes and civil conflict. Journal of Peace Research 55(5): 625–640.

Bartusevičius

Henrikas

Leeuwen

Florian van

Petersen

Michael Bang

(2020) Political repression motivates antigovernment violence. Royal Society Open Science 10(6): 10.1098/rsos.221227.

Beger

Andreas

Dorff

Cassy L

Ward

Michael D

(2016) Irregular leadership changes in 2014: Forecasts using ensemble, split-population duration models. International Journal of Forecasting 32(1): 98–111.

Bell

Curtis

(2016a) Coup d’état and democracy. Comparative Political Studies 49(9): 1167–1200.

Bell

Curtis

(2016b) The rulers, elections, and irregular governance dataset (reign) (www.oefresearch.org).

Birch

Sarah

Daxecker

Ursula

Höglund

Kristine

(2020) Electoral violence: An introduction. Journal of Peace Research 57(1): 3–14.

Blair

Robert A.

Sambanis

Nicholas

(2020) Forecasting civil wars: Theory and structure in an age of big data and machine learning. Journal of Conflict Resolution 64(10): 1885–1915.

10.

Boix

Carles

(2003) Democracy and Redistribution. Cambridge: Cambridge University Press.

11.

Boix

Carles

(2008) Economic roots of civil wars and evolutions in the contemporary world. World Politics 60(3): 390–437.

12.

Bowlsby

Drew

Chenoweth

Erica

Hendrix

Cullen

Moyer

Jonathan D

(2020) The future is a moving target: Predicting political instability. British Journal of Political Science 50(4): 1405–1417.

13.

Breiman

Leo

(2001) Random forests. Machine learning 45(1): 5–32.

14.

Cadena

Jose

Korkmaz

Gizem

Kuhlman

Chris J

Marathe

Achla

Ramakrishnan

Naren

Vullikanti

Anil

(2015) Forecasting social unrest using activity cascades. PLOS ONE 10(6): 1–27 (https://doi.org/10.1371/journal.pone.0128879).

15.

Carey

Sabine

(2006) The dynamic relationship between protest and repression. Political Research Quarterly 59(1): 1–11.

16.

Cederman

Lars-Erik

Hug

Simon

Krebs

Lutz F

(2010) Democratization and civil war: Empirical evidence. Journal of Peace Research 47(4): 377–394.

17.

Cederman

Lars-Erik

Weidmann

Nils B

(2017) Predicting armed conflict: Time to adjust our expectations? Science 355(6324): 474–476.

18.

Chadefaux

Thomas

(2014) Early warning signals for war in the news. Journal of Peace Research 51(1): 5–18.

19.

Chenoweth

Erica

Stephan

Maria J

(2011) Why Civil Resistance Works: The Strategic Logic of Nonviolent Conflict. New York: Columbia University Press.

20.

Chenoweth

Erica

Ulfelder

Jay

(2017) Can structural conditions explain the onset of nonviolent uprisings? Journal of Conflict Resolution 61(2): 298–324 (https://doi.org/10.1177/0022002715576574).

21.

Chiba

Daina

Gleditsch

Kristian Skrede

(2017) The shape of things to come? expanding the inequality and grievance model for civil war forecasts with event data. Journal of Peace Research 54(2): 275–297 (https://doi.org/10.1177/0022343316684192).

22.

Choucri

Nazli

(1974) Forecasting in international relations: Problems and prospects. International Interactions 1(2): 63–86 (https://doi.org/10.1080/03050627408434390).

23.

Collier

Paul

Hoeffler

Anke

Rohner

Dominic

(2009) Beyond greed and grievance: feasibility and civil war. Oxford Economic Papers 61(1): 1–27.

24.

Coppedge

Michael

Gerring

John

Knutsen

Carl Henrik

Lindberg

Staffan I

Teorell

Jan

Altman

David

Bernhard

Michael

Steven Fish

Glynn

Adam

Hicken

Allen

Lührmann

Anna

Marquardt

Kyle L

Paxton

Pamela

McMann

Kelly

Pemstein

Daniel

Seim

Brigitte

Sigman

Rachel

Skaaning

Svend-Erik

Staton

Jeffrey

Wilson

Steven

Cornell

Agnes

Alizada

Nazifa

Gastaldi

Lisa

Gjerløw

Haakon

Hindle

Garry

Ilchenko

Nina

Maxwell

Laura

Mechkova

Valeriya

Medzihorsky

Juraj

Röomer

Johannes von

Sundström

Aksel

Tzelgov

Eitan

Wang

Yi-ting

Wig

Tore

Zilblatt

Daniel

(2020) V-Dem codebook v10.

25.

Cranmer

Skyler J

Desmarais

Bruce A

(2017) What can we learn from predictive modeling? Political Analysis 25(2): 145–166.

26.

Croicu

Mihai

Sundberg

Ralph

(2015) UCDP georeferenced event dataset codebook version 4.0. Journal of Peace Research 50(4): 523–532.

27.

Della Porta

Donatella

Diani

Mario

(2020) Social Movements: An Introduction. Hoboken, NJ: Wiley.

28.

Demeritt

Jacqueline HR

(2016) The strategic use of state repression and political violence. In: Thompson William R (ed.) The Oxford Encyclopedia of Empirical International Relations Theory. New York: Oxford University Press.

29.

Fearon

James D

(2004) Why do some civil wars last so much longer than others? Journal of Peace Research 41(3): 275–301.

30.

Fearon

James D

Laitin

David D

(2003) Ethnicity, insurgency, and civil war. American Political Science Review 97(1): 75–90.

31.

Finkel

Evgeny

(2015) The phoenix effect of state repression: Jewish resistance during the holocaust. American Political Science Review 109(2): 339–353.

32.

Fjelde

Hanne

(2010) Generals, dictators, and kings: Authoritarian regimes and civil conflict, 1973–2004. Conflict Management and Peace Science 27(3): 195–218.

33.

Gleditsch

Kristian Skrede

Ruggeri

Andrea

(2010) Political opportunity structures, democracy, and civil war. Journal of Peace Research 47(3): 299–310.

34.

Goldstone

Jack A

Bates

Robert H

Epstein

David L

Gurr

Ted Robert

Lustik

Michael B

Marshall

Monty G

Ulfelder

Jay

Woodward

Mark

(2010) A global model for forecasting political instability. American Journal of Political Science 54(1): 190–208.

35.

Gurr

Ted Robert

Lichbach

Mark Irving

(1986) Forecasting internal conflict: A competitive evaluation of empirical theories. Comparative Political Studies 19(1): 3–38 (https://doi.org/10.1177/0010414086019001001).

36.

Hafner-Burton

Emilie M

Hyde

Susan D

Jablonski

Ryan S

(2014) When do governments resort to election violence? British Journal of Political Science: 149–179.

37.

Hegre

Håvard

(2014) Democracy and armed conflict. Journal of Peace Research 51(2): 159–172 (https://doi.org/10.1177/0022343313512852).

38.

Hegre

Håvard

(2015) Civil conflict and development. In: Walle

Nicholas van de

Lancaster

Carol

(ed.) Oxford University Press Handbook on the Politics of Development. Oxford: Oxford University Press.

39.

Hegre

Håvard

Allansson

Marie

Basedau

Matthias

Colaresi

Mike

Croicu

Mihai Catalin

Fjelde

Hanne

Hoyles

Frederick

Hultman

Lisa

Högbladh

Stina

Jansen

Remco

Mouhleb

Naima

Muhammad

Sayeed Auwn

Nilsson

Desirée

Nygård

Håvard Mokleiv

Olafsdottir

Gudlaug

Petrova

Kristina

Randahl

David

Rød

Espen Geelmuyden

Schneider

Gerald

Uexkull

Nina von

Vestby

Jonas

(2019) Views: A political violence early warning system. Journal of Peace Research 56(2): 155–174 (https://doi.org/10.1177/0022343319823860).

40.

Hegre

Håvard

Bell

Curtis

Colaresi

Michael

Croicu

Mihai

Hoyles

Frederick

Jansen

Remco

Leis

Maxine Ria

Lindqvist-McGowan

Angelica

Randahl

David

Rød

Espen Geelmuyden

Vesco

Paola

(2021) Views₂₀₂₀: Revising and evaluating the views political violence early-warning system. Journal of Peace Research 58(3): 599–611 (https://doi.org/10.1177/0022343320962157).

41.

Hegre

Håvard

Bernhard

Michael

Teorell

Jan

(2020) Civil society and the democratic peace. Journal of Conflict Resolution 64(1): 32–62 (https://doi.org/10.1177/0022002719850620).

42.

Hegre

Håvard

Ellingsen

Tanja

Gates

Scott

Gleditsch

Nils Petter

(2001) Toward a democratic civil peace? Democracy, political change, and civil war, 1816–1992. American Political Science Review 95(1): 33–48.

43.

Hegre

Håvard

Karlsen

Joakim

Nygård

Håvard Mokleiv

Strand

Håvard

Urdal

Henrik

(2013) Predicting armed conflict 2010–2050. International Studies Quarterly 57(2): 250–270.

44.

Hegre

Håvard

Metternich

Nils W

Nygård

Håvard Mokleiv

Wucherpfennig

Julian

(2017) Introduction: Forecasting in peace research. Journal of Peace Research 54(2): 113–124 (https://doi.org/10.1177/0022343317691330).

45.

Hegre

Håvard

Vesco

Paola

Colaresi

Michael

(2022) Lessons from an escalation prediction competition. International Interactions 48(4): 521–554 (https://doi.org/10.1080/03050629.2022.2070745).

46.

Hibbs

Douglas A

(1973) Mass Political Violence. A Cross-National Causal Analysis. New York: Wiley.

47.

Hultman

Lisa

Leis

Maxine

Nilsson

Desirée

(2022) Employing local peacekeeping data to forecast changes in violence. International Interactions 48(4): 823–840 (https://doi.org/10.1080/03050629.2022.2055010).

48.

Hultquist

Philip

(2017) Is collective repression an effective counterinsurgency technique? unpacking the cyclical relationship between repression and civil conflict. Conflict Management and Peace Science 34(5): 507–525.

49.

Ives

Brandon

Lewis

Jacob S

(2020) From rallies to riots: Why some protests become violent. Journal of Conflict Resolution 64(5): 958–986.

50.

Kalyvas

Stathis N

(2006) The Logic of Violence in Civil War. Cambridge: Cambridge University Press.

51.

Kocher

Matthew Adam

(2004). Human Ecology and Civil War. Unpublished PhD dissertation, University of Chicago.

52.

Kuntz

Philipp

Thompson

Mark R

(2009) More than just the final straw: Stolen elections as revolutionary triggers. Comparative Politics 41(3): 253–272.

53.

Lichbach

Mark Irving

(1987) Deterrence or escalation? the puzzle of aggregate studies of repression and dissent. Journal of Conflict Resolution 31(2): 266–297.

54.

McCullough

Michael E

Kurzban

Robert

Tabak

Benjamin A

(2013) Cognitive systems for revenge and forgiveness. Behavioral and Brain Sciences 36(1): 1–15.

55.

Moore

Will H.

(1998) Repression and dissent: Substitution, context, and timing. American Journal of Political Science 42(3): 851–873.

56.

Morgan

Richard K

Beger

Andreas

& Adam Glynn (2019) Varieties of forecasts: Predicting adverse regime transitions. V-Dem Working Paper 2019:89.

57.

Mueller

Hannes

Rauh

Christopher

(2018) Reading between the lines: Prediction of political violence using newspaper text. American Political Science Review 112(2): 358–375.

58.

Mueller

Hannes

Rauh

Christopher

(2022) Using past violence and current news to predict changes in violence. International Interactions 48(4): 579–596 (https://doi.org/10.1080/03050629.2022.2063853).

59.

Mueller

John

(1990) The obsolescence of major war. Bulletin of Peace Proposals 21(3): 321–328.

60.

Muller

Edward N

Weede

Erich

(1990) Cross-national variations in political violence: A rational action approach. Journal of Conflict Resolution 34(4): 624–651.

61.

Oswald

Christian

Ohrenhofer

Daniel

(2022) Click, click boom: Using Wikipedia data to predict changes in battle-related deaths. International Interactions 48(4): 678–696 (https://doi.org/10.1080/03050629.2022.2061969).

62.

Pearlman

Wendy

(2013) Emotions and the microfoundations of the Arab uprisings. Perspectives on Politics 11(2): 387–409.

63.

Pettersson

Therese

Öberg

Magnus

(2020) Organized violence, 1989–2019. Journal of Peace Research 57(4): 597–613 (https://doi.org/10.1177/0022343320934986).

64.

Pierskalla

Jan Henryk

(2010) Protest, deterrence, and escalation: The strategic calculus of government repression. Journal of Conflict Resolution 54(1): 117–145.

65.

Raleigh

Clionadh

Hegre

Håvard

Karlsen

Joakim

Linke

Andrew

(2010) Introducing ACLED: An armed conflict location and event dataset. Journal of Peace Research 47(5): 651–660 (https://doi.org/10.1177/0022343310378914).

66.

Rasler

Karen

(1996) Concessions, repression, and political protest in the Iranian revolution. American Sociological Review 61(1): 132–152.

67.

Ritter

Emily Hencken

Conrad

Courtenay R

(2016) Preventing and responding to dissent: The observational challenges of explaining strategic repression. American Political Science Review 110(1): 85–99.

68.

Robertson

Graeme B

(2010) The politics of protest in hybrid regimes: Managing dissent in post-communist Russia. Cambridge University Press.

69.

Rød

Espen Geelmuyden

(2019) Fraud, grievances, and post-election protests in competitive authoritarian regimes. Electoral Studies 58: 12–20.

70.

Rød

Espen Geelmuyden

Weidmann

Nils B

(2022) From bad to worse? How protest escalates to armed conflict in autocracies. Political Geography 103(May): 102891.

71.

Schrodt

Philip A

(2014) Seven deadly sins of contemporary quantitative political analysis. Journal of Peace Research 51(2): 287–300.

72.

Sutton

Jonathan

Butcher

Charles R

Svensson

Isak

(2014) Explaining political jiu-jitsu: Institution-building and the outcomes of regime violence against unarmed protests. Journal of Peace Research 51(5): 559–573.

73.

Tilly

Charles

(1978) From Mobilization to Revolution. Reading, MA: Addison-Wesley.

74.

Tollefsen

Andreas Forø

Strand

Håvard

Buhaug

Halvard

(2012) Prio-grid: A unified spatial data structure. Journal of Peace Research 49(2): 363–374.

75.

Ward

Michael D

Beger

Andreas

(2017) Lessons from near real-time forecasting of irregular leadership changes. Journal of Peace Research 54(2): 141–156.

76.

Ward

Michael D

Greenhill

Brian D

Bakke

Kristin M

(2010) The perils of policy by p-value: Predicting civil conflicts. Journal of Peace Research 47(4): 363–375.

77.

Ward

Michael D

Metternich

Nils W

Dorff

Cassy L

Gallop

Max

Hollenbach

Florian M

Schultz

Anna

Weschle

Simon

(2013) Learning from the past and stepping into the future: Toward a new generation of conflict prediction. International Studies Review 15(4): 473–490.

78.

Weidmann

Nils B

Dorussen

Hans

Gleditsch

Kristian Skrede

(2010) The geography of the international system: The cshapes dataset. International Interactions 36(1): 86–106.

79.

White

Robert W

(1993) On measuring political violence: Northern Ireland, 1969 to 1980. American Sociological Review 58(4): 575–585.

80.

Witmer

Frank DW

Linke

Andrew M

O’Loughlin

John

Gettelman

Andrew

Laing

Arlene

(2017) Subnational violent conflict forecasts for sub-Saharan Africa, 2015–65, using climate-sensitive models. Journal of Peace Research 54(2): 175–192.

81.

Wood

Elisabeth Jean

(2003) Insurgent Collective Action and Civil War in El Salvador. Cambridge Studies in Comparative Politics. New York: Cambridge University Press.

82.

WorldBank (2019) World development indicators.

83.

Congyu

Gerber

Matthew S

(2018) Forecasting civil unrest using social media and protest participation theory. IEEE Transactions on Computational Social Systems 5(1): 82–94.

84.

Young

Lauren E

. (2019) The psychology of state repression: Fear and dissent decisions in Zimbabwe. American Political Science Review 113(1): 140–155.

Predicting armed conflict using protest data

Abstract

Keywords

Introduction

Literature review

From protest to armed conflict: Theoretical explanations

Protest-repression dynamics

Political institutions

Economic development

Research design

Model specifications and comparisons

Estimation

Evaluation

Results

Evaluation: Incidence

Evaluation: Onset

Conclusion

Footnotes

Replication data

Acknowledgements

Funding

ORCID iDs

Notes

References