Abstract
We investigate 833 reported estimates detailing the impact of external interventions aimed at alleviating civil war on conflict intensity. This investigation encompasses 34 studies conducted between 1996 and 2020, primarily focusing on African countries. While the average reported effect is both negative and statistically significant, our analysis reveals substantial divergence in the results. We apply meta-regression analysis to examine the sources of this heterogeneity. Our main findings are as follows. First, differences in data, intervention targets, conflict intensity measures, and the publication year of the primary studies account for the observed heterogeneity in reported estimates. Second, we find no evidence of publication selection bias that aligns with prior beliefs, theoretical expectations, or statistical significance. Third, after considering potential sources of heterogeneity and publication bias, the overall genuine effect of external intervention remains negative and statistically significant. This implies that external intervention efforts do mitigate conflict intensity, albeit with a small magnitude of the statistical significance coefficient. For policy purposes, external interventions are likely to have a modest effect.
Plain language summary
We looked at whether external interventions affect the intensity of conflicts predominantly in African countries for studies published from 1996 to 2020. It seems like external interventions usually reduce conflict intensity, but the results from different studies vary a lot. We studied 833 reported results from 34 studies to understand why. Overall, the average effect of these interventions is negative, meaning they do reduce conflict, but the results differ because of things like different data, targets of the interventions, and how conflict intensity is measured. We also found that there’s no publication selection bias in how studies are published. After considering all these factors, we still see a statistically significant negative effect, meaning external interventions do help reduce conflict intensity, but the impact is modest.
Introduction
Conflict and civil war are a major cause of development failure in low- and middle-income countries. Consequently, prioritizing conflict de-escalation and mitigation becomes imperative in the development efforts, aiming to foster progress and prosperity in the world’s developing countries. One suggested mechanism to mitigate civil war, especially its intensity, is external intervention by foreign powers. These intervenors may include a neighbouring country, great powers, or multilateral organizations such as the United Nations (UN). From the standpoint of external powers, mitigating the intensity of civil war also serves a non-altruistic objective, because lowering civil war intensity stems the tide of refugees and obviates the need for deeper and more costly humanitarian interventions in the future when conflict intensifies. The 20-year external intervention in the Afghan civil war serves as an illustrative example, beginning with substantial resources but eventually petered out into cheap talk (refer to Appendix A for the theoretical discussion). The ongoing Russian-Ukraine war and the recent internal conflict in Sudan further underscore the topical nature of external interventions. In these instances, the role of external powers in influencing conflict dynamics becomes even more pronounced and relevant.
We conduct a comprehensive meta-analysis of the existing empirical literature on external interventions in civil wars and their effect on conflict intensity, chiefly measured through battle deaths. We have documented the large numbers of empirical studies conducted until 2020 predominantly for African countries. This thorough review has identified and analysed 833 reported external intervention estimates on conflict intensity from 34 studies. Figure 1 visualizes the extent of disagreement both in terms of the direction of the effect size and their statistical significance. Slightly more than one-third of the estimates (35%) reported a negative and statistically significant impact on conflict intensity. In contrast, about one-fifth of the reported estimates find a positive and statistically significant impact. In the remaining 44%, the evidence is mixed, but statistically insignificant. Despite the voluminous literature exploring the associations between external intervention and conflict intensity, along with international relations policy efforts aimed at promoting peaceful conflict resolution, the empirical studies yield substantially divergent results. The first motivation of this study is to scrutinize whether external intervention actually reduces conflict intensity.

Sign and significance of external intervention impact on conflict intensity (N = 833).
Figure 2 portrays the development of the reported estimates overtime in individual empirical studies, organized by the year of publication in Google Scholar. Dixon’s seminal paper in 1996 initially indicated a negative impact of external interventions on conflict intensity. Subsequent publication in 2004 introduced evidence of both negative and positive impacts. Despite a wealth of studies in the last decade, substantial variations persist in the estimated parameters. A second motivation of this study is, therefore, to explore the reasons behind this consistent disagreement and divergence in estimates.

Reported external interventions impacts diverge over the period 1996 to 2020 (N = 833).
Driven by these dual objectives, this study conducts a meta-analysis, considering both the temporal variance of reported estimates (Figure 2) and the extent of disagreement in direction and statistical significance (Figure 1). Notably, no meta-analysis has been undertaken, to our knowledge, to probe the actual associations between external interventions and conflict intensity. Our use of meta-analysis methods enables us to amalgamate, synthesize, and probe the estimated parameters, surpassing the confines of an ordinary literature review. A meta-analysis, with its statistical power and regression analysis, surpasses traditional literature reviews by accurately discerning the true direction of empirical findings and unveiling systematic biases such as publication bias. Our overarching aim is to uncover the effectiveness of interventions in conflict mitigation, particularly exploring whether one-sided or general interventions prove more effective, and whether interventions are pro-government or not.
The paper is structured as follows. Section 2 offers a concise review of pertinent empirical literature. Section 3 outlines the steps taken to identify eligible studies, provides an overview of the evidence base, and elaborates on the construction of the meta-dataset. Moving to Section 4, a comprehensive discussion on aspects of estimation design that systematically influence reported estimates of conflict intensity linked to external interventions. Section 5 introduces the methodological approach of the meta-regression analysis (MRA), while Section 6 presents detailed results and discussion. Concluding remarks are provided in Section 7. Additionally, in Appendix A, we present theoretical perspectives on external interventions and conflict intensity are presented.
A Brief Review of Related Empirical Literature
The scholarly exploration of the link between external interventions and conflict intensity is complex, encompassing diverse methodologies that address different facets of intervention. Divergent views exist, with proponents asserting that external interventions can mitigate violence and foster peace, while skeptics question their efficacy. This section offers a concise overview of the key factors centering on the connection between external interventions and conflict intensity.
Numerous studies have explored the consequences of military interventions by external actors in conflict zones. Peacekeeping missions, initially effective in reducing violence and stabilizing situations, often hinge on factors such as mission mandates, troop deployment strategies, and coordination with local actors. Bara and Hultman (2020) assert that the aggregate military capability, encompassing training, equipment, and logistic support to address hostile situations, is a crucial determinant of success. However, the impact of military interventions is not unidimensional. Wood et al. (2012) argue that these interventions may inadvertently escalate violence, particularly when the intervening force is perceived as biased.
By contrast, diplomatic interventions are frequently advocated as alternatives to military involvement. While diplomacy can significantly contribute to conflict resolution, its effectiveness is contingent on factors such as the mediator’s credibility, inclusivity of negotiations, and the willingness of conflicting parties to engage constructively. In this context, Regan and Aydin (2006) emphasize the pivotal role of third-party diplomatic interventions in reducing informational asymmetries about capabilities and incentives. External actors facilitate dialog and consensus-building, helping parties find cooperative solutions to end civil wars through negotiated agreements. Other scholars argue that economic assistance to a party involved in a civil war can influence structural conditions that affect the likelihood of conflict prevailing. Regan and Meachum (2014) conclude that economic interventions can increase the chances of regaining stability by addressing political concerns that might otherwise fuel armed rebellion. However, their findings do not indicate a discernible impact from diplomatic interventions.
Identifying the specific types of intervention (economic, military, diplomatic, or peace-keeping missions) employed in the studies included in this assessment is often challenging. However, interveners typically set specific targets that determine whether an intervention is perceived as partisan or neutral. Some studies categorize economic and military interventions as partisan, while diplomatic and mission interventions are often considered neutral. Nevertheless, the lack of explicit specification for the specific target makes retroactively determining the nature of the intervention difficult. For example, interventions involving troops or equipment support, common in military interventions, can be classified as partisan if provided to one of the conflicting parties. Alternatively, they may be viewed as neutral, such as when troops are deployed to monitor a ceasefire or control a buffer zone (Sousa, 2014). Consequently, our empirical approach centers on conflict intensity associated with external interventions, and assesses the specific target (biased or neutral, with biased indicating pro- or anti-government), rather than focusing on the types of interventions.
The empirical literature exploring the relationship between external interventions and conflict intensity reveals a nuanced understanding of the varied outcomes associated with military, diplomatic, and economic interventions. While these interventions have the potential to play constructive roles, their success depends on a multitude of factors. Within scholarly literature, several other potential factors mediate and influence the empirically estimated effectiveness of external interventions. These encompass aspects of the study design used to estimate the relationship, including the choice of data, measures to gauge conflict intensity, the adopted estimation approach, and the specified control variables aimed at explaining the impact of external interventions on the dependent variable — conflict intensity.
Numerous studies examining conflict intensity rely on the monthly count of battle-related deaths as their primary dependent variable. This strategy aims to capture more precise temporal variations and evaluate the specific impact of interventions. For example, Hultman et al. (2014) and Haass and Ansorg (2018) have identified heightened conflict intensity associated with external interventions using this method. Conversely, certain studies, such as those by Di Salvatore (2019), Phayal (2019), and Phayal and Prins (2020), choose yearly counts of battle-related deaths. They argue that this approach offers a more comprehensive understanding, allowing sufficient time to assess the effectiveness of external interventions in reducing violence against civilians. In addition to these approaches, alternative measures are employed in the literature. These include assessing the net impact of interventions, such as the escalation or de-escalation of conflict without explicitly factoring in battle-death counts.
Another segment of the literature focuses on conflict duration, examining the time span from the initiation of a conflict to its resolution or cessation, notwithstanding the fact that precise measurements of civil war duration are notoriously challenging. This body of literature, associated with external interventions, centers on the temporal dimension of conflicts, investigating how long they endure (the length of time) and the factors influencing their duration. In our study, we specifically concentrate on conflict intensity, measured using battle-related deaths. This choice ensures homogeneity and comparability in the reported outcomes. Aggregating diverse outcomes of external interventions would be inappropriate, as emphasized by Demena and van Bergeijk (2017) and Demena et al. (2021).
Nevertheless, numerous studies have delved into the explanatory capacity of conflict duration as a metric of conflict intensity in the context of external intervention. This body of literature suggests that prolonged conflicts may witness heightened intensity, with external interventions influencing these dynamics. However, Kathman and Wood (2014) present somewhat ambiguous findings, indicating that longer wars may encourage more rebel violence in the post-conflict period while potentially reducing militia violence. Several studies also examine the impact of various other control variables on conflict intensity associated with external interventions. For instance, Kathman and Wood (2011) and Di Salvatore (2016) hypothesized that rising income per capita could reduce individuals’ willingness to participate in conflict, finding that destitute states are more prone to violent conflict. In a contrasting conclusion, a more recent study by Di Salvatore (2020), found the positive impact of purchasing power on conflict intensity.
Population is another commonly studied control variable. Haass and Ansorg (2018), using a fixed-effect specification, demonstrate that the likelihood of civilian targeting increases in highly populated countries. Additionally, the role of regime type in influencing conflict intensity is explored in various studies. As regimes become more democratic, conflicting parties may rely less on civil victimization, as democracies provide means to address political grievances through peaceful processes (Balcells & Kalyvas, 2014; Kathman & Wood, 2014). Moreover, scholars have investigated the relationship between ethnicity and violence intensity. Studies by Escribà-Folch (2010) and Di Salvatore (2016) indicate that higher levels of ethnic fractionalization are associated with an increased likelihood of civil wars. In diverse societies with multiple ethnic groups, competition and grievances among these groups may contribute to elevated conflict intensity. However, Escribà-Folch (2010) reveals a non-linear relationship between ethnic diversity and conflict intensity.
In our meta-analysis, we scrutinize not only real factors, such as the impact of control variables discussed above, which can shape the explanatory power of external interventions in conflict intensity, but also the influence of study design on estimating this relationship. Researchers commonly utilize time series data to explore the impact of external interventions on conflict intensity, often followed by employing panel datasets. Notably, there is an absence of cross-sectional data application in the literature, potentially minimizing biases related to time-invariant heterogeneity. The analysis of conflict-related data, such as intensity, duration, conflict locations, and casualties, predominantly relies on the Uppsala Conflict Data Project (UCDP).
Regarding study design, the empirical literature varies in the estimation methods employed. The predominant method involves (with or without zero-inflated) negative binomial regression, given the often count-based measure of conflict intensity in terms of battle deaths (e.g., Balcells & Kalyvas, 2014; Beardsley et al., 2019). This method accounts for the over-representation of zeros during active conflict periods without battle-related deaths. Due to the resultant over-dispersed dependent variable, researchers commonly opt for zero-inflated or conventional negative binomial regression. Some other studies employ non-linear regression techniques such as Probit, Logit, or Tobit (see, Dixon, 1996; Doyle & Sambanis, 2000; Kathman & Wood, 2011). A few researchers address endogeneity concerns by applying instrumental variable (IV) techniques to enhance the robustness of causal claims. This recognizes the simultaneous determination of conflict intensity and external interventions, recognizing that intense conflicts may attract more interventions, and vice versa. For instance, Sawyer et al. (2017) use donor GDP as an instrument for external rebel financial assistance in an IV probit model. Similarly, Carnegie and Mikulaschek (2020) employ the rotation of seats between African regions in the UN Security Council as an instrument for the deployment of UN peacekeepers, acknowledging the challenge in identifying precise IVs.
Identifying Eligible Studies and Overview of the Evidence Base
Identifying Eligible Studies
To identify relevant studies, we conducted an extensive search using the Google Scholar web engine, EconLit, and the Web of Science database. Additionally, we performed a manual search using the reference lists of identified recent studies. Employing a broad combination of keywords, such as “external intervention,”“external intervention + conflict intensity,”“peacekeeping + conflicts,” and “third-party intervention + peace process,” yielded an extensive literature set. For instance, the combination “external intervention + conflict intensity” yielded 18,400 potential studies. Initial screening based on titles, abstracts, and keywords led to the identification of 98 prospective studies. In cases of uncertainty, further examination was conducted based on the introduction and conclusions of the identified studies.
We selected studies that satisfy the following criteria for detailed review based on the full-text evaluation: English language empirical studies that report econometric results investigating the impact of external interventions on conflict intensity (measured as count of battle death or categorical change of the intensity). The studies must report at least sample size, coefficients, standard errors, or t-values. The imposition of these criteria yielded 37 empirical studies published until September 2020 for coding. Majority of the reasons for exclusion were that studies relied on literature review, historical analysis, descriptive studies, other outcome variables, or sanctions as external intervention.
All authors actively participated in constructing the dataset to maintain the highest scientific standard. The two authors independently conducted the search, eligibility assessment, and data extraction using a Microsoft Excel template. Any discrepancies between the two authors were thoroughly reviewed by the third author, and consensus was achieved for a disagreement rate of about 2.5%. Extensive data coding was conducted on various study characteristics, such as conflict intensity, external interventions, data specifications, and publication characteristics. This rigorous process aimed to minimize potential subjectivity bias and enhance the reliability and robustness of the findings.
Overview of the Evidence Base
Figure 3 presents the identified 37 empirical studies, reporting the cumulative number of studies published over the period 1996 to 2020. The evidence started in 1996 by Dixon evaluating the effectiveness of third-party intervention using data for the period 1984 to 1994. The development of the econometric literature was very limited in the 1990s and 2000s. Following this early work, the next study was only published in 2004 examining the impact of various UN interventions for the years 1994 to 1997. Between 2005 and 2010, only a handful of studies (7) was appeared examining the impact of different external interventions (such as peacekeeping, foreign support, non-UN mandate, international mediation) using various cross-country analyses predominantly for sub-Saharan African (SSA) countries. In the last decade (from 2011 to 2020), every additional year resulted in three or four studies. As a result, in that period, a sharp increase in the number of studies was recorded, generating 28 additional empirical studies. The increasing number of studies can be attributed to the expanding availability of data assessing external interventions on conflict intensity, particularly in SSA countries.

Cumulative number of studies per year published in the period 1996 up to 2020.
It is essential to describe some critical challenges encountered during the coding of reported estimates. A crucial aspect involves deciding between a best-set, average-set or all-set approach in constructing the meta-dataset. Following the recommendation of Stanley and Doucouliagos (2012) and Demena and van Bergeijk (2017), we opted to collect all reported estimates of the primary studies. Another challenge was identifying outliers, with some recent meta-analyses (e.g., Mebratie & van Bergeijk, 2013) considering large estimates exceeding 10 in absolute value as outliers and excluding them from the analysis. In contrast, Stanley and Doucouliagos (2012), suggested unusually large estimates could be attributed to coding errors.
Engaging two independent reviewers, we identified 19 estimates exceeding 10 in absolute value. After meticulous verification, these reported estimates appeared genuine. To impartially evaluate and concurrently pinpoint outliers in the reported coefficients and their estimated precisions, we employed the multivariate method proposed by Hadi (1994). Utilizing this approach, we identified about 4.6% of the reported estimates as outliers, including three studies completely excluded as outliers. This method has been identified as suitable for outliers in multivariate framework and is commonly applied in meta-analysis (Demena, 2021; Demena et al., 2022; Floridi et al., 2020). Applying this method yielded 833 estimates from 34 empirical studies, excluding 40 observations provided by 13 studies. In the Supplemental File, Table B1 presents the list of included studies
Of these studies, 25 are peer-reviewed journal articles, while the remaining nine are working papers or unpublished studies. Over two-thirds of the reported estimates focus solely on evaluating external interventions in Africa, while the rest combining cases from Africa and other regions, except for one study on the Bosnian war. The oldest study was published in 1996, the most recent in 2020 and the median study was published in 2015. This indicates a recent and rapid evolution of the topic, with half of the research published in the last 5 years. The minimum, median and maximum number of observations per study are 1, 18 and 153 coefficients, respectively. For a comprehensive overview, refer to Appendix B, Table B2.
Why Do Estimates of External Interventions on Conflict Intensity Vary? Potential Factors Explaining Heterogeneity
We have observed temporal variation over time in the reported estimates (Figure 2) and substantial differences both within and between studies (Figure 4). This section aims to uncover the underlying reasons for potential sources of heterogeneity across the reported estimates by examining variations in the design of the included empirical studies. We collect various potential moderator variables that reflect each study’s data characteristics, estimation characteristics, specification characteristics, and proxies employed for our variable of interests — external interventions, and conflict intensity (Table 1 gives the details of these selected potential explanatory variables). While we cannot account for all potential moderator variables explaining variations in the reported estimates, our selection is based on common design choices made by researchers of the included studies, aiming to capture the heterogeneity of the reported estimates within the literature.

Reported estimates vary widely both within and across the primary studies (N = 873).
Description and Summary Statistics of Collected Variables.
Data characteristics
The coding of the included studies involves constructing dummies for specific characteristics such as data type (panel or time series), the length or timespan taken by the individual studies, and the number of countries included. Time series data analyses are predominant (82.8%), followed by panel data sets (17.2%), with no application of cross-sectional data in the literature, minimizing the potential biases due to time-invariant heterogeneity (Gujarati & Porter, 2009). The mean logarithm values of the number of years of data and the observations per study are 2.945 and 7.863, respectively. About 66.7% of estimates originate from studies analyzing the impact of external inventions on conflict intensities in a single region (Africa, Europe or Asia). This is important as regional differences may influence the effectiveness of external interventions, considering that some regions experience more conflicts than others. For instance, Africa, with 67% of reported estimates, has encountered more conflicts and consequently attracts a greater number of interventions. In 2005 alone, 77% of UN peacekeeping operations were deployed in Africa, constituting 75% of the total budget of that year (Murshed, 2009).
Dependent variable (conflict intensity)
To capture a better temporal variation in conflict intensity as well as to provide a better effect of an intervention, most studies deploy the monthly counts of battel-related deaths as their dependent variable. As a result, about two-thirds (60.1%) of the reported estimates build from a monthly account of battel-related deaths. The second dependent variable is measured in terms of the yearly count of battel-related deaths. These studies (for instance, Di Salvatore, 2019; Phayal, 2019; Phayal & Prins, 2020) argue that measuring conflict intensities over the year yields a better understanding than monthly as it takes time to assess external interventions to curtail violence against civilians. Whereas the remaining estimates had used other measures such as the net impact of interventions (e.g., escalation of conflict, de-escalation of conflict or no change in battle deaths). Our analysis centers on examining the impact of external interventions or third-party involvements on conflict intensity measured in terms of battel-related death rather than assessing the success or failure of these interventions.
Independent variable (external interventions)
The international community’s interventions often categorized as economic, military, diplomatic or a combination, are implemented by States or international organizations like the UN on conflict-affected States (Sousa, 2014). While identifying the specific intervention types (i.e., if economic, military, diplomatic or mission) in the included studies may not often be possible, interveners generally have specific targets, such as supporting opposition or government or maintaining neutrality. We therefore first classify interventions as partisan/biased or neutral based on the studies, with biased interventions supporting either the government or the opposition. We considered that third-party intervention is partisan/biased, when supporting the opposition or the government, and neutral, when the target is not intended to change the balance of capabilities of the parties. In over 82.1% of estimates, interventions were considered partisan. Next, we classify, whether the type of partisan intervention is in supporting the government (current political system) or opposing group or analyzing neutral mediations. We found that the majority (80.0%) of the reported estimates from the primary studies involve intervention as pro-government, while the remaining assess anti-government or neutral mediations.
Estimation characteristics
Researchers in this literature adopted various estimation techniques to examine the relationship between external interventions and conflict intensity nexus. The most applied method is negative binomial regression (31.9%), with or without zero-inflation adjustment. Since conflict intensity is mostly measured in counts of battel deaths, there is a common issue of over-representation of zeros and over-dispersed dependent variable (Phayal & Prins, 2020). This occurs when months or years have active conflict but without any battle-related deaths, resulting in a choice of dependent variable coded as 0 battle deaths (Beardsley et al., 2019). In this case, the proportion of monthly or yearly counts of conflict without battel-related deaths is substantially higher than for the cases with nonzero battle deaths. Similarly, it is normal to observe that the variance of the dependent variable is greater than its mean, suggesting the outcome variable is highly over-dispersed. Given this nature, either zero-inflated regression or a conventional negative binominal regression is the preferred estimation approach. Another quarter of the reported estimates (27.6%) in primary studies employed non-linear regression techniques such as Probit, Logit, or Tobit. Approximately 15.7% of estimates addressed endogeneity concerns by using instrumental variable (IV) techniques, like two-stage least squares, as discussed in Section 2.
Specification characteristics
In terms of model specifications, various control variables have been considered. Dummies were included to account for factors such as natural resources, income level, country’s population, level of democracy or regime type, ethnic fractionalization, and conflict duration. Recent empirical studies have highlighted the significance of these variables in assessing the impact of external interventions on conflict intensity. Natural resources (Carnegie & Mikulaschek, 2020; Di Salvatore, 2020), income level (Kathman & Wood, 2014), country population (Bara & Hultman, 2020), democracy or regime type (Kathman & Wood, 2011), ethnic fractionalization (Balcells & Kalyvas, 2014), and conflict duration (Bara & Hultman, 2020; Beardsley et al., 2019; Haass & Ansorg, 2018) are among the key considerations. Additionally, year- and country-fixed effects accounted for 22.6% and 27.1% of the estimates, respectively. Fixed effects specification is preferred for country-level variations, providing better handling of temporal factors and other poorly measured static variables (Phayal & Prins, 2020).
Publication characteristics
We also accounted for the publication characteristics of the included studies. We attempt to control for potential aspects of the quality of the included studies looking at factors that are not reflected by the heterogeneity in empirical designs and data characteristics discussed above. We control for study quality to test if peer-reviewed publication is systemically linked with reported external intervention estimates. About half of the reported estimates are contributed by publications in peer-reviewed journals. Additional aspects of study quality considered encompass the number of citations in Google Scholar relative to the age of the study, the publication year of the study and the Scimago Journal ranking.
Meta-Regression Methodology
A standardized methodological approach is essential when dealing with estimates that vary in proxies for external interventions and conflict intensity outcomes. To ensure comparability across included empirical studies, we employ the partial correlation coefficient (PCC), a widely applied method in recent meta-analyses (Demena, 2021; Floridi et al., 2020, 2022). The PCC is computed as:
where
After deriving the PCC, our approach follows three stages. In the first stage, we compute both the simple overall average effect of external interventions and the weighted effect using inverse variance weights. In the second stage, we employ both graphical and statistical analyses. We begin with a graphical assessment to determine the authenticity of the overall effect derived in the first stage and to identify any potential selection bias. Funnel plots are employed to get an indication of the extent of publication bias. While this method relies on visual inspection, which can be subjective, it is complemented with formal statistical analyses.
To formally filter out potential publication bias and derive the genuine effect, we apply the following meta-regression model (MRM):
where
In a classical regression, the error term (
where
In the third stage, we incorporate the sources of heterogeneity outlined in Section 4 to elucidate the factors contributing to the diverse reported estimates, as evident in Figure 4, exhibiting substantial variations in reported estimates within and across primary studies. In this stage, we endeavor to explore the extent to which the estimates reported in each study are influenced by the nature of the empirical design and other study characteristics. To address heterogeneity, we enhance the FAT-PET in Equation 3, leading to the setup of a multivariate MRM as described in Equation 4.
All other parameters remain consistent with those reported in Equation 3. Here X denotes a vector encompassing the moderator variables outlined in Table 1 along with their summary statistics and descriptions.
Beyond within-study dependence, estimating both Equations 3 and 4 involves additional concerns. An important concern is the potential for between-study dependence, especially when multiple studies are authored by the same individuals, as in our case, rendering them statistically non-independent. To address this, we apply both the basic weighted clustered standard errors (CDA) model, and the mixed-effects multilevel (MEM) model (Demena 2024a). For the CDA, we mean study-level clustered standard errors, hence the reference to clustered data analysis. We also employ a fixed effect (FE) estimation that may also address the issue of individual within-variation (Demena & Afesorgbor, 2020). Additionally, one may argue that the number of primary studies seems to be limited, and thus we also employ the wild bootstrap clustering, tailored for cases with a small number of clusters, considering the perceived limitation in the number of primary studies (Demena, 2024b). For the MEM, we initially use Cochran’s Q-test to assess the presence of statistical dependency between studies, as highlighted in Table 3.
To capture the statistical dependency between studies while also controlling for within-study dependence, a multi-level model or hierarchical model is considered as most appropriate (Demena, 2015). Through the inclusion of a random individual effect for each study, this model allows to account both the within-study and between-study dependences and thus the reference to a multilevel random effect model (Demena, 2021). The latter is best designated as MEM controlling for both within-study and across-study variations when it includes the identified moderator variables (Afesorgbor et al., 2024; Demena, 2017). Accordingly, this is described as a two-level model, augmenting Equation 4 becomes:
Another empirical concern of estimating Equation 4 or 5 is the potential multicollinearity which can arise if many moderators are employed as presented in Table 1. Adhering to the MAER-Net guidelines (Havránek et al., 2020) and in line with contemporary trends in meta-analysis (Demena & Afesorgbor, 2020; Floridi et al., 2021, 2022), we employ the general-to-specific (G-to-S) approach. The procedure commences by including all potential moderator variables in the general specification and subsequently systematically eliminating the least significant variables one at a time until only the significant ones remain. Doing so, we exclude 13 variables that are statistically insignificant at least at the 10% level. The joint-test for the remaining nine moderator variables rejects the null hypothesis of a zero joint effect, (F9, 808) = 8.92 (p = 0.000). Conversely, in support of the excluded 13 moderator variables, the null hypothesis of a zero joint effect cannot be rejected, (F13, 808) = 0.52 (p = 0.9163). The latter indicates that the 13 moderator variables jointly appear to be statistically indistinguishable from zero, and thus do not contribute to the explanation of the heterogeneity.
Meta-Regression Results
Bivariate Meta-Regression Evidence
Prior to conducting the bivariate MRA, we compute both the simple and weighted average effects. Table 2 shows the overall impact of external interventions on conflict intensity.
Overall Reported Estimates of External Intervention on Conflict Intensity.
The arithmetic mean of the conflict intensity impact associated with external interventions.
Uses inverse variance as weight.
The unweighted simple average across studies reveals a statistically significant negative effect (−0.009) of external interventions on conflict intensity. To enhance precision, we also compute the weighted average effect using inverse variance. Despite a 22% reduction in magnitude, the weighted average effect aligns with the unweighted average, signifying a negative and statistical impact. The evidence, therefore, suggests that external interventions are likely to significantly diminish conflict intensity. However, following established guidelines for PCC effect magnitude (Doucouliagos, 2011), the practical impact of external interventions on reducing conflict intensity appears nearly negligible. While this initial summary provides insights, it serves as an indication. To offer a more comprehensive understanding, we must address potential publication bias and explore various drivers of heterogeneity. Accordingly, our next step involves addressing issues related to publication bias.
Testing for Publication Bias: Graphical Inspection
Funnel plot is a widely method to assess the issue of publication bias. It takes the form of a scatter diagram, where reported external interventions are plotted on the horizontal axis and their precision (typically the reciprocal of the standard error) on the vertical axis. In the absence of publication bias, a funnel plot exhibits symmetry, resembling an inverted funnel (Stanley, 2008). This symmetry arises because the most precise estimates are clustered around the underlying effect at the top of the funnel, while less precise estimates are more widely dispersed at the bottom of the graph (Stanley, 2005). Conversely, the presence of publication bias becomes apparent when examining an asymmetrical funnel plot. In this case, a tendency exists to selectively report statistically significant results in a given direction. This implies that certain estimates are either overrepresented or discarded to favor a particular expected sign or conclusion (Afesorgbor & Demena, 2022; Demena, 2021). There is also the potential publication bias in a seemingly symmetrical funnel plot when the graph appears to be hollowed and excessively wide. As highlighted by Stanley (2008), this occurs when statistically significant results are chosen irrespective of the direction of the effects, or when both statistically significant negative and positive findings receive equal reporting preference, resulting in the appearance of a symmetrical funnel.
Figure 5 gives the funnel plot. The vertical axis displays the logarithm of the inverse of the standard error and the horizontal axis the magnitude of the estimated external intervention effect. The plots appear to have a symmetrical funnel plot. The most precise estimates are close to around the weighted effect (−0.007), represented by the long and short dashed vertical line. Many imprecise reported estimates seem to be evenly distributed on both the right and left sides of the overall weighted average effect. In other words, there are estimated effects both larger and smaller than −0.007. This implies that the reported estimates of the impact of the external interventions exhibit a symmetrical distribution around the vertical axis (weighted average effect), without any noticeable hollowing or excessive width in the reported estimates. Therefore, the funnel plot suggests an absence of any publication selection bias. However, it’s important to note that this method of testing publication bias is based on visual inspection, making it subjective and less compelling. Hence, the application of the formal statistical method of MRA is required.

Funnel plot testing publication bias (N = 833).
Testing for Publication Bias: Statistical Analysis
Moving beyond the basic visual inspection of the funnel plot, we employ econometric testing to ascertain the presence of publication bias. Doing so, we introduce a more formal method for testing bias through regression-based FAT, as detailed in Section 5, Equation 3.
Table 3 reports the results of the FAT analysis using various estimation specifications. Across all estimations, the FAT does not indicate the existence of a statistically significant publication selection bias, despite all estimated coefficients being negative. Across the specified estimation techniques, publication bias ranges between −0.380 and −0.508. Following the perspective of Doucouliagos and Stanley (2013), irrespective of the applied econometric specification, the magnitude of the estimated publication bias in the research of external interventions and conflict intensity is considered irrelevant. Therefore, in alignment with the visual inspection of the funnel plot, the formal statical approach (FAT) consistently reports a clear absence of evidence for bias.
Publication Bias and Underlying Genuine Effect Test.
Note. Reported results are estimated from Equation 3 specification. All estimates use the inverse variance as weights and standard errors reported in parentheses are clustered at the study level. Panel 1 (CDA: clustered data analysis is estimated via the study level clustered robust standard errors; Panel 2 (FE) is fixed-effect estimation clustered at the study level; Panel 3 (MEM) is mixed-effects multilevel estimated through the restricted maximum likelihood; and Panel 4 (Wild bootstrapped) is regression bootstrapping the standard error (a non-standard cluster adjustment) reported with p-values. Test for between-study heterogeneity (Cochran’s Q-test) is 55521.34*** on 832 degrees of freedom with p-value less than .001 and I2statistics (variation in reported estimates attributable to heterogeneity) is 84.9%.
The Underlying Genuine Effect
Moving beyond the issue of publication selection bias, Table 3 also allows an examination of the presence of a genuine underlying effect. We apply the method of PET specified in Equation 3 that provides the overall average effect after accounting for publication selection bias. Consistent with the average effects presented in Table 2, all PET results across the estimation methods consistently point to a negative impact of external interventions, but statistically insignificant. This implies that external interventions are not associated with a significant reduction in conflict intensity.
Nonetheless, it is worth indicating that these findings represent average across all empirical methods. As presented in Figure 4 (depicted in the box plots), the reported estimates in the studies exhibit substantial variability both within and between studies. Empirically, the between studies heterogeneity is evident by Cochran’s Q-test, as reported in Table 3. Therefore, a multivariate MRA, as presented in the next section, becomes crucial, as our inferences may also depend on various potential sources of heterogeneity, including the quality of the primary studies, misspecification, research design or other characteristics. Moreover, this approach will further refine the examination of publication bias and the overall genuine underlying effect.
Multivariate Meta-Regression Evidence: Explaining Observed Sources of Heterogeneity
Table 4 gives the results of the reduced multivariate MRA estimating Equations 4 and 5 using the G-to-S modeling. Following this approach, outlined in Section 5, Column 1 presents the specific multivariate model with 9 variables that remain statistically significant, at least at the 10% level. To address within-study correlation, this specific model (Column 1) is subsequently re-estimated by applying the preferred MEM model (Column 4, Equation 5). For comparison and robustness checks, we also include secondary specifications (Equation 4) of CDA in Column 2, FE in Column 3, and wild bootstrap clustering in Column 6. Additionally, in Column 5 we replace partisan external intervention with pro-government and anti-government interventions for further analysis.
Explaining the Drivers of Heterogeneity in the Estimates Reported Across Studies.
Note. *, **, and *** stand for 10%, 5%, and 1% level of significance, respectively. Reported results are estimated from Equations 4 and 5 specifications. All estimates use the inverse variance as weights and standard errors reported in parentheses are clustered at the study level. Panel 1 is the specific model derived from the general specification (G-to-S approach); Panel 2 (CDA: clustered data analysis, also known as WLS) is estimated via the study level clustered robust standard errors; Panel 3 (FE) is fixed-effect estimation clustered at the study level; Panel 4 (MEM) is mixed-effects multilevel estimated through the restricted maximum likelihood; Panel 5 (MEM) replaced the external intervention partisan with pro-government and anti-government interventions; and Panel 6 (Wild bootstrapped) is regression bootstrapping the standard error (a non-standard cluster adjustment) reported with p-values.
Based on these estimations, we identify seven points highlighting observed sources of heterogeneity and variations in the underlying meta-effect. Conclusions are drawn primarily from our preferred specification (Column 4) and are categorized as follows: (i) highly consistent evidence if both sign and significance are supported by two secondary specifications (Columns 2 or 3, and 6); (ii) moderately consistent evidence if consistency exists with one of the secondary specifications (Column 2, 3, or 6); and (iii) weakly consistent if consistency is observed between two secondary specifications only (among Columns 2, 3, and 6) or results are supported by preferred specification only (Column 4).
First, we find highly consistent evidence that a single region examination tends to yield lower conflict intensity compared to a multi-region analysis. Our findings suggest that higher conflict intensity in primary studies that include countries from different regions (cross-region analysis) is likely to be upward biased due to aggregation bias in the aggregated data. bibr3 Therefore, we can infer that a single region analysis may be a better approach or good practice in future research, as it permits capturing facts more closely and identifying a more precise impact of external interventions.
Second, we find highly consistent evidence of systematic difference in terms of the outcome variable used to measure the extent of conflict intensity. Specifically, compared to other measures of conflict intensity, the impact of external intervention yields lower estimates when deaths per year is used as a proxy variable. Our results align with findings in the literature (e.g., Di Salvatore, 2019; Phayal, 2019; Phayal & Prins, 2020). These studies suggest that measuring conflict intensities over the year provides a better understanding than other measures, such as battle deaths per month. They argue that it takes time to assess the current and future potential for peacekeeping missions to curtail violence against civilians. Therefore, in future research, analyzing annual data would offer clearer insights into how interventions or observer missions are effective.
Third, concerning external interventions, we find moderately consistent evidence indicating that the type or target of intervention systematically affects the extent of conflict intensity. Specifically, we find that partisan intervention, as opposed to neutral intervention, is associated with de-escalation of the conflict. This aligns with the existing literature, which posits that neutral interventions tend to escalate conflict, while partisan interventions enhance the chance of victory for the supported party, thus being linked to reducing battlefield deaths (Hultman et al., 2014). Conversely, Sousa (2014) finds that partisan interventions escalate conflict intensity, but neutral interventions do not have any effect.
To provide a more nuanced assessment, we further explore whether partisan interventions is supporting the government (pro-government) or opposition group (anti-government). We find that if third-party intervention activities are pro-government, they significantly reduce battle deaths, whereas anti-government interventions enhance such deaths. This suggests that interventions supporting the government tend to have adequate funding and military deployment, while those supporting opposition sides do lend to “cheap talk” outcomes. These results confirm the previous findings in the literature, for instance, Haass and Ansorg (2018) and Sambanis et al. (2020), who observed that governments get support through peacekeeping operations as these foreign troops are more experienced, equipped with high-quality militaries, and are better able to deter violence from state and non-state actors. Additionally, these missions can create buffer zones within conflict areas, reach remote locations more effectively, and exert diplomatic pressure through peace agreements (Haass & Ansorg, 2018). Conversely, external interventions in the absence of popular local support may act as a catalyst for civil war, particularly when combined with other factors like ethnic or social fractionalization (Sambanis et al., 2020). These results have significant implications, suggesting that when partisan intervention supports the government, it is likely to de-escalate conflict intensity, while the opposite holds true for one-sided interventions in favor of rebels, in terms of battle deaths.
Fourth, regarding estimation characteristics, the application of IV stands out, as compared to other estimators, by yielding consistently negative and significant estimates in most cases (except for MEM models). Recent meta-analyses (e.g., Anwar & Afesorgbor, 2021) reported that IV specifications, controlling for endogeneity, measurement bias, unobserved country-fixed effects, and other potentially omitted variables are likely to yield more precise estimates in line with the theoretical predictions. This importance of IV aligns with those of Sawyer et al. (2017), who, using donor GDP as an instrument for external rebel financial assistance, discovered that rebels receiving this highly fungible support are less likely to see the end of the conflict. Additionally, our results show that studies controlling for year-fixed effects, on average, yield lower estimates than those that do not. While several recent meta-analyses suggest that dealing with unobservable time-varying effects can reduce potential bias in impact analysis, the results of these associations are weakly consistent across different specifications.
Fifth, the inclusion of natural resources in the estimated models presents a noteworthy consideration. We find weakly consistent systematic differences between studies that include the existence of natural resources in the empirical models and those that do not. This result aligns with existing work related to “greed and grievance” (e.g., Sambanis et al., 2020; Sousa, 2014). However, contrary findings have been documented by de Rouen and Sobek (2004) and Di Salvatore (2020).
Sixth, regarding publication characteristics, we find highly consistent evidence that the publication year (age of the study) systematically affects conflict intensity estimates. Recent studies tend to report an enhancing role of external interventions in conflict intensity, supporting the “research-cycle” hypothesis regarding novelty and fashion in academic research (Goldfarb, 1995). This hypothesis proposes seminal research often produces promising and significant impacts, while skeptical evidence follow-up with contrary findings of the empirical effect in question (Afesorgbor & Demena, 2022; van Bergeijk et al., 2019). Moreover, we find no systematic difference between estimates reported in peer-reviewed articles and working papers, as well as the journal’s impact factor. This finding suggests that publications in peer-reviewed journals (as opposed to working papers) do not tend to report selected evidence. This contrasts with the argument in Costa-Font et al. (2013), known as the “winner’s curse,” suggesting that peer-reviewed articles are associated with more selected evidence.
Finally, having considered these observed sources of variation, we offer a holistic view of the underlying implied effect of external interventions on conflict intensity. In comparison to the bivariate analyses, the inclusion of the observed moderator variables strongly impacts the significance and magnitude of the PET but has a weak impact on FAT. Regarding FAT, we observe weakly consistent publication bias with a magnitude of little selectivity, indicating the bias found in this literature is irrelevant. Concerning PET, there is now an underlying genuine effect between external interventions and conflict intensity. However, there are many underlying genuine heterogeneity effects than that are only related to a single PET (Demena & van Bergeijk, 2017). Therefore, we apply the “best practice” underlying overall genuine effect, exploring the findings presented in Table 4 (for a similar approach, see Afesorgbor & Demena, 2022; Floridi et al., 2020). We define the best practice conditional on a set of moderator variables, consisting of coefficient estimates derived from recent studies that focus on a single-region empirical model, analyze the outcome indicator as yearly battle-related deaths associated with partisan interventions (support of government), and the design being an IV approach that includes time-fixed effects.
The resulting best practice effect predicts a genuine effect conditional on the identified heterogeneity, which is −0.090 and statistically significant at the 5% level. Therefore, we observe that the corrected correlation coefficient derived from the best practice approach is substantially larger than the simple (weighted) average as well as the bivariate PET. Importantly, it is statistically significant after accounting for the observed sources of heterogeneity. However, following Doucouliagos (2011), the implied genuine overall effect corresponds to a small-sized correlation coefficient. Hence, by accounting for potential observed moderator variables and applying the best practice approach, the current evidence suggests that external intervention efforts, overall, prove advantageous in curbing or de-escalating conflict intensity. It is noteworthy, however, that the magnitude of the statistical significance coefficient is small. For policy purposes, external interventions are likely to have a modest effect. Any policy intervention involves some form of cost-benefit analysis, involving in this case, not just the scale of the desired outcome of the intervention but also the implementation costs linked to the intervention.
Conclusions
We present the first meta-analysis encompassing a comprehensive array of empirical studies spanning the 1996 to 2020 period. Our analysis delves into the impact of external interventions on conflict intensity, drawing on a database comprising 833 reported estimates from 34 distinct studies. While the aggregated reported effect emerges as negative and statistically significant, a noteworthy degree of variability characterizes these estimates. Approximately one-third of the reported estimates indicate a negative and statistically significant effect, offering a clear pattern. In stark contrast, a positive and statistically significant impact is observed in roughly one-fifth of the reported estimates. Meanwhile, the remaining 44% of estimates reveal evidence of statistical insignificance, whether negative or positive. Our meta-analysis aims to disentangle the diverse sources contributing to this variation, systematically examining the factors that exert systematic effects on the reported estimates. Furthermore, our work involves systematic exploration of the overall genuine effect underscored by the collective findings of this literature. Adhering strictly to the MRA reporting guidelines of MAER-Net, we present a nuanced and thorough analysis that sheds light on the intricate dynamics at work within the realm of external interventions and conflict intensity.
First, our analysis reveals a dependence of the reported estimations on the type of data utilized. Single-region data tends to yield lower estimations of conflict intensity compared to samples incorporating mixed/multi-region data. This suggests a potential upward bias in samples with aggregated multi-region data due to aggregation bias. Consequently, we advocate for a shift toward a single-region analysis in future research, positing it as a more prudent approach. This approach allows for a closer examination of the impact of external interventions.
Second, our findings indicate that estimation outcomes are influenced by both the target of interventions and the measure of conflict intensity. Notably, partisan interventions exhibit a trend toward de-escalation, contrasting with neutral interventions that seem to escalate conflict. This pattern suggests that partisan interventions contribute to the increased likelihood of victory for the supported party, resulting in a reduction of battle-related deaths. Additionally, pro-government third-party interventions demonstrate a capacity to diminish battle deaths, while the converse holds true for anti-government interventions. This implies that interventions supporting governments tend to benefit from ample funding and military deployment, while those supporting opposition factions may lend to the “cheap talk” interventions. Furthermore, our analysis indicates that the impact of external intervention on conflict intensity is more nuanced when annual battle deaths are considered, as opposed to using monthly data, which doesn’t systematically influence reported outcome significance. This suggests that the temporal dimension, specifically the time required for effects to materialize, plays a crucial role in mitigating conflict intensity. As a recommendation for future research, employing annual conflict intensity data is advised to gain clearer insights into the effectiveness of interventions or observers’ missions.
Third, our investigation uncovers a noteworthy influence of publication year on reported estimates, with recent studies indicating an amplifying role of external intervention in conflict intensity. This phenomenon may be attributed to the “research-cycle” hypothesis, wherein novel and trendy academic research tends to yield promising and significant impacts, followed by more skeptical evidence in subsequent studies. Alternatively, it might suggest a transformation over time, where initially well-funded and serious external interventions gradually devolve into mere “cheap talk.” While our findings do not provide compelling evidence, we do observe either weakly or moderately systematic variations related to estimation techniques, time-fixed effects, and the presence of natural resources.
Considering these various observed sources of variation, we present the overall implied effect of external intervention on conflict intensity, conditioned on the identified heterogeneity. The resulting best-practice effect predicts a genuine effect of −0.090, which is statistically significant at the 5% level. This suggests that external intervention activities make a statistically significant contribution to restraining or de-escalating conflict intensity, albeit with a practical effect of small magnitude. For policy purposes, therefore, external interventions are likely to have a modest effect. Any policy intervention involves some form of cost-benefit analysis, involving in this case, not just the scale of the desired outcome of the intervention but also the implementation costs linked to the intervention. Importantly, our analysis does not reveal any systematic evidence of publication selection bias in the literature on external interventions and conflict intensity.
Supplemental Material
sj-docx-1-sgo-10.1177_21582440241299976 – Supplemental material for A Meta-Analysis of the Impact of External Interventions on Conflict Intensity: Navigating Heterogeneity and Unveiling Genuine Effects
Supplemental material, sj-docx-1-sgo-10.1177_21582440241299976 for A Meta-Analysis of the Impact of External Interventions on Conflict Intensity: Navigating Heterogeneity and Unveiling Genuine Effects by Binyam Afewerk Demena, Amar Anwar and Syed Mansoob Murshed in SAGE Open
Footnotes
Acknowledgements
A preliminary version of this paper was presented at the 20th Jan Tinbergen European Peace Science Conference (2021) and at the Institute of Development and Policy Research, Addis Ababa University. Comments by participants are gratefully acknowledged. The author acknowledges the financial assistance of the Development Economics research group, International Institute of Social Studies, Erasmus University.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Data Availability Statement
The data supporting the findings of this study are available from the corresponding author upon reasonable request.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
