Abstract
The Truth and Reconciliation Commission (TRC) in Peru is usually cited as an example of how capture–recapture methods can help improve our understanding of mass violence from incomplete observed data. Using 25,000 documented death records, the TRC estimated a total of 69,000 killings, and that the Shining Path was the main perpetrator, in contrast with the raw data where the Peruvian State appears to be responsible for the most killings. One feature not often noticed is that the TRC applied an unusual indirect procedure, combining data on different perpetrators and lumping together missing perpetrator data in one group. I show that direct estimations with strict stratification by perpetrator and accounting for missing data do not support the results of the TRC’s indirect approach. I estimate a total of 48,000 killings, substantially lower than the TRC estimate, and the Peruvian State accounts for a significantly larger share than the Shining Path. Rather than an example of correcting biases in the observed data through capture–recapture methods, the TRC actually introduced further distortion.
Keywords
Introduction
The quantitative analysis conducted by the Truth and Reconciliation Commission (TRC) of Peru is arguably the most prominent application of capture–recapture methods (CRMs) for estimating deaths in armed conflicts. It is often cited as an example of its ability to correct biases in the observed data, as the TRC returns estimated victim number and perpetrator shares that diverge notably from the raw data. The TRC used three different sources on 25,000 documented killings, where the Peruvian State (hereafter, the State) appeared to be main perpetrator. Using CRMs, the TRC estimated 69,000 total killings, and surmised that the insurgent group the Shining Path (Sendero Luminoso in Spanish) accounted for the largest share. Their results have often been cited as evidence that CRMs provide a better understanding of the central patterns in violence than raw observed data. My reanalysis does not support this interpretation, and suggests that the TRC actually introduced further distortion rather than eliminating biases.
The devil is in the details: few have noticed that the TRC applied an unusual indirect procedure. The available data for Peru on the source, group, and geographic strata are too sparse for CRMs to be applied directly in all geographic strata. To address this challenge, the TRC used a nonstandard approach, combining data of different perpetrators and assigning missing perpetrator data to one perpetrator group. I show that a standard CRM application is possible for a subset of the data, and that this allows us to estimate the full data with more accuracy than the indirect application of CRMs. I estimate a total of 48,000 killings, substantially lower than the TRC estimate. Moreover, the State appears to account for a significantly larger share than the Shining Path. Rather than eliminating biases in the original data, the indirect CRM estimation by the TRC seems to have created a distortion in the data of documented deaths. In these direct estimates, the ratio of State-perpetrated to Shining-Path perpetrated deaths is roughly the same as the documented deaths.
Given the salience of the TRC analysis for Peru, this reanalysis has important lessons for both researchers and policymakers. Application of CRM is increasingly used to provide quantitative accounts of armed conflicts and human rights data. These analyses are often used to support proposed policies for reparation, institutional reforms, criminal prosecutions, as well as historical narratives. It is thus crucial to assess critically the possible limitations of this application and propose alternatives that can help overcome its problems.
Peru suffered a war of insurgency and counterinsurgency from 1980 until around 2000, involving the police and army, paramilitary forces and self-defense groups, on the one hand, and insurgent groups, mainly the Shining Path, on the other (Comisión de la Verdad y Reconciliación, (CVR), 2003). By 2000 most of the insurgent leaders were either captured or killed, and insurgency activity declined dramatically. In 2001 the government established a TRC to cast light on the facts of the war, and in particular the total number of killings and the responsible actors. Before the TRC, around 9000 killings were documented by human rights non-governmental organizations (NGOs) and the Ombudsman’s Office, and the State was responsible for more than 90% of these recorded deaths. The TRC changed this picture dramatically: it added around 15,000 additional documented killings, with the Shining Path as the main perpetrator of about 57% of these. The final TRC dataset contained about 25,000 killings (documented deaths and disappearances), classified according to three distinct sources of information (the TRC itself, the Ombudsman’s Office, and human rights NGOs), and three perpetrators: the State, the Shining Path, and “Other” perpetrators. The quantitative work for the TRC was undertaken by Ball et al. (2003), extending previous CRM applications to human rights in Guatemala (Ball, 2000) and Kosovo (Ball et al., 2002).
From the 25,000 documented killings, with 47% attributed to the State and 37% to the Shining Path, the TRC estimated approximately 69,000 total killings, with 30% carried out by the State and 46% by the Shining Path, that is, the opposite pattern of responsibilities to the documented raw data.
The TRC results were surprising and motivated considerable debate in Peru, because of the large total estimated number of killings, “greater than the number of human losses suffered by Peru in all of the foreign and civil wars that have occurred in its 182 years of independence” (CVR, 2003). The claim that the Shining Path was the main perpetrator of killings was also a surprising finding, and did not conform to common perceptions in Peru about the armed conflict.
These results also influenced views about how CRMs could be applied to human rights violations and violence. The reversal in the assigned proportions of responsibilities compared with the observed data was seen to bolster the claim that CRMs provided a more unbiased account of the true events. It was argued that analyses of the observed data were inaccurate and naïve accounts of an armed conflict (Price and Ball, 2015a), “misleading and biased”, 1 while CRMs allegedly corrected for census undercounts and could uncover true perpetrator and time patterns (Ball, 2012; Ball et al., 2003; Landman and Gohdes, 2013; Manrique-Vallier et al., 2013; Price and Ball, 2015a, 2015b). 2
While it is clearly warranted to caution against naïve views about observed non-random samples, it is also naïve to think that CRMs applied to biased samples will deliver unbiased estimates: “the [CRM] estimator can be biased, since we are assuming that the model which describes the observed data also describes the count of the unobserved individuals. We have no way to check this assumption” (Bishop et al., 1975: 254).
In practice, CRMs face several practical challenges, as exemplified by the indirect nonstandard application used by the TRC in Peru. Left with three samples that implied different pictures of the war and extremely sparse data for the Shining Path, Ball et al. (2003) asserted that a standard application of CRMs to this party was not possible. They sought to get around this by adding the sparse Shining Path data to dense data for the State to produce an estimate, and then compute an estimate for the Shining Path as the combination of the State added to the Shining Path minus the State. Essentially, the TRC authors realized that they could not obtain direct estimates everywhere for the Shining Path, and thus opted to avoid them everywhere. I will show, however, that in many geographical areas or strata, the data are sufficiently dense to apply standard CRMs. I computed direct estimates wherever possible and compared these with the TRC’s indirect estimates. A key finding is that the indirect method has a strong upward bias when estimating the Shining Path deaths. The direct estimates do not display the perpetrator reversal in the TRC estimates, and accentuate the primary role of the State. In other words, by applying CRMs in the normal manner to the strata for which it is possible to do so, we find that the striking TRC result appears to be an artifact of the unconventional approach devised to deal with the sparse data problem. This is an important result, given the extensive campaign to promote the TRC’s striking but incorrect findings, and how other data on documented deaths or events have been downplayed as a result.
The aim of this manuscript is not to challenge the validity of the CRM as a methodology for quantitative conflict analysis, but to provide new and improved CRM-based estimates for the case of Peru. However, these estimates also undermine claims made about the utility of CRMs, namely that they provide an inherently less biased account of events compared with observed raw data. I show that when the TRC analyses is looked at more closely, this claim is not supported, thus undermining the arguments often offered by capture–recapture advocates for the importance of the technique to quantitative conflict analyses.
This has more general implications beyond this particular study as CRMs are increasingly being used in several areas of the social sciences. 3 These methods work relatively well when data are sufficiently dense, but the proposed indirect application is not an adequate solution when data are sparse and can introduce distortions that are more severe than the alleged biases in the observed data. Researchers should be very cautious when the estimates suggest the opposite pattern to the observed data.
The raw TRC killings data
The TRC collected a sample on killings committed in Peru during the war from 1980 until 2000. Mobile teams were sent to pre-assigned areas with target numbers from each area. The data include retrospective information provided by victims’ relatives or friends, which were further reviewed, verified and qualified by TRC officials. What became evident in many interviews was that respondents did not know who the perpetrator was and thus did not report it. In some cases, TRC officials determined a perpetrator based on the “regional historical context” or known patterns of human rights violations. In addition to the data collected by the TRC itself, the TRC used data from two other sources: the Ombudsman’s Office, and human rights NGOs.
The TRC considered three perpetrators: the State, the Shining Path, and Other. The State includes the police and army units, and self-defense groups, which are State-sponsored citizens’ armed counter-insurgent groups, and paramilitary groups, that is, State-sponsored or protected armed units that do not legally belong to any official armed branch. Besides the Shining Path, other less powerful insurgent groups such as the Túpac Amaru Revolutionary Movement were treated as a third group called Other perpetrators. The TRC included in this group all cases where the perpetrator was unknown or undetermined.
Table 1 summarizes the available information by perpetrator and intersected sources. There is a notable difference between the TRC data and the other two sources. In the latter, the State is overwhelmingly the main perpetrator, while in the TRC data the proportions are more comparable and the Shining Path is the main perpetrator. In terms of source overlap, only observations for the State are fairly evenly distributed across all three sources. The TRC has only 33.6% exclusive observations for the State (3888 out of 11,564), but 94.9% exclusive observations for the Shining Path, 8768 out of 9243, leaving limited intersections with the other sources. For Other perpetrators, 3138 out of 3885 (80.8%) observations are from the TRC. 4 These samples suggest two very different pictures of the war in Peru, and calls for an examination of the statistical procedures.
Killings by source overlap and responsible party.
Note: C: TRC; D: Ombudsman’s Office; N: human rights NGOs.
The Truth Commission’s estimation
The TRC applied CRMs, creating “models of interdependence among the three lists by using constrained, hierarchical log-linear models” (Ball et al., 2003: 22) (Agresti, 2002; Bishop et al., 1975; Fienberg, 1972; Sanathanan, 1972; Zwane and van der Heijden, 2005):
where
The TRC argued that “two levels of stratification,” to geographic strata and the three perpetrators, was “not possible because of the sparseness of the data for reported deaths attributed to the PCP-Shining Path and other perpetrators” (Ball et al., 2003: 24). The TRC used a nonstandard approach, adding the number killings by the Shining Path to the killings of the State “[i]n order to reduce the sparseness and to take advantage of the dense information about deaths attributed to the State” (Ball et al., 2003: 24–25). The TRC exploited the fact that the existence of CRM estimates for the dense data for the State implies the existence of estimates for sums of the State with any other perpetrator group, as these sums also must be dense. Under this approach, the TRC created the following four combinations: the State, the State plus the Shining Path, the State plus Other, and All documented killings. The TRC then calculated the killings by the Shining Path by subtracting the State plus the Shining Path minus the State. Similarly, the estimate for Other was derived by subtracting the State plus Other minus Other.
The TRC partitioned the data into 58 geographic strata based on the political subdivisions of Peru, grouping larger units where the armed conflict was less intense and smaller units where it was more intense (Ball et al., 2003: 47). As this indirect method may produce negative estimates for the Shining Path and Other, the TRC excluded models with such results, and used the next lowest
Table 2 presents the selection that the TRC performed to build the estimation sample. The TRC excluded 2742 killings because of missing information on where the killing occurred. At the same time, the TRC included 2944 killings for which the perpetrator is unidentified, also missing data, in the Other group. Unfortunately, the TRC did not provide any discussion on sample selection, patterns of missingness in the data, and their possible impact on the estimates of killings by perpetrator. This selection choice essentially mostly excluded killings by the State (1998 killings), thereby changing the perpetrator’s observed responsibility. In the original sample, 46.8% of the killings were caused by the State and 37.4% by the Shining Path, but these percentages changed through selection to 43.6% for the State and 41.3% for the Shining Path.
Sample selection by the TRC.
After estimating total killings for the final 58 strata and three perpetrators, as well as their differences, the TRC aggregated the results into seven regions: 1. Ayacucho, 2. Centro, 3. Nor-Oriente, 4. Sur Andino, 5. Huancavelica, 6. Lima-Callao, and 7. Otras; and into total values for the whole country.
Table 3 shows the TRC results by perpetrator and the total number of killings, 69,280. 6 The Shining Path was responsible for 31,331 killings, while the State was responsible for 20,458 killings. The fifth column of this table shows the difference between the Shining Path and the State. This last statistic led the TRC to assert that the Shining Path “is responsible for a significantly greater number of deaths than agents of the State,” and to “reject the hypothesis that the number of deaths caused by one group is the same as the number caused by either one of the other two groups” (Ball et al., 2003: 7).
TRC’s estimates of total number of killings by responsible party and differences between killings attributed to the Shining Path and the State. Standard errors are in parentheses.
Direct estimation for a subset of strata
The TRC authors chose to use indirect estimation as they could do not standard estimations for the Shining Path in all strata. However, there are nine Shining Path strata for which we can use standard estimation for at least one log-linear model of the three sources. It is instructive to compare these standard estimates to the TRC results. These nine strata contain 3996 observations, which is 44% of the total for this party. Table 4 compares the standard estimations in this subset of strata to the TRC’s estimations.
Estimates for the Shining Path killings by method for nine strata that admit standard estimations.
AYA: Ayacucho, CEN: Centro, NOR: Nor-Oriente, SUR: Sur Andino, HUA: Huancavelica, Obs.: Observations, Direct est.: Direct estimation, Est.: Estimate, S.E.: Standard Error.
A standard estimation results in 6288 total killings for these strata, while the TRC’s estimate is much larger: 11,673 killings. 7
Table 5 reports the difference between the killings of the Shining Path and the State in these nine strata by method. The direct estimations stand in stark contrast to the result from the TRC’s indirect application of CRMs. Whereas the TRC’s result led to a reversal in the largest perpetrator compared with the observed data, the standard estimation results indicate that the State is responsible for a significantly larger number of killings than the Shining Path. A simulation study – detailed in Online Appendix C – comparing direct and indirect estimations indicates that the indirect estimates have larger and often substantially upward biases and larger standard errors than the direct estimation. Thus, the TRC approach is very likely to contain a large upward bias in these nine strata, and indeed all strata of the sample.
Differences between killings of the Shining Path and the State by method for nine strata that admit direct estimations.
Est.: Estimate, S.E.: Standard Error, Z: Z-statistic, p: p-value, C.I.: Confidence Interval.
In sum, analyses of both the actual data used by the TRC and simulations indicate that the indirect estimation does not eliminate but rather introduces biases for the sample subset. In the next sections I turn to estimates for the entire country beyond the strata that allow a conventional direct estimation as well as addressing how to deal with missing data problems in the unknown perpetrators category.
Direct estimation by geographic interpolation for the whole country
Having established that there exist strata that admit a direct estimation, I used them to estimate total killings in all strata by a geographic interpolation method known as kriging. I show below that the kriging results can be validated by alternative estimation methods that maintain a strict stratification by perpetrator or do not require mixing data of different perpetrators in any way, namely (a) a fixed effects estimation consisting of constructing groups of strata where direct estimation is possible, and (b) an alternative stratification in fewer dense strata where a direct estimation can be done.
Similar to the case of the Shining Path, there are 23 strata for Other perpetrators for which we can conduct a direct estimation, and these comprise 61% of the total number of observations. Geographic interpolation is obtained by first estimating the ratio of the estimated total population over the observed population for a subset of the strata that admits a direct estimation. Then, this ratio is calculated for each stratum as a weighted combination of the ratios in the strata of the first subset. Finally, this interpolated ratio is multiplied by the number of counts in each stratum. Online Appendix D details the kriging procedure.
Table 6 shows the estimated number of killings by perpetrator aggregated by region. The State is responsible for an estimated 20,516 killings, the Shining Path for 15,089 killings and Other for 12,444 killings, in percentages of 42.7%, 31.4%, and 25.9%, respectively. The Total column presents the sum by row, 48,049 killings; a number substantially lower than estimated by the TRC. Note also that the ratio of killings by the State and by the Shining Path is 20,516/15,089 = 1.36, which is very close to the corresponding ratio in the observed unselected data, 11,564/9243 = 1.25.
Direct estimation. Estimates for killings by all parties by region. Standard errors are in parentheses.
AYA: Ayacucho, CEN: Centro, NOR: Nor-Oriente, SUR: Sur Andino, HUA: Huancavelica, LIM: Lima-Callao, and OTR: Otras.
Table 7 reports the differences between the estimated number of killings by the Shining Path and the State. We can reject the hypothesis that the total difference is greater than or equal to zero at a high level of confidence. Thereby this test rejects or, more precisely, determines as very unlikely, that the number of killings by the Shining Path is equal to or larger than the number of killings by the State.
Direct estimation. Differences between the killings of the Shining Path and the State by region.
AYA: Ayacucho, CEN: Centro, NOR: Nor-Oriente, SUR: Sur Andino, HUA: Huancavelica, LIM : Lima-Callao, and OTR: Otras, Est.: Estimate, S.E.: Standard Error, z: z-statistic, p: p-value, C.I.: Confidence Interval.
I assessed the accuracy of the kriging geographic interpolation procedure in three ways. First, I performed a leave-one-out cross validation (see Online Appendix E), which suggested that kriging estimates are generally larger than direct estimates, yet smaller than the indirect TRC estimates. This reinforces the finding that the Shining Path carried out fewer killings than the State. Second, I compared kriging estimates to fixed effects estimates, based on groups of strata by geographical vicinity. Online Appendix F explains the fixed effects estimation and its results: 46,000 total killings and that the State is the main party responsible. Finally, I stratified the data into fewer strata so that a direct estimation was possible for all three perpetrators, see Online Appendix G. Once again, the estimation results of this stratification was a total number of killings of 46,000 with the State as the main perpetrator.
In sum, estimates for the whole country can be produced from the strata that allow a conventional direct estimation. A direct estimation that strictly stratifies by perpetrator results in 48,000 killings of which the State is the main party responsible.
Multiple imputation for unknown perpetrators
In this section I address the problem of missing perpetrator data by a multiple imputation defined in Rubin (1987). This consists of constructing simulated datasets and then computing averages over their corresponding estimates. I first estimated a multinomial logit for the probability of reporting a specific perpetrator depending on observed variables such as the victim’s gender and age, whether the victim is disappeared or dead, year of killing, and the source to which the event was reported. With these parameters’ estimates I simulated 20 samples obtained from imputing missing perpetrator data from Other to the three perpetrator groups. Online Appendix H describes in detail the construction of the samples by multiple imputation.
Table 8 shows the perpetrator composition in the sample used for the previous estimation and the averages of the 20 imputed samples. This is essentially a reassignment of perpetrator data: 2863 observations of Other are attributed to the State and the Shining Path, 1573 and 1290 observations, respectively. In these imputed samples, on average the State is responsible for 51% of the killings and the Shining Path for 47%, while identified Other are responsible for only 2% of the killings.
Total counts and percentage by responsible party for the original sample and samples of imputed missing data.
For each of the imputed samples, I carried out a kriging interpolation. For the Shining Path there are now 11 strata that allow for direct estimation, which contain 49% of the total number of observations, while for ‘Other’ (identified), there are on average 9 strata where direct estimation is possible, which contain 37% of the total number of observations.
Table 9 presents the results of the multiple imputation estimations by perpetrator aggregated by region. The estimated total number of killings, 47,849, changes only slightly from the total number estimated in the previous section, 48,049. The number of killings by the State increases from around 20,516 to 27,872, while the number of killings by the Shining Path increases from 15,089 to 18,341. Most killings that were previously contained in the group of Other, actually missing data, are now attributed to these two perpetrators. The composition of estimated killings results in 58.3% for the State, 38.3% for the Shining Path, and 3.4% for identified Other.
Multiple imputation estimation for killings of all parties by region. Standard errors are in parentheses.
AYA: Ayacucho, CEN: Centro, NOR: Nor-Oriente, SUR: Sur Andino, HUA: Huancavelica, LIM : Lima-Callao, and OTR: Otras.
Table 10 shows the difference between the killings of the Shining Path and the State for the multiple imputation estimations by region and total. The total negative difference widens from -5427 in Table 8 to -9531, and we can also reject the hypothesis that the difference between the total number of killings of the Shining Path and the State is greater or equal to zero is – that is, the State seems to be responsible for a greater number of killings than the Shining Path.
Multiple imputation estimation. Differences between killings of the Shining Path and the State by region.
AYA: Ayacucho, CEN: Centro, NOR: Nor-Oriente, SUR: Sur Andino, HUA: Huancavelica, LIM : Lima-Callao, and OTR: Otras, Est.: Estimate, S.E.: Standard Error, z: z-statistic, p: p-value, C.I.: Confidence Interval.
In conclusion, accounting for missing perpetrator data does not change the estimated number of total killings, but it does change the composition of perpetrators. The killings with missing perpetrator that were attributed to Other are attributed in a larger proportion to the State than to the Shining Path, adding to the previous findings where the former is responsible for more killings than the latter.
Conclusions
In this analysis I applied the same CRMs as the TRC of Peru. A key challenge for estimation was that the raw data here were very sparse for perpetrators other than the State. The TRC applied a nonstandard method that adds and then subtracts data of different perpetrators and does not account for missing perpetrator data. I have shown that the TRC’s indirect method had a sizable upward bias compared with a direct estimator. For the Peruvian data, the application of the indirect method reversed the pattern of responsibilities in the observed samples. However, this is an artifact of the indirect method rather than a result of removing bias.
As an alternative, I used a subset of strata where the standard estimation approach was possible, and used this to estimate the whole dataset by means of geographic interpolation. The figure that emerges for the armed conflict in Peru is illustrated in Table 11. A direct estimation, strictly stratified both by geography and perpetrator, extended by a multiple imputation of missing perpetrator data, sets the estimated total number of killings at around 48,000, of which 28,000 were committed by the State, 18,700 by the Shining Path, and 1300 by Other perpetrators. The proportion of responsibilities is thus 58.3% for the State, 38.3% for the Shining Path, and 3.4% for Other perpetrators. I find that it is very unlikely from a statistical point of view that the Shining Path has an equal or larger responsibility for killing than the State; the State seems responsible for a greater number of killings than the Shining Path.
Peru 1980–2000: Estimates and confidence interval limits of the total number of killings by responsible party (confidence level: 95%).
These findings imply a word of caution on the use of CRMs for quantitative analysis of violence, especially when the estimates seem to differ substantially from the patterns of the observed data. Rather than correcting for alleged biases in the observed data, the TRCs’ indirect application of CRMs introduced a distortion and a misunderstanding about the proportion of responsibilities in Peru. Direct application of CRMs produces estimates that are closer to the patterns shown in the observed data.
Supplemental Material
supplemental_material – Supplemental material for Capturing correctly: A reanalysis of the indirect capture–recapture methods in the Peruvian Truth and Reconciliation Commission
Supplemental material, supplemental_material for Capturing correctly: A reanalysis of the indirect capture–recapture methods in the Peruvian Truth and Reconciliation Commission by Silvio Rendon in Research & Politics
Footnotes
Acknowledgements
I would like to thank Kristian Skrede Gleditsch for great editing and thoughtful comments throughout the refereeing process for both this and the original paper. I thank two anonymous referees, and R. Alvarado, D. Cedano, D. Manrique-Vallier, K. Pollock, N. Quella, M. Spagat, and former TRC officials E. González-Cueva and J. Ciurlizza for providing the data for this research. I also thank participants of the seminars at the Central Bank of Peru and at Instituto de Matemática y Ciencias Afines in Lima, Peru. I started this research while I was an assistant professor at Stony Brook University, and continued the research as a personal project after I left that role. The views expressed in this paper are those of the author and do not represent the views of any organization that the author may currently be affiliated with. All remaining errors and omissions are solely mine.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental materials
The supplemental files are available at http://journals.sagepub.com/doi/suppl/10.1177/2053168018820375. The replication files are available at ![]()
Notes
Carnegie Corporation of New York Grant
This publication was made possible (in part) by a grant from the Carnegie Corporation of New York. The statements made and views expressed are solely the responsibility of the author.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
