In his discussion of evidentiary policies, Blackstone famously noted that ‘it is better that ten guilty persons escape than that one innocent suffer’ (Blackstone 1769). The conventional wisdom among lawyers, judges as well as academics holds that accepting this statement as a maxim necessitates the adoption of pro-defendant evidentiary rules. It is also commonly believed that costs associated with false convictions being greater than those associated with failures to punish offenders due to the presence of punishment costs provides a utilitarian rationale for Blackstonian principles. After formalizing Blackstone ratios (either as marginal rates of substitution or, alternatively, as the ratio between quantities of errors), I show these conventional views are incorrect. I then propose a simple modification of the Blackstone ratio, which shifts the focus from aggregate outcomes to consequences for individuals within the criminal justice system. This modification better aligns commonly held views about the Blackstone ratio with its actual implications and justifications.
‘[B]etter that ten guilty persons escape, than that one innocent suffer’.1 This is William Blackstone’s well-known statement which later provided the motivation for many commentaries on, and was also used to capture the rationale behind, many pro-defendant procedures and doctrines in criminal law. Moreover, similar statements in which different numbers have been substituted for in Blackstone’s statement have been made by others, some pre-dating Blackstone.2 Thus, today, Blackstone’s ratio is used to refer to a more general proposition about the asymmetry between the perceived costs, or social tolerability, of punishing innocent parties and failing to punish offenders, and embodies one of the first principles to which students of the law are exposed. It is therefore unsurprising that Blackstone’s ratio is far from being merely an academic curiosity, but it has also greatly influenced actual legal decision making, precedents, and therefore, the law. Given its great importance, many have attempted to construe the precise meaning of Blackstone’s ratio,3 and others have interpreted it as providing the basis for a variety of criminal procedural safeguards which call for different types of protections for criminal defendants.4 Yet others have suggested that the ratio’s prioritization of wrongful convictions over wrongful acquittals can be rationalized based on various characteristics of the criminal justice system.5
Here, I question whether Blackstone ratios, by which I mean ratios of the type Blackstone has referred to more generally, as opposed to the specific ratio of ten to one, can serve the function that they have been understood to serve. To do so, I provide a simple formalization of Blackstone’s ratio, which follows, in my opinion, the most natural economic interpretation of his original statement. Namely, I conceptualize Blackstone’s ratio as a lower bound on the marginal rate of substitution between the number of wrongful convictions and the number of wrongful acquittals associated with social preferences. I find this to be the most natural interpretation of Blackstone’s statement, because the statement specifies how two things ought to be traded-off, and this is precisely what is conveyed through the marginal rate of substitution. In fact, this appears to be the definition which has been implicitly adopted in some prior studies attempting to empirically assess Blackstone ratios most consistent with people’s preferences,6 but which has surprisingly evaded prior attempts at formalizing the Blackstone ratio (see, e.g., DeKay (1996)).
It is also worth noting that Blackstone made this statement when describing a principle that ought to constrain the nature of evidentiary rules. Thus, Blackstone’s ratio implies that we ought to implement procedures and policies in our criminal justice system such that the rate at which we actually trade-off the number of failures to convict guilty people and the number of wrongful convictions is no smaller than some ratio greater than (and, specifically, in Blackstone’s statement). This trade-off, in turn, is measured by the marginal rate of transformation, that is, the rate at which we can trade-off the two errors, given behavioral and technological constraints. I allow to be any number greater than , to incorporate the asymmetry inherent in Blackstone’s statement, and also to allow for the variation in views regarding the precise ratio noted previously.
I then question whether the Blackstone ratio thus defined can provide a meaningful criterion that can guide criminal justice policy. My analysis reveals two important findings contrary to conventional wisdom. First, an arbitrarily large Blackstone ratio can emerge even when convictions are made through a ‘more likely than not’ (MLTN) standard—a phrase often, but not always, used interchangeably with the ‘preponderance of the evidence’ standard7 —which is a weaker standard than the ‘beyond a reasonable doubt’ standard used in criminal trials. In fact, this standard is typically associated with a Blackstone ratio that is greater than one within standard law enforcement models. Thus, a Blackstone ratio that places greater emphasis on acquitting the innocent rather than convicting the guilty does not necessarily imply the high standards of proof adopted in criminal justice systems. It is worth noting that this analysis is purely positive in its nature, that is, it does not relate to a normative claim. It merely assesses the validity of statements made in the literature that implementing a large Blackstone ratio requires heightened standards of proof.
Then, I turn to a related normative issue. There is an impression created in the literature that punishment costs imply large Blackstone ratios. This claim is based on the observation that, given any crime rate, an additional wrongful conviction generates punishment costs, while an additional wrongful acquittal reduces them.8 Thus, wrongful convictions carry greater social costs than wrongful acquittals, and therefore optimal policies ought to reflect this asymmetry, which in turn provides a basis for a Blackstone ratio greater than . I show that this implication does not hold, either: the optimal conviction criterion that emerges in the standard enforcement model with costly punishments can require tolerating more than one additional false conviction to eliminate a failure to convict, that is, a Blackstone ratio smaller than one.
These results, taken together, reveal a lack of fit between what Blackstone’s ratio is believed to stand for and its implications when interpreted in the intuitive manner which I have described. Thus, although I believe that the most natural interpretation of the Blackstone ratio is that it is a constraint on the marginal rate of substitution, I next question whether an alternative definition of Blackstone’s ratio, one which has been suggested in the literature, may perform better. This alternative definition focuses on the ratio between the quantities of the two types of errors, as opposed to how they ought to be traded-off.9 Unsurprisingly, this definition leads to similar problems.
I then consider a simple modification of Blackstone’s statement which might mitigate the disconnect between what the statement describes and its purported implications. For this purpose, I propose a modified ratio, which I call the ratio. The ratio focuses on the trade-offs between the probabilities of erroneous outcomes, conditional on the defendant’s actual guilt, as opposed to trade-offs between the numbers of erroneous outcomes, as in Blackstone’s original statement. This is formalized by the marginal rate of substitution, , between two simple probabilities: convictioninnocence and non-convictionguilt. I show that the ratio performs better with respect to both issues described above. Specifically, (i) when punishment is costly, the optimal standard of proof is associated with an ratio greater than , and (ii) an ratio generates an evidentiary standard that is stricter than the MLTN standard as long as is greater than the ratio between the truly innocent to truly guilty parties, that is, the ratio between the compliance rate and the offense rate. Thus, I conclude by suggesting a reformulation of Blackstone’s statement that describes the ratio:
‘It is better that a guilty person more likely escape conviction, than an innocent more likely suffer’.
This statement, too, involves a trade-off. It sacrifices clarity for brevity. A more precise and lengthier statement that describes the ratio, and which departs from Blackstone’s original statement as little as possible, is the following:
‘It is better that a guilty person’s likelihood of conviction be reduced by , than an innocent person’s likelihood of conviction be increased by ’
Quite interestingly, what is responsible for the better performance of the ratio is its focus on individuals rather than aggregates. The classic Blackstone ratio focuses on the numbers of guilty and innocent convicted, whereas the ratio focuses on individual consequences (the probabilities of conviction, conditional on guilt and innocence). As a result, the trade-off that defines the ratio is unresponsive to changes in the aggregate offense rate, while the classic Blackstone ratio is. Because evidentiary standards like the MLTN and optimal standard of proof also have similar properties, the ratio can more naturally be related to them.
Before formalizing the points described above, it is worth describing how the this article builds on prior scholarship. While most prior scholarship focusing on the relationship between Blackstone ratios and evidentiary rules appears to be informal, in the mathematical sense, a notable exception is DeKay (1996).
As here, DeKay (1996) considers various mathematical definitions of the Blackstone ratio and questions how it relates to standards of proof. Unlike here, DeKay (1996) does not consider a definition of the Blackstone ratio which focuses on the marginal rate of substitution between the number of errors. Instead, it considers the ratio of error frequencies definition I consider in section 3, as well as the ratio of (relative) utilities from errors. However, DeKay’s utility based analysis is conceptually problematic, because it does not define utility functions in their ordinary economic sense, that is, as representations of rankings of possible outcomes, which in this case, corresponds to error pairs.10 Also, as noted in Kaplow (2011a), DeKay’s analysis implicitly takes behavior to be exogenous, and is thus incapable of incorporating deterrence effects, which are at the heart of most theoretical economic analyses of crime.
Another important difference between DeKay (1996) and this article is their focus. The former is concerned with highlighting the fact that standards of proof and Blackstone-like ratios are different concepts, which are affected by different considerations. Thus, it concludes that ‘many authors have incorrectly assumed that the standard of proof completely determines the ratio of erroneous acquittals to erroneous convictions’ (DeKay, 1996, p. 131). This difference between concepts is presumably apparent to most theoretical social scientists today. Indeed, the question posed here is whether high Blackstone ratios necessarily imply pro-defendant biases, and whether optimal standards of proof in the presence of costly sanctions imply large Blackstone ratios. Thus, it takes the type of difference that DeKay notes, as given, and conducts a systematic analysis of the relationship between the two concepts to dispel conventional wisdom regarding the same. It shows that no Blackstone ratio, regardless of how large it may be, can on its own justify pro-defendant evidentiary rules, and that costly sanctions are insufficient to provide a utilitarian rationale for large Blackstone ratios. Moreover, unlike any prior work that I am aware of, including DeKay (1996), this systematic analysis allows me to propose a simple modification to the Blackstone ratio to restore its usefulness.
My analysis is also closely related to the economics literature on standards of proof, and in particular Kaplow (2011a). The pioneering work on optimal standards of proof, namely Lando (2002), and Demougin and Fluet (2005, 2006, and 2008), conceive of standards of proof as a threshold evidence to which the evidence observed in court must be compared to render decisions, and investigate the properties of optimal standards of proof. Subsequently, many economics articles that incorporate additional functions and effects of evidentiary requirements adopted similar conceptions of the standard of proof.11 Notably, none of these articles explicitly questioned the relationship between Blackstone ratios and (optimal) standards of proof, as I do here.
The closest in this regard is Kaplow (2011a), which focuses on a trade-off introduced in Kaplow (2011a, 2011b, and 2011c) and Mungan (2011). In these articles, evidentiary standards may affect not only people’s decisions to refrain from crime, but also innocent parties’ decisions to engage in socially benign (or even desirable) behavior. Kaplow (2011a) notes that when these types of ‘chilling behavior’ are accounted for, the optimal standard of proof balances deterrence benefits and chilling costs. The article then explains the mismatches between optimal standards of proof and standards considered in the literature, including the MLTN standard, as well as objectives like achieving a targeted ratio of judicial errors. Kaplow also notes that the presence of costly sanctions may have an ambiguous effect on the optimal standard of proof.12 However, Kaplow (2011a) does not focus on how the optimal standard of proof or the MLTN standard compares to Blackstone ratios. Moreover, in its comparisons it focuses on mismatches between the standards analyzed (e.g., the optimal standard of proof vs. MLTN) as opposed to questioning whether principles reflected by one concept can appropriately rule out other principles or practices as I do in this article.
This latter exercise is useful for ascertaining whether conventional and doctrinal approaches can act as useful constraints on the rules or standards that can be adopted. This is because principles that guide legal practices are often not precise, and thus do note define, for instance, the specific evidentiary standard to be applied to achieve optimality. The latter task is, in fact, impracticable as noted in various criticisms of Kaplow (2011a).13 This is perhaps why legal decision makers often prefer to focus on more workable concepts that are easier to describe. The Blackstone ratio is perceived as providing this type of constraint. My analysis here highlights that its focus on the number of errors, as opposed to the probabilities of errors conditional on behavior, makes it ill-fit to serve the function it has been assumed to serve in the literature. Thus, slightly modifying it, as described above, may restore its function, when correctly interpreted, as a helpful and intuitive tool in discussing criminal justice policies.
Another point worth emphasizing is that the analysis here builds on prior studies (see, e.g., the survey in Polinsky and Shavell, 2007) which follow Becker (1968) in studying the deterrence effects associated with sanctions and other law enforcement policies. More specifically, following the prior literature on the optimal standard of proof noted above, deterrence effects are incorporated by allowing the offense rate to be responsive to the evidentiary standards adopted by the government. These effects play a central role in Section 2.2 in demonstrating that large Blackstone ratios need not imply strong pro-defendant protections; and they play a similar role in showing that punishment costs need not provide a rationale for large Blackstone ratios in Section 2.3.
Given the impact and prevalence of Blackstone’s ratio, it is important to clarify and formalize the points described above. This is done in the next three sections. I provide brief concluding remarks in the fifth section. An appendix in the end contains formalizations of some properties of evidentiary rules.
Blackstone’s ratio and implications
To formalize social preferences consistent with Blackstone’s statement, consider a social objective, , which is a function of the two objects of interest, namely the number of guilty persons who escape conviction, , and the number of innocent people who suffer, . Implicit in Blackstone’s statement is that , that is, both types of erroneous outcomes are socially undesirable. Thus, conceiving of as a social utility function, one can express the set of combinations of and which give rise to the same (attainable) social utility, , through a social indifference curve
Since, , it follows that the marginal rate of substitution () between and is:
If, for instance, this marginal rate of substitution is constant and equals , then this suggests that reducing by one and increasing by would leave social utility unchanged. On the other hand, the same change in the number of erroneous outcomes would imply an increase in social utility if , which corresponds to Blackstone’s statement: ‘[B]etter that ten guilty persons escape, than that one innocent suffer’.
This set-up does not require any specific functional form for . However, the simplest case, where the is constant, is obtained when , where the constant can be interpreted as Blackstone’s ratio. Figure 1, below, depicts this case through social indifference curves () in space, along with a ‘minimum error frontier’ labeled , that reflects the attainable pairs given evidentiary and behavioral constraints described in the next sub-sections. The point at which this frontier is tangent to a social indifference curve (e.g., point marked in Figure 1) is where is maximized, because .
Blackstonian indifference curves and the minimum error frontier.
It is worth noting that the marginal disutility from each erroneous outcome need not be constant, as depicted in Figure 1, and the set-up described above allows for this possibility. Thus, more generally, one can conceive of Blackstone’s ratio as placing a lower bound on the associated with around its maximizer, that is, , given the constraints formalized below. This section focuses on this interpretation of Blackstone’s ratio, and an alternative interpretation proposed in the literature is considered in Section 3.
Evidentiary standards and conviction probabilities
Blackstone made his statement as one which ought to guide principles governing evidentiary rules. Thus, to study the relationship between evidentiary rules and Blackstone ratios, I first describe how and are affected by evidentiary standards, first without employing any specialized assumptions, and subsequently specifying a relationship between these standards and crime rates to allow more specific observations. Generally, the number of innocent people who suffer punishment is given by
where is the number of people in the population; is the unconditional probability of innocence; denotes trial, such that is the probability of an innocent person standing trial; and denotes a conviction such that is the probability of conviction conditional on trial and innocence. Similarly,
where denotes guilt. Here is the probability a guilty person escapes punishment, either because they don’t stand trial or because they stand trial but are not convicted.
To abbreviate expressions, I adopt the following notation for these probabilities
In general, the evidentiary standards adopted in trials affect and , which may, in turn, have deterrence effects. To capture these effects, I adopt a modeling approach introduced in the prior literature on optimal standards of proof (see, e.g., Demougin and Fluet (2006), Mungan (2020b), and Fluet and Mungan (2025)). In this approach, and are modeled as functions of , that is, and ,14 as opposed to functions of the evidentiary standards chosen by the government. This approach can be adopted, since each evidentiary standard corresponds to a particular false conviction rate, . This eliminates the need to keep track of the evidentiary standard as a separate policy tool, and is convenient for purposes of identifying the relationship between evidentiary rules and outcomes of interest, by expressing them as a function of .
Intuitively, when the government adopts the strictest evidentiary standard, such that no convictions are ever obtained, it follows that . Similarly, when the government adopts the most liberal evidentiary standard, every defendant is convicted, and thus . Consistent with this intuition, as one relaxes the evidentiary standard, both and are increased. Thus, one can express as an increasing function of with
to keep track of how and co-move as one changes the evidentiary standard. A more detailed and mathematical explanation of this modeling approach can be found in the appendix, below, as well as in Section 3.1 of Fluet and Mungan (2025).
As the government changes evidentiary standards, tracked through changes in as explained above, this can have impacts on the behavior of individuals. For instance, when potential offenders notice that they are less likely to be convicted upon committing offenses, due to increased evidentiary standards, they may be more inclined to commit offenses. The precise impact of changes in evidentiary standards on the offense rate naturally depends on the behavioral responses of individuals, which can be modeled in different ways as explained in Sections 2.1 and 2.2., below. However, for current purposes it is sufficient to note the general relationship between evidentiary standards and the offense rate by letting . To simplify the analysis, I assume that and , that is, that there are compliers and offenders under all feasible policies. As will become clear below, these assumed conditions hold in standard models of crime and deterrence when some people derive large benefits from crime and others dislike committing them.
Next, note that the possible choices of imply a pair of and which can be compactly expressed using the notation in (5 ) as
Thus, the collection of and pairs form the set of erroneous outcomes achievable through the trial system under various evidentiary rules, for example, the minimum error frontier depicted in Figure 1. Therefore, one can potentially ‘transform’ one of the errors into some quantity of the other through changes in . I call the rate at which this can be accomplished the marginal rate of transformation between and , which—assuming —is given by the following:
Thus, measures the slope of the minimum error frontier in space as depicted in Figure 1, achieved when an evidentiary standard such that is used. Importantly, depends on deterrence effects, , because it measures the trade-off between aggregate concepts, namely the numbers of innocent and guilty convicted, which are directly impacted by deterrence. This property, which is caused by the Blackstone ratio being defined through references to and , plays a key role in causing the Blackstone ratio to serve as an inadequate tool in constraining evidentiary policies.
Specifically, implementing an evidentiary rule that leads to would be inconsistent with principles embodied by a Blackstone ratio of . Conversely, a policy leading to cannot be ruled out as being socially undesirable solely based on a Blackstone ratio of . Next, for any given Blackstone ratio , I question whether the ‘more likely than not standard’ (MLTN), denoted , is necessarily such that , that is, such that it can be dismissed as being socially undesirable solely based on Blackstone’s ratio.
The ‘more likely than not’ standard
I focus on the MLTN standard, because my objective is to relate the current analysis to prior legal scholarship and conventional wisdom that the Blackstone ratio is sufficient reason for adopting heightened protections for defendants. The analysis in this sub-section, summarized by Proposition 1 and Corollary 1, below, reveals that the MLTN standard cannot be ruled out by any arbitrarily large Blackstone ratio. Thus, higher standards of proof like ‘beyond a reasonable doubt’ are not necessarily implied by large Blackstone ratios.
In general, evidentiary standards can be thought of as mappings from any body of evidence which may be observed in court into convictions and acquittals at trial.15 The MLTN standard, in turn, would require a conviction only if the observed evidence is such that16
since only then would it be more likely that the defendant at trial has more likely than not committed crime. This, in turn, corresponds to a likelihood ratio test of the form:
To relate this rule to the probabilities of conviction and associated with it, a subtle but intuitive observation is needed. To describe this observation in a simple way, I assume that the likelihood ratios associated with the evidence is continuously distributed. Because this observation is relatively simple, I only intuitively describe it here, but the appendix, below, contains a formalization (in particular, see (29), below).
The observation is the following: When likelihood ratio tests are implemented, corresponds to the threshold likelihood ratio used, that is, when convictions occur if and only if evidence such that for some is observed. This is because represents the increase in the probability of correct convictions as one increases the probability of false convictions. This is achieved by letting marginal pieces of evidence, which previously would not suffice for a conviction, be deemed sufficient for a conviction. Because these marginal pieces of evidence are all such that , the ratio between the increase in versus is , and hence . Thus, an associated with the MLTN standard, denoted , assuming it is interior, is such that
As long as the evidence generation process is sufficiently informative,17 a standard that satisfies (11) exists, and in many cases it is unique (see, e.g., the examples used in the proof of Corollary 1). Thus, in what follows I assume existence and also the uniqueness of to simplify descriptions and derivations of results.18
A key insight is that the MLTN standard (captured by (11)) is not a function of the (marginal) deterrence effects of evidentiary standards (i.e., ), while the manner in which the quantity of the two errors can be traded-off is. More precisely, the MLTN standard only specifies (through (11)) how conditional probabilities of judgement errors, that is, and , ought to be traded-off, given the offense rate that is observed when a trial is taking place. In fact, it requires that, holding the offense rate constant, making evidentiary standards slightly weaker result in a reduction in the number of guilty people who avoid punishment (i.e., the left hand side of (12)) that equals the increase in the number of innocent people convicted (i.e., the right hand side of (12)).
However, changes in evidentiary standards around MLTN affect not only the conditional probabilities and , but also the offense rate, as long as . In particular, when making convictions easier leads to deterrence effects, i.e., , the reductions in and the increase in obtained through laxer evidentiary requirements are greater than the left hand and right hand sides of (12), respectively. This is reflected by the second terms which appear both in the numerator and the denominator of (8). These impacts through the crime rate naturally affect reductions in more than the increase in , as long as a guilty person is more likely to escape conviction than an innocent person is likely to be punished (i.e., as long as ). A sufficient condition for this result is that , that is, the guilty are more likely to avoid trials than the innocent are likely to stand trials. Thus, when this condition holds, the MLTN standard is associated with a marginal rate of transformation that is greater than . These observations are reflected in the proof of the next proposition which summarizes the relationship between Blackstone ratios and the MLTN standard.
The more likely than not standard may be associated with any arbitrarily large marginal rate of transformation. Thus, large Blackstone ratios, on their own, are insufficient to necessitate standards that are stronger than MLTN.
In particular, (I), as long asand the guilty are more likely to escape trials than the innocent are likely to stand trial, i.e.,. (II), ifis sufficiently large and.
. Thus, for any , it follows that there exists large enough such that if .▪
The proposition formalizes the claim that Blackstone ratios being greater than some arbitrary number is insufficient to rule out the MLTN standard as an adequate evidentiary standard to achieve Blackstonian social objectives under two conditions. One condition is relatively intuitive: the probability with which a guilty person is not brought to trial is times greater than the probability with which an innocent person is brought to trial. Presumably, this condition is quite plausible when the proportion of innocent people standing trial is small for a large range of Blackstone ratios (e.g., the condition holds for even when , if ).
The second condition requires deterrence to be enhanced when evidentiary standards are made slightly weaker than MLTN to make convictions easier. This condition may also appear intuitive to many, since making convictions easier ought to deter people from committing crimes, especially when they are more likely to be brought to trial upon committing crime than when they do not (i.e., ). However, in the law enforcement literature it has been noted that easier convictions can have counter-deterrent effects by increasing the likelihood of being wrongfully convicted and thereby reducing the expected benefits associated with refraining from crime, that is, reducing the opportunity cost of crime.19 Thus, I note below that even within such models, the impact of weakening evidentiary standards around MLTN is an increased level of deterrence, provided that crime commission is not the norm, that is, . Also, the assumptions imposed within these models allow me to identify more specific relationships between evidentiary standards and the crime rate, and thereby illustrate that arbitrarily large Blackstone ratios can be consistent with the MLTN standard.
Implications in models where wrongful convictions lower deterrence
In standard models where wrongful convictions lower deterrence,20 a person who commits crime faces an imprisonment sentence with probability whereas a person who does not commit crime is imprisoned with probability . People who commit crime receive benefit , which differs from person to person, and is the cumulative distribution of these benefits across people (who are assumed to form a continuum of measure for simplicity) with . Normalizing the cost of being punished to , a person’s expected net-benefit from crime is , whereas the person faces an expected cost of when they refrain from crime. Thus, people, who are assumed to be risk-neutral, commit crime if
and the crime rate is
It is worth briefly noting a couple additional properties of , which captures the evidence generation process, that becomes relevant in these models. First, whenever likelihood ratio tests are used, it follows that
Again, because these properties are intuitive, I relegate their formalization to the appendix, below, and only provide the intuition behind them. The first inequality follows from the fact that as more types of evidence are found sufficient for a conviction, both the guilty and the innocent face a greater risk of being convicted, and thus and move in the same direction (see (29) in the appendix). The second inequality follows from the fact that conviction probabilities are increased by allowing convictions based on types of evidence which are less and less indicative of guilt (i.e., are associated with a smaller likelihood ratio ) and thus is decreasing in (see (30) in the appendix). Given these observations, it follows that crime is minimized by an interior , as long as , that is, when the evidentiary process is sufficiently informative,21 and I assume this condition holds to abbreviate the analysis.
Building on these preliminary observations, note that when the evidentiary standard for convictions is MLTN, it follows that
where the second equality is obtained by using (11). Thus, if and vice versa.
Combining this observation with proposition 1 leads to the following corollary.
For any Blackstone ratio such that , there exists a population (i.e., a cumulative distribution function ) such that the marginal rate of transformation associated with the more likely than not standard exceeds the Blackstone ratio (i.e., ).
The expressed in (13) can be re-written by using (17) as follows:
where is the hazard rate associated with . Thus, if is a shifted exponential distribution with support , it follows that for all , and in particular for . Thus, if , it follows that , since . Next, note that when this condition holds and , it follows that
Therefore, , and therefore there exists large enough such that , when .▪
The corollary extends the results in Proposition 1. It shows that these results are not only possible in an abstract sense, but that they can easily emerge in settings in which the effect of evidentiary standards on deterrence is modeled through one of the most common approaches. It is also worth highlighting that both Proposition 1 and Corollary 1 focus on identifying intuitive sufficient conditions for ; none of the conditions reported are necessary. In particular, because the relationship between and can differ greatly based on the nature of the evidence generating process, results are derived by focusing on their upper bounds (of ) to produce a lower bound for . Thus, in future work, conditions weaker than can be obtained at the expense of using specific functional forms for .
Optimal evidentiary standards with costly punishment and implied Blackstone ratios
My previous analysis reveals that relatively permissive evidentiary standards can generate quite large Blackstone ratios. Next, I investigate a relationship that goes in the opposite direction. I question whether optimal evidentiary standards that emerge in models which incorporate costly punishments result in s consistent with large Blackstone ratios. To do so, I describe the standard model of law enforcement with costly punishment (see, e.g., Polinsky and Shavell, 2007) which incorporates erroneous judgements (see, e.g., Rizzolli and Saraceno, 2013).
This model follows the same behavioral assumptions as described via (14) and the surrounding text. Thus,
The welfare consequences of implementing , on the other hand, are evaluated by the following social welfare function
Next, I describe the components of this welfare function which have not yet been introduced.
In (19), denotes the per person social cost associated with punishment, which consists of the cost to the person being punished (normalized to ) plus the cost, or minus the net benefit, associated with punishment due to its effects on third parties or the government. When , the sanction is fully transferable, in the sense that the losses to the punished person generate benefits of equal magnitude elsewhere. The most well-known case of transferable sanctions is when the sanction is a monetary fine which is assumed to be collected from offenders without friction. The case of imprisonment is typically incorporated by assuming ; there are costs to the person imprisoned and costs associated with the administration of the punishment which are not offset by any benefits that third parties may obtain from the punishment. Although cases where are possible (e.g., a largely monetary sanction with a small imprisonment/community service component), it is my impression that they are not analyzed as frequently. Total punishment costs are (included in the integral term in (19)) plus , which represent the punishment costs of the guilty and the innocent, respectively.
The (average) harm from each crime is , and thus total criminal harm is . The final component is the benefit from crime, given by , where determines whether or not the benefits from crime enter the social welfare function. The case where corresponds to the conventional utilitarian social where function where the criminal’s benefit fully enters the social welfare function. On the other hand, the case where can arise under different circumstances. First, criminal benefits can be outright excluded from the welfare function (see, e.g., Stigler, 1970). Second, the harm from the crime can be interpreted as equaling (as opposed to just ) consisting of some harm that equals the benefit to the criminal plus some additional amount (see, e.g., Rizzolli and Saraceno, 2013). Finally, Curry and Doyle (2016) provide a framework wherein the benefits from crime can also be achieved through voluntary market exchanges, and show that maximizing welfare is then equivalent to minimizing the costs of crime, that is, the case of . Here, I let to capture these cases, and any other cases that may arise in between, to increase the generality of the analysis.
With this set-up, I question whether evidentiary standards that maximize the welfare function in (19) necessarily imply marginal rates of transformation greater than . The next proposition answers this question where the maximizer of (19), assumed to be a proper and interior maximum, is expressed as function of the harm from crime as .
The optimal evidentiary standard may be consistent only with Blackstone ratios smaller than , even with costly punishments. In fact, when the minimum achievable crime rate is less than half, (i) , and, as a result, (ii) holding all else equal, a large enough harm from crime implies .
It follows that and , and therefore per (8). Thus, as long as . Next, note that is characterized by
Thus, it follows that, . Therefore, . Thus, for large enough .▪
Proposition 2 reveals that costly punishments, alone, are insufficient to rationalize policies and procedures that are consistent with large Blackstone ratios. This is because the optimal evidentiary standard can require trading-off more than a single false conviction to reduce the number of criminals unpunished by one, contrary to preferences consistent with large Blackstone ratios.
The rationale behind this result is that the optimal evidentiary standard (unlike the MLTN standard) seeks to balance two competing objectives, namely achieving deterrence gains and reducing punishment costs. The former objective is best served by implementing a standard that maximizes the opportunity cost of crime commission. This is achieved when the marginal impact of the evidentiary standard on the conviction probability of a guilty person and an innocent person is equalized, i.e., . Thus, holding the crime rate constant, the deterrence maximizing standard trades-off innocent people who are punished against guilty people who escape punishment proportional to the size of each group, i.e., . Moreover, when deterrence is maximized, by definition the impact on the crime rate is zero, that is, . Thus, the deterrence maximizing evidentiary standard is associated with a marginal rate of transformation that is smaller than , as long as the minimum achievable crime rate is less than half. As a result, when deterrence becomes a relatively important goal, which occurs when harms from crime relative to other values are high, the marginal rate of transformation between and is less than .
An alternative Blackstone ratio definition
As noted in the introduction, I find the interpretation of Blackstone’s statement as specifying a lower bound for the between and to be the most natural one. However, prior commentary and scholarship suggests that some interpret Blackstone’s statement as providing a lower bound for , instead. (See, for example, the description of Blackstone’s ratio in Connolly (1987) reproduced in note 9, below.) Thus, in this section I briefly note that similar issues related to existing claims regarding the Blackstone ratio emerge under this alternative interpretation, as well. I follow the same approach as in the previous section, and first note that any requirement regarding the ratio can be satisfied by the MLTN standard.
The more likely than not standard may generate any arbitrarily large wrongful failure to punish to wrongful punishment ratio (i.e., ).
For any , a sufficient condition for is that
where . Next, note that for any , there exists a sufficiently informative evidence generating process (operationalized by the function ) such that (21) holds. As an example, suppose where represents the informativeness of the evidence generating process. Then, for close enough to ,22 one can express the MLTN standard as a function of as . It follows that , and therefore (21) holds.▪
Proposition 3 reports results analogous to those summarized by Proposition 1 and Corollary 1. However, its proof focuses on a different aspect of the ratio being analyzed. Specifically, because and depend on the probabilities conviction, conditional on trial, an evidentiary standard that generates a very small probability of conviction for the innocent in trial can lead to a very large ratio between them. The MLTN standard, on the other hand, is determined not by the levels of these probabilities (i.e., and ) directly, but by how they co-move (i.e., on ). Thus, as the proof of the proposition illustrates, the MLTN standard can yield a very small probability of conviction conditional on innocence (i.e., ) and therefore a very large ratio. This implies that any requirement regarding the size of , on its own, is insufficient for suggesting the need for standards of proof which are more demanding than MLTN.
This insight also explains why the Blackstone ratio, interpreted as a lower bound on , cannot be justified as a requirement based on a welfare analysis when there are costly sanctions within the model described in Section 2.3. This is because the optimal evidentiary standard is then also a function of how conditional conviction probabilities co-move as one adjusts evidentiary standards, as noted next.
The optimal evidentiary standard with costly sanctions may generate a wrongful release to wrongful conviction ratio (i.e., ) smaller than .
Note that if . This condition holds if
As noted in the proof of Proposition 2, , which implies that is independent of the criminal benefit distribution . Thus, there exists benefit distributions such that , in which case (22) holds for sufficiently large .▪
Proposition 4 shows that this alternative interpretation of the Blackstone ratio also does not provide a basis for the purported relationships between the Blackstone ratio and optimal evidentiary standards when there are costly sanctions.
Modified Blackstone ratio
Given the issues associated with the conceptions of Blackstone ratios considered in Sections 2 and 3, here I consider a modified Blackstone ratio, which focuses on the trade-off between the probability with which a person faces a conviction upon refraining from crime (i.e., ) and the probability with which one avoids conviction upon committing crime (), respectively. Thus, the modified marginal rate of substitution is given by:
where represents a social objective function analogous to the function explained in Section 2. The modified Blackstone ratio, or the ratio, can then be thought of as a lower bound to around the maximizers of .
Following arguments similar to those presented in Section 2, let
denote the pairs of and that can be obtained through different evidentiary rules. The marginal rate of transformation between these probabilities is then given by
Thus, for any modified Blackstone ratio of , it follows that evidentiary rules such that would be inconsistent with the maximization of . Similarly, policies such that cannot be ruled out as being socially undesirable solely based on the modified ratio of .
The expression in (25) and the analysis in the previous sections highlight the intuitive appeal of this modified ratio. The optimal evidentiary standard that emerges in the model presented in Section 2.3. As well as the MLTN standard previously described are both closely related to the manner in which probabilities of conviction, conditional on behavior, co-move, which corresponds to . The optimal evidentiary standard depends on this relationship because it trades off gains from deterrence against punishment costs, which both depend on this relationship as is clearly illustrated through (20) which characterizes the optimal standard. The MLTN standard, on the other hand, is related to this ratio, because a defendant on trial has more likely than not committed the crime if the evidence produced at trial is a stronger indicator of guilt than priors are an indicator of innocence, that is, the evidence is associated with a large enough likelihood ratio which corresponds to . This is captured by (11) which again clearly illustrates the relationship between and the MLTN standard. The relationship between the ratio and the evidentiary standards operationalized by and can thus be formalized as follows.
The optimal evidentiary standard with costly sanctions is only consistent with ratios greater than (i.e., for all ).
The marginal rate of transformation associated with the MLTN standard is smaller than the ratio whenever this ratio exceeds the compliance to offense ratio obtained under the MLTN standard (i.e., whenever ).
Proposition 5 formalizes the clear relationship between the ratio and the evidentiary standards previously discussed. First, it reveals that costly punishments, in fact, provide a rationale for principles that are conveyed through asymmetric ratios. Given costly punishments, a simple utilitarian welfare function always produces s which are consistent with ratios that are greater than . Second, it reveals that weak evidentiary requirements, like MLTN, are inconsistent with ratios which are larger than the compliance to offense ratios which would emerge under such standards. Thus, this result suggests that the ratio may produce a better fit between the principles that were presumably meant to be conveyed through Blackstone’s statement as well as the implications that are assumed to follow from it.
Conclusion
Blackstone’s simple statement has given rise to numerous commentaries and academic debates. From an economist’s perspective, his statement is most consistent with a description of how the quantity of the two types of judicial errors, wrongful convictions and failures to convict, ought to be traded-off. It has been taken for granted by many scholars that social preferences described by Blackstone are sufficient to warrant pro-defendant evidentiary rules. Others have suggested that the presence of punishment costs in the criminal justice system imply that the quantities of judicial errors ought to be traded-off asymmetrically, as suggested by Blackstone, from a simple cost-benefit perspective. Here, I have shown that neither claim is correct. This is because Blackstone’s statement focuses on a trade-off between the quantities of the two types of judicial error, which is not a direct determinant of evidentiary standards that are the subject of discussion. Instead, these standards are more directly related to how the probabilities with which the criminal justice system convicts individuals, conditional on their behavior, can be traded-off. Correcting this misunderstanding is important for focusing efforts and academic interest on the more relevant subjects of interest, namely individuals’ probabilities of conviction, as opposed to judicial errors in aggregate.
Appendix: Derivation of From Evidence Generation Processes
Evidence observed in court () is assumed to be a random variable whose distribution depends on whether the defendant is truly guilty or innocent. The evidence may be multi-dimensional, but the likelihood ratio associated with the evidence, is single dimensional. Thus, let for denote the probability density function associated with the production of any evidence which involves a likelihood ratio of conditional on the defendant’s type, and let denote the associated cumulative distribution function. An implication of the Neyman–Pearson lemma (Neyman and Pearson, 1933) is that threshold rules wherein a person is found guilty only if the evidence is associated with a likelihood ratio exceeding a threshold maximize the probability of a correct conviction, , given any targeted level of false conviction, . When these evidentiary rules are implemented, it follows that the probability of conviction given the trial of an innocent person is
Thus, one can express the critical likelihood ratio, as a function of the targeted level of wrongful conviction probability as
where is the inverse of .
Next, noting the relation , one can express the probability of correct conviction, as a function of , as:
Thus, it follows that
where the first equality is obtained by differentiating (28) and the second equality follows from the definition of . Thus, it follows that
Footnotes
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
ORCID iD
Murat C Mungan
Notes
References
1.
AllenRLaudanL (2008) Deadly dilemmas. Texas Tech Law Review41: 65.
2.
AllenRSteinA (2013) Evidence, probability, and the burden of proof. Arizona Law Review55: 557.
3.
BeckerG (1968) Crime and punishment: An economic approach. Journal of Political Economy76: 169–217.
4.
BlackstoneW (1769) Commentaries on the Law of England 352. Oxford, England: Clarendon Press.
5.
BurtisMGelbachJKobayashiB (2017) Error costs, legal standards of proof, and statistical significance. Supreme Court Economic Review25: 1.
6.
ChengEPardoM (2015) Accuracy, optimality and the preponderance standard. Law, Probability and Risk14: 193.
7.
ConnollyT (1987) Decision theory, reasonable doubt, and the utility of erroneous acquittals. Law and Human Behavior11: 1987.
8.
CurryPDoyleM (2016) Integrating market alternatives into the economic theory of optimal deterrence. Economic Inquiry54: 1873.
9.
de KeijserJEde LangeEGvan WilsemJ (2014) Wrongful convictions and the Blackstone ratio: An empirical analysis of public attitudes. Punishment and Society16: 32.
10.
DemouginDFluetC (2005) Deterrence versus judicial error: A comparative view of standards of proof. Journal of Institutional and Theoretical Economics161: 193.
11.
DemouginDFluetC (2006) Preponderance of evidence. European Economic Review50: 963.
12.
DemouginDFluetC (2008) Rules of proof, courts, and incentives. The RAND Journal of Economics39: 20.
13.
EppsD (2015) The consequences of error in criminal justice. Harvard Law Review128: 1065.
14.
FallonR (2008) The core of an uneasy case for judicial review. Harvard Law Review121: 1693.
15.
FindleyK (2008) Toward a new paradigm of criminal justice: How the innocence movement merges crime control and due process. Texas Tech Law Review41: 133.
16.
FluetCMunganM (2022) Laws and norms with (un)observable actions european. European Economic Review145: 104129.
17.
FluetCMunganM (2025) Informational properties of liability regimes. Journal of Legal Studies54: 199.
18.
GaroupaNRizzolliM (2012) Wrongful convictions do lower deterrence. Journal of Institutional and Theoretical Economics168: 224.
19.
GuerraALuppiBParisiF (2019) Standards of proof and civil litigation: A game-theoretic analysis. The BE Journal of Theoretical Economics19: 20170005.
20.
KaplowL (1994) The value of accuracy in adjudication: An economic analysis. The Journal of Legal Studies23: 307.
21.
KaplowL (2011a) Burden of proof. Yale Law Journal121: 738.
22.
KaplowL (2011b) On the optimal burden of proof. Journal of Political Economy119: 1104.
23.
KaplowL (2011c) Optimal proof burdens, deterrence, and the chilling of desirable behavior. American Economic Review101: 277.
24.
KaplowLShavellS (1994) Accuracy in the determination of liability. The Journal of Law and Economics37: 1.
25.
LandoH (2002) When is the preponderance of the evidence standard optimal?. The Geneva Papers on Risk and Insurance-Issues and Practice27: 602.
26.
LandoH (2006) Does wrongful conviction lower deterrence. The Journal of Legal Studies35: 327.
27.
LandoH (2009) Prevention of crime and the optimal standard of proof in criminal law. Review of Law & Economics5: 33.
28.
LandoHMunganM (2018) The effect of type-1 error on deterrence. International Review of Law and Economics53: 1.
29.
LeubsdorfJ (2016) The surprising history of the preponderance standard of civil proof. Florida Law Reveview67: 1569.
30.
MialonSH (2023) Filing Selection and the Optimal Standard of Proof. Available at SSRN 4700539.
31.
MunganM (2011) A utilitarian justification for heightened standards of proof in criminal trials. Journal of Institutional and Theoretical Economics167: 352.
32.
MunganM (2020a) The optimal standard of proof with adjudication avoidance. Review of Law & Economics16: 20190019.
33.
MunganM (2020b) Justifications, excuses, and affirmative defenses. The Journal of Law, Economics, and Organization36: 343.
MunganMObidzinskiMOytanaY (2023) Accuracy and preferences for legal error. American Law And Economics Review25: 190.
36.
MunganMSamuelA (2019) Mimicking, errors, and the optimal standard of proof. Economics Letters174: 18.
37.
NewmanJ (1993) Beyond “reasonable doubt”. New York University Law Review68: 979.
38.
NeymanJPearsonE (1933) On the problem of the most efficient tests of statistical hypotheses. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character231: 289.
39.
ObidzinskiMOytanaY (2019) Identity errors and the standard of proof. International Review of Law and Economics57: 73.
40.
PngI (1986) Optimal subsidies and damages in the presence of judicial error. International Review of Law and Economics6: 101.
41.
PolinskyMShavellS (2007) The theory of public enforcement of law. In: Polinsky M, Shavell S (eds.) Handbook of Law and Economics. North-Holland: Elsevier, pp.403–454.
42.
PosnerRA (2007) Economic Analysis of Law. New York: Aspen Publishers.
43.
RisingerM (2010) Tragic consequences of deadly dilemmas: A response to Allen and Laudan. Seton Hall Law Review40: 991.
44.
RizzolliMSaracenoM (2013) Better that ten guilty persons escape: Punishment costs explain the standard of evidence. Public Choice155: 395.
45.
SommerRSommerBHeidmetsM (1991) Release of the guilty to protect the innocent. Criminal Justice and Behavior18: 480.
46.
StiglerG (1970) The optimum enforcement of laws. Journal of Political Economy78: 526.
47.
VolokhA (1997) n guilty men. University of Pennsylvania Law Review146: 173.