Abstract
The public goods theory of alliances exerts substantial influence on scholarship and policy, especially through its claim that small alliance participants free-ride on larger partners. Prior statistical tests of free-riding suffer from model specification and generalizability problems, however, so there is little reliable and general evidence about this prediction. In this study, I address those limitations with a new test of the free-riding hypothesis. Using data on 204 alliances from 1919 to 2007, I examine how often states with a small share of total GDP in an alliance decrease military spending while states with a large share of allied GDP increase military spending. I find little evidence to support this expression of the free-riding hypothesis. This implies that free-riding based on economic weight is unusual in alliance politics, which may be due to limits on alliance security as a public good or bargaining between alliance members.
Introduction
Olson and Zeckhauser (1966) argue that international alliances generate a collective action problem. According to their theory, security from an alliance is a public good, so smaller alliance participants “free ride” on the contributions of larger members. Free-riding is reflected by disproportionate allocations of resources to defense, where smaller alliance members spend a lower share of their national income on the military, relative to larger partners.
The prominence of the public goods model in academic and policy discussions of alliances belies a major problem. Fifty-four years after the publication of “An economic theory of alliances,” there is little reliable and general evidence about Olson and Zeckhauser’s prediction that small states are prone to free-ride. Here, I attempt to fill that empirical gap in the literature.
In this paper, I test the free-riding prediction with a statistical model that addresses model specification and generalizability issues in previous research. I use the public goods logic to predict that states with a low share of total GDP in an alliance will decrease percentage changes in military spending, and states with a high share of total allied GDP will increase military spending. Then I employ a Bayesian model to estimate the association between economic weight and percentage changes in military spending for 204 alliances. I find little evidence of free-riding based on economic weight.
My research design addresses two key limitations in existing tests of the public goods logic. First, many empirical estimates of free-riding within alliances have a model specification problem with the dependent variable. Olson and Zeckhauser use military spending as a share of GDP to measure contributions to the alliance, and GDP to measure state size. This approach is widely emulated, but it is problematic because GDP is part of the independent and dependent variables. Changes in GDP shift the defense burden, and this deterministic component affects correlation and regression estimates. 1
One notable paper addresses the dependent variable specification problem, but may not pro- duce general findings. Plümper and Neumayer (2015) examine how growth in military spending by North Atlantic Treaty Organization (NATO) members responds to changing US and Soviet spending. They demonstrate that NATO members are unresponsive to US and Soviet military spending, and present this as evidence of free-riding. Plümper and Neumayer (2015) also find no correlation between NATO member size and the extent of free-riding, however, which they argue contradicts Olson and Zeckhauser (1966). This paper’s focus on NATO brings me to the second limitation: a lack of generalizability.
NATO is the epicenter of free-riding discussions. Following Olson and Zeckhauser’s emphasis on military spending as a share of GDP, accusations of free-riding emphasize that NATO members have lower defense burdens than the US. Scholars, pundits and policymakers have spent decades arguing over how well the public goods model applies to NATO (e.g. Pryor, 1968; Sandler and Forbes, 1980; Palmer, 1990; Hilton and Vu, 1991; Boyer, 1993; Gates and Terasawa, 1992; Sandler and Hartley, 2001; Lanoszka, 2015; Plümper and Neumayer, 2015; Kim and Sandler, 2020).
Most studies of the public goods model focus on NATO, but NATO is a difficult case for making general conclusions. NATO is exceptionally large, durable and capable. There are only seven tests of the public goods model outside of NATO, most of which examine one or two alliances (Russett, 1970; Starr, 1974; Reisinger, 1983; Thies, 1987; Conybeare and Sandler, 1990; Oneal and Whatley, 1996; Siroky, 2012). Six of these studies estimate correlations between GDP and defense burdens, so they suffer from the aforementioned specification problem. This leaves a need for a general examination of the public goods logic.
Using a Bayesian model that estimates the association between economic weight and percentage changes in military spending within many alliances, I find little evidence that small states are more inclined to free-ride than their larger allies. This finding expands the economic size results of Plümper and Neumayer (2015) from the NATO case. Given established theoretical skepticism of the public goods model (e.g. Palmer, 1990; Gates and Terasawa, 1992; Sandler and Hartley, 2001; Norrlof, 2010; Niou and Zeigler, 2019), what do these findings add? Some existing theoretical skepticism of the public goods model is motivated by inconsistent correlations between GDP and defense burdens. These correlations do not provide reliable evidence, however. Without a reliable and general test, theoretical revisions of a parsimonious public goods model may be premature.
There are three reasons to undertake another test of the public goods model. First, although academic theory has progressed since 1966, the public goods model retains an important place in discourse about alliances (e.g. Walt, 1990; Sandler, 1993; Mearsheimer, 1994; Goldstein, 1995; Sandler and Hartley, 2001; Garfinkel, 2004; Walt, 2009; Norrlof, 2010; Barrett, 2010; Plümper and Neumayer, 2015). Second, policy and popular discussions of alliances employ collective action ideas. Pundits and American policymakers often refer to allied “free-riding” (Lanoszka, 2015). For example, Barack Obama complained in 2016 that “Free riders aggravate me” and US allies “have to pay your fair share.” Donald Trump has implied that the United States would not protect allies who spend too little on defense.
Third, increasing great power competition makes assessing Olson and Zeckhauser’s argument important for policy debates. Competing visions of US grand strategy hinge in part on the explanatory power of the public goods model. Advocates of retrenchment and “restraint” use the public goods logic to claim that US allies free-ride, so the US should withdraw from many alliances (Preble, 2009; Posen, 2014). Others assert that alliances do not provide a public good and the benefits of alliance participation outweigh the costs (Brooks et al., 2013; Brands and Feaver, 2017).
The paper proceeds as follows. First, I briefly summarize the public goods theory of alliances, then I describe the model and results. In the final section, I discuss some implications of the findings for scholarship and policy.
Free-riding in alliances
To identify observable implications of free-riding, I use the public goods argument to derive predictions about economic size and military spending in alliances. This exercise does not critique the public goods model. Instead, I take the public goods logic as given, but employ another measure of defense effort. Following Plümper and Neumayer (2015), I use percentage changes in military spending to measure defense effort. Like Olson and Zeckhauser, I conceptualize state size using economic weight.
Olson and Zeckhauser connect alliances and defense spending through a collective action problem. As the aggregate military capability of an alliance provides security for members, states contribute security by investing in their military. Because an alliance cannot exclude members without undermining its purpose, alliance security is a public good. Olson and Zeckhauser expect that larger members of the alliance will invest more in defense because they value security from the alliance more. Small alliance members rely on larger partners for security and reduce their defense burdens. As a result, small states free-ride on larger alliance participants.
Olson and Zeckhauser compare alliance member size using economic resources, specifically GDP. Economic weight within an alliance, or each state’s share of total allied GDP, is a related way to conceptualize differences in state size. Using economic weight facilitates state size comparisons across diverse alliances, where states may be large or small members. A greater share of total economic resources in an alliance gives a state more economic weight and increases their potential military spending contribution.
In the public goods model, economic weight shapes defense expenditures. Smaller members can free-ride and lower military spending. Because larger states place higher absolute value on security from an alliance, alliance participation will increase their investment in military capability. Therefore, alliance participation will decrease military spending by small states and increase military spending by large states.
I test the public goods predictions of military spending by small and large alliance members in a sample of all non-microstates from 1919 to 2007, using percentage changes in spending to facilitate a reliable and general empirical test. 2 State-year observations are the unit of analysis. My assessment of the public goods model estimates the extent of free-riding within individual alliances using alliance-specific economic weight parameters.
Testing the public goods logic
I use a Bayesian model to estimate alliance-specific effects of differences in economic weight on military spending. 3 This approach provides general evidence about the association between economic weight and military spending across 204 alliances. I use Bayesian estimation because it regularizes the 204 alliance parameter estimates. Partial pooling in this Bayesian model draws on information from all alliances to inform each alliance parameter. 4 With hundreds of alliance- specific parameters, Bayesian estimation shares information across alliance parameters with less risk of spurious inferences from multiple comparisons.
For each of the 204 offensive and defensive treaties in the Alliance Treaty Obligations and Provisions (ATOP) data (Leeds et al., 2002), I estimate the association between economic weight and military expenditures. An economic weight parameter for each alliance captures the consequences of alliance participation for military spending. I code economic weight in alliances such that a positive economic weight parameter implies more military spending for large members and less spending by small members, as the public goods model predicts. A negative economic weight parameter indicates that larger alliance members spend less on the military, and small members spend more.
The model starts with state-year percentage changes in military spending yit, transformed with an inverse hyperbolic sine.
5
I model this variable using a t-distribution with degrees of freedom ν to account for heavy tails. The expected value of military spending µit depends on a constant α, state and year varying intercepts, and state-year control variables
The
This coding of economic weight facilitates a simple test of the public goods hypothesis across multiple alliances. Multiplying a positive γ by negative one for small states reduces military spending growth. Multiplying the same positive economic weight parameter by positive one increases military spending growth for large states. Therefore, positive γ parameters will change military spending growth in ways that match the public goods model.
γ is a vector of 204 alliance-specific economic weight parameters. Because
Each alliance has unique effects on military spending, but the economic weight parameters have a common prior distribution where γ ∼ N (θ, σall), which shares information across the alliance estimates. Partial pooling estimates the dispersion of the alliance parameters from the data, so the prior for γ has mean θ and variance σall, and relies on a normal distribution, which is standard in most multilevel models. θ is the mean hyperparameter of the alliance coefficients and each γ deviates from the overall mean θ based on a variance hyperparameter σall. Every economic weight parameter holds the impact of other alliances constant.
Results
The public goods model predicts many positive economic weight parameters. Because I employed Bayesian modeling, each γ has a posterior distribution. I focus interpretation on the posterior mean and 90% credible intervals. 8 The posterior mean is the expected value of γ, while the credible intervals capture uncertainty around that estimate.
No alliance has a clear positive economic weight parameter. Figure 1 plots the γ parameter for each alliance against the start year of the treaty. Points mark the posterior mean. The error bars encapsulate the 90% credible interval.

Estimated association between economic weight and defense spending growth in 204 defensive and offensive alliances from 1919 to 2007. Positive estimates match the predictions of the public goods model. Points represent the posterior mean and the error bars cover the 90% credible interval. The dashed line marks the posterior mean of the θ parameter, which is the average association between economic weight and percentage changes in military spending.
No alliances have a positive 90% credible interval to match the public goods model’s predictions. Most credible intervals are consistent with military spending growth between -0.02 to 0.015. Only 13 of the 204 alliances have a positive posterior mean. Therefore, there is little evidence of free-riding based on economic weight as the public goods model predicts.
To give further evidence, I use the γ parameters to predict the effect of participation in specific alliances for large and small states. Figure 2 summarizes estimates for several notable alliances. Once again, there is limited evidence of lower defense spending by small states and higher spending by large states. Even alliances with the expected pattern of military spending across large and small members have uncertain implications for military spending growth.

Estimated impact of alliance participation on defense spending growth in large and small members of the Organization of American States (OAS) (ATOPID 3150), NATO (ATOPID 3180), the Arab League (ATOPID 3205), the U.S.-Korea alliance (ATOPID 3240) and the Warsaw Pact (ATOPID 3285). Points mark the mean estimated impact of alliance participation on military spending and the error bars cover the 90% interval.
First, the NATO estimates in Figure 2 corroborate the size result of Plümper and Neumayer (2015), as there is no evidence that small economic weight encourages lower military spending. Expressing the impact of alliance participation through economic weight gives a small estimate of the impact of NATO participation on large and small members. Junior NATO members may still spend less on the military by relying on allied capability (George and Sandler, 2017), however. Furthermore, the US–South Korea alliance (ATOPID 3240) estimates in Figure 2 are the reverse of the free-riding predictions.
Among other alliances, the Arab League (ATOPID 3205), Organization of American States (ATOPID 3150) and Warsaw Pact (ATOPID 3285) have the expected results for the public goods model, albeit with substantial uncertainty. Although large alliance members increase spending and small members decrease spending in expectation, all these estimates are consistent with substantial positive and negative, as well as null effects. This suggests that highly asymmetric multilateral alliances are the most likely cases of free-riding, but that free-riding based on economic weight is weak even then. There are no clear differences in military spending growth based on economic weight, even in some of the best-case alliances for the public goods model’s free-riding predictions.
Conclusion
Although Olson and Zeckhauser’s model is parsimonious, there is little evidence to match its prediction of free-riding based on economic weight. These results provide general evidence that economic weight is not a determinant of free-riding, which matches the finding of Plümper and Neumayer (2015) in NATO. The results do not rule out that alliances provide public goods, or that alliance participation reduces small states’ military spending, however.
If alliances do provide public goods, they may only do so in particular circumstances. Small states may only reduce military spending if they believe allied commitments are credible (Goldstein, 1995; DiGiuseppe and Poast, 2018), or leaders are inclined to lower spending (Fuhrmann, 2020). If that is the case, inquiry should emphasize sources of leverage in bargaining between alliance members (Morrow, 1991; Norrlof, 2010; Brooks et al., 2013; Johnson, 2015; Kim, 2016). Alternatively, the extent to which security from an alliance is a public good may vary with factors like technology and strategic doctrine (Sandler and Hartley, 2001).
Beyond scholarship, the results have implications for policymakers and pundits’ inclination to use “free-riding” rhetoric in alliance politics. Free-riding is inextricable from a public goods under-standing of alliances. But without a clear sense of when or whether alliances provide public goods, free-riding is an inaccurate description of reduced defense effort by alliance participants. Low defense effort could reflect cheap-riding on allied capability, or efficiency gains from specialization in pooled military resources.
Despite these results, collective action remains a central concept in international politics. It may be inappropriate to use alliances to understand collective action problems in international organizations more generally, as Olson and Zeckhauser (1966: 266–267) advocate, but collective action applies to other international organizations. Still, with little evidence of free-riding based on economic weight, policymakers and scholars should be cautious about relying on the public goods model to understand alliance politics.
Supplemental Material
sj-pdf-1-rap-10.1177_20531680211005225 – Supplemental material for Reassessing the public goods theory of alliances
Supplemental material, sj-pdf-1-rap-10.1177_20531680211005225 for Reassessing the public goods theory of alliances by Joshua Alley in Research & Politics
Footnotes
Acknowledgements
Thanks to Molly Berkemeier, William Clark, Matthew Fuhrmann, Hyeran Jo, Carlisle Rainey, Weiwin Yin, the editor and two anonymous reviewers for helpful comments.
Correction (June 2025):
Declaration of conflicting interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
Supplementary materials
The supplementary files are available at http://journals.sagepub.com/doi/suppl/10.1177/20531680211005225. The replication files are available at
.
Carnegie Corporation of New York Grant
This publication was made possible (in part) by a grant from the Carnegie Corporation of New York. The statements made and views expressed are solely the responsibility of the author.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
