Abstract
Scholarship claims that diversity undermines trust and cooperation. Critiques focus on studies’ inability to discern diversity’s causal effects. In fact, most studies are unable to distinguish diversity (i.e., mixture) and marginalized group share (e.g., percentage Black). The authors argue for preserving this distinction and identify obstacles to doing so. First, homogeneously disadvantaged communities are acutely underrepresented in North America and Europe, the settings of most diversity research. The second issue, a case of the ecological fallacy, concerns our inability to infer associations between individual outcomes and diversity from associations between macro-level outcomes and diversity. Much diversity research would be better served by using group share measures that align with the in-group/out-group theories they draw on to motivate research and explain findings. The authors clarify the data and analytic requirements for research that seeks to draw conclusions about diversity per se. Practically, the distinction between diversity and marginalized group share is also relevant for policy.
An active, cross-disciplinary, and controversial social science literature has linked diversity to undesirable outcomes including lower trust, civic participation, and public goods provision (for reviews, see Dinesen, Schaeffer, and Sønderskov 2020; Dinesen and Sønderskov 2018; van der Meer and Tolsma 2014). Scholars have challenged these studies for inferring a causal effect of diversity without accounting for confounders (Abascal and Baldassarri 2015; Portes and Vickstrom 2011).
In this article, we draw attention to a more fundamental issue that precedes considerations of causality: studies about the relationship between racial/ethnic diversity and social outcomes do not distinguish racial/ethnic diversity from non-White or immigrant shares, 1 conceptually or empirically. Although the reasons for this have not gone entirely unacknowledged (Kustov and Pardelli 2018; Schaeffer 2013; Uslaner 2010), scholars have generally failed to fully appreciate the consequences for data collection, modeling, and inference.
Analytical clarity is imperative to the study of diversity. In this article, we make the case for preserving the distinction between diversity (i.e., heterogeneity) 2 and non-White or immigrant shares (e.g., percentage Black or Latino in the United States or percentage foreign-born or children of immigrants in Europe). 3 Our contributions are twofold. First, we argue that most research on racial/ethnic diversity in North America and Europe is unable to disentangle heterogeneity from non-White or immigrant shares. A major obstacle stems from the underrepresentation of relatively homogeneously non-White or homogeneously immigrant communities in most Western contexts. 4 Without homogeneously non-White communities, homogeneous communities (and their residents) may be better off because they are homogeneous or because they are predominantly White (or, more generally, advantaged). Aggravating this empirical limitation is the fact that most studies of diversity explain their results using theories that are predicated on in-group/out-group shares, not heterogeneity, and are thus not captured by measures of heterogeneity.
Our second contribution is to demonstrate that macro-level correlations between diversity and social outcomes are not especially informative of micro-level behavior. This case of the ecological fallacy has gone unnoticed in the diversity literature. We provide analytical proof that a macro-level association with diversity is consistent with at least two different micro-level processes. On one hand, individual attitudes and behavior might be associated with diversity; on the other, they might be associated with group shares. This problem can be avoided by focusing on individual- or group-level outcomes 5 rather than population-level outcomes.
To conclude, we delineate the empirical requirements—in terms of data, analysis, and assumptions—for research to advance claims about associations with diversity.
A Review of the Diversity Literature
Putnam (2007) famously reported a negative association between racial/ethnic diversity in U.S. census tracts and residents’ reported trust in neighbors, concluding that people “hunker down” in the face of diversity, at least in the short term. Alesina and La Ferrara (2000) similarly documented a negative association between diversity and reported trust across U.S. localities. Similar results obtain in Australia (Leigh 2006), the United Kingdom (Laurence 2011), Germany (Gundelach and Freitag 2014; Gundelach and Traunmüller 2014), and Denmark (Dinesen and Sønderskov 2015). And, cross-nationally, diversity is associated with lower average reported trust (Anderson and Paskeviciute 2006; Delhey and Newton 2005; Knack and Keefer 1997). Looking beyond trust, research has linked local diversity to lower participation in voluntary organizations (Alesina and La Ferrara 2002; Costa and Kahn 2003b), lower community attachment (Laurence and Bentley 2016), weaker civic norms (Knack and Keefer 1997), more military desertions (Costa and Kahn 2003a), and lower collective action (Vigdor 2004).
These studies have repeatedly attracted criticism for advancing causal claims about the effects of diversity without fully accounting for compositional differences between diverse and homogeneous communities and their residents. Examining Putnam’s (2007) data, Abascal and Baldassarri (2015) found the negative association between diversity and trust disappears when they control for differences between the residents of diverse and homogeneous communities in terms of race/ethnicity, economic conditions, and residential stability. Race/ethnicity, in particular, plays a central role: non-Whites are more likely than Whites to live in diverse communities, and they also report lower trust. 6 Additional work underscores marked socioeconomic differences between heterogeneous and homogeneous communities. For example, studies in the United States, the United Kingdom, and Europe have found that heterogeneous communities are characterized by higher levels of economic deprivation, 7 which are, in turn, associated with lower trust. When studies control for economic factors, the negative association between diversity and trust is weakened or disappears (Fieldhouse and Cutts 2010; Ivarsflaten and Strømsnes 2013; Letki 2008; Marschall and Stolle 2004; Sturgis et al. 2011). 8 Some studies that controlled for economic deprivation have even found that diversity predicts higher reported trust under certain circumstances (Bécares et al. 2011; Tolsma, van der Meer, and Gesthuizen 2009).
Finally, residential segregation and thus limited opportunities for intergroup contact, rather than diversity, might be at the root of lower trust (Laurence 2017; Stolle, Soroka, and Johnston 2008; Sturgis et al. 2011; Uslaner 2012). Indeed, racial/ethnic diversity has grown even in U.S. communities that have witnessed steady or rising White–non-White segregation (Lichter, Thiede, and Brooks 2023; see also Menendian, Gambhir, and Gailes 2021). In fact, field experiments on the effects of actual contact, rather than geographic proximity, have found qualified support for the premise that contact reduces prejudice (Finseraas and Kotsadam 2017; Lowe 2021; Mousa 2020). Contact was also shown to increase native majorities’ trust toward immigrants (measured behaviorally) in a field experiment in which soldiers were assigned to rooms with or without ethnic minorities (Finseraas et al. 2019).
Besides confoundedness with other community characteristics, scholars have critiqued the heterogeneity indexes used to capture diversity for conceptual reasons: for quantifying differences along a single dimension (Steele et al. 2022), for obfuscating the distinction between advantaged and disadvantaged groups (Abascal and Baldassarri 2015), and for equating different numeric compositions (Posner 2004). These critiques complement a rich, mostly qualitative literature on conceptualizations of “diversity,” many of which accuse the diversity concept of equating different kinds of difference (Bell and Hartmann 2007; Berrey 2015; Lentin and Titley 2008). These critiques remind us that if heterogeneity indexes fail to distinguish between different kinds of groups, it is because, and not despite, such measures are faithful to a conception of diversity that depends only on the number of groups in a population and the distribution of people across them (Blau 1977).
Scholars will no doubt continue to debate whether correlations with diversity measures imply that diversity affects social outcomes, or whether they are compositional artifacts of deep-seated differences between diverse and homogeneous communities and their residents. We do not enter that fray here. Our concern precedes these debates: we call for preserving the analytic distinction between diversity on one hand and non-White or immigrant share on the other, against the overarching tendency to confuse the two, especially in contexts, such as North America and Western Europe, in which homogeneously non-White and immigrant communities are underrepresented. For instance, in a recent review and meta-analysis of 87 studies on the association between diversity and self-reported trust, Dinesen et al. (2020) acknowledged this problem, noting that in almost all of the studies they consider, measures of diversity overlap strongly with non-White or immigrant shares. Regardless, the authors adopt a “broad” definition of diversity that encompasses both aspects. Leaning on this definition in their meta-analysis, the authors count as measures of diversity the same measures that other authors have described as confounding factors, such as non-White share and immigrant share. By our count, almost 60 percent of the studies contributed one or more estimates on the basis of group share to the meta-analysis. 9 Pooling estimates from these studies, Dinesen et al. (2020) found a significant, modest association between measures of diversity and non-White or immigrant share on one hand and lower trust on the other. From this, the authors concluded, “the observed negative relationship thus first and foremost reflects a contextual effect . . . of ethnic diversity on social trust” (pp. 455–56). This conclusion clashes with one of the author’s own readings a few years earlier: recognizing that various diversity measures are essentially indistinguishable from “mere minority concentration,” Schaeffer (2013) wrote, “much of the [European] research on ethnic diversity and social cohesion is actually about majority responses to minority concentration and tells us little about diversity effects per se” (p. 762).
In the remainder of the article, we do not distinguish between the range of desirable social outcomes considered by the diversity literature, which include interpersonal trust, civic participation, and public goods provision, among others. Instead, we use the terms prosociality and prosocial outcomes as shorthand for all of them.
The Case for Distinguishing Diversity from Marginalized Group Share
The distinction between diversity and non-White or immigrant shares may seem subtle, but the empirical expectations implied by claims about the effects of diversity diverge from those that follow from theories that hinge on non-White or immigrant shares, or out-group share generally.
How Scholars Explain Associations with Diversity
How do scholars motivate the conclusion that diversity engenders negative social outcomes? In most cases, negative associations between diversity and prosocial outcomes—whether at the individual (e.g., Putnam 2007) or aggregate level (e.g., Knack and Keefer 1997)—are explained with reference to individual attitudes and behavior toward in-group and out-group members. 10
Most commonly, scholars draw on classic social psychological theories of intergroup conflict and threat. 11 According to intergroup conflict and threat theories, spatial and social proximity intensify hostility between groups competing for scarce resources (Blalock 1967; Sherif et al. 1961). 12 Reworking conflict/threat theories, Dinesen and Sønderskov (2015) posited that people read the presence of out-group members as a cue that the average, or generalized, other is more likely to be an out-group member (i.e., someone they mistrust) (see also Dinesen and Sønderskov 2018). Mistrust and aversion toward out-group members could be rooted in competition or simply in ignorance, as implied by contact theory (Allport 1954).
According to other explanations, negative outcomes arise not from out-group hostility, but from coordination challenges across groups (Habyarimana et al. 2007). Trust and solidarity may be harder to develop toward out-group members, because of a lack of communal knowledge, social ties and coordination opportunities, as well as biased expectations concerning out-group behavior. Sanctioning might also be less effective toward out-group members, both because it is less likely to be meted out and because the costs of sanctioning by out-group members may be lower (Habyarimana et al. 2007). Finally, van der Meer and Tolsma (2014) theorized that the absence of a shared culture with out-group neighbors may foster personal feelings of anomie.
Whether they stress competition, ignorance, opportunities or expectations, all of these theories rely on a categorical distinction between the in-group and the out-group and they imply that interactions with in-group versus out-group members produce better outcomes. (We therefore refer to these collectively as “in-group/out-group theories.”) None of these arguments imply that people’s attitudes or behavior respond to diversity, as we demonstrate next.
Diverging Predictions of Diversity and Group Share Theories
Studies that aim to test theories of in-group/out-group dynamics—or use them to explain their results—should use measures of group share, rather than measures of diversity (i.e., heterogeneity). This is not mere semantics. The predictions of in-group/out-group theories diverge from those implied by claims about diversity and by the use of diversity measures.
Consider a scenario with only two groups: Whites and non-Whites. Figure 1 plots the expected relationships bet-ween prosocial behavior and the proportion of non-Whites, for Whites. The straight dashed line represents expectations on the basis of in-group/out-group theories: as the share of non-Whites increases, prosocial behavior among Whites declines. The curved solid line represents the relationship between non-White share and prosocial behavior, again for Whites, implied instead by claims about diversity. If diversity undermines prosociality, we should expect a nonmonotonic, curvilinear relationship, in which prosociality is maximized in relatively homogeneously White and relatively homogeneously non-White contexts and minimized where the population is evenly split. This is the relationship that studies tacitly test when they use heterogeneity indexes, like the Herfindahl-Hirschman index, which are sensitive only to the number and relative sizes of groups, not their identities (for discussions see, e.g., Abascal and Baldassarri 2015; Posner 2004; Steele et al. 2022).

Hypothesized effects of diversity and group share on prosocial behavior for Whites in a world with two groups: Whites and non-Whites.
Figure 1 reveals where the two approaches diverge. In majority-White communities, both diversity and non-White/group share accounts predict that Whites become less prosocial as non-White share rises. In this part of the distribution, heterogeneity and non-White share increase in tandem. This is true until communities hit 50 percent non-White. After that, more non-Whites generate less heterogeneity, not more, and the theoretical accounts diverge. If prosocial behavior hinges on out-group share, then Whites will continue to become less prosocial as non-White share rises. If, however, diversity drives Whites’ behavior, Whites will become more prosocial as communities become more homogeneous, albeit homogeneously non-White. In other words, claims about diversity and diversity measures imply that Whites will be more prosocial in a community that is nearly 100 percent non-White than a community that is just 50 percent non-White. This prediction not only contradicts the in-group/out-group theories on which diversity studies draw, it is also implausible.
Why has the issue been overlooked? We believe two factors play a role. The first is widespread elision—both rhetorical and conceptual—between diversity and non-White or immigrant shares. Research from the United States suggests that, although people uniformly associate the term diversity with heterogeneity, some of them, including liberal Whites, also associate it with non-White share (Abascal and Ganter 2022; Abascal, Xu, and Baldassarri 2021). 13 The second factor stems from (1) the dearth of non-White scholars working in this area combined with (2) the fact that relatively homogeneous communities in North America and Europe, where this literature has flourished, are largely homogeneously White communities. For White scholars in majority-White contexts (and White readers), more diversity means more out-group members (i.e., more non-Whites). Equipped with theories of intergroup conflict/threat, it makes sense to them to expect that people will react badly to the presence of out-group members. But for, say, a Black American scholar, more diversity may correspond to more in-group members. And for her, the consequences of being around relatively more in-group members, especially if as a result of exclusion from White spaces, are neither obvious nor obviously desirable. The overlap between diversity and minority share in majority-White contexts has not only obscured the distinction between them, it also represents a major empirical challenge to research about diversity.
Obstacle I: The Underrepresentation of Homogeneously Non-White Communities
The first empirical hurdle that faces research on diversity stems from the fact that in most Western European and North American countries, relatively homogeneous communities are, by and large, predominantly White or native-born communities (Baldassarri and Abascal 2020; Koopmans and Schaeffer 2016; Kustov and Pardelli 2018; Uslaner 2012). For example, more than 88 percent of all metropolitan and micropolitan statistical areas in the United States are majority White, and about 12 percent are majority non-White (Figure 2). In fact, of 990 areas in the United States, just 20 are more than 75 percent non-White. Majority-non-White communities are also underrepresented among census tracts (Abascal and Baldassarri 2015). A few scholars have also remarked on the underrepresentation of “majority-minority” communities in European countries and its implications for diversity research (Koopmans and Schaeffer 2016; Schaeffer 2013). More generally, rising diversity in North America and Western Europe is driven by non-White immigration (Baldassarri and Abascal 2020).

Association between diversity and non-White share: U.S. metropolitan and micropolitan statistical areas. Diversity (i.e., heterogeneity) is represented by a Herfindahl-Hirschman index based on five groups: Whites, Blacks, Latinos, Asians, and all others.
When homogeneous communities are mostly native, White communities, it is not possible to disentangle correlations with diversity from correlations with non-White or immigrant shares. Homogeneous communities might be better off not because they are homogeneous, but because they are homogeneously advantaged. The solution is to study contexts where different kinds of relatively homogeneous communities are represented. Kustov and Pardelli (2018) did this in a study of Brazil; they found that diverse municipalities have lower public goods provision than homogeneously White communities, but higher public goods provision than homogeneously Black communities.
If diversity and marginalized group share overlap in majority-White contexts, is the distinction relevant for research on North America and Western Europe? It is, for at least three reasons. First, in such contexts, diversity and out-group share coincide only for majority-group members. In our two-group example, for instance, an increase in diversity means more out-group members for Whites but fewer out-group members for non-Whites. By ignoring this and drawing conclusions about the overall effects of diversity, studies assume (and imply) that the effects of diversity are the same for all groups. Despite relying on a predominantly White sample, for example, Putnam’s (2007) abstract reads, “in ethnically diverse neighborhoods residents of all races tend to ‘hunker down’” (p. 137). This tendency to generalize about the effects of diversity stands in contrast with experimental studies on intergroup contact, which focus on how members of one group (typically the majority) reacts to contact with out-group members, rather than relying on designs or drawing conclusions about the synthetic effects of a generic “diversity” (e.g., Finseraas et al. 2019).
The distinction between diversity and marginalized group share is also relevant because many majority-White countries are witnessing the growth of non-White and immigrant populations. This is why many scholars have become interested in the effects of diversity to begin with. As a result of demographic changes, even majority-White countries will come to have more majority-minority communities, that is, communities where diversity and marginalized group share are at odds. This is already the case for subcontexts such as schools and workplaces. Thus, the distinction between diversity and marginalized group share will become increasingly important moving forward.
Finally, as we have shown, the theories mobilized in studies of diversity in fact make predictions about out-group share and exposure to it. Group share measures, rather than heterogeneity measures, are a better operationalization of the theoretical constructs embedded in conflict, threat, and other theories used in studies of diversity.
Although we are not the first scholars to recognize the overlap between heterogeneity and non-White or immigrant shares, empirical research has yet to effectively address this limitation (Kustov and Pardelli 2018 is an exception). Moreover, the underrepresentation of homogeneously non-White and immigrant communities is not the only obstacle for diversity research.
Obstacle II: The Problem of Aggregation
Social scientists have long been warned about the risks of drawing conclusions at the individual level from associations at the ecological level (“ecological fallacy”; Robinson 1950), and a rich literature attempts to identify methods to circumvent this problem (Goodman 1953, 1959; King 1997). The same caveat applies here: an aggregate-level association with diversity can arise from different micro-level processes, and it is not unequivocal evidence that individual residents are less prosocial in more diverse communities. This is, however, what prior studies imply, when they explain aggregate associations by claiming that individuals respond to diversity or when they invoke aggregate associations to justify the use of diversity indexes in individual-level analyses.
What could explain an aggregate association between diversity and prosocial behavior? Figure 3 illustrates several possible scenarios (of many) and reports group-level and population-level associations for each. First, consider the case where individual prosociality is correlated with diversity at the individual level and across groups (Figure 3A). The curved dotted and dashed lines depict the associations between group share and prosociality among White and non-White individuals, as implied by studies that frame their questions and findings in terms of associations with “diversity” at the individual level or use diversity indexes in individual-level analyses. Here, members of both groups are less prosocial in heterogeneous communities, and they are more prosocial behavior in relatively homogeneous communities, whether homogeneously White or homogeneously non-White. The solid gray line depicts aggregate levels of prosocial behavior in this case, obtained by averaging over the group-level curves. The macro-level association between prosocial behavior and non-White share is curvilinear. From an aggregate perspective, the macro-level association makes sense: everyone is least prosocial in the most diverse communities, and as a result, aggregate levels of prosocial behavior are also lowest in these communities.

Examples of macro- and micro-associations between group share and prosocial behavior. Dotted and dashed lines represent group-level average prosocial behavior, and the thick solid line represents the population average prosocial behavior, derived from the aggregation of the group-level curves. In (A), levels of prosociality are similar in both groups and are associated with heterogeneity; the three curves perfectly overlap. In (B), prosociality is associated with in-/out-group share. In (C), prosociality is associated with heterogeneity but groups vary in their overall level of prosociality.
However, a different micro-level process is also consistent with a curvilinear association between group share and prosocial behavior at the macro level. In Figure 3B, prosociality for individuals in both groups is correlated not with diversity but with out-group share. The greater the share of non-Whites, the less prosocial Whites are and the more prosocial non-Whites are. This micro-level process also yields a curvilinear association between group share and aggregate prosocial behavior wherein prosocial behavior is lower in relatively heterogeneous communities and higher in more homogeneous ones. This happens because individuals are less prosocial as their in-groups come to represent a smaller and smaller share of a community. 14
In the Appendix, we develop and generalize on these examples, showing that a curvilinear association between group share and prosocial behavior at the macro level—the kind of association that might suggest a role for diversity—tells us little about the micro-level process that produced it. At the aggregate level, prosociality may vary across neighborhoods not only because residents are exposed to different racial compositions, but also because they may react differently to variations in racial composition. Both dynamics are simultaneous and indistinguishable at the aggregate level. Figure 3C even illustrates a case in which prosociality is indeed associated with diversity among both groups, but because baseline prosociality differs between groups, prosociality declines almost monotonically with non-White share at the macro level. In settings with more than two groups, or in which individuals behave differently within identifiable groups, the connection between individual-level and population-level associations is even looser.
Note that our recommendation to examine micro data does not address questions regarding causal inference and potential confounding. Abascal and Baldassarri (2015), for example, showed that the association identified by Putnam (2007) is accounted for by the fact that “nonwhites report lower trust and are overrepresented in heterogeneous communities” (p. 722).
The Way Forward: How to Study Diversity at Macro and Micro Levels
How can research that is genuinely interested in investigating and advancing claims about diversity per se overcome the obstacles we have identified? First, to claim an outcome is associated with diversity at a population level, one needs to observe the full range of relatively homogeneous communities, and not only (or overwhelmingly) homogeneously native, White communities. This is difficult, but not impossible in observational studies of real-world communities. For one, researchers can look beyond North America and Western Europe. Kustov and Pardelli’s (2018) study of public goods provision across Brazilian municipalities is an excellent example, and it is not the only one (see also Levine et al. 2014). In fact, studies in developing countries that predate most of those covered here have fruitfully investigated the relationship between ethnic diversity and societal problems, such as stalled economic growth (Collier 1998; Easterly and Levine 1997; Posner 2004) and violent conflict (Blattman and Miguel 2010; Fearon and Laitin 2003). These studies, most of which make cross-national comparisons, focus on parts of the world, particularly Africa, with the full range of relevant groups represented in both relatively homogeneous and heterogeneous communities as well as long-standing histories of ethnic diversity (Posner 2004).
Even within North American or European countries, researchers could home in on areas within which lower level communities span the spectrum of racial compositions. Recall that fewer than 12 percent of all metropolitan and micropolitan statistical areas in the United States are more than 50 percent non-White. In major U.S. cities, such as New York and Chicago, however, non-White neighborhoods are the norm. Of 2,111 census tracts in New York City, 15 for example, about two thirds (68.4 percent) are more than 50 percent non-White. In Chicago, that figure is 67.5 percent.
Scholars can also leverage experimental designs to “create” the kinds of “communities” that are underrepresented in the real world and also to evaluate causal claims about the effects of diversity. For example, a researcher could randomly assign participants to groups that range in composition from predominantly White, to heterogeneous White and Black, to predominantly Black. Experimental researchers have explored behavior in relatively homogeneous and heterogeneous settings (Habyarimana et al. 2007) and systematically varied the number and characteristics of in-group and out-group members (Adida et al. 2016). Surprisingly, these designs rarely incorporate homogeneously marginalized groups (Gereke, Schaub, and Baldassarri 2022). Even in the largely experimental organizational literature on diversity and its relationship to deliberation, decision making and performance (for a review, see Carter and Phillips 2017), experiments on racial/ethnic diversity operationalize homogeneity using homogeneously majority groups, most commonly homogeneously White ones (e.g., Antonio et al. 2004; Levine et al. 2014; Sommers 2006).
To study associations with diversity at the group level—say, for Whites—researchers must observe a substantial number of relatively homogeneously non-White or homogeneously immigrant communities as well as a substantial number of Whites in homogeneously White and homogeneously non-White communities. This is not easy: the individual members of any group are, by definition, underrepresented in homogeneously out-group communities. If the researcher does not want to assume, reasonably so, that Whites react similarly to all non-White groups, she must further observe a substantial number of Whites in specifically homogeneously Asian communities, homogeneously Black communities, and so on. Or she can restrict her claims to Whites in diverse White and Asian communities, for example, a constraint that may prove unworkable given the rise of multiethnic communities (Zhang and Logan 2016).
Importantly, group-level analyses require researchers to assume that within-group variation is negligible, that is, that subgroups that cannot be identified in the data (e.g., in many cases, Mexican and Cuban Americans) do not vary widely in how they react to community composition from each other or from the larger group that can be identified (e.g., Latinos). If they do, the obstacles that face “group-level” analyses are indistinguishable from those that face population-level analyses.
Alternatively, individual-level data can be used to investigate associations with diversity for all individuals, regardless of background. Such research must clear additional hurdles. Say a researcher is studying a context with just two groups, Whites and Blacks; to avoid the implausible assumption that Whites respond to Blacks in the same way that Blacks respond to Whites (see Uslaner 2012), she would need to observe a substantial number of Whites in predominantly Black communities and a substantial number of Blacks in homogeneously White communities. Where homogeneously Black communities are scarce, the first set of observations will be exceedingly difficult to come by. With more groups, the difficulty grows exponentially. With three groups—say, Whites, Blacks, and Latinos—a researcher would need substantial numbers of observations of (1) Whites in predominantly White, (2) predominantly Black, and (3) predominantly Latino communities; (4) Blacks in predominantly Black, (5) predominantly White, and (6) predominantly Latino communities; and (7) Latinos in predominantly Latino, (8) predominantly White, and (9) predominantly Black communities, among others. A researcher who wanted to study the five major U.S. racial/ethnic groups would need substantial numbers of observations in more than 25 cells.
All diversity research, whether at the group or individual level, requires researchers to define what counts as a “group.” Naturally, the people who are classified as members of a racial/ethnic group may not be recognized, or recognize themselves, as members of a coherent group with shared interests. This issue is obvious for analyses that use categories, such as “non-White” or “immigrant,” that people do not immediately recognize, for example, from census forms. However, the critique applies to all analyses that treat identification as an unexamined proxy for group membership. All racial/ethnic categories are constructed, they encompass heterogeneous individuals, and their “groupness” is an empirical possibility rather than an ontological necessity (Brubaker 2004).
Importantly, diversity researchers do not need to assume that only those racial/ethnic categories defined in the data are consequential, or even that people explicitly identify with those categories. They do need to assume that fairly salient and consequential lines of social division separate people who identify with, or are identified with, different racial/ethnic categories. Indeed, if the gap between lay and analytic categories runs too deep, it can foreclose interpretation of attitudes and behavior altogether.
In sum, research that seeks to study and advance claims about associations with diversity among individuals, without reference to their race/ethnicity, must clear an exceptionally high bar. Although many studies share this ambition (see Dinesen et al. 2020), most fall short.
Conclusion
Thirty-six hundred journal articles published in 1999 16 refer to “racial diversity.” 17 In 2020, that number was 17,300. In that interval, the number of articles published annually about diversity increased almost fivefold, and this was not due to increases in publishing on the topic of race/ethnicity. 18
Research has advanced myriad claims about the effects diversity for trust, participation, and cooperation. This research has been criticized for advancing causal claims without compelling causal identification strategies. However, the diversity literature suffers from a more elementary problem: most studies cannot distinguish diversity from non-White or immigrant shares, conceptually or empirically. Instead, scholars frame their questions and results using theories that make predictions about group shares and that are not captured by diversity measures.
In this article, we have identified two obstacles to distinguishing diversity from marginalized group share, and in so doing, clarified best practices for diversity research moving forward. The first obstacle stems from the underrepresentation of homogeneously non-White or homogeneously immigrant communities, especially in North America and Europe, where most diversity research is based. The second obstacle concerns the inability to infer micro-level associations with diversity from macro-level associations, in line with insights from the ecological inference literature.
To recover the association between prosociality and diversity for a specific racial/ethnic group, say Whites, research requires data on sufficient numbers of Whites in homogeneously White and homogeneously non-White communities. To make inferences about the association between prosociality and diversity for individuals, regardless of race/ethnicity, research requires data on sufficient numbers of Whites in homogeneously White and homogeneously non-White communities and sufficient numbers of non-Whites in homogeneously White and homogeneously non-White communities (in a two-group scenario). If we think a context is better understood in terms of three, four, five or more groups, rather than two (e.g., White and non-White), the types of observations we need grow exponentially. Without these observations, claims about diversity should be appropriately restrained. The result, ironically, is that to understand diversity in diverse groups, we should not rely primarily on measures of diversity.
Toward a Theory of Diversity
We have documented a lack of empirical evidence to support the claim that individual prosociality is associated with diversity, rather than group shares. Here, we briefly touch upon a related shortcoming, namely, the lack of well-developed theories that could account for individual responses to diversity, as opposed to group shares. In fact, we have been able to identify only three mechanisms that could generate a curvilinear relationship between group shares and prosociality that diversity studies imply: normative homogeneity of sanctioning, status ambiguity, and expectations related to community composition. 19
First, people might be more wary of sanctioning in homogeneous communities, where a univocal normative order is inferred from racial/ethnic homogeneity. Sanctioning is effective at promoting prosocial behavior, and coethnics might be better equipped to find, and hence sanction, each other (Habyarimana et al. 2007). If sanctioning, real or perceived, is a function not just of coethnicity, but of inferred normative homogeneity within communities, then individuals might behave more prosocially where (they think) their neighbors are more similar to each other.
Second, prosocial behavior could be depressed in diverse communities as a result of rank ambiguity between individuals from different backgrounds. Conflict is more likely to emerge between individuals whose relative symbolic, economic, or political status is ambiguous (Gould 2003). If status is also constructed on the basis of numbers (Koopmans and Schaeffer 2015; see also Legewie and Schaeffer 2016), the relative status of two individuals from different groups is more ambiguous where those groups are evenly split. Here, interpersonal conflict is more likely.
Third, people might engage in prosocial behavior where they expect others will do the same. If expectations are affected not just by others’ identities, but also by community composition, then expectations and subsequent behavior could assume a curvilinear shape across group shares. Take the hypothetical example of Whites in White and Black neighborhoods. If Whites believe other Whites become less prosocial as Black share grows, and they believe Blacks become more prosocial as Black share grows, then Whites in 50 percent White and 50 percent Black neighborhoods will think their neighbors are least prosocial, and they will follow suit.
The Practical Implications of Diversity Research
The conclusions of diversity research have implications outside academia, where studies on this topic have garnered considerable attention. For example, Putnam (2007) was covered in both mainstream and right-wing media, and the study continues to circulate in far-right corners of the Internet. Coverage of this research has tended to converge on similar policy lessons: because diversity has negative consequences, then policies should aim to curb interracial mixing (Krikorian 2007); at the very least, they should not promote it (e.g., Thernstrom et al. 2012).
Consider the policy lessons that would flow from a different conclusion: not that people “of all races” report lower trust as communities become more mixed, but that Whites report lower trust as the share of non-Whites in their communities grows. The second interpretation might set off a quest to allay Whites’ biases toward non-Whites or mitigate its consequences, not to curb the number of non-Whites admitted into majority-White schools, neighborhoods, or countries.
These recommendations are at odds, but they follow from subtly different interpretations of the same empirical patterns. The point is not that scholars should cease to make claims about diversity because these claims support right-wing policies. 20 The point is that they need to distinguish diversity from marginalized group share, because this distinction has practical implications, as well as conforming to best practices of aligning theoretical constructs and empirical measures.
Supplemental Material
sj-pdf-1-srd-10.1177_23780231231196507 – Supplemental material for Greater Diversity or Fewer Whites? Disentangling Heterogeneity and Marginalized Group Share at Macro and Micro Levels
Supplemental material, sj-pdf-1-srd-10.1177_23780231231196507 for Greater Diversity or Fewer Whites? Disentangling Heterogeneity and Marginalized Group Share at Macro and Micro Levels by Maria Abascal, Flavien Ganter and Delia Baldassarri in Socius
Footnotes
Acknowledgements
For valuable feedback, we are grateful to Elizabeth Bruch, Filiz Garip, Andreas Wimmer, and the participants of the Social Demography Seminar at Harvard University, the Center for Migration and Development at Princeton University, the INTERACT final workshop at Bocconi University, and the Seminar for Comparative Social Analysis (SOC 237) at the University of California, Los Angeles. Thanks to Shannon Rieger for research support.
WITH AN APPENDIX BY
Daniel Lacker
Columbia University
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This project received funding from the National Science Foundation [CAREER #1845177] and the European Research Council [INTERACT Project #639584].
Supplemental Material
Supplemental material for this article is available online.
Notes
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
