Abstract
Research on linguistic biases shows that stereotypic expectancies are implicitly reflected in language and thereby subtly communicated to message recipients. Research on the Negation Bias shows that the use of negations (e.g., not stupid vs. smart) is more pronounced in descriptions of stereotype-inconsistent compared with stereotype-consistent behaviors. This article reports a replication study of the original research conducted in Dutch, using newly developed materials, and in five different languages: English, Dutch, Hungarian, Finnish, and Serbian. The results validate the existence of the Negation Bias in all five languages. This suggests that negation use serves a similar stereotype-maintaining function across language families.
Language use plays a crucial role in the formation and maintenance of social-category stereotypes (Beukeboom, 2014; Maass, 1999). Research has revealed a number of linguistic biases that show how speakers’ stereotypic expectancies result in systematic variations in language use when describing behaviors of categorized individuals. These biased descriptions, in turn, contribute to the consensualization of stereotypes within cultural groups (for a review, see Beukeboom & Burgers, 2019).
One linguistic bias is the Negation Bias (Beukeboom, Finkenauer, & Wigboldus, 2010). Research on the Negation Bias reveals that the use of negations (compared with affirmations) is more pronounced in descriptions of stereotype-inconsistent compared with stereotype-consistent behaviors. Negations (e.g., The junkie was not deceitful) imply exceptions to the rule and simultaneously introduce stereotype-consistent concepts in communications about stereotype-inconsistent information. Consequently, negation use—in behavior descriptions of categorized individuals—tends to reaffirm and maintain existing stereotypes.
In this article, we aim to find further evidence for the negation bias by conducting a conceptual replication study of Beukeboom et al. (2010, Study 2) using larger samples and newly developed materials. Moreover, we aim to extend the original work by testing whether the Negation Bias (originally obtained across multiple experiments in the Dutch language) can be validated in other languages. The present study was conducted in five different languages from three language families: English and Dutch (Germanic languages), Hungarian and Finnish (Finno–Ugric language family), and Serbian (Slavic language family). This allowed us to test whether the Negation Bias functions in a similar manner across different types of languages.
Linguistic Biases and the Negation Bias
To understand how stereotypes become shared knowledge, it is crucial to focus closely on language use in communications about socially categorized people (e.g., Beukeboom, 2014; Beukeboom & Burgers, 2019; Collins & Clément, 2012; Maass, 1999; Wigboldus & Douglas, 2007). In often quite subtle ways, language reflects, constructs, and maintains beliefs about social categories. With regard to describing behaviors of categorized individuals, Beukeboom and Burgers (2019) distinguished two types of stereotype-maintaining biases. One type relates to what information is communicated, and predicts that speakers tend to predominantly communicate about stereotype-consistent (rather than stereotype-inconsistent) information (e.g., Bratanova & Kashima, 2014). The second type of stereotype-maintaining biases, which we focus on here, relates to how information about categorized individuals is formulated. For example, the Linguistic Intergroup Bias (Maass, Salvi, Arcuri, & Semin, 1989), Linguistic Expectancy Bias (Wigboldus, Semin, & Spears, 2000), Stereotypic Explanatory Bias (Sekaquaptewa, Espinoza, Thompson, Vargas, & von Hippel, 2003), and Irony Bias (Burgers & Beukeboom, 2016) all show how subtle variations in verbal formulations communicate what is (un)expected for a categorized target or category.
The significance of these linguistic biases in descriptions of others lies in the fact that they implicitly communicate stereotypes to message recipients. The language used in stereotype-inconsistent messages (i.e., concrete, explanatory, ironic, negations) causes recipients to infer lower essentialism for the described behavior, which implies that the behavior is seen as unexpected and likelier caused by situational circumstances than by enduring dispositional factors. In contrast, the language used in stereotype-consistent messages (i.e., abstract, unmarked, literal, affirmations) implies that the behavior is expected and likelier caused by the actor’s enduring dispositional characteristics than by situational circumstances. Thus, through subtle variations in language use, speakers implicitly transmit their stereotypic expectancies to recipients, leading to the stereotype being shared and maintained interpersonally (Beukeboom, 2014; Beukeboom & Burgers, 2019; Maass, 1999; Wigboldus & Douglas, 2007).
The Negation Bias (Beukeboom et al., 2010) fits this line of work. It hypothesizes that when people describe stereotype-inconsistent behavior of an actor, they will be more likely to use negations than when they describe stereotype-consistent behavior (Beukeboom et al., 2010). The linguistics literature posits that negations’ main function in natural language is denial, either of something explicitly stated or of an implicit pre-supposition implied in context (Tian & Breheny, 2016; Tottie, 1991). For example, a negation like “the train was not late this morning” is likelier used when the train is normally late than when it is normally punctual. Similarly, a statement like “Arsenal did not win” implies that someone expected that Arsenal would win, which is subsequently denied (Jordan, 1998; Moxey & Sanford, 2000). Thus, negations are meaningful in that they indicate that something is different, unusual, or contrary to an existing (implicit) expectancy (Jordan, 1998).
Experimental studies on processing and comprehension corroborate that negations are relatively more appropriate in communication about expectancy-inconsistent information. For instance, sentences with negations typically take longer to process and comprehend than affirmation sentences (Carpenter & Just, 1975; Deutsch, Gawronski, Strack, 2006; Gough, 1965), but this difference disappears when negations are used in an appropriate context (Glenberg, Robertson, Jansen, & Johnson-Glenberg, 1999; Tian & Breheny, 2016). One appropriate context (i.e., “a context of plausible denial”; Wason, 1965) is when an expectation needs to be noted or denied. Thus, negations seem more appropriate, and easier to process and comprehend, when denoting information that is inconsistent with expectations.
The Negation Bias (Beukeboom et al., 2010) links these ideas to stereotypes. When communicating about other people’s behavior, people automatically activate social-category stereotypes associated with a discussed target, either primed by a target’s appearance or by a category label used in the communication (Devine, 1989; Dijksterhuis & Van Knippenberg, 1996; Fiske, 1998; Lepore & Brown, 1997). Such (implicitly) activated social-category stereotypes facilitate the use of associated concepts in language use (e.g., junkies–deceitful). This can be in the form of an affirmation (e.g., the junkie is deceitful) to describe stereotype-consistent behavior, or a negation (e.g., the junkie was not deceitful), when target behavior is inconsistent with the activated stereotype. Moreover, following the pragmatic function of negations described above, in case of stereotype-inconsistent behavior, negations can function to denote an exception to this implicitly activated stereotype. Based on this, the original work on the Negation Bias (Beukeboom et al., 2010) hypothesized that negation use is likelier when a target person’s behavior is more unexpected.
Beukeboom et al. (2010) provided evidence for the Negation Bias in three experiments showing that people have a higher preference for negations to describe stereotype-inconsistent (vs. stereotype-consistent) behaviors of others. The first study was a within-participants, forced-choice experiment. Participants were presented with a series of sentence pairs describing an actor’s behavior. Each sentence in a pair described the same actor and behavior and had the same structure: one sentence contained a negated adjective (e.g., The rock musician did not leave a messy hotel room) and the other contained an affirmative antonym (e.g., The rock musician left a neat hotel room). Participants were asked to choose the sentence of each pair of which the choice of words felt most natural. Stereotype consistency was manipulated by varying the social category of the actors. The same sentence pairs where thus presented once with an actor for whom the described behavior was stereotypically unexpected (e.g., rock musician—leaving a neat hotel room), and once with an actor for whom the same described behavior was stereotypically expected (e.g., cleaner—leaving a neat hotel room). Results showed that participants were significantly more likely to choose the negation alternative in sentence pairs in which the actor’s behavior was stereotype-inconsistent, compared with sentence pairs in which the actor’s behavior was stereotype-consistent.
The second experiment used new stimulus sets (other actors and stereotypical behaviors) and the actor’s behavior in the sentence, and descriptions of this behavior (either using a negation or affirmation) were separated. That is, participants were presented with a factual sentence about an actor’s behavior (e.g., The professor scored high on the IQ test), followed by two possible descriptions: a negation description (e.g., He is not stupid) and an affirmation description (e.g., He is smart). For both descriptions, participants indicated on 7-point scales the extent to which they found it an appropriate description to use when describing the behavior in the sentence. Again, stereotype consistency was manipulated by varying the social category of the actor in the behavioral sentence (e.g., professor, garbage man). Results revealed a significant interaction between stereotype consistency and description type. In line with the Negation Bias hypothesis, participants rated negation descriptions (e.g., He is not smart) as more appropriate to describe stereotype-inconsistent behaviors (e.g., The professor scored low on the IQ test) than for stereotype-consistent behaviors (e.g., The garbage man scored low on the IQ test). In addition, for affirmation descriptions the opposite pattern was observed; affirmation descriptions (e.g., He is smart) were rated as more appropriate to describe stereotype-consistent behaviors (e.g., The professor scored high on the IQ test) than stereotype-inconsistent behaviors (e.g., The garbage man scored high on the IQ test).
The third experiment tested whether people spontaneously produce the Negation Bias when they are free to formulate their own person descriptions. Participants spontaneously described their impression of a male or female person performing either the stereotypically female sport jazz ballet, or the stereotypically male sport rugby. Results showed that participants spontaneously used more negations in descriptions of persons performing a stereotype-inconsistent behavior (Man performing jazz ballet and woman playing rugby) than in descriptions of persons performing stereotype-consistent behavior (Man playing rugby or woman performing jazz ballet).
In addition to the effects of the experimental manipulations, mediational (Study 2) and correlational analyses (Study 3) demonstrated that perceived unexpectedness of the described behavior was positively related to (rated appropriateness of) negation usage. Finally, a fourth experiment focused on what message recipients infer from behavior descriptions containing either a negation or affirmation. This showed that, compared with affirmations, negations induce recipients to infer that the described characteristics were unexpected and of lower essentialism (i.e., less enduring, dispositional) for the target. Together, this provided evidence for the notion that negation use varies as a function of stereotypic expectancies, induces stereotype-confirming inferences in recipients, and thereby plays a role in stereotype transmission and maintenance.
In the present article, we aim to replicate and extend the main study on biased negation production (i.e., Beukeboom et al., 2010, Study 2). Many scholars nowadays agree that replication studies are essential for cumulative theoretical development (Brandt et al., 2014; Open Science Collaboration, 2015). Well-conducted replications, using large samples, can provide confirmation or disconfirmation of published results, and, when successful, greater confidence about the veracity of the predicted effect. This is also what we aim to achieve with regard to the Negation Bias.
Second, we extend the original study by testing whether the Negation Bias (originally obtained in Dutch) generalizes to other languages. Interestingly, all human languages have elements to express negation (Horn, 1989; Tian & Breheny, 2016). As opposed to affirmations, negation descriptions contain a negation marker (such as not or no) to deny the truth value of a particular proposition. Languages usually have more than one grammatical way to express negation. The Negation Bias focuses on syntactic negations that are expressed by means of a separate particle like NOT (e.g., not smart). We conducted five identical replication studies in five different languages belonging to different language families: English and Dutch (Germanic language family), Finnish and Hungarian (Finno–Ugric language family), and Serbian (Slavic language family). Each of these languages allow for syntactic negations, for example, not friendly is translated as, respectively, niet vriendelijk (Dutch), ei ystävällinen (Finnish), nem barátságos (Hungarian), and nije druželjubiv (Serbian). Because it can be expected that the pragmatic and cognitive mechanisms behind the Negation Bias function similarly across languages, we hypothesized that the Negation Bias would replicate in each of these languages.
Third, we extend the original study by testing whether the Negation Bias generalizes to other materials with other social categories and stereotypes. We developed new material sets with social-category stereotypes that were applicable across languages. This also allowed us to slightly improve the materials. While in the original study (Beukeboom et al., 2010) a few stimuli were based on general expectancies about situations (e.g., man attending a birthday party vs. funeral) rather than stereotypic expectancies (i.e., expectancies about social categories), the present new materials only included social categories, as this is the focus of the negation bias.
Method
We provide all materials and report how we determined sample size, all data exclusions (if any), all manipulations, and all measures in the studies.
Participants
Five participant samples of adult native speakers of English, Dutch, Hungarian, Finnish, and Serbian were recruited to complete an online Qualtrics questionnaire in their native languages. The survey link was spread via email to a Dutch panel Kieskompas, 1 and/or via social media (Hungarian, Finnish, Serbian). These participants were not paid, but were offered a ticket in a lottery for cinema vouchers. English participants were recruited in the United States from the Prolific panel (www.prolific.ac) and were paid as part of their panel membership.
Our design (see below) employs 6 material sets each with 2 observations in both the stereotype-inconsistent and stereotype-consistent conditions. We aimed for sample sizes > 100 (no maximum was set), so that we would have at least 1,200 (6 × 2 × 100) observations per condition. Based on the large effect obtained in the original study (Beukeboom et al., 2010, Study 2), 2 we estimated that this sample size enabled a sufficiently powered study (Brysbaert & Stevens, 2018). In all samples, analyses were only conducted after data collection had ended. 3
Before analyzing data, we excluded participants who failed to meet our predetermined inclusion criteria: that is, age <18 years, nonnative speakers of the study language, incomplete questionnaires, participants who took extremely long (>a full day). Only the English version included attention-check items (Oppenheimer, Meyvis, & Davidenko, 2009), based on which four cases were excluded. In both the English and Serbian sample, we excluded one case for being extremely fast in combination with series of equal responses. No other data were excluded.
Size and demographics of the final samples were as follows:
Dutch: N = 234; 71% male (n = 167); Age M = 56.9, SD = 15.1, range (21-92)
English (United States): N = 111; 56% male (n = 62); Age M = 28.4, SD = 9.1, range (18-59)
Hungarian: N = 115; 38% male (n = 44); Age M = 30.6, SD = 10.4, range (19-68)
Finnish: N = 111; 17% male (n = 19); Age M = 28.7, SD = 7.5, range (20-58)
Serbian: N = 122; 40% male (n = 49); Age M = 33.9, SD = 13.0, range (18-67)
See also Appendix Table A1 (available online) for an overview.
Design and Materials
The study employed a 6 (Material set) × 2 (stereotype consistent vs. stereotype inconsistent) × 2 (negation vs. affirmation) within-participants design. The method is a conceptual replication of Study 2 in Beukeboom et al. (2010) with (largely) new stimulus materials. The main dependent variable was the perceived appropriateness of behavior descriptions with negations and affirmations.
Participants were presented with 24 sentences from 6 material sets, each describing an actor’s behavior that was either stereotype consistent or inconsistent. Each set consisted of two actors described with a social-category label (e.g., student; housewife) and two behaviors (e.g., leaves the dishes in the sink for a week; washes the dishes immediately after dinner). One behavior was stereotype consistent for Actor 1 but inconsistent for Actor 2, whereas, for the second (mirrored) behavior, this was reversed. Each set of actors and behaviors allowed for creating four sentences, two of which were stereotype consistent and two of which were stereotype inconsistent, by varying combinations of actor and behavior in the sentence. Each sentence, in turn, was associated with two relevant trait descriptions (one negation and one affirmation, e.g., She is not tidy; She is messy or She is not messy; She is tidy). Traits used in the negation and affirmation descriptions were antonyms, such that the two relevant descriptions of each sentence provided semantically comparable descriptions of the behavior. Note that in this design, actors, behaviors, and the judged descriptions were identical across stereotype-consistent and inconsistent conditions.
Material sets were selected after a pilot test on Hungarian (N = 39) and Dutch (N = 28) student samples testing 10 potential sets, partly based on the original Beukeboom et al. (2010) materials. Participants judged the actor–behavior sentences on unexpectedness and actor–trait associations using the same items as in the manipulation checks reported below. We selected sets in which—in both the Dutch and Hungarian samples—stereotype-inconsistent actor–behavior sentences were evaluated as significantly more unexpected than stereotype-consistent actor–behavior sentences, for both actors. Also, in all selected sets, actor–trait associations were higher for stereotype-consistent (vs. inconsistent) pairings. We assumed that the results of the Dutch and Hungarian pilot test would also apply to the other language samples; this was confirmed in the manipulation checks of the actual experiment. See Appendix Tables A2 and A3 for selected sets and manipulation check scores per item, for all five language samples.
All instructions and materials were translated from English into the other four languages by native speakers of each language (see https://osf.io/tvre4/ for all materials and the appendix with additional data and analyses).
Procedure
The questionnaire first explained that the study focused on behavior descriptions, and that participants would be presented with sentences describing a person’s behavior followed by two possible descriptions of that behavior. Participants were asked to indicate how appropriate they found each description to describe the behavior in the sentence. Subsequently, the sentences were presented (e.g., The professor scores high on the IQ test), each followed by two possible descriptions on the same page: a negation description (e.g., He is not stupid) and an affirmation description below (e.g., He is smart). Participants indicated how appropriate they found each description on a 7-point scale ranging from 1 = very inappropriate to 7 = very appropriate. Sentences were presented in a mixed random order: a random sentence from Set 1 was followed by a random sentence from Set 2, and so forth till Set 6, and again starting with a remaining sentence from Set 1, until all 24 sentences were presented.
Next, we measured our manipulation check variable expectedness of the actor’s behavior in the sentence. The behavior sentences were presented once more, with order randomized in the same manner as above, and participants indicated how expected they found the behavior for the person in the sentence on 7-point scales ranging from −3 = very unexpected, to +3 = very expected (recoded to 1-7 in results).
Next, we measured actor–trait associations. This variable is not included in the analyses, but results are reported in the Appendix, Table A3, as additional information about the stimuli. Participants were asked to rate the degree to which they associated the persons they just saw in the sentences with certain characteristics. This was asked for the two actors and two traits of each material set, again in mixed randomized order. For each of the four combinations per set, participants were asked to which degree they in general associated actors (e.g., professors) with the trait (e.g., smart), and answered on scales ranging from 1 = not at all to 7 = very much.
Finally, we asked participants to judge entitativity of each category 4 and to report demographics.
Results
Manipulation Checks
In each language sample, we conducted a repeated-measures analysis of variance (ANOVA) on the mean rated expectedness of the stereotype-consistent and stereotype-inconsistent behavior sentences across the six material sets. We observed significant effects of stereotype consistency on mean-rated expectedness in
Dutch F(1, 233) = 1799.25, p < .001, η2p = .89,
English F(1, 110) = 600.53, p < .001, η2p = .85,
Hungarian F(1, 114) = 626.43, p < .001, η2p = .85,
Finnish F(1, 110) = 816.41, p < .001, η2p = .88, and
Serbian F(1, 121) = 515.07, p < .001, η2p = .81.
In all language samples, participants rated the stereotype-consistent sentences as significantly more expected (MDutch = 5.59, SD = 0.55; MEnglish = 5.56, SD = 0.58; MHungarian = 5.56, SD = 0.70; MFinnish = 5.44, SD = 0.51; MSerbian = 5.56, SD = 0.68) than the stereotype-inconsistent sentences (MDutch = 2.74, SD = 0.61; MEnglish = 2.99, SD = 0.69; MHungarian = 2.82, SD = 0.77, MFinnish = 2.97, SD = 0.58; MSerbian = 2.90, SD = 0.79).
In all language samples, these differences were also significant (p < .001) for all six material sets separately. This confirms that our manipulation was successful. See Appendix Tables A2 and A3 for detailed results per sentence and actor–trait associations in each set.
Appropriateness of Negation and Affirmation Behavior Descriptions
The Negation Bias predicts that the use of negations is perceived as relatively more appropriate in descriptions of stereotype-inconsistent (vs. stereotype-consistent) behaviors. To test this hypothesis, we computed the means of perceived appropriateness for the affirmation and negation descriptions for both the stereotype-inconsistent and consistent behaviors. This resulted in four scores which were included in a 2 (stereotype consistency of behavior: inconsistent vs. consistent) × 2 (description type: affirmation vs. negation) ANOVA with repeated measures on both factors, which we employed in all language samples separately.
First, in line with the original study (Beukeboom et al., 2010, Study 2), in all language samples, we found a main effect of description type,
Dutch F(1, 233) = 28.10, p < .001, η2p = .11,
English F(1, 110) = 32.81, p < .001, η2p = .23,
Hungarian F(1, 114) = 101.46, p < .001, η2p = .47,
Finnish F(1, 110) = 37.35, p < .001, η2p = .25, and
Serbian F(1, 121) = 49.70, p < .001, η2p = .29.
In all language samples, the use of affirmation descriptions was rated as significantly more appropriate (MDutch = 4.86, SE = 0.06; MEnglish = 5.34, SE = 0.07; MHungarian = 4.89, SE = 0.08; MFinnish = 5.05, SE = 0.08; MSerbian = 4.90, SE = 0.08) than the use of negation descriptions (MDutch = 4.64, SE = 0.06; MEnglish = 4.92, SE = 0.08; MHungarian = 4.15, SE = 0.10; MFinnish = 4.54, SE = 0.10; MSerbian = 4.34, SE = 0.08).
We also observed small significant main effects of stereotype consistency in Dutch, Hungarian, Finnish, and Serbian samples, but not in the English sample.
Dutch F(1, 233) = 9.32, p = .003, η2p = .04,
English F (1,110) = 2.50, p = .12, η2p = .02,
Hungarian F(1, 114) = 7.80, p = .006, η2p = .06,
Finnish F(1, 110) = 9.25, p = .003, η2p = .08, and
Serbian F(1, 121) = 3.97, p = .05, η2p = .03.
This peripheral finding indicates that descriptions of stereotype-inconsistent behavior sentences were judged as somewhat more appropriate (MDutch = 4.80, SE = 0.06; MEnglish = 5.16, SE = 0.07; MHungarian = 4.57, SE = 0.09; MFinnish = 4.84, SE = 0.08; MSerbiant = 4.67, SE = 0.08) than descriptions of stereotype-consistent behaviors (MDutch = 4.71, SE = 0.06; MEmglish = 5.10, SE = 0.07; MHungarian = 4.47, SE = 0.08; MFinnish = 4.75, SE = 0.08; MSerbian = 4.57, SE = 0.07). Note that this finding stands in contrast to the stereotype consistency bias (Bratanova & Kashima, 2014) discussed above. It might be the case that stereotype-inconsistent (compared with consistent) information is perceived as more appropriate because this information is new/unexpected and therefore more informative, in line with Grice’s (1975) conversational maxims.
More relevant for the Negation Bias, however, is the interaction between stereotype consistency and description type. This interaction was significant for
Dutch F(1, 233) = 4.35, p = .04, η2p = .02,
English F(1, 110) = 6.14, p = .01, η2p = .05,
Hungarian, F(1, 114) = 50.24, p < .001, η2p = .31, and
Serbian, F(1, 121) = 28.70, p < .001, η2p = .19,
and marginally significant for
Finnish, F(1, 110) = 3.61, p = .06, η2p = .03.
In line with the Negation Bias, pairwise comparisons with Bonferroni corrections (see Table 1) showed that, in all language samples, the use of negations was perceived as significantly more appropriate in descriptions of stereotype-inconsistent (vs. stereotype-consistent) behaviors. The affirmation descriptions only differed significantly in the Hungarian and Serbian samples, where the use of affirmations was judged as more appropriate in descriptions of stereotype-consistent compared to inconsistent behaviors, in line with the original study.
Means (and Standard Deviations) of Judged Appropriateness of Negation and Affirmation Descriptions as a Function of Stereotype Consistency (Consistent, Inconsistent) of Described Behavior, in Dutch, English, Hungarian, Finnish, and Serbian language samples.
Note. Means in rows with different superscript within negations (a, b) and affirmations (x, y) description types are significantly different according to Bonferroni-adjusted pairwise comparisons with a certainty of p < .02 in English data, and p < .005 in all other languages.
Alternative Analyses
The hypothesis tests mentioned above follow the methods for statistical testing in the original article (Beukeboom et al., 2010, Study 2). This test generalized across material sets and participants. In linguistics, scholars recommend to analyze similar designs with mixed-effects modelling with random crossed effects for stimulus sets and participants (Baayen, Davidson, & Bates, 2008). This analysis is advantageous, because it not only distinguishes fixed effects related to the independent variables, but also takes random effects due to (random) variation between different participants and different stimulus sets into account. To apply this procedure, we used R (R Core Team, 2017), with R packages lme4 (1.1.21) for using linear mixed effects models (Bates, Maechler, Bolker, & Walker, 2015) and sjPlot (2.6.3) for interpreting the results (Lüdecke, 2017). The sjPlot package includes several established techniques for computing p values and model fit. For p values we used the conditional F tests with Kenward–Roger approximation, and the model fit is reported as the marginal and conditional R2, using the method proposed by Nakagawa, Johnson, and Schielzeth (2017).
We first transposed the datasets of the five language samples such that each actor–behavior sentence became a case, leading to 24 cases per participant (6 stimulus sets × 4 sentences). Next, we estimated mixed-effects models with stereotype consistency as fixed factor, and random intercepts for participants and stimulus sets thus correcting for potential differences between stimulus sets and individual participants. See Appendix Tables A5-A9, first and third columns (black font) and Figure 1 for plots for each language sample. Results of this more precise analysis confirm the previously presented findings. We find a negative direct effect of stereotype consistency on the appropriateness of negation use in all five languages, indicating that negations are more appropriate to describe stereotype-inconsistent (vs. consistent) behaviors. This confirms the Negation Bias and shows the findings are not due to the type of analyses conducted.

Fixed effects of stereotype consistency (inconsistent vs. consistent) on the judged appropriateness (y-axis) of negation and affirmation descriptions in the five language samples (full models in Appendix Tables A5-A9).
With regard to affirmations, we find a direct effect of stereotype consistency on the appropriateness of affirmation use for two languages (Hungarian, Serbian), indicating that affirmations are judged as more appropriate in description of stereotype-consistent (vs. inconsistent) situations. For the other three languages, we found no direct effect of consistency on the appropriateness of affirmations.
The same analyses also allowed us to further explore one hypothesized explanatory variable of the effect of stereotype consistency on appropriateness of negation (and affirmation) use; perceived expectedness of the actor’s behavior. These exploratory analyses (see Appendix for a detailed description) provide evidence that perceived expectedness of the actor’s behavior is an explanatory variable in predicting the appropriateness of negation (and affirmation) descriptions in Hungarian and Serbian, but not in the other language samples. This suggests there are other explanatory variables which will be discussed in the next section.
General Discussion and Conclusion
The findings corroborate the Negation Bias in five different languages. In all language samples, negation use was rated as more appropriate in descriptions of stereotype-inconsistent (vs. stereotype-consistent) behavior. Specifically, to describe the same behaviors (e.g., washes the dishes immediately after dinner), the use of negation descriptions (e.g., not messy) was judged as more appropriate when the actor was stereotypically unexpected to show this behavior (e.g., student) compared with when the actor was expected to show this behavior (e.g., housewife). These findings were confirmed both by the original statistical tests using repeated measures ANOVA (Beukeboom et al., 2010, Study 2) and advantageous mixed effects models correcting for random effects between participants and stimulus sets. This provides confirming evidence, using new experimental materials, for the notion that negations function to communicate that an actor’s behavior is stereotype inconsistent. The fact that these findings replicate across two Germanic languages (Dutch, English), two Uralic languages (Finnish, Hungarian) and one Slavic language (Serbian) suggests that the Negation Bias serves a similar function in the transmission and maintenance of social stereotypes within these language families.
A number of findings are noteworthy in comparison with the original study.
First, the effect sizes were smaller than in the original study and also varied considerably between the five language samples. The effect size of the stereotype consistency × description type interaction on description appropriateness was large in the original study (ηp2 = .46), compared with the present Hungarian (ηp2 = .31), Serbian (ηp2 = .19), English (ηp2 = .05), Finnish (ηp2 = .03), and Dutch (ηp2 = .02) samples. An important reason is that the interaction in the original study is for a large part driven by differences in affirmations (i.e., more appropriate for stereotype-consistent vs. inconsistent). Note that affirmations are not part of the Negation-Bias hypothesis which focuses on the different use of negations, but are included in the design to allow comparing negation appropriateness in stereotype consistent and inconsistent situations while keeping actors, behaviors, and the judged descriptions the same across conditions. Nevertheless, the judged appropriateness of affirmations is interesting in itself. In the present study, we only observed significant effects for affirmations in the Hungarian and Serbian sample, and in all samples the difference is more pronounced for negations (See Tables 1, A5-A9 and Figure 1 for a visualization).
These differences may be due to the fact that the present studies used different materials than the original. Overall (across six sets) the effect sizes of the manipulation check (i.e., judgements of expectedness of behaviors in sentences) were large and comparable to the original study, but there obviously were differences in the strength of stereotypic expectancies between material sets within the different language samples. We aimed to manipulate more or less universal stereotypes in our materials, but such differences between cultural groups are unavoidable. Still, the fact that the findings were replicated in all language samples and were confirmed in our alternative analyses using a mixed effects model which corrected for potential differences between material sets and individual participants, substantiates the robustness of the negation bias effect.
Another difference is that, in the present study, data were collected in online experimental surveys rather than laboratory environments as in the original study. Although sample sizes in the present studies are much larger than in the original study, it is very well possible that a part of these online participants was relatively less focused on the questionnaire. The noise this introduces may also explain the lower effect sizes.
Finally, it is interesting to consider the underlying process of the Negation Bias effect. The increased use of negations to describe stereotype-inconsistent events may be driven by two explanatory variables that could function similarly across languages. The first relates to work in the field of pragmatics that argues that negations function to indicate something that is different, unusual, or contrary to an existing (implicit) presupposition or expectancy that is relevant in context (e.g., Jordan, 1998; Moxey & Sanford, 2000; Wason, 1965). In communication about categorized individuals, negations then function to denote an exception to implicitly activated stereotypes. This means that the use of negations should be likelier to the extent that a target person’s behavior is more unexpected. This explanation does not pertain to the use of affirmations. In an additional exploratory analysis (see Appendix), we checked for the relation between expectedness of the described behavior and appropriateness of negation use and found mixed results (Appendix Tables A5-A9, second and fourth columns; gray font). As expected, and in line with the original study, for Hungarian and Serbian expectedness negatively predicted negation appropriateness. For Dutch, English, and Finnish, expectedness was unlike the (different type of) additional analyses in the original study, not related to negation appropriateness. This suggests that other explanatory variables besides expectedness play a role in driving the Negation Bias.
One second probable explanatory variable relates to work within the social-cognition field showing that social categorization automatically activates associated stereotypes (Devine, 1989; Dijksterhuis & Van Knippenberg, 1996; Fiske, 1998; Lepore & Brown, 1997). A categorization, for instance by means of a label, should cognitively activate stereotype consistent terms, and inhibit inconsistent terms, which makes their use in person descriptions likelier. This means that the more strongly an activated category is associated with a given concept (e.g., junkies–deceitful), the likelier it will be used when describing a categorized target. This should both result in an increased use of negations in stereotype inconsistent situations and an increased use of affirmations in stereotype consistent situations. To test for the effect of this explanatory variable a measure of actor–trait associations is needed. We did measure actor–trait associations in our material sets, and these associations were in general very strong (see Table A3). Yet the nature of the current design (only 2 actor–traits scores per set and 4 actor–behavior sentences) prevents a proper mediation analysis using this variable (see Appendix).
All in all, the evidence for the underlying mechanism of the Negation Bias (both in the original and the present studies) is not conclusive. It seems likely that the biased use of negations may be driven by a combination of the above mechanisms. Future research might be able to further explore this and also test whether actor–trait associations (the social cognition explanation) can explain the differences in effects on affirmations we observed in the different language samples.
Implications and Future Research
The increased use of negations to describe stereotype-inconsistent behaviors has several implications for stereotype maintenance. First, in line with their pragmatic function, negations imply to message recipients that the described target’s behavior is unexpected, an exception to the rule, and likelier caused by situational circumstances than by enduring dispositional factors (Beukeboom et al. 2010, Study 4). By describing information using a negation, people also convey a mitigated, more neutral version of the described event (Giora, Fein, Ganzi, & Alkeslassy Levi, 2005). For instance, using not smart conveys a less negative impression than stating stupid because the former introduces a positive concept. Likewise, not stupid conveys a less positive impression than smart by introducing a negative concept. Moreover, negations introduce stereotype-consistent concepts in communication about stereotype-inconsistent information, and thereby communicate what was expected instead. Thus, by describing stereotype-inconsistent behavior with negations rather than affirmations, the behavior is framed as unexpected, transient, more neutral in valence, while re-affirming existing associations with a category. The Negation Bias thereby functions to share and maintain social-category stereotypes (Beukeboom & Burgers, 2019). Interestingly, the present study suggests that this mechanism generalizes across various languages from different language families. Future studies might explore whether the Negation Bias also extends to non-European languages like Chinese, Arabic, and Hindi.
A number of boundary conditions are important. First, it should be noted that, although all languages have elements to express negation (Horn, 1989; Tian & Breheny, 2016), various grammatical forms to express negations exist (Verhagen, 2005), and these forms may differ between languages. The Negation Bias focuses on syntactic negations, which are expressed by means of a separate particle NOT (e.g., not smart) and can be found in most languages. It may work differently, however, for morphological negations, which are expressed by prefix (e.g., un- as in unhappy) or suffix (e.g., -less as in hopeless). It has been argued that syntactic negations (“The clown is not happy”) are processed in two steps, with recipients first activating the negated concept (happy) followed by the negation (not), thereby activating the stereotypic inference that clowns are usually happy (e.g., Fraenkel & Schul, 2008; Giora et al, 2005; Mayo et al., 2004). In contrast, morphological negations (e.g., “The clown is unhappy”) are typically processed in one step, with addressees immediately coming to the intended meaning of “unhappy” without first activating the negated concept (“happy”; e.g., Mayo, Schul, & Burnstein, 2004). This could imply that the stereotype maintaining effect of the Negation Bias would primarily function for syntactic (but not for morphological) negations. Yet this may be different for different languages. Future research might take these different types of formulations into account and also further address recipient inferences (e.g., Beukeboom et al., 2010, Study 4) in addition to negation versus affirmation production.
Along with other linguistic variations, negations are an important linguistic means by which stereotypes are interpersonally shared and maintained. The present studies show that the Negation Bias functions similarly in various languages, just like it has for instance been shown for the Linguistic Intergroup Bias (Maass, 1999). Research into these processes is important, because awareness of the subtle linguistic ways through which people share potentially harmful stereotypes is needed for any attempt to prevent or correct them.
Supplemental Material
e_Negation_bias_replication_DEF_APPENDIX_manipulation_checks_and_analyses – Supplemental material for The Negation Bias in Stereotype Maintenance: A Replication in Five Languages
Supplemental material, e_Negation_bias_replication_DEF_APPENDIX_manipulation_checks_and_analyses for The Negation Bias in Stereotype Maintenance: A Replication in Five Languages by Camiel J. Beukeboom, Christian Burgers, Zsolt P. Szabó, Slavica Cvejic, Jan-Erik M. Lönnqvist and Kasper Welbers in Journal of Language and Social Psychology
Footnotes
Acknowledgements
The authors would like to thank various anonymous reviewers for their valuable feedback on earlier drafts of this article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
Notes
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
