Gender Stereotypes in Natural Language: Word Embeddings Show Robust Consistency Across Child and Adult Language Corpora of More Than 65 Million Words

Abstract

Stereotypes are associations between social groups and semantic attributes that are widely shared within societies. The spoken and written language of a society affords a unique way to measure the magnitude and prevalence of these widely shared collective representations. Here, we used word embeddings to systematically quantify gender stereotypes in language corpora that are unprecedented in size (65+ million words) and scope (child and adult conversations, books, movies, TV). Across corpora, gender stereotypes emerged consistently and robustly for both theoretically selected stereotypes (e.g., work–home) and comprehensive lists of more than 600 personality traits and more than 300 occupations. Despite underlying differences across language corpora (e.g., time periods, formats, age groups), results revealed the pervasiveness of gender stereotypes in every corpus. Using gender stereotypes as the focal issue, we unite 19th-century theories of collective representations and 21st-century evidence on implicit social cognition to understand the subtle yet persistent presence of collective representations in language.

Keywords

collective representations gender stereotypes machine learning natural-language processing word embeddings open data open materials

Get full access to this article

View all access options for this article.

References

Abele

A. E.

Uchronski

Suitner

Wojciszke

(2008). Towards an operationalization of the fundamental dimensions of agency and communion: Trait content ratings in five countries considering valence and frequency of word occurrence. European Journal of Social Psychology, 38, 1202–1217. doi:10.1002/ejsp.575

Bailey

A. H.

LaFrance

Dovidio

J. F.

(2019). Is man the measure of all things? A social cognitive account of androcentrism. Personality and Social Psychology Review, 23, 307–331. doi:10.1177/1088868318782848

Beyer

(2018). Low awareness of occupational segregation and the gender pay gap: No changes over a 16-year span. Current Psychology, 37, 373–389. doi:10.1007/s12144-016-9521-4

Caliskan

Bryson

J. J.

Narayanan

(2017). Semantics derived automatically from language corpora contain human-like biases. Science, 356, 183–186. doi:10.1126/science.aal4230

Charlesworth

T. E. S.

Banaji

M. R.

(2019). Gender in science, technology, engineering, and mathematics: Issues, causes, solutions. The Journal of Neuroscience, 39, 7228–7243. doi:10.1523/jneurosci.0475-18.2019

Croft

Schmader

Block

Baron

A. S.

(2014). The second shift reflected in the second generation: Do parents’ gender roles at home predict children’s aspirations? Psychological Science, 25, 1418–1428. doi:10.1177/0956797614533968

Cvencek

Meltzoff

A. N.

Greenwald

A. G.

(2011). Math-gender stereotypes in elementary school children. Child Development, 82, 766–779. doi:10.1111/j.1467-8624.2010.01529.x

DeFranza

Mishra

(2020). How language shapes prejudice against women: An examination across 45 world languages. Journal of Personality and Social Psychology, 119, 7–22. doi:10.1037/pspa0000188

Dunham

Baron

A. S.

Banaji

M. R.

(2016). The development of implicit gender attitudes. Developmental Science, 19, 781–789. doi:10.1111/desc.12321

10.

Durkheim

(2009). Sociology and philosophy. New York, NY: Taylor & Francis (Original work published 1898).

11.

Eagly

A. H.

Mladinic

(1994). Are people prejudiced against women? Some answers from research on attitudes, gender stereotypes, and judgments of competence. European Review of Social Psychology, 5, 1–35. doi:10.1080/14792779543000002

12.

Eagly

A. H.

Nater

Miller

D. I.

Kaufmann

Sczesny

(2020). Gender stereotypes have changed: A cross-temporal meta-analysis of U.S. public opinion polls from 1946 to 2018. American Psychologist, 75, 301–315. doi:10.1037/amp0000494

13.

Eagly

A. H.

Wood

(2012). Social role theory. In Van Lange

P. A. M.

Kruglanski

A. W.

Higgins

E. T.

(Eds.), Handbook of theories of social psychology (pp. 458–476). Thousand Oaks, CA: SAGE.

14.

Ellemers

(2018). Gender stereotypes. Annual Review of Psychology, 69, 275–298. doi:10.1146/annurev-psych-122216-011719

15.

Ethayarajh

Duvenaud

Hirst

(2020). Understanding undesirable word embedding associations. In Korhonen

Traum

Màrquez

(Eds.), Proceedings of the 57th annual meeting of the Association for Computational Linguistics (pp. 1696–1705). Stroudsburg, PA: Association for Computational Linguistics. doi:10.18653/v1/p19-1166

16.

Fiske

S. T.

Cuddy

A. J. C.

Glick

(2002). A model of (often mixed) stereotype content: Competence and warmth respectively follow from perceived status and competition. Journal of Personality and Social Psychology, 82, 878–902. doi:10.1037//0022-3514.82.6.878

17.

Garg

Schiebinger

Jurafsky

Zou

(2018). Word embeddings quantify 100 years of gender and ethnic stereotypes. Proceedings of the National Academy of Sciences, USA, 115, E3635–E3644. doi:10.1073/pnas.1720347115

18.

Gaucher

Friesen

Kay

A. C.

(2011). Evidence that gendered wording in job advertisements exists and sustains gender inequality. Journal of Personality and Social Psychology, 101, 109–128. doi:10.1037/a0022530

19.

Godfrey

Holliman

(1993). Switchboard-1 Release 2 (Catalog No. LDC97S62). Retrieved from https://catalog.ldc.upenn.edu/LDC97S62

20.

Günther

Rinaldi

Marelli

(2019). Vector-space models of semantic representation from a cognitive perspective: A discussion of common misconceptions. Perspectives on Psychological Science, 14, 1006–1033. doi:10.1177/1745691619861372

21.

Hill

Bordes

Chopra

Weston

(2016, May). The Goldilocks principle: Reading children’s books with explicit memory representations. Paper presented at the 4th International Conference on Learning Representations (ICLR 2016), San Juan, Puerto Rico. Retrieved from https://arxiv.org/abs/1511.02301

22.

Koenig

A. M.

Eagly

A. H.

(2014). Evidence for the social role theory of stereotype content: Observations of groups’ roles shape stereotypes. Journal of Personality and Social Psychology, 107, 371–392. doi:10.1037/a0037215

23.

Kurdi

Mann

T. C.

Charlesworth

T. E. S.

Banaji

M. R.

(2019). The relationship between implicit intergroup attitudes and beliefs. Proceedings of the National Academy of Sciences, USA, 116, 5862–5871. doi:10.1073/pnas.1820240116

24.

Lewis

Lupyan

(2020). Gender stereotypes are reflected in the distributional structure of 25 languages. Nature Human Behaviour, 4, 1021–1028. doi:10.1038/s41562-020-0918-6

25.

Liben

L. S.

Bigler

R. S.

Krogh

H. R.

(2002). Language at work: Children’s gendered interpretations of occupational titles. Child Development, 73, 810–828. doi:10.1111/1467-8624.00440

26.

MacWhinney

(2000). The CHILDES project: Tools for analyzing talk (3rd ed.). Mahwah, NJ: Erlbaum.

27.

Martin

C. L.

Ruble

D. N.

(2010). Patterns of gender development. Annual Review of Psychology, 61, 353–381. doi:10.1146/annurev.psych.093008.100511

28.

Mikolov

Grave

Bojanowski

Puhrsch

Joulin

(2018). Advances in pre-training distributed word representations. In Calzolari

(Ed.), Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018) (pp. 52–55). Retrieved from https://www.aclweb.org/anthology/L18-1008.pdf.

29.

Miller

D. I.

Nolla

K. M.

Eagly

A. H.

Uttal

D. H.

(2018). The development of children’s gender-science stereotypes: A meta-analysis of 5 decades of U.S. Draw-A-Scientist studies. Child Development, 89, 1943–1955. doi:10.1111/cdev.13039

30.

Moscovici

(1988). Notes towards a description of social representations. European Journal of Social Psychology, 18, 211–250. doi:10.1002/ejsp.2420180303

31.

Moscovici

(2000). Social representations: Explorations in social psychology. Cambridge, England: Polity Press.

32.

Nosek

B. A.

Smyth

F. L.

Hansen

J. J.

Devos

Lindner

N. M.

Ranganath

K. A.

. . . Banaji

M. R.

(2007). Pervasiveness and correlates of implicit attitudes and stereotypes. European Review of Social Psychology, 18, 36–88. doi:10.1080/10463280701489053

33.

Payne

B. K.

Vuletich

H. A.

Lundberg

K. B.

(2017). The bias of crowds: How implicit bias bridges personal and systemic prejudice. Psychological Inquiry, 28, 233–248. doi:10.1080/1047840X.2017.1335568

34.

Peabody

(1987). Selecting representative trait adjectives. Journal of Personality and Social Psychology, 52, 59–71. doi:10.1037/0022-3514.52.1.59

35.

Pennington

Socher

Manning

C. D.

(2014). GloVe: Global vectors for word representation. In Pang

Daelemans

(Chairs), Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 1532–1543). doi:10.3115/v1/D14-1162

36.

Powlishta

K. K.

(1995). Gender bias in children’s perceptions of personality traits. Sex Roles, 32, 17–28. doi:10.1007/BF01544755

37.

Rhodes

Leslie

S.-J.

Yee

K. M.

Saunders

(2019). Subtle linguistic cues increase girls’ engagement in science. Psychological Science, 30, 455–466. doi:10.1177/0956797618823670

38.

Schwarzer

(2020). Package ‘meta’: General package for meta-analysis. Retrieved from https://cran.r-project.org/web/packages/meta/meta.pdf

39.

U.S. Bureau of Labor Statistics. (1998). Labor force statistics from the current population survey: 1995–1999 annual averages - household data - tables from employment and earnings (Table 10). Retrieved from https://www.bls.gov/cps/cps_aa1995_1999.htm

40.

U.S. Bureau of Labor Statistics. (2019). American time use survey—2019 results (Table A-1). Retrieved from www.bls.gov/tus/a1-2019.pdf

41.

Williams

J. E.

Bennett

S. M.

(1975). The definition of sex stereotypes via the adjective check list. Sex Roles, 1, 327–337. doi:10.1007/BF00287224

42.

The World Bank. (2020). Labor force participation rate, female (% of female population ages 15-64) (modeled ILO estimate). Retrieved from https://data.worldbank.org/indicator/SL.TLF.ACTI.FE.ZS

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.37 MB