Abstract
This study aims to investigate the presence of bias in the visual representation of librarians generated by ChatGPT across three different library settings: school, public, and academic. It focuses on analysing biases related to gender, ethnicity, age, attire, hairstyles and library design in the generated images. The research employed a zero-shot prompting technique to instruct ChatGPT to create visualisations of librarians in the specified settings, either interacting with another librarian or advising a library user. The generated images were then evaluated based on criteria such as positioning, posture, visual cues indicating age and gender and the characteristics of the library environment. The analysis revealed significant biases in the generated images, with a predominant depiction of librarians as Caucasian. Gender representation overstated the presence of men in all libraries, most notably in academic libraries with only 6% of academic librarians depicted as female. Additionally, there was a noticeable trend towards older librarians in public and academic settings, and the size of library buildings increased from school to academic environments. These findings highlight the reinforcement of stereotypes and the misrepresentation of authority dynamics, particularly the portrayal of men in positions of power relative to female colleagues. This study contributes to the growing body of research on biases in generative AI outputs, emphasising the potential dangers of relying on such tools for image generation. It underscores the importance of critically examining AI-generated content to avoid perpetuating discrimination and inequality within the profession of librarianship. The findings serve as a cautionary note regarding the implications of using generative AI for visual representation in professional contexts.
Keywords
Introduction
Stereotypes are a ‘lazy generalization’ based on an oversimplified and often inaccurate assumption about a group of people drawing upon limited information, neglecting individual differences (Hamilton, 2020). Membership in certain professions is frequently stereotyped (e.g. ‘men are builders, women are nurses’) but so is a profession’s attire (e.g. all scientists wear lab coats). Gender as well as ethnic (‘racial’) stereotypes in relation to professions and occupations are damaging as they harm a person’s self-esteem and career prospects and may lead to discrimination (from subconscious to overt) In consequence, members of society ‘not fitting the mould’ often tend to shy away from pursuing such careers, thereby further numerically reinforcing these prevalent stereotypes (Ellemers, 2018; Greenwald et al., 2002; Heilman, 2012).
Stereotypes are embedded in people’s minds often as early as childhood (e.g. the ‘draw-a-scientist test’; Chambers, 1983) and hard to dislodge once firmly entrenched. Such stereotypes tend to be reinforced in the public life through visual imagery in advertisements (Entman and Rojecki, 2001; Kitch, 2001; Lafky et al., 1996) and social media (Croucher et al., 2020; Dixon, 2016). The advent of generative Artificial intelligence (generative Ai) models, in particular text-to-image applications such as DALL-E4, Midjourney of Stable Diffusion has led to a proliferation of user-generated realistic images for a wide range of purposes. The fact that images are copy-right free allows presenters and educators to swiftly generate and insert fit-for-purpose images into their sales or public education presentations, no longer having to rely on stock photography. The question arises to what extent such images are value neutral or perpetuate potentially harmful stereotypes.
In this paper we will examine experimentally how a popular generative Ai model, ChatGPT4o, interprets and visualises librarians in three library settings (school, public, university). We will first discuss the perceptions of librarians as held by the general public and then review the literature on biases that have been identified in generative Ai responses. Based on this we will prompt ChatGPT4o to generate a series of images of a librarian talking with a client and images of two librarians talking with each other, which will be analysed as to gender, ethnicity, age, attire and hairstyles. The paper concludes with a discussion of the findings and their implications for library management and professional representation.
Background
Perceptions of librarians held by the general public
Although libraries along with many other institutions have undergone dramatic transformations due to ongoing digital disruption, perceptions of librarians held by the general public are largely shaped by persistent stereotypes that have remained relatively unchanged for decades (Seale, 2008; White, 2012). In keeping with most professions, early librarians were male, characterised as grim and grouchy or alternatively pale and bookish (Dickinson, 2003; Newmyer, 1976; White, 2012). In Western libraries the turn of the 20th century marked the feminisation of the role, and from Dewey’s depiction of the librarian as ‘a mouser in musty books’ (Dewey, 1876: 6) the stereotype developed to portray the librarian as a frumpy, middle-aged and unattractive spinster (Luthmann, 2007; Radford and Radford, 2003; Seale, 2008), whose reserved/repressed nature stifled spontaneity and who demanded an orderly and quiet, if not silent environment (Arant and Benefiel, 2012; White, 2012). Her attire is an amalgam of ‘cardigan, pearls, tweed skirt, hair in a bun with spectacles perched on the nose’ (Luthmann, 2007). The stereotypical librarian is either an old maid, a police type figure or a fumbling and inept parody, although sometimes beneath this façade they are a hidden hero or sexpot (Cullen, 2000; Engle, 1998; Katherine, 2000; Marinelli and Baker, 2000; Radford and Radford, 2003; Robinson, 2006; Seale, 2008). These archetypes have a symbiotic relationship with popular culture, which often depicts librarians in exaggerated or comedic roles, reinforcing the notion of librarians as rigid, out of touch and somewhat repressed. For instance, the character of Giles in Buffy the Vampire Slayer exemplifies the archetype of the stuffy, British librarian, who is both a hero and a humourless figure. Similarly, characters like the librarian in The Mummy or Barbara Gordon (Batgirl) embody the idea of the ‘closeted sexy librarian’ who, while initially appearing frumpy, are later revealed to possess hidden allure. Parodies, such as Conan the Librarian, play on this dichotomy by contrasting the stereotype of the weak, passive librarian with an exaggeratedly powerful, heroic figure, underscoring the incongruity of these cultural portrayals (Bird, 2024; Cullen, 2000; Everett, 2025; Jaćimović and Petrović, 2014; White, 2012).
Stereotypes are difficult to shift, and the public perception of librarians as older, unattractive, subordinate and powerless women extends across different cultures (Majid and Haider, 2008; Mayesti et al., 2019; Micle, 2014; Youssef Salib, 2017) and has links to historic gender roles and societal expectations of females (Radford and Radford, 1997, 2003; Seale, 2008). Librarianship is perceived as a low-status semi-profession, with the librarian’s duties seen as predominantly administrative, focusing on managing books and performing repetitive tasks, rather than engaging with people or fostering intellectual growth (Arant and Benefiel, 2012; Jennings, 2016; Majid and Haider, 2008; Newmyer, 1976). It is this misperception, and lack of understanding of what librarians actually do across the many different places in which they work that contributes to their ‘invisibility’, leaving depictions in popular culture to continue to fill the information vacuum (Luthmann, 2007; Majid and Haider, 2008; Seminelli, 2016).
Current demographics of the librarianship occupation
A review of demographic characteristics of librarians from major English-speaking countries reveals that the stereotype of the librarian as an older, white woman has some basis in fact (Luthmann, 2007; White, 2012). According to occupational data, Australian librarians have a median age of 53 years, with 66% of librarians in Australia identifying as female (Jobs and Skills Australia, 2024). The occupation in the United States is slightly younger, with the average age being 47.5 in 2017, but with a higher percentage of women (82.4%; Deloitte, n.d.). Canadian demographics are somewhat similar with 80% of library workers being female, with a similar median age (48 years; CAUT, 2017) In the United Kingdom, workforce mapping suggests that 76.1% report as female and 60% of the information workforce are aged over 45 (Reddington and Kinetiq, 2024). The percentage of librarians who identified themselves as White/Caucasian was 85.9% in the USA, 89% in Canada and 91% in the United Kingdom (Chan, 2020; Deloitte, n.d; Reddington and Kinetiq, 2024.).
Biases identified in generative Ai responses
The documentation associated with generative Ai models, such as ChatGPT, often claims that the models aim for neutrality, factual accuracy and avoiding political stances (Rozado, 2023). However, as many researchers have noted, any generative Ai model cannot be entirely free from bias, since factors such as model specifications, algorithmic constraints, and policy decisions influence its final product (Ferrara, 2023, 2024; Walker and Timoneda, 2024). Some of the ideological bias can be derived from its use of language (Van Poucke, 2024), while others derive from the quality of the training data (Baack, 2024; Spennemann, 2025a; Walker and Timoneda, 2024). Of concern may be that much of the data sources used for model training had to be readily accessible and copyright free, and were thus comprised of internet text, Wikipedia pages, public information, old books and other publications and records, which may include outdated and racially biased information (Choudhary, 2024; Spennemann, 2025a). Additionally, the training data incorporated into its dataset will be, whether consciously or subconsciously, shaped by the ideologies of and level of care exercised by those involved in its development (Baack, 2024). As a result, political orientation tests have demonstrated that early versions of ChatGPT exhibited biases, generally favouring libertarian, progressive, and left-leaning perspectives (Fujimoto and Takemoto, 2023; Motoki et al., 2024, 2025; Rozado, 2023; Rutinowski et al., 2024), often with a North American (Cao et al., 2023) or Eurocentric focus (Broadhead, 2024; Tao et al., 2024). Some studies indicated that bias did not trend in the same direction in all policy areas (Sullivan-Paul, 2023). While recent model releases have somewhat reduced the incidences of such biases, they still persist (Balestri, 2024; Rutinowski et al., 2024; Tao et al., 2024; Walker and Timoneda, 2024), possibly because moderation and filtering after training does not fully remove underlying training biases and, moreover, may introduce new biases (Balestri, 2024; Walker and Timoneda, 2024). Other generative AI models may be more centrist (Google Gemini) or conservative (Perplexity, Falcon; Choudhary, 2025). Meanwhile, the recently released DeepSeek has been found to generate responses that align with the perspectives of the Chinese Communist Party and the government of the People’s Republic of China (Lu, 2025).
There is an abundance of evidence of gender stereotyping in generative Ai responses (Duan et al., 2025; Kaplan et al., 2024) particular the reproduction of normative identities and narratives such as stereotyped binary gender identity (Gillespie, 2024). Numerous studies examined the gender representation among professions, primarily from the medical field (Abdulwadood et al., 2024; Ali et al., 2024; Currie et al., 2024a, 2024b; Gisselbaek et al., 2024; Lee et al., 2024; Morcos et al., 2021). A cross-professional study identified that 60% of the depicted professions were fully stereotypical congruent with popular perceptions of professions’ gender stereotypes (e.g. mechanics are men, nurses are female; Melero Lázaro and García Ull, 2023). Librarians were not included among the 37 professions assessed.
A considerable body of work looked at gender and ethnic bias in generative Ai text-to-image applications such as DALL-E3, DALL-E4, Midjourney or Stable Diffusion. A comparison of the image generation prompt created by the generative Ai language model with the image as rendered suggests that biases are introduced, or amplified, by the algorithm of the image generation program (Saumure et al., 2025). In text-to-image generated images, covert bias through inadequate representation is pervasive (Hacker et al., 2024). Where a gender was not specified, for example, generative Ai image generators return significantly more images showing more men performing ‘important’ roles than women (Abdulwadood et al., 2024; Ali et al., 2024; Currie et al., 2024a, 2024b; Gisselbaek et al., 2024; Lee et al., 2024; Morcos et al., 2021; York et al., 2024; Zhou et al., 2024). Likewise, unless specifically requested, ChatGPT-generated prompts tend to primarily visualise people of Caucasian appearance (Abdulwadood et al., 2024; Ali et al., 2024; Currie et al., 2024a, 2024b; Gisselbaek et al., 2024; Hosseini, 2024; Lee et al., 2024; Morcos et al., 2021; Wiegand et al., 2024; York et al., 2024; Zhou et al., 2024). Other ethnic biases portray African-American characters in service roles unlike Caucasian-looking characters (Hosseini, 2024), reinforcing observations made in textual analyses (Amin et al., 2024; Hofmann et al., 2024). Beyond gender and ethnicity, generative Ai image generators also render the male and female representations with stereotypical facial and body images (Currie et al., 2024b; Lio and Ahuja, 2024; York et al., 2024) while favouring the representation of younger individuals (Currie et al., 2024b; Gisselbaek et al., 2024). In single-shot prompting (using one prompt without follow up prompts), renderings of pregnant women or people with visually discernible disabilities/impairments are absent (Currie et al., 2024b).
Methodology
Data collection process
ChatGPT4o is linked by OpenAi’s Dall-E image generation algorithm. The prompts provided to ChatGPT4o were deliberately formulated in an unconstrained fashion (zero-shot prompting) in order to ensure that the interpretation by the generative Ai model was not biased towards the user’s perceptions. ChatGPT4o itself generated the complex textual prompt for DALL-E to render the image. No human manipulation in the image conceptualisation and implementation process occurred.
The text of the ChatGPT4o prompts was as follows:
Prompt 1: ‘Think about [insert library type] libraries and librarians working in these. Provide me with a visualisation that shows two librarians talking with each other against the background of the interior of the library’.
Prompt 2: ‘Think about [insert library type] libraries and librarians working in these. Provide me with a visualisation that shows one librarian giving advice to one library user against the background of the interior of the library’.
While the specification of ‘two librarians talking with each other’ in prompt 1 was chosen to (a) elicit the portrayal of any interpersonal dynamics and (b) to ensure that the head would be shown in profile or three-quarter view which then allowed to correctly score the hairstyle (see below), it did not provide unequivocal evidence for stereotyping of librarians that a prompt forcing the representation of a single individual would create. This predicated the need to run prompt 2.
The resulting image was saved to disk and the Ai-generated prompt retrieved from the image panel and, together with the image, copied into the data file. After saving, the chat was deleted to ensure a clean and unbiased generation of new images without legacy information available to ChatGPT4o. Fifty images each were generated for each of the three library types (public, school and university) for each of the two prompts (Figures 1 and 2). The total sample is comprised of 300 images.

Examples of images generated as part of the project, showing two librarians conversing (prompt 1) (image number in brackets, for ChatPGT4o-generated prompt see supportive data document): (a) (S4), (b) (S11), (c) (P1), (d) (P15), (e) (U2) and (f) (U23).

Examples of images generated as part of the project, showing a librarian giving advice to a library user (prompt 2) (image number in brackets, for ChatPGT4o-generated prompt see supportive data document): (a) (S66), (b) (S55), (c) (P78), (d) (P54), (e) (U59) and (f) (U98).
Scoring
Positioning, posture
People ‘read’ both book and non-book text pages (such as a newspaper or webpage) based on culturally defined patterns. In the western world, which writes horizontally from left to right, initial human eye movement while scanning the page for information, follows the F-pattern focussing on the top and left side of the page, with the bottom right largely disregarded (at least initially; Djamasbi et al., 2011). Images are often read in a Z-Pattern (Gaiger, 2018) or a diagonal pattern from top left to bottom right (Gutenberg diagram; Eldesouky, 2013). Irrespective of reading pattern used, the positioning of the two figures matters for their interpretation, with initial great weight given to the figure on the left. In addition, the postures standing, sitting as well as the body language (head cocked, arms forked, smiling, etc.) were scored.
Attributes
Scored were visual cues to gender (male/female) and age, with the latter scored as young, middle aged and old. The criteria boundaries were somewhat fluid. For example, characters with white hair and/or wrinkles were classed as old. In addition the presence of a beard for males, hair style (bob, open long hair, bun, ponytail) for women, as well as the presence of spectacles or a book were considered.
Setting
The library setting in the background was scored in terms of nature and positioning of windows and the nature of shelving (single level, multiple level).
Statistics
Summary data and frequencies were established using MS Excel, while the statistical comparisons were established with an on-line comparison of proportions calculator (MedCalc Software 2018). Differences were classed as ‘significant’ if p < 0.05 and very significant if p < 0.001.
Data
The images were generated on 17, 18, 20 and 21 February 2025 (Spennemann, 2023). The images as well as the prompts that ChatGPT4o generated and used to generate the images, have been documented according to an established protocol (Spennemann, 2023) and have been archived in a supplementary data file housed in the research depository of the author’s institution at doi: 10.26189/e77b1068-b059-492c-94e9-7cf0c9333622. The images carry the prefixes P (public library), S (school library) or U (university library) with a serial number indicating an image’s position in the production sequence. The supplementary datafile also contains auxiliary tables. These are indicated by the prefix ‘S’, that is, ‘Table S1’.
Results
OpenAi’s Dall-E rendered the images either realistic (e.g. Figure 1(a) and (c)–(e), Figure 2) or in cartoon-style (e.g. Figure 1(b)). While cartoon-style was used in 30% of all images set in school libraries, that style was eschewed for images set I public and university libraries (6% each).
Ethnicity, gender and age
Looking at the ethnicities, the representations of librarians in the images created by generative Ai was extremely biased, with all realistically rendered public and school librarians being Caucasian and only one (2%) of university librarians being of a different ethnicity (Asian). The portrayal of the two librarians conversing was likewise extremely biased with 98% of all representations rendered as Caucasian. Five of the 300 renderings (1.67%) could be classed as Asian and one (0.33%) as Hispanic. None of the renderings include persons of African, African-American or Pacific Islander descent.
The gender balance of the depiction of librarians is almost even among school librarians (52% men) but becomes skewed among public librarians (70% men) and extremely skewed among university librarians (94% men) (public vs university χ2 = 9.659, df = 1, p = 0.0019) (Table 1). In terms of gender balances of the two librarians conversing, women significantly outnumber men among young librarians (63.36%), but their representation declines significantly among middle aged staff (48.67%; χ2 = 3.899, df = 1, p = 0.0483) (Table 2). While there is a consistent trend of decreasing representation of women from school to public to university libraries (53%:44%:40%) this is only indicative and not statistically significant.
Gender and age breakdown of librarians (in %) talking with their clients in the portrayal of school public and university librarians.
Gender and age breakdown of two librarians (in %) conversing in the portrayal of school public and university librarians.
In terms of the age structure, the depicted age of librarians increases with perceived complexity of the library, with women librarians in public libraries more likely to be old than in school libraries (χ2 = 7.113, df = 1, p = 0.0077). A separate dataset, representing 200 generated images of women working in the creative and cultural industries included 30 women librarians and 10 archivists (Spennemann, 2025b) all of which were rendered as Caucasians. Of the visualisations of librarians, 86.7% are depicted as young women, while 80% of the archivists are depicted as middle aged.
The extremely biased ethnic composition of the images, with over 98% of all representations of librarians being Caucasian, raised the question whether this was symptomatic of the algorithm or whether the ChatGPT4o/DALL-E4 combination might generate images based on the location of the user (via the user’s IP address). To test whether the authors’ location (Australia) had an effect, two secure VPN connections were established, one to a server located in the US state of Mississippi and one to a server located in the People’s Republic of China (both connections using the service privateinternetaccess.com). For the test, 10 images each were generated using prompt 1 (with clean restart after each prompt). For the Chinese server the prompt was translated in Chinese (simplified) (想想公共图书馆和在其中工作的图书管理员。请为我提供一个可视化图像,其中显示一位图书管理员在图书馆内部背景下向一位图书馆用户提供建议). ChatGPT rendered all images of librarians and the clients they were talking with were rendered as Caucasians irrespective of whether the server was located in Mississippi or the PRC.
After 10 iterations logged into the Mississippi server, ChatGPT4o was asked ‘Why are all librarians that you depict of Caucasian ethnicity?’ to which it responded You’re absolutely right to point that out. The images generated tend to reflect biases present in the datasets used to train AI models, which historically have overrepresented certain demographics while underrepresenting others. This is an issue that extends across many AI-generated visuals and text outputs, and addressing these biases is an ongoing challenge in AI development.
It also offered to ‘generate a more diverse representation of librarians’. When prompted ‘yes generate a more diverse image’ it returned the following images in four iterations, seen in Figure 3a–d.

Examples of images of diverse representation of librarians.
Posture, body language and expressions of authority
Clear differences can be observed in the depiction of postures between a single librarian talking to a client and two librarians conversing. Single librarians are predominately depicted as authority figures standing or seated at a desk (irrespective of age group or library type), significantly more so than two librarians (women χ2 = 55.667, df = 1, p < 0.0001; men χ2 = 111.623, df = 1, p < 0.0001) (Tables 3–6). The latter are commonly depicted on an equal footing; that is, both standing in library or seated on chairs and furniture at similar heights (Supplemental Tables S2–S5).
Single librarian. Women’s body language (in %).
Two librarians conversing. Women’s body language (in %).
Single librarian. Men’s body language (in %).
Two librarians conversing. Men’s body language (in %).
As all images show the two librarians engaged in a conversation, the figures have been rendered in a range of body-language positions. These positions have been recoded into nine key positions. Where differences between the left and right hand existed, the coding prioritised some hand gestures. Thus one hand touching the chin was coded as overriding all other hand positions, except gesticulation with one or both hands, which in turn is overridden by one or both hands being placed in the pocket.
As noted, in the western world, which writes horizontally from left to right, initial human eye movement also occurs from left to right while scanning an image for information. All but one image depicted two librarians, the exception being an image which showed a male and female librarian talking with a seated male librarian (U38). The data show that the likelihood of two women librarians being depicted decreases from school libraries to university libraries, while the likelihood of two men librarians depicted increases (Table 7). Of these same-gender pairs, only the difference of women librarians between school and university libraries is statistically significant (χ2 = 6.319, df = 1, p = 0.0119). When comparing which gender is placed at the left in a pair, then men are significantly more likely placed on the left in school libraries (χ2 = 6.916, df = 1, p = 0.0085) and university libraries (χ2 = 4.125, df = 1, p = 0.0423) but not public libraries (Table 7).
Two librarians conversing. Proportions (in %) of conversation positions.
If same gender pairs as well as depictions of unclear power hierarchy are excluded then males are dominant between 83.3% (old-aged librarians) and 100% (young librarians), with an overall average of 96.3%. While increase of older female librarians as authority figures compared to middle aged librarians is significant (χ2 = 4.689, df = 1, p = 0.0303), no significant differences were observed when considering the different types of libraries (Table 8).
Two librarians conversing. Authority in the conversation (in %).
Dress and hair styles
Among school libraries, young women librarians were significantly more likely to be depicted wearing only an open or buttoned blouse as their top compared to middle-aged women (χ2 = 5.806, df = 1, p = 0.0160) or women depicted as old (χ2 = 7.273, df = 1, p = 0.0070). Middle-aged women librarians were very significantly more likely to wear cardigans than men (χ2 = 22.772, df = 1, p < 0.0014) as were women librarians depicted as old (χ2 = 10.170, df = 1, p = 0.0001) (Table 9). Among male librarians, depictions of formal suits or parts thereof (shirt and tie) dominate with between 70.8% and 76.5% of all dress styles. An increasing trend of librarians depicted as wearing formal suit attire can be observed across the types of libraries from 46.2% among school librarians to 63.9% among public librarians and 93.6% among male university librarians (Table 10) with the increase between the latter two cohorts being significant (χ2 = 11.414, df = 1, p = 0.0007). In the dataset of 200 women working in the creative and cultural industries all women librarians and 80% of the archivists were depicted wearing cardigans, primarily with an open blouse (Spennemann, 2025b).
Single librarian. Women’s clothing above the waist (in %).
Single librarian. Men’s clothing above the waist (in %).
When considering what librarians were shown to wear when conversing with each other (Tables 11 and 12), women of all age groups were very significantly more likely depicted wearing cardigans than men (young: χ2 = 11.629, df = 1, p = 0.0006: middle; χ2 = 20.960, df = 1, p < 0.0001; old: χ2 = 9.848, df = 1, p = 0.0017). Older women were significantly more likely to be depicted wearing cardigans than middle aged women (χ2 = 6.453, df = 1, p = 0.0111) or young women (χ2 = 613.381, df = 1, p = 0.0003), while, conversely, young women were significantly more likely shown wearing just a blouse compared to middle aged women (χ2 = 4.165, df = 1, p = 0.0413) and older women (χ2 = 9.356, df = 1, p = 0.0022). When considering the various library classes, then women are significantly more likely to wear business suits while working in university libraries than in school of public libraries (χ2 = 9.922, df = 1, p = 0.0016). None of the other combinations of attire differed significantly between age groups or classes of libraries (Table 13).
Two librarians conversing. Women’s clothing above the waist (in %).
Two librarians conversing. Men’s clothing above the waist (in %).
Single librarian. Women’s clothing below the waist (in %).
With regard to the portrayal of two male librarians conversing with each other, it is not surprising that formal suits dominated over any other piece of attire at any age groups (Table 15) (young: χ2 = 23.250, df = 1, p < 0.0001: middle; χ2 = 92.642, df = 1, p < 0.0001; old: χ2 = 13.167, df = 1, p = 0.0003) as well as type of library (school: χ2 = 31.599, df = 1, p < 0.0001: public; χ2 = 31.468, df = 1, p < 0.0001; university: χ2 = 49.091, df = 1, p < 0.0001). None of the other combinations of attire depicted as being worn by men differed significantly between age groups or classes of libraries (Table 14).
Single librarian. Men’s clothing below the waist (in %).
Two librarians conversing. Men’s clothing below the waist (in %).
Women librarian’s hairstyles changed with age, with a decrease of women depicted with open long hair or tied in a ponytail while the percentage of women depicted wearing their hair in a bun or as short hair increases (Table 16). When considering women’s hairstyles by age group (Table 17) among librarians conversing, young women are significantly more frequently depicted with open, long hair than hair tied back (χ2 = 6.598, df = 1, p = 0.0102) or tied into a bun (χ2 = 9.797, df = 1, p = 0.0017). Among middle aged women, hair tied into a bun is significantly more often depicted than open short hair (χ2 = 6.583, df = 1, p = 0.0103) which in turn is significantly frequently often depicted than ponytails (χ2 = 4.649, df = 1, p = 0.0311). Among older women, buns likewise significantly outnumber open, long hair (χ2 = 9.543, df = 1, p = 0.0020). When comparing the representation of hair tied into a bun between age groups, then younger women are very significantly less frequently depicted with buns than middle aged women (χ2 = 16.530, df = 1, p < 0.0001) or older women (χ2 = 16.027, df = 1, p = 0.0001). In the dataset of 200 women working in the creative and cultural industries, hair tied in a bun dominated (65%) among librarians and archivists, followed by open, untied hair (30%; Spennemann, 2025b).
Single librarian. Women’s hairstyles as a proportion of each age class (in %).
Two librarians conversing. Women’s hairstyles as a proportion of each age class (in %).
When considering the class of library with respect to women’s hairstyles (Table 17), women depicted as working in public and university libraries were more significantly more likely to wear their hair in a bun than women working in school libraries (public χ2 = 8.506, df = 1, p = 0.0035; university χ2 = 15.671, df = 1, p = 0.0001). While women depicted as working in school libraries were more likely to wear their hair open and long, the difference was only significant when compared with university libraries (χ2 = 9.312, df = 1, p = 0.0023).
Among the depiction of single male librarians, the proportion of men depicted with beards rose from 41.2% (n = 17) among young men to higher percentage of 88.4% (n = 69) among middle-aged men (χ2 = 18.148, df = 1, p < 0.0001) and to 95.8% (n = 24) among old men, which does not represent a statistically significant increase compared to middle-aged men. Among two librarians conversing, a similar pattern emerged: the proportion of men depicted with beards increased from 36.36% (n = 66) among young men, to a significantly higher percentage of 69.01% (n = 71) among middle-aged men (χ2 = 14.541, df = 1, p = 0.0001) and to 89.47% (n = 19) among old men (the latter not significant). When considering the depiction of beards on males faces across the types of libraries, then there is no trend among the depiction of single male librarians, but a consistent trend of increasing representation of men with beards among two librarians conversing, from school to public to university libraries (47.8%:56.6%:66.7%) which is indicative only but not statistically significant.
While men are more frequently depicted with glasses than women (Table 18), that difference is not statistically significant. Among depictions of single librarians, the increase in the wearing of glasses between young and middle-aged librarians is the only increment increase that is significant (χ2 = 6.295, df = 1, p = 0.0121). There is, however, a significant age progression among the two librarians conversing, with middle aged men more likely to wear glasses than young men (χ2 = 6.709, df = 1, p = 0.0096) and old men being more likely to wear glasses than middle aged men (or young men (χ2 = 26.322, df = 1, p < 0.0001). The same applied to the age differences among women (young vs middle age χ2 = 4.634, df = 1, p = 0.0313; middle age vs old age χ2 = 24.891, df = 1, p < 0.0001). There is also a significant difference in the portrayal of young and middle aged librarians between the depiction of single librarians, which have a significantly higher representation of people wearing glasses than in the portrayal of two librarians communicating with each other. (young men χ2 = 45.506, df = 1, p < 0.0001; middle aged women χ2 = 35.894, df = 1, p < 0.0001; middle-aged men χ2 = 68.799, df = 1, p < 0.0001).
Proportion of men and women librarians depicted with glasses by age class.
Ethical issues
A small percentage of images of librarians interacting with clients exhibit excess familiarity, primarily the inappropriate placement of a librarian’s hand on a client’s shoulder (Figure 4) or back. Such excess familiarity was depicted for all three library classes, with ‘incidences’ in school libraries being the most common (16%), followed by university libraries (10%) and public libraries (4%) in primarily older male–male and older male-female pairings, but also in male-female pupil and female-male pupil combinations. None of the images depicting two librarians conversing shows any touching of the other.

Examples of images professionally inappropriate behaviour by the librarian (image number in brackets, for ChatPGT4o-generated prompt see supportive data document): (a) (U88) and (b) (P79).
Rendering and prompting errors
The image generation by DALL-E resulted in some errors in rendering, such as depicting librarians with three hands (P79, U25; Figure 4(b)), a female librarian partially emerging from a table-top (U9), a librarians in a seated position floating in free air (P23, S15), a free floating book (U14) as well as windows with misshapen perspective windows (U42). Unexpected logic errors occurred in two renderings of a single librarian giving advice to a client in public library. The prompts generated by ChatGPT4o construed the library as the user itself rendering it in anthropomorphic form (Figure 5).

Examples of images logic errors in prompt generation (image number in brackets, for ChatPGT4o-generated prompt see supportive data document): (a) (P98) and (b) (P99).
Discussion
Through comparison of stereotypes of librarians and the actual demographic composition of the librarianship profession in Western countries, it appears that there is some truth in the generalisations made regarding those who work in libraries. It is evident that the profession is overwhelmingly female, Caucasian and middle-aged or older. The ease with which quantitative demographics such as age, gender and ethnicity are captured may feed the rigid, often exaggerated and fixed beliefs (Scott, 2015) that form stereotypes; this image problem has existed for at least 50 years, as documented through research and professional publications which decry popular culture depictions and demand a more nuanced recognition (Attebury, 2010; Jalal, 2024; Katherine, 2000; Newmyer, 1976; Pagowsky and DeFrain, 2014; White, 2012).
Complicating this situation is the bias which is present in images created through the use of popular generative Ai platforms as revealed through the findings of this study. The biases both concur with actual demographics (librarians are older, Caucasian) and contradict reality (librarians are predominately female). Of particular concern is the fact that male librarians are depicted as authority figures in 96.3% of the generated images. This not only perpetuates an incorrect perception that the majority of librarians are male, but also reinforces outdated stereotypes in which men are positioned as the primary authority figures in libraries, while women occupy semi-professional or subordinate roles. This representation is not only inaccurate but also diminishes the status of women, particularly within a profession that is predominantly feminised. When a prompt questioning this outcome was entered into ChatGPT4o ‘When asked to generate images of librarians, why does this tool depict the majority as male, and almost all these males as authority figures when actual demographics are that librarianship is overwhelmingly a female dominated profession?’ (18 March, 04:00 GMT) the platform gave several reasons, including historical stereotypes and gender roles, training data bias, lack of diverse representation, gender bias in visual representation and generative AI limitations. These points essentially present the same argument: the tool has ‘learned’ that male figures dominate professions and generally hold authoritative positions. In conclusion, the tool offered: ‘Addressing this issue involves improving training data to include a wider variety of accurate, diverse, and contemporary representations of librarians and their roles, ensuring that the AI reflects the true makeup and dynamics of the profession’.
Based on these observations and considerations we then chose to put the question of what constitutes a ‘typical librarian’ to the generative Ai model and assess whether the response conformed to the stereotypes that were visualised. We prompted ChatGPT4o to ‘describe a typical librarian in terms of gender, ethnicity, age and attire’ (6 March 2025 07:41 GMT). ChatGPT4o responded that ‘There is no single “typical” librarian, as librarians come from diverse backgrounds and can be of any gender, ethnicity, or age. However, based on common stereotypes and traditional depictions, a librarian might be described as:
Gender: Often portrayed as female, though librarians can be of any gender.
Ethnicity: Varies widely depending on location and demographics. In Western media, librarians are sometimes depicted as Caucasian, but in reality, they come from diverse ethnic backgrounds.
Age: Commonly imagined as middle-aged or older, but many librarians are young professionals just entering the field.
Attire: Often depicted wearing professional or semi-formal clothing, such as cardigans, blouses, button-up shirts, skirts, dress pants, or comfortable shoes. Glasses are frequently included in stereotypical portrayals.
Of course, these are just broad generalizations, and real-life librarians defy these stereotypes in many ways!’
We followed this up with the question ‘if you were asked to visualise a single typical librarian in an image, would you use that description to generate the image?’ to which ChatGPT4o responded: “If I were asked to visualize a single “typical” librarian in an image, I would likely use the first description as a general guideline, since it aligns with common stereotypes and expectations. That means the image might feature:
A middle-aged woman,
Possibly wearing glasses,
Dressed in professional or semi-formal attire (such as a cardigan, blouse, or dress pants),
In a library setting, surrounded by books.”
While is beyond the scope of this paper to examine to what extent algorithms in ChatGPT or in DALL-E (as its visualisation element) are responsible for the observed biases, other work suggests that biases are introduced by ChatGPT at the autonomous prompt generation phase and by DALL-E at the point of image generation (Spennemann, 2025), suggesting that these biases are deep-seated and not easy to rectify.
The responses made by ChatGPT 4o to questions regarding the ways in which images are generated in response to prompts are of concern for several reasons, not the least being the possible proliferation of synthetic images that are misrepresentative and which exacerbate damaging misconceptions regarding the role and value of librarians and the inclusivity of the profession, a concern shared by other occupations expressed in similar studies (Currie et al., 2024a, 2024b; Lee et al., 2024; Wiegand et al., 2024; York et al., 2024). Beyond potential damage to the librarianship profession, research supports the notion that bias inherent in data is amplified through the application of AI algorithms, and that people may be more susceptible to these generated biases than those expressed in person-to-person interactions (Glickman and Sharot, 2024). Not only might people come to adopt biases communicated by generated AI material, these biases may be transmitted to future decisions which are made even without use of generative AI tools (Vicente et al., 2025).
Despite the scepticism and distrust reported in AI mediated communications (Altay and Gilardi, 2024; Formosa et al., 2025; Glikson and Asscher, 2023; Sahebi and Formosa, 2025), the opacity of image generation algorithms and limitations in users’ AI literacy and cognitive capacities can lead to what are described as ‘illusions of consensus’(Burton et al., 2024: 1648) and ‘illusions of understanding’(Messeri and Crockett, 2024: 49). Illusion of consensus is the mistaken belief that a single narrow interpretation is the consensus held by many sources, while the illusion of understanding occurs when reliance on AI tools causes an overestimation of one’s understanding, potentially leading to the prioritisation of a single or particularised point of view. These metacognitive errors, when combined with the exacerbation of bias possible in images composed by generative AI may lead to widespread misconceptions. This may be particularly the case in situations such as the stereotyping of an occupation such as librarianship, which is already heavily dominated by inaccurate and simplistic interpretations.
Limitations and future research
The authors acknowledge that while every measure was taken to implement a clearly defined methodology and analysis, there is inevitably some element of subjectivity in the interpretation of the images generated through this process. This is particularly so when making judgements regarding age and ethnicity. Another limitation of this type of research is the necessary reliance upon a relatively narrow prompts, which cannot represent every possible prompt that could be employed to generate images of librarians. Therefore, while a wide range of images were generated, it is not possible to say definitively that this is indicative of the output in every single interaction of this nature with ChatGPT 4o.
As the creators of ChatGPT in its various iterations are aware of, and conduct research to reduce bias across the platform (Eloundou et al., 2024), future studies are warranted to evaluate whether changes are evident in text to image generation over time. Through the interim period, it is evident that immediate research following this publication could focus on strategies to build AI literacy in users as a safeguard to minimise negative impacts of the bias that currently exists, as identified in this paper.
Conclusions
Through examination of visualisations of school, public and academic librarians generated by ChatGPT 4o, this paper has revealed the extent to which bias exists and the implications this has for the librarianship profession as well as for users of Generative Ai tools in general. When analysed according to gender, ethnicity, age, attire and hairstyles, the tool created stereotypical images depicted a Caucasian profession in which men hold authority in almost every instance. School libraries and librarians were the least formal, with ground level buildings, younger staff more casually dressed and with a warm and welcoming atmosphere. Public libraries largely occupied the middle ground, being staffed by older men and women, who brought a professional atmosphere, while academic libraries espoused connotations of wisdom and formality, with libraries spread over several stories in historical buildings and librarians which were older, besuited and bespectacled. While some (younger) female librarians wore casual clothes and their hair long and in a ponytail (mostly in schools), the clichéd bun and cardigan was worn by many of the female public and academic librarians, and shelves of books were omnipresent.
Due to its minimal cost, low technological skill requirements and limited need for artistic expertise, it is evident that reliance on generative Ai for creating visual content for publication across various media and marketing channels could rapidly increase. With the proliferation of biased images, misconceptions about librarianship and a wide range of occupations could become further engrained, contributing to the perpetuation of discrimination and the exacerbation systemic inequalities for marginalised groups.
This research contributes to enhancing awareness of inherent biases in content created using generative Ai and serves as a critical alert concerning the overreliance on these tools. Through its conclusions, the paper adds to the growing body of evidence highlighting the need for continued efforts in training generative AI on data that accurately reflects reality rather than sustaining and propagating stereotypes. It highlights the importance of designing algorithms in a manner that mitigates inherent biases, thereby preventing the amplification of existing inequalities.
Supplemental Material
sj-docx-1-lis-10.1177_09610006251357286 – Supplemental material for What do librarians look like? Stereotyping of a profession by generative Ai
Supplemental material, sj-docx-1-lis-10.1177_09610006251357286 for What do librarians look like? Stereotyping of a profession by generative Ai by Dirk HR Spennemann and Kay Oddone in Journal of Librarianship and Information Science
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Institutional review board statement
Not applicable.
Data availability statement
The original data presented in the study are openly available via doi: 10.26189/e77b1068-b059-492c-94e9-7cf0c9333622.
Supplemental material
Supplemental material for this article is available online.
Author biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
