Sage Journals: Discover world-class research

Abstract

Perceptual biases surrounding the credibility of female sports reporters has been a robust area of research for decades, taking on many methodological forms. One vein of this research has been experimental designs where reporter sex is systematically manipulated to examine impacts on perceived credibility. Despite remarkable similarities in overall study design, findings from these studies have been mixed, variably demonstrating biases against female reporters, in favor of female reporters, or no biases at all. This paper reports results of a systematic review of this literature, highlighting differences in stimuli (e.g., medium, visual prominence of reporters), theoretical mechanisms, and measures employed in order to illuminate possible reasons for these varied findings.

Keywords

gender (biological) sex credibility sports journalism experiments

Exploration of the challenges and biases faced by female sports reporters has been a vibrant stream of scholarship in the field for decades. Researchers have probed potential biases against female reporters using multiple methodological approaches, including surveys of journalists (Hardin & Shain, 2006) and editors (Laucella et al., 2016), interviews with female sports journalists (Cramer, 1994), ethnographic (Genovese, 2015) and netnographic observation (Demir & Ayhan, 2022), content analysis of media coverage (Boczek et al., 2022; Eastman & Billings, 2000), and more.

In addition to these, one consistent vein of research in this area over the past three decades is experimental designs that seek to establish theoretically supported cause-effect relationships in these biases (e.g., Charitat & Cianfrone, 2023; Ordman & Zillmann, 1994). In some respects, these experimental studies bear striking similarities. Quite often a between-subjects manipulation is affected such that nearly identical stimuli are presented between groups of participants, where the sex of the reporter/author/commentator is manipulated. In some studies, these manipulations are somewhat subtle (e.g., a change in byline and reporter photograph; Mudrick & Lin, 2017), whereas in others the manipulations are more pronounced (e.g., female versus male reporter appearing onscreen; Greer & Jones, 2012). Samples of research participants are asked to read/watch/listen to the stimulus and then evaluate the reporter along some salient criteria, typically some assessment of credibility, knowledgeability, authoritativeness, or similar.

Despite this general uniformity in study design, outcomes from these studies are quite varied. For example, Etling and Young’s (2007) experiment found a bias against female reporters such that they were rated as less authoritative than male sports reporters. A few years later, Baiocchi-Wagner and Behm-Morawitz (2010) found no difference between female and male reporters in rated credibility. Later Greer and Jones (2012) report results indicating that female reporters were rated as more competent than males. Thus, broadly generalizing what is known about biases regarding female sports reporters from these seemingly similar experimental studies is limited, potentially inhibiting the accumulation of knowledge that is a hallmark of “normal science” (Kuhn, 1996, p. 10). With this in mind, the purpose of this paper is to systematically review this specific form of empirical research examining biases against female sports reporters in order to illuminate differences across study outcomes, and then identify major characteristics of these studies that might help explain the assortment of findings (i.e., differences in study stimuli, measures, theoretical foundations).

Literature Review

As previously noted, a wealth of scholarship has examined the challenges that women have faced in sports journalism and broadcasting, illuminating longstanding biases in both the volume and nature of sports coverage (Schoch, 2020), challenges faced by female sports reporters both in the newsroom (Harris & Bowes, 2025) and with audiences (Johnson et al., 2023). The total body of literature on the topic is vast, and given the attention allocated to the subject, one might expect some consistency across the literature. Indeed, some consistency can be found. Consider the programmatic studies that examine newsroom practices and lived experiences of reporters (Hardin & Shain, 2005, 2006; Hardin & Whiteside, 2009), or over-time content analysis of sports media (Cooky et al., 2013, 2015, 2021) in order to explore these biases. However, studies employing experimental designs over the past 30 years in order to establish cause/effect relationships have yielded less consistency.

Normal Science

“Normal science…is a highly cumulative enterprise, eminently successful in its aim, the steady extension of the scope and precision of scientific knowledge.”

(Kuhn, 1996, p. 52)

Systematic progression of the body of knowledge is a defining characteristic of science. In the opening pages of his landmark essay on The Structure of Scientific Revolutions, Kuhn (1996) repeatedly notes that scientific advancement is a “piecemeal process” (p. 1), an “incremental process” (p. 2) and a “cumulative process” (p. 3). In times of normal science, which Kuhn argues constitutes the bulk of scientific discovery, scholars operate on a shared foundation of previous work. Within this shared framework, science progresses through “miniscule” (p. 24) advancements, and the work of normal science is to advance the body of knowledge slowly but additively. By Kuhn’s view, in pursuing normal science, we add detail to our further understanding of a phenomenon. The goal is not so much to branch off into new directions, but instead to better map known terrain and gain a more granular understanding of current territory.

Kuhn’s conception and definition of paradigms, crisis, and scientific revolution are not adopted here in a strict sense. Instead, they are invoked as a useful framework for examining the research employing a specific methodological approach to a somewhat unified body of literature probing a generally singular question—what is the impact of a reporter’s biological sex on audience perceptions of his/her perceived credibility? To the extent that experimental studies examining the effect of reporter sex on audience perceptions are executed remarkably uniformly, they reflect a shared paradigm of sorts. Similarities in general study design, measurement, and execution unite research in this area, and in this sense, they somewhat reflect Kuhn’s view of scientific inquiry operating within a shared paradigm during periods of normal science. Despite these surface-level consistencies in design, findings generated from these studies have been less uniform, inhibiting the forward progression of what is known about the relationship between reporter sex and audience perceptions. To illuminate this inconsistency, this study systematically examined the constellation of published articles that reflect the general properties described earlier in the introduction: (a) experimental studies that (b) manipulated reporter biological sex (c) in order to examine impacts on credibility perceptions.

Method

Sample

To identify all published studies that meet these criteria, an exhaustive search was conducted using the following parameters:

• An advanced search of multiple EBSCO databases (Academic Search Complete, Communication Source, and SPORTDiscuss) was conducted for various permutations of “reporter,” “sex,” “gender,” “experiment,” and “sport.”

• Similar searches were conducted via Google Scholar.

• Once articles were identified, reference lists within each article were also consulted for possible works to include. Furthermore, subsequent articles citing each identified source were reviewed using the above search terms. This process was repeated as additional works were identified.

This process yielded a final sample of 20 articles¹. Articles within the sample were published between 1994 and 2025 and appear in a variety of scholarly outlets. Four articles were in Communication & Sport and the Journal of Sports Media each. The International Journal of Sports Communication contained two of the sampled articles, as did Journalism & Mass Communication Quarterly. The remainder were from an assortment of journals.

Article Characteristics

Although not a formal content analysis in that assessments were not subjected to intercoder reliability, this review recorded manifest properties of these studies that require little judgment or interpretation but instead reflect mere clerical documentation (e.g., sample size; sport depicted). This included multiple study properties: the nature of manipulations employed in the study designs (e.g., reporter sex; athlete sex); sport(s) examined in each study; nature of stimuli (i.e., video, audio, print, etc.); nature and size of sample (i.e., undergraduate students, paid panels of research participants); theoretical framework(s) employed; measures employed; statistical tests employed; and findings. Results of this review are summarized in the table below and then discussed (Table 1).

Table 1.

Summary of Attributes of Experimental Studies Examining the Effect of Reporter Sex on Credibility

Authors	Theoretical Framework(s)	Sport	Stimuli	Medium	Sample	IVs	DVs	Covariates	Abbreviated results	Bias observed
Ordman and Zillmann (1994)	None.	Men’s college basketball; Women’s gymnastics	Magazine column with byline and reporter photograph; Radio broadcast.	Print; audio	Convenience sample of undergraduate students (N = 132)	Reporter sex; Sport/Medium (2)	Competence; Persuasiveness	Reporter attractiveness	Overall/main effect such that female reporters were rated as less competent.	Bias against female reporters.
Etling and Young (2007)	Male hegemony	Various; Baseball	Audio recording of “topical stories and scores” and an opinion piece on steroids in baseball.	Audio	Convenience sample of undergraduate students (N = 244)	Participant sex; Reporter sex; Sexist attitudes (Benson & Vincent, 1980; Swim et al., 1995)	Authoritativeness (McCroskey, 1966)	None	Female reporters were rated as less authoritative than males.	Bias against female reporters.
Etling and Young (2007)	Male hegemony	Various; Baseball		Audio	Convenience sample of undergraduate students (N = 244)		Authoritativeness (McCroskey, 1966)	None	Sexist attitudes negatively predicted perceptions of female sportscaster authoritativeness.	Bias against female reporters.
Baiocchi-Wagner and Behm-Morawitz (2010)	Social identity theory	Men’s and Women’s college basketball	Print news story comparing two teams with manipulation of author name and athletes’ sex.	Print (but tested via online survey)	Convenience sample of undergraduate students (N = 316)	Participant sex; Reporter sex; Athlete sex	Credibility (McCroskey, 1966); Reporter Persuasiveness	Sport fandom; Sport media use; News media credibility	No effect of reporter gender on credibility.	None.
Davis and Krawczyk (2010)	Stereotypes; media richness theory; Parasocial Interaction	Football	Audio recording, video recording, or audio/visual recording of news story about football delivered by female sportscaster.	Audio/Video	Convenience sample of undergraduate students (N = 112)	Participant sex; Presentation format (3); Level of attractiveness (3)	Attractiveness; Dynamism; Expertness; credibility	None	Did not manipulate reporter gender.	N/A
Davis and Krawczyk (2010)	Stereotypes; media richness theory; Parasocial Interaction	Football		Audio/Video	Convenience sample of undergraduate students (N = 112)		Attractiveness; Dynamism; Expertness; credibility	None	Attractiveness was generally positively correlated with competence, expertness, and trustworthiness.	N/A
Etling et al. (2011)	Hegemony; schema theory	Various; Baseball	Audio recording of “numerous current stories and scores.”	Audio	Convenience sample of undergraduate students (N = 290)	Participant sex; Reporter sex; Vocal Pitch (2)	Authoritativeness (McCroskey, 1966); Sexist attitudes (Benson & Vincent, 1980; Swim et al., 1995); Gender role orientation (Bem, 1981)	None	Female participants rated male reporters as more authoritative.	Contingent bias against female reporters among female participants.
Greer and Jones (2012)	Gender schema theory	College football; College volleyball	Video of sports commentator offering opinion. No competition footage.	Video	Convenience sample of undergraduate students (N = 172)	Reporter sex	Competency (McCroskey, 1966); Likability (Reysen, 2005)	Ethnicity; Sports participation; Time spent following sports	Overall/main effect such that female reporters were rated as more competent than male reporters, regardless of sport being discussed.	Bias in favor of females.
Mastro et al. (2012)	Communication accommodation theory; social norms; stereotypes	Hockey, football, gymnastics, Women’s basketball, tennis	Print news excerpts featuring a male or female reporter covering various sports.	Print	Convenience sample of students (N = 244)	Reporter sex; reporter race; sex stereotype of sport (Masculine/Feminine)	Expertise; contribution; Character; Likability.	Gender & Racial attitudes (Swim et al., 1995)	Reporter Race/Gender X sport interactions on rated expertise such that ratings generally aligned with race- and gender-based sport stereotypes. For stereotypically feminine sports (i.e., gymnastics, women’s basketball), differences were only observed within gymnastics and not basketball.	Contingent bias such that male reporters were generally rated as more expert for stereotypically male sports. For female sports, contingent bias was evident only for gymnastics and not women’s basketball.
Hahn and Cummins (2014)	Gender schema theory	Men’s & Women’s college basketball	Print Q&A story with basketball player featuring a picture of the reporter and athlete.	Print	Convenience sample of undergraduate students (N = 230)	Reporter sex; athlete sex; Attractiveness (2)	Credibility (Ohanian, 1990)	None	Did not report comparison of male/female reporters.	Contingent bias such that unattractive reporters of the opposite sex as the athlete were rated as most credible.
									Reporters covering male athletes were perceived as more credible regardless of participant sex.
									Unattractive reporters covering the opposite sex were seen as more credible.
Harris (2013)	Stereotypes	Men’s & Women’s college basketball	45s video excerpt from a college basketball game featuring sportscaster voiceover. Reporter not depicted/shown.	Video	Sample of 18-65 year olds recruited online (N = 249)	Reporter sex; athlete sex; Amount of sports watched (3)	Qualifications, dynamism (Berlo et al., 1969)	None	For men’s basketball, no gender effect was observed.	Contingent bias such that for women’s basketball, female reporters were rated as more qualified. This was particularly true among viewers who viewed the most sports.
Harris (2013)	Stereotypes	Men’s & Women’s college basketball		Video	Sample of 18-65 year olds recruited online (N = 249)	Reporter sex; athlete sex; Amount of sports watched (3)	Qualifications, dynamism (Berlo et al., 1969)	None	For women’s basketball, the author reports a marginally significant effect of reporter gender, with female reporters rated as more qualified than males. This effect was particularly pronounced for viewers who watched the most sports.
Mudrick et al. (2016)	Social role theory	Men’s professional basketball	4m video excerpt of a female and male sportscaster debating NBA topics.	Video	Paid online sample (N = 544)	Reporter sex	Credibility (Ohanian, 1990); Dynamism (Berlo et al., 1969); Attitude toward source (Bagozzi et al., 2001); Behavioral intentions/media consumption; Masculine gender role norms (Levant et al., 2007); Modern sexism instrument (Swim et al., 1995)	Sports viewing frequency (Davis & Krawczyk, 2010)	No difference in sportscaster credibility.	Contingent bias, such that sexist attitudes predicted decreased credibility of female sportscasters.
Mudrick et al. (2016)	Social role theory	Men’s professional basketball		Video	Paid online sample (N = 544)	Reporter sex		Sports viewing frequency (Davis & Krawczyk, 2010)	Endorsement of gender-role stereotypes and sexist attitudes predicted decreased credibility of female sportscasters.
Mudrick and Lin (2017)	Gender role theory; Source credibility theory	College football; College volleyball	Mock news story/game summary featuring an [un]attractive male or female sportscaster. Reporter photo and byline included.	Print	Convenience sample of undergraduate students (N = 328)	Reporter sex; Reporter attractiveness (2); Sport (2)	Trustworthiness, expertise (Ohanian, 1990); Perceived fit (Till & Busler, 2000); Reader loyalty	Sport involvement (Trail & James, 2001); Sport news consumption	No overall/main effect of reporter gender.	Contingent bias such that female reporters were rated as more credible for “female” sports.
									Reported an interaction effect between reporter sex X sport such that perceived fit was greater for a female reporter covering a female- appropriate sport
									Than a male- appropriate sport.
									Similarly, they also found impacts of attractiveness on expertise, but only for male reporters.
Pratt et al. (2018)	Social identity theory	Softball, Women’s gymnastics, Women’s Figure Skating	Highlights of women competing in feminine sports. Duration not specified.	Video	Convenience sample of undergraduate students (N = 235)	Reporter sex; Participant sex	Credibility (Kemp, 2007); Likeability (Reysen, 2005)	None.	Overall/main effect for reporter gender such that female reporters were rated as more credible and more likable than males.	Bias in favor of female sportscasters.
Cummins et al. (2018)	Objectification theory; gender schema theory	Professional/College football	Video compilation of 6 reporter standups featuring 3 male and 3 female reporters discussing football. No competition footage.	Video	Convenience sample of undergraduate students (N = 66)	Participant sex; reporter sex	Visual attention; Credibility, attractiveness (Ohanian, 1990)	Sport fanship (Wann et al., 1995)	No overall/main effect of reporter gender.	Contingent bias against female reporters among male participants.
									Observed a reporter sex X participant sex interaction such that male participants rated female reporters as less credible.
									Attractiveness was correlated with expertise, most notably among female reporters.
Luisi et al. (2020)	Social role theory	Professional football	4.5 m video excerpt of NFL game ending in a scoring play. Sex of play-by-play announcer was systematically varied, but sex of color commentator (male) was consistent.	Video	Convenience sample of undergraduate students and adults recruited online (N = 188)	Participant sex; Reporter sex	Credibility (combined measures of qualifications & dynamism; Berlo et al., 1969)	Unclear	Female play-by-plan announcers were rated as less credible than males, regardless of viewer gender.	Bias against female commentators.
Bell et al. (2022)	Social identity theory	Professional football	41-54s videos of professional football with commentary by female pairs, male pairs, or mixed-sex pairs. Reporter not shown/depicted.	Video	Paid online sample of sports fans (N = 415)	Participant sex; Commentator sex; Sports Spectator identification	Qualifications, dynamism (Berlo et al., 1969); Authority, character (McCroskey, 1966)	None	Did not report overall comparisons of commentators based on reporter sex.	Contingent bias against female commentators among female participants.
Bell et al. (2022)	Social identity theory	Professional football		Video	Paid online sample of sports fans (N = 415)			None	Reported separate comparisons of female/male/mixed-sex commentators based on participant sex. Results suggest an effect such that that female participants rated male commentators higher in credibility than female commentators. For male participants, no effect was suggested.
Boczek et al. (2022)	Hegemonic masculinity	Soccer (European football)	“Teaser” for an op-ed article on the national football team, featuring a headshot of the reporter and name.	Online article	Online panel (N = 635)	Participant sex; Reporter sex; athlete sex	Expertise (Ohanian, 1990); Reading Intention	Interest in topic; right-wing authoritarianism (as a measure of support for traditional gender roels)	No overall/main effect of reporter gender.	Contingent bias against female reporters among female participants.
Boczek et al. (2022)	Hegemonic masculinity	Soccer (European football)		Online article	Online panel (N = 635)	Participant sex; Reporter sex; athlete sex	Expertise (Ohanian, 1990); Reading Intention		Reported a reporter sex X participant sex interaction such that female participants rated female reporters as lower in expertise compared to male.
Brisbane et al. (2021)	Social identity theory; stereotyping	Professional football	35s video of sports reporter discussing a rule change. No competition footage.	Video	Paid online sample (N = 491)	Reporter sex; Participant sex; information type (fact v. Opinion)	Credibility; Perceived knowledge	Attractiveness; sports media use; Propensity to stereotype (Shrivastava & Gregory, 2009)	No effect of reporter sex on credibility or perceived knowledge.	None.
Charitat and Cianfrone (2023)	Source credibility theory	Esports	One 4-min video of league of legends competition with commentary by male shoutcasters or a mixed-gender pair.	Video	Online sample of league of legends viewers (N = 239)	Commentator sex (all-male pair; mixed-sex pair)	Trustworthiness; Expertise (Ohanian, 1990)	Involvement; Identification	Female shoutcasters were rated as more trustworthy than males.	Bias in favor of females.
Charitat and Cianfrone (2023)	Source credibility theory	Esports		Video	Online sample of league of legends viewers (N = 239)	Commentator sex (all-male pair; mixed-sex pair)	Trustworthiness; Expertise (Ohanian, 1990)	Involvement; Identification	No effect of shoutcaster sex on expertise.	Bias in favor of females.
Yang et al. (2022)	Source credibility theory; Social role theory	Men’s professional basketball	Audio podcast featuring a female reporter discussing the NBA all-star game.	Audio	Snowball sample of Chinese NBA viewers (N = 242)	Auditory cuteness (2)	Perceived attractiveness, perceived expertise (Ohanian, 1990); information Satisfaction (Kourouthanassis et al., 2017); gender role beliefs (Brown & Gladstone, 2012); Continuance Intentions (Sartore-Baldwin & Walker, 2011)	Gender, sexism, fanship, age, topic familiarity, experience listening to sports podcasts	Attractiveness was positively related to perceived expertise.	N/A
Yang et al. (2022)	Source credibility theory; Social role theory	Men’s professional basketball		Audio	Snowball sample of Chinese NBA viewers (N = 242)	Auditory cuteness (2)			Adherence to traditional gender roles intensified this effect, such that the more listeners endorsed traditional gender roles, the stronger the relationship between attractiveness and perceived expertise.	N/A
Olgemöller and Schupfner (2025)	Role theory; social identity theory	Soccer (European football)	74-s Audio recording of in-game commentary by male or female announcer.	Audio	Online convenience sample of Germans (N = 304)	Sex of original commentator; Speaker sex (3, original, male, female)	Credibility (McCroskey & Teven, 1999); Trust (Matthes & Kohring, 2003); sexism (Swim et al., 1995) Soccer fandom (Pentecost & Spence, 2009)	None	Rated credibility of female commentators decreased as participant sexism increased.	Contingent bias against female commentators among participants high in soccer fandom.
Olgemöller and Schupfner (2025)	Role theory; social identity theory	Soccer (European football)		Audio	Online convenience sample of Germans (N = 304)			None	The interaction of participant sexism and commentor sex was not significant for rated trust.

Results

Bias Observed

Clearly, the most important attribute of these studies is the nature of the outcomes demonstrated therein with respect to potential biases against female sports reporters. Perhaps owing to the first of these studies, many have sought to test an assumed or predicted bias against female reporters, based on the stereotypical notion that sports is a masculine domain. As will soon be discussed in the review of theoretical frameworks, traditional gender-role stereotypes (e.g., Mudrick & Lin, 2017) and hegemonic masculinity (e.g., Etling & Young, 2007) undergird such assumptions. According to this stereotypical view, men are superior in terms of their knowledge and ability to offer insights and reportage on the topic, which would translate to enhanced credibility.

Bias Against Female Reporters

One of the earliest known studies exploring perceptions of female sports reporters arguably cast the mold for much of what followed. In their study, Ordman and Zillmann (1994) reported a bias against female reporters that cut across both male and female sports (i.e., men’s college basketball; women’s gymnastics). For both sports, female reporters were rated as less competent than male reporters. Just over a dozen years later, Etling and Young (2007) observed a similar bias, where a female sports reporter was rated as less authoritative than a male reporter delivering an identical audio recording of an opinion/editorial on steroid use in baseball.

More recently, Luisi et al. (2020) likewise found a bias against female reporters in the context of 4.5-min play-by-play commentary of a professional (American) football. In their study, female commentators were rated as less credible than males, independent of participants’ sex. Moreover, these evaluations were also independent of participants’ sexist attitudes. Thus, select studies examining print (Ordman & Zillmann, 1994) and audio (Etling & Young, 2007) news stories, as well as play-by-play commentary of athletic competition (Luisi et al., 2020) have demonstrated a bias against female sports reporters/commentators across multiple sports.

Bias in Favor of Female Reporters

Other scholarship on the matter has been more mixed, sometimes reporting findings precisely in opposition to these studies. Again, it bears repeating that these studies generally follow the same general methodological approach with respect to study design.

First, consider Greer and Jones (2012) and their study examining commentary surrounding collegiate football or women’s collegiate volleyball. In that study, participants watched a short video of a female/male sports reporter discussing a female/male athlete’s recent injury and explicitly offering opinion on its impact on the team’s future performance. Contrary to theoretically derived predictions, participants in that study reported a bias in favor of female reporters such that they were rated as more competent than male reporters. The authors interpreted this to be the result of numerous factors, including changing social norms, greater rated attractiveness of the female reporter, or heavy female composition of the sample who judged the female reporter more favorably.

A few years later, Pratt et al. (2018) likewise found a bias in favor of female reporters whereby they were rated as more credible than males delivering audio commentary within highlight videos. However, it bears noting that in Pratt et al.’s (2018) study, only female athletes competing in stereotypically feminine sports (i.e., softball, women’s gymnastics, women’s figure skating) were shown. The authors offer several interpretations for their findings, including salience of social identity among female participants leading to more favorable evaluations, and increased recognition of female participation in sports.

Most recently, a bias in favor of favor of females was observed in the context of esports, where Charitat and Cianfrone (2023) adapted a similar experimental design within this relatively new athletic context. In their study, participants viewed a 4-min video of League of Legends competition, where either an all-male or mixed-sex pair of shoutcasters offered running commentary. Although no difference was found for rated expertise, findings showed that compared to their male counterpart, the female shoutcaster was rated as significantly more trustworthy. The authors speculated that participants recognized the professional nature of the shoutcasters, which would mitigate any traditional biases against female reporters. Thus, a select few studies have offered results suggesting that female reporters were rated as more credible than their male counterparts, but no consensus explanation for the findings emerged.

No Bias Observed

Just as studies have demonstrated biases both against and for female reporters, a number of studies have failed to find any differences in terms of perceived credibility. For example, Baiocchi-Wagner and Behm-Morawitz (2010) failed to observe any bias in their comparison of reported credibility of male/female authors of a print news story that “compares and contrasts two basketball teams’ strengths and weaknesses before concluding with his or her ‘projection’ of the regional championship winner” (p. 267). The stories attributed to female/male authors were identical except for a manipulation of the authors’ names/byline and athlete sex, so the subtlety of the manipulation could potentially be interpreted as an explanation for this lack of between-groups differentiation.

However, nearly a dozen years later, Brisbane et al. (2021) similarly observed no sex-based difference in rated credibility or perceived knowledge in a context where the manipulation of reporter sex was more overt. Whereas Baiocchi-Wagner and Behm-Morawitz’s (2010) employed a text stimulus, Brisbane et al. (2021) employed 35-s video stimuli that featured female/male journalists reporting either fact-based or opinion-based stories discussing a proposed rule change in professional (American) football. Their results found no difference in the rated credibility or perceived knowledge of female/male reporters. In sum, studies over the decades have variably found biases against female reporters, in support of female reporters, or none at all.

Contingent Bias Observed

A more common observation across the literature is some contingent bias variably in support of or against female sports reporters, depending on multiple factors including reporter characteristics, sport being examined, participant sex, or other individual characteristics.

In terms of contingent biases against female reporters, these at times depend on the sex of the participant. For example, Etling et al. (2011) compared listener response to audio recordings of female/male reporters covering “numerous current stories and scores” (p. 9). They found that female study participants rated male reporters as more authoritative, although no such difference was evident among male participants. A near identical pattern was observed by Bell et al. (2022) who compared evaluations of all-male, all-female, or mixed-sex pairs of commentators in an NFL broadcast. Their results suggest that female study participants rated the all-female and mixed-sex pairs of commentators as less credible than the all-male pair. No similar pattern was observed among male participants, who rated all pairs of commentators similarly. Boczek al. (2022) reported an identical bias in the context of soccer (European football). In their study, participants read a short teaser for an opinion-based column. Their results similarly indicated that female participants rated male reporters as significantly higher in perceived expertise than female reporters. Cummins et al. (2018) observed a similar contingent bias against female reporters; however, it was only among male study participants who rated female reporters as less credible when delivering news and information on (American) football. Thus, these biased perceptions have variably been observed at times among female but not male participants, and vice versa. Furthermore, these variable contingent biases were observed across varied sports (i.e., professional American football, European football) or sports news.

In addition to participant sex, another relatively uniform contingent condition involves the sex of the athlete or the closely related concept of sport gender typing (i.e., stereotypically masculine v. feminine sports). Such studies often emphasize reporter-sport congruence or fit, with the assumption being that men are presumably better qualified to speak on men’s sports and vice-versa. One such example is Harris’s (2013) study examining viewer response to female/male commentators offering voiceover commentary in a 45-s video excerpt of women’s/men’s college basketball. That study found that for women’s basketball, female reporters were rated as more qualified. This was particularly true among viewers who viewed the most sports. However, for men’s basketball, no such effect emerged. Likewise, Mastro et al. (2012) observed contingent biases that fell along gender lines, with male reporters being evaluated more favorably when covering stereotypically masculine sports (i.e., hockey, football), and female reporters being evaluated more favorably when covering feminine sports (i.e., gymnastics; women’s basketball). Similarly, Mudrick and Lin (2017) examined evaluations of female/male reporters covering a women’s versus men’s sports (i.e., college volleyball and college football, respectively). Their findings showed that that rated fit was greater for a female reporter covering a female sport than a stereotypically male sport. Notably, this difference in perceived fit did not carry over to perceptions of reporter expertise or trustworthiness, which did not differ by reporter sex. Lastly, it also bears emphasizing that some studies employing variation in athlete sex have failed to find such contingent biases based on presumed fit or congruence (Baiocchi-Wagner & Behm-Morawitz, 2010; Ordman & Zillmann, 1994).

Other attributes of both the reporter and study participants have also been shown to govern credibility perceptions. For example, reporter attractiveness has been a topic of examination in multiple studies that have yielded mixed results. For example, Hahn and Cummins (2014) manipulated reporter sex, athlete sex, and reporter attractiveness in their study of reader evaluations of a print Q&A story with female/male college basketball players. Although no overall bias was observed, they report a contingent bias as a function of reporter attractiveness and athlete sex, such that unattractive reporters covering the opposite sex were rated as most credible. Mudrick and Lin’s (2017) aforementioned study examining evaluations of female/male reporters likewise examined attractiveness as a categorical attribute of the reporter (i.e., high/low reporter attractiveness conditions). However, their findings were mixed and not in concert with Hahn and Cummins (2014). Specifically, Mudrick and Lin (2017) report that for male reporters, more attractive reporters were judged as more trustworthy. For female reporters, no impacts of attractiveness on expertise or trustworthiness were observed. In addition, a number of studies have employed attractiveness as an evaluation of female/male reporters, not a manipulated variable. Such studies have reported that reporter attractiveness was correlated with perceived expertise (Cummins et al., 2018; Yang et al., 2022).

Lastly, a growing number of recent studies have demonstrated contingent effects as a function of an individual’s endorsement of sexist views or traditional gender roles. For example, Mudrick and Lin (2017) found that greater endorsement of gender-role stereotypes and sexist attitudes predicted decreased credibility of female sportscasters. Similarly, both Yang et al. (2022) and Olgemöller and Schupfner (2025) employed measures of sexist attitudes or support for traditional gender roles in their predictive models, both demonstrating causal relationships between those individual characteristics and credibility perceptions. Thus, inclusion of such individual-level characteristics illustrates the utility of examining not just biological sex but individual differences as an explanatory mechanism.

Having reviewed the varied outcomes across these studies, the question then turns to explanations for these inconsistencies. Differences across an assortment of study attributes may help illuminate this assortment of findings.

Differing Stimuli

Testing for effects across a robust and diverse array of message types, contexts, and repetitions is an important aspect of experimental designs in communication research (Jackson et al., 1989; Jackson & Jacobs, 1983). Doing so helps aid generalizability of study findings and provides weight in support of the argument that cause-effect relationships are explored via broader concepts and not the idiosyncratic function of a singular message or unique stimulus. Thus, in this sense, the diversity of media content reflected across these studies could be viewed as a strength. However, differences in textual/audio/visual modalities reflected in the media employed, visual appearance of reporters, nature of the information (e.g., opinion vs. fact-based reportage), and more could clearly contribute to different study outcomes. Moreover, studies in this literature frequently fail to employ any type of message repetition, and many reflect single-stimulus designs where study participants see and evaluate only a single message or exemplar of a broader phenomenon (i.e., one reporter versus multiple sports reporters or stories).

Medium Effects

Among the studies reviewed, stimuli spanned print/text formats (n = 6), audio (n = 6), video only (n = 1), and audio-visual content (n = 10)² reflecting a variety of types of stories or messages. Only one study reviewed here directly compared media formats to specifically test for medium effects.³ Davis and Krawczyk (2010) drew upon media richness theory in order to examine how reporters’ visual attractiveness impacts credibility perception, but that study did not offer any specific predictions. Their stimuli were either audio recordings, a video recording without sound, or an audio-video recording of a female sports reporter reading a script about (American) football. However group means were not reported, and differences in audience response between these formats were hard to discern.

Even within a single medium, differences in the nature of the content reflected by these experimental materials also makes generalization difficult. For example, although video content reflects the most common form of experimental stimuli, these videos reflect a variety of sports-related content, including fact- and opinion-based sports reporting (e.g., Brisbane et al., 2021; Greer & Jones, 2012), or play-by-play or color commentary (e.g., Harris, 2013; Luisi et al., 2020). In some cases, the reporter was visibly depicted (e.g., Cummins et al., 2018), whereas in other stimuli the reporter was not shown (e.g., Bell et al., 2022). Thus, video stimuli varied in the nature of the information as well as visual salience or prominence of the reporter.

Visual Depiction of Reporter

Independent of medium, the visual depiction of female/male reporters is arguably another important (and inconsistent) property of stimuli employed across the literature. Most typically, studies testing for effects of reporter sex did not include visual depictions of reporters (n = 10). In these studies, manipulation was achieved via game voiceover from female/male commentators (e.g., Charitat & Cianfrone, 2023) or delivering news reports (e.g., Etling & Young, 2007). A smaller number of studies (n = 7) included visual depiction of reporters, although this was manifested in different ways. For example, studies employing text/print stimuli manipulated reporter sex via a reporter photo and byline that accompanied a text passage (Boczek et al., 2022; Hahn & Cummins, 2014; Mastro et al., 2012; Mudrick & Lin, 2017), arguably a somewhat subtle manipulation. Other studies employed more overt manipulations where reporters appeared onscreen delivering news or editorial content (e.g., Brisbane et al., 2021; Greer & Jones, 2012). In a select few studies, reporters were visually depicted in some experimental conditions but not all (Davis & Krawczyk, 2010; Ordman & Zillmann, 1994). In sum, studies across the literature may have uniformly varied the biological sex of reporters, but the visual representation of those manipulations varied in their prominence.

Sport Examined and Nature of Information

Although potentially more superficial, another possible reason for different findings across the body of literature is the various sports employed as the context for these studies. Again, the previous discussion of contingent effects due to sport gender typing and alignment or fit with reporter employed may be more salient. Nonetheless, totaling across the individual sports, men’s sports were more commonly employed (n = 19) at a rate twice that of women’s sports (n = 10). Thus, even within the scholarly literature focusing on perceptions of female sports reporters, a bias toward men’s sports was found.

Within men’s sports, (American) football was the most common (n = 8), and stimuli focusing on this sport included depictions of actual competition (e.g., Bell et al., 2022), talking-head coverage (e.g., Cummins et al., 2018), or print news stories (e.g., Mudrick & Lin, 2017). The next most common men’s sport employed was basketball, and again, stimuli included both athletic competition (e.g., Harris, 2013), sports talk (Mudrick et al., 2016), and print (e.g., Hahn & Cummins, 2014). Baseball was featured less frequently (n = 2), as was soccer/European football (n = 2), and hockey (n = 1).

With respect to women’s athletics, the most common sport employed within stimuli was basketball (n = 4). In those studies, three employed print stories on the topic and only one depicted actual athletic competition (Harris, 2013). Women’s gymnastics (n = 3) and volleyball (n = 2) served as the context for investigation an equal number of times, with stimuli reflecting actual athletic competition (e.g., Pratt et al., 2018) as well as print news coverage (Ordman & Zillmann, 1994). Lastly, softball (n = 1) and women’s figure skating (n = 1) were also employed, and 1 study included tennis as a gender-neutral sport.

Differing Theories

Despite the varied study outcomes, some greater consistency emerges with respect to the theoretical or conceptual frameworks employed as a vantage point for testing potential biases surrounding female sports reporters. Although precise articulations of and sources for these theories/perspectives vary, many of these studies are rooted in the notion of sports as a traditionally masculine domain. As such, women are systemically disadvantaged when discussing the topic. This sentiment cuts across a variety of related perspectives, including male hegemony/hegemonic masculinity (Boczek et al., 2022; Etling et al., 2011; Etling & Young, 2007); gender schema theory (Cummins et al., 2018; Etling et al., 2011; Greer & Jones, 2012; Hahn & Cummins, 2014); social/gender role theory (Luisi et al., 2020; Mudrick et al., 2016; Mudrick & Lin, 2017; Olgemöller & Schupfner, 2025; Yang et al., 2022), or merely stereotypes (Davis & Krawczyk, 2010; Harris, 2013; Mastro et al., 2012). Likewise, notions of “fit” also fall under this umbrella based on the stereotypical notion that female reporters are better suited for coverage of female athletes (Mudrick & Lin, 2017).

These studies have generally (although not consistently) yielded findings in support of this argument, demonstrating overall biases or contingent biases against female sports reporters as credible sources of sports information (Cummins et al., 2018; Etling et al., 2011; Etling & Young, 2007; Luisi et al., 2020). However, findings employing this perspective are not perfectly uniform and also provide evidence directly contradictory to schema/stereotype-oriented frameworks (e.g., Greer & Jones, 2012).

A second common theoretical framework employed across this literature is Social Identity Theory (SIT; Tajfel & Turner, 1979). That theory generally holds that individuals associate or identify with broader social categories or in-groups based on some salient criteria (e.g., race/ethnicity; sex/gender; team affiliation). Furthermore, individuals seek to maintain positive social status through association with positive or successful ingroups and derogation of salient outgroups. For example, SIT was employed in the broader context of sports to explain differences in the extent to which fans display in-group affiliation via “basking in reflected glory” after team victories (BIRGing; Cialdini et al., 1976), or “cutting off reflected failure” (CORFing; Synder et al., 1986) or “cutting off future failure” (COFFing; Wann et al., 1995) after team losses.

In the context of sports reporting, studies have generally posited that these biases impact individual response such that membership in associated in-groups based on reporter/participant sex will impact perceptions of or responses to reporters (Baiocchi-Wagner & Behm-Morawitz, 2010; Bell et al., 2022; Brisbane et al., 2021; Pratt et al., 2018).⁴ However, among the studies employing this theoretical framework, findings have been inconsistent. For example, several studies employing SIT have failed to find predicted biases in perceived credibility based on reporter/participant sex (Baiocchi-Wagner & Behm-Morawitz, 2010; Bell et al., 2022; Brisbane et al., 2021). Furthermore, although Pratt et al. (2018) found support for their hypotheses, examination of the stated predictions suggest better fit with aforementioned schema/gender role/stereotype-based perspectives than Social Identity Theory. Thus, support for SIT as a mechanism explaining potential gender/sex biases remains in question.

Lastly, an assortment of additional theoretical or conceptual frameworks has been invoked in studies testing for sex-based biases in credibility perceptions. Although virtually all the studies reviewed here discussed source credibility extensively as a central concept, several studies have named source credibility as the theoretical basis for study predictions (Charitat & Cianfrone, 2023; Mudrick & Lin, 2017; Yang et al., 2022). Additional frameworks include Media Richness Theory (Davis & Krawczyk, 2010), Objectification Theory (Cummins et al., 2018), or Communication Accommodation Theory (Mastro et al., 2012) to support predictions surrounding differences in individual response to competing media forms or female/male sports reporters, respectively.

Differing Samples

Given the present focus on experimental designs, a common attribute of these studies is the use of human subjects as sources of data with respect to the impacts of the manipulated message attributes. However, the nature of these samples varies in terms of demographics, means of recruitment, as well as sample size.⁵ The total number of participants employed across these studies was 5,676, with an average of 283.80 (SD = 147.06) participants per study. The smallest sample reported was N = 66 (Cummins et al., 2018), and the largest was N = 635 (Boczek et al., 2022).

With respect to the demographic composition of samples, the majority of articles reviewed here (n = 12) generally relied on samples of undergraduate students, reflecting just under half the participants employed in this literature (undergraduate participant n = 2,369; 41.75%). At times, these were explicitly labeled as convenience samples (e.g., Pratt et al., 2018) whereas other articles simply referred to participants as undergraduate students (e.g., Baiocchi-Wagner & Behm-Morawitz, 2010). Furthermore, the nature of these student samples was at times acknowledged as a study limitation (e.g., Luisi et al., 2020) or possible explanation for study outcomes. For example, Greer and Jones (2012) noted that a majority of participants in their study were “women majoring in communication” (p. 76), and offered that as a possible explanation for their observation that female reporter was rated as more competent than the male reporter employed their stimuli.

The composition of the remainder of the samples varied, although most reflect the use of online survey-experiments with participants recruited through a variety of approaches. For example, several studies (Bell et al., 2022; Boczek et al., 2022; Brisbane et al., 2021; Mudrick et al., 2016) employed paid participants recruited through Amazon Mechanical Turks (MTurks), Qualtrics, or unspecified panel vendors, reflecting roughly 40% of the total body of participants across these studies (n = 2,389; 42.09%). Other online studies employed ostensibly purposive samples, such as Chinese NBA viewers (Yang et al., 2022) or League of Legends players (Charitat & Cianfrone, 2023) appropriate to the specific study.

Finally, some studies also referenced some means of checking the quality of participant responses. Such efforts reflect deleting participants due to quickly completing a study (Boczek et al., 2022) or other attention checks, such as correctly answering questions regarding the study stimuli (Bell et al., 2022; Yang et al., 2022).

Differing Measures

One important property of empirical research that contributes to the accumulation of knowledge during normal science is uniformity in measurement. Unlike our peers in the STEM disciplines who may have the advantage of highly standardized measures endorsed by a governing body (e.g., the National Institute of Standards and Technology), those of us within the social sciences operate with greater latitude, particularly with respect to the central focus of this review, “credibility.”

On the one hand, studies reviewed here have generally drawn upon a few consistent and popular measures of credibility. On the other, actual application of these measures has varied in important ways, including use of only select subscales, abbreviated or adapted versions of the measures, or amalgamations of separate subscales into idiosyncratic composite measures for analysis. The result is at times an apples-to-oranges comparison across the literature, or at the very least, comparison of different varieties of apples.

McCroskey’s (1966) measure of credibility is a broadly cited multi-dimensional instrument that captures authoritativeness and character. The original measure consisted of 22 and 20 seven-point Likert-type statements, or abbreviated 12-item semantic differential items to capture authoritativeness and character respectively. However, scholars examining credibility of sports reporters have employed that scale in differing ways. Baiocchi-Wagner and Behm-Morawitz (2010) acknowledged that the original instrument consisted of separate subscales measuring authoritativeness and character, but they then report results suggesting that all items were combined into a single measure of credibility. Etling and Young (2007) also employed the measure from McCroskey to assess perceived authoritativeness, but they reported that they adapted a smaller subset of 15 of 22 items from the original scale for use. Greer and Jones (2012) also report drawing upon McCroskey (1966), focusing on “competency” as their construct of interest. Lastly, Olgemöller and Schupfner (2025) report using the competence sub-scale from McCroskey and Teven (1999). Thus, all four studies drew upon/cited the same general measure, but in different ways, focusing on different components of credibility.

Another common measure of credibility is the work of Berlo et al. (1969). That scale was likewise a multi-dimensional scale designed to capture three aspects of source credibility (i.e., safety, qualification, and dynamism), each measured via five semantic differential items. A small number of studies reviewed here (n = 4) drew upon items from this measure. For example, Harris (2013) employed discrete measures of dynamism and qualifications. Similarly, Luisi et al. (2020) also employed measures of dynamism and qualifications from Berlo et al. (1969). However, they combined these subscales into a single measure of credibility for subsequent analysis. Thus, dependent measures from the two studies that employed the same credibility scale are again close but not perfectly uniform.

Permutations of Ohanian’s (1990) measure may be the most common across the literature, appearing in 7 of the studies reviewed here. Ohanian’s (1990) scale captures the dimensions of expertise, trustworthiness, and attractiveness, each via 5 adjectives. However again, applications of this measure vary. Some studies report employing discrete subscales, such as Boczek et al. (2022) and Yang et al. (2022), who employed the expertise subscale. Likewise, Mudrick and Lin (2017) and Charitat and Cianfrone (2023) employed separate measures of expertise and trust(worthiness) in their studies. However, other studies report using variable permutations. For example, Hahn and Cummins (2014) and Cummins et al. (2018) report combining the items from that scale based on factor analytic results that suggested only two dimensions they labeled “credibility” and “attractiveness.”

Furthermore, some studies report varied combinations of multiple measures. For example, Mudrick et al. (2016) report using Ohanian’s (1990) measure of trustworthiness, attractiveness, and expertise, along with Berlot et al.’s (1969) measure of dynamism, and then combining all into a single construct of credibility. Likewise, Bell et al. (2022) report using measures of dynamism and qualifications from Berlo et al. (1969) as well as authority and character from McCroskey (1966). Furthermore, their method reports separate reliability measures for each subscale. However, their results report findings regarding a single credibility measure, suggesting that all items were combined into a single dependent measure.

Lastly, a small number of studies do not explicitly cite a source for the credibility measures employed. For example, Ordman and Zillmann (1994) and Mastro et al. (2012) report the items used within their method, but a source for these measures was not offered. Similarly, Davis and Krawczyk (2010) fully report the items used in their study via an appendix. But again, a source was not offered, and review of those items suggests single-item measures of expertise, dynamism, etc. Brisbane et al. (2021) report four items used to capture perceived credibility, but a source is not offered, and results suggest that items were combined into a single measure. Thus, across these studies generally studying “credibility,” varied differences in operationalization contribute to measurement error (Loken & Gelman, 2017) and prohibit direct comparisons.

Beyond these measures of credibility, studies have employed an assortment of other tools to capture other variables of interest. As previously noted, adherence to or endorsement of traditional gender roles/norms have emerged as an important variable to help explain perceived credibility of female/male sports reporters (Mudrick & Lin, 2017; Yang et al., 2022). However, measurement of these concepts varied. For example, Etling and Young (2007) and Etling et al. (2011) used items from Benson and Vincent’s (1980) Sexist Attitudes Toward Women Scale and Swim et al.’s (1995) Old Fashioned Sexism Scale. Mastro et al. (2012) likewise cited the work of Swim et al. (1995), noting that items were modified from that study to measure gender-based attitudes as a covariate in their analysis. Mudrick et al. (2016) also cite the work of Swim et al. (1995) but instead refer to the measure as the modern sexism instrument, and they also used Levant et al.’s (2007) Male Role Norms Inventory-Revised to assess support for traditional male gender roles. Olgemöller and Schupfner (2025) report using items that “align closely” with the Swim et al. (1995) measure (p. 7).

Boczek et al. (2022) also assert the importance of individual endorsement of traditional gender roles, but they assessed this trait via a measure of right-wing authoritarianism (RWA; Manganelli Rattazzi et al., 2007). Lastly, Yang et al. (2022) employed support for traditional gender roles as a moderator in their study of the effect of auditory cuteness on perceived expertise, citing Brown and Gladstone’s (2012) measure of the construct. Thus, although multiple studies have asserted the importance of gender role beliefs, they have measured it via different scales.

Discussion

Despite growth in female participation in sports (McGuire, 2025) and sports viewing (Nielsen, 2023), biases surrounding women’s role in sports journalism persist in varied forms. This paper reviewed 30 years of scholarly research using experimental studies designed to test the impact of reporter sex on audience perceptions of reporter credibility in order to document consistencies and discrepancies in study findings, as well as various study attributes that may contribute to differing outcomes. Among the articles reviewed here, an equal number of studies alternately found biases against female reporters (e.g., Ordman & Zillmann, 1994), in favor of female reporters (e.g., Pratt et al., 2018), or no bias whatsoever (e.g., Baiocchi-Wagner & Behm-Morawitz, 2010). More commonly, these biases were dependent upon various study attributes, such as participant sex and/or athlete sex (e.g., Etling et al., 2011), reporter attractiveness (e.g., Mudrick & Lin, 2017), or endorsement of traditional gender roles (e.g., Yang et al., 2022). Thus, consistent accumulation of knowledge is hampered by inconsistency in findings and sometimes directly contradictory outcomes, leading to the question posted by this essay’s title: What do we know? Despite these multiple studies collectively employing more than 5,000 research participants, the answer remains somewhat inconclusive.

One possible and intuitive explanation for these varied study findings over the decades is broader social change regarding normative perceptions surrounding female participation in sport. For example, the National Collegiate Athletic Association (NCAA) reports double-digit growth in women’s participation and leadership in college sports over the past decade (McGuire, 2025). Ongoing coverage of Olympic games has documented increased representation of women’s sports both in newspapers (Dean & Somaini, 2024) and televised coverage of the games (Angelini & Arth, 2022). In professional sports, increased viewership of WNBA has been widely celebrated, attributed to interest in star athletes such as Caitlin Clark (Bachman, 2024) and increased interest overall (Poole, 2025). With this growth, changes in normative perceptions could arguably follow, which would seemingly be reflected in the diminished biases regarding credibility perceptions. However, the three studies reviewed here that documented overall biases against female sports reporters were published at equal time intervals, each separated by more than a decade (i.e., Ordman & Zillmann, 1994; Etling & Young, 2007; Luisi et al., 2020). Thus, changes in normative perceptions would not seem to be reflected. Furthermore, scattered among these chronologically were other studies variably demonstrating opposite effects, no effects, or contingent effects. As such, changes in social norms may be plausible, but the scholarship reviewed here cannot reliably speak to such changes as a singular explanatory mechanism.

Accumulation of Knowledge during Times of Normal Science

Returning to Kuhn’s (1996) thoughts on the work that characterizes normal science, many of the studies reviewed here certainly reflect that type of research. Recall that during periods of normal science, researchers operate under a shared paradigmatic framework, employing similar approaches to study design and measurement to develop more granular knowledge of the area of inquiry. If one were to ascribe to this view, the early work of Ordman and Zillmann (1994) clearly establishes a blueprint for much of the research that followed—between-subjects manipulations of media messages in order to affect variation in reporter sex, random assignment of male/female research participants to experimental conditions, measurement of some form of perceived or rated credibility. Much of what followed largely mimics this general design, incrementally adding other variables (e.g., athlete gender, audience, and reporter characteristics) to provide more nuanced detail to what’s known about this cause-effect relationship. However, the assortment of at-times contradictory findings does not reflect a consistent accumulation of knowledge, and the sometimes unsystematic way this research has collectively unfolded could possibly help explain differences in study findings.

Stimuli, Sport, and Replication

As noted here, research in this area reflects notable differences in media platforms and modalities, visual prominence of the manipulation of reporter sex, sport and sex of the athlete, theories used to undergird the predicted effects, and measures used to capture potential effects. The end result is arguably a failure to systematically replicate study findings in order to more confidently demonstrate robustness of effects over time and context. To be fair, this is hardly limited to research on this specific topic, and the “replication crisis” has been observed in other social scientific disciplines (Jensen et al., 2023; Maxwell et al., 2015). Although straightforward replication could help illuminate sometimes contrasting findings, careful consideration of various study and design attributes noted here could also aid systematic progression within the literature.

With respect to the messages tested in the studies reviewed here, their generally common feature is media coverage of sports. However, this reflects a remarkably large universe, and wide differences in study stimuli likely have some bearing on varied study outcomes. Moving forward, exploring these differences in a way that is explicitly grounded in the meaningful differences afforded by competing media platforms is needed in order to better speak to potential medium impacts. Questions surrounding the impact of competing media platforms on credibility perceptions are hardly novel or unique to sports communication (e.g., Lee, 1978; Newhagen & Nass, 1989), and impacts of media form have been broadly documented (e.g., Bracken, 2006; Kiousis, 2006). In short, what differences between media platforms and messages should make a difference, and why?

With respect to both the medium being tested as well as type of message, differences can variably magnify or minimize the role of the reporter, a central aspect of this research. For example, text-based media lend themselves to easy experimental manipulation in the form of alternate bylines/reporter photographs next to a story (e.g., Boczek et al., 2022; Hahn & Cummins, 2014; Mastro et al., 2012). However, this also may moderate potential impacts on credibility perceptions if readers selectively attend to or choose to ignore this message element. In audio-visual stimuli, reporter sex is more overt and persistent in the form of the reporter’s voice (e.g., Bell et al., 2022) or physical appearance (e.g., Greer & Jones, 2012), which arguably would yield greater impacts on viewer response.

With respect to the visual inclusion of the reporter within the message, depiction is important to the extent that it helps make reporter sex an overt cue that could influence perceptions. Too subtle a manipulation could serve as a possible explanatory mechanism for lack of effects/differences in perception (e.g., Baiocchi-Wagner & Behm-Morawitz, 2010). Moreover, studies employing Social Identity Theory rest upon salience of reporter sex to denote in-group/out-group status among study participants (e.g., Brisbane et al., 2021), and prominence of the reporter could play a key role in helping activate that salience. Lastly, visual depiction of the reporter is also important in study designs that have incorporated reporter attractiveness in some fashion (e.g., Yang et al., 2022). All these considerations may have direct bearing on reader/viewer response, depending on the proposed theoretical mechanism at work.

With respect to the sport being examined, a key consideration is obviously athlete sex, which has been a factor in some studies (e.g., Harris, 2013; Mudrick & Lin, 2017). Beyond this, broader normative perceptions surrounding stereotypically masculine and feminine sports also merit persistent consideration (e.g., Pratt et al., 2018) in order to more clearly and consistently map the terrain of extant knowledge.

Theory and Replication

With respect to theoretical frameworks, scholarship in this domain has relied on competing perspectives, and no single approach has emerged as dominant. Thus, a goal of future work is to identify theories that are supported by study findings and rule out those that fail to explain observed biases. To be certain, these competing perspectives are not necessarily a weakness or at odds—they’re just different. Furthermore, study findings have variably supported (Etling & Young, 2007) or failed to support (Baiocchi-Wagner & Behm-Morawitz, 2010) theoretically derived predictions. In the case of the former, future work should seek to continue to explore relationships in order to continue to explore limiting conditions or meaningfully expand contexts and applications. For example, if future work continues to examine esports (e.g., Charitat & Cianfrone, 2023) or other new forms of competition, researchers should explicitly articulate precisely what attributes surrounding this novel context could/should present novel findings. In the case of theoretically driven studies where predictions are not supported, genuine consideration must be given to why the lack of support was discovered: Is the theory no longer valid (and why)?; Was there a flaw in the research design that yielded the finding (e.g., lack of salience of reporter sex within the stimulus)?

In both scenarios (i.e., studies demonstrating support for predictions, as well as lack of support), it is paramount that researchers more directly test the theoretical mechanisms at work, as this allows greater confidence in support for or elimination of a given theory. Take the notion of stereotypes/schema as an explanatory framework. Brisbane et al.’s (2021) study predicting sex-based differences in perceptions of female/male journalists failed to find evidence supporting this hypothesis. In that study, they offered stereotyping as a theoretical mechanism, arguing that stereotyping is a two-stage process whereby stereotypes are first automatically activated in the first stage and then applied to an encounter in the second stage. Thus, failure to support the predicted findings could alternately be the result of (a) broad, normative changes in these stereotypes, (b) individual-level differences in endorsement of these stereotypes, (c) failure within study participants to automatically activate the stereotype, or (d) failure to apply the stereotype. Careful, precise measurement is needed to verify both individual-level belief in as well as activation of these stereotypes and not the mere presumption of stereotypical views. In the case of the final possibility, failure to apply stereotypes when evaluating a target could be the result of sensitization to the study purpose (Leustek, 2017). Thus, it could be that the theory is incorrect, or it could be that it was not directly tested.

Similarly, studies relying on Social Identity Theory as the explanatory mechanism would be well served to employ manipulation checks to affirm individual recognition of and membership to a social group. Furthermore, that group membership must be salient during viewing in order to impact subsequent evaluations, and failure to support SIT-derived predictions could be a function of lack of salience of the specific identity in question. Lastly, Social Identity Theory also holds that individuals may hold multiple identities that are not mutually exclusive (Campo et al., 2019; Dalai & Naraine, 2024). This invites questions of which identity is most salient when making subsequent judgements. In sum, careful and more direct measurement of concepts and theoretical mechanisms is needed to help explain and hopefully alleviate inconsistent findings across the literature.

Measurement (and Inclusion) of Key Concepts

This careful testing of theoretical mechanisms also invites careful attention to measurement of key concepts, as well as their integration into research designs. As noted at the outset of this essay, a common hallmark of these studies is factorial manipulation of reporter sex and oftentimes participant sex. However, observers have long noted the distinction between biological sex and more sociologically oriented concept of gender, which invokes broader discussions of identity and social norms (Butler, 1990). Furthermore, gender identity has been robustly embraced by some exploring the nexus of sport and communication (Kane et al., 2013; Lenskyj, 2012). Despite this, most of the work reviewed here adheres to binary definitions of reporter/participant sex.

Thus, perhaps the greatest need in advancing this literature is to more strongly integrate gender, both as a function of an individual participant’s identity as well as endorsement of traditional gender-role norms as an explanatory mechanism. On the latter, a select few studies reflect this shift to varying degrees (Boczek et al., 2022; Brisbane et al., 2021; Mudrick et al., 2016; Yang et al., 2022). Future work exploring biased perceptions need to more precisely capture the more nuanced concept of gender instead of, or in combination with biological sex. Furthermore, use of these measures in more sophisticated analyses (e.g., mediation/moderation) has potential to advance our understanding of these biases beyond statistical procedures that more strongly rely on categorical measures. For example, Yang et al.’s (2022) recent work employing moderation analysis found that the more participants embraced traditional/stereotypical gender roles, the lower the perceived expertise of a female podcaster. Furthermore, gender role beliefs also moderated the relationship between expertise and intent to continue the podcast. Thus, future work should directly assess gender identity as well as support of stereotypical gender roles for inclusion in analyses, as recent studies demonstrate the potentially greater predictive ability of these concepts compared to simple biological sex or sport gender typing (Olgemöller & Schupfner, 2025; Yang et al., 2022;).

In addition, as previously noted, the key outcome across all this body of literature is the notion of credibility or a close analog. However, operational measurement of this outcome varies across the literature, inhibiting direct comparisons between studies. On the one hand, differing scales may be required due to differences in context. For example, returning to earlier concerns regarding stimuli selection, some media may emphasize select attributes of credibility that are less salient in other contexts (e.g., dynamism; Metzger et al., 2003). Furthermore, credibility could be variably tied to the reporter, news organization, or medium, further adding imprecision to measurement. Thus, a modicum of flexibility may be justified.

Nonetheless, recall that one attribute of a paradigm is shared approaches to measurement among scholars working within an area of research (Kuhn, 1996). Thus, future work exploring biased perceptions should work to not only identify the source of measurements employed (i.e., what measure is being used) but work to embrace greater uniformity in use of these measures. Greater consistency in measurement represents one modest means of ruling out the potential explanations of differing study outcomes. For example, although a case could certainly be made for alternate measures depending on study context, Ohanian’s (1990) scale including expertise, trustworthiness, and attractiveness has been most commonly employed and also reflects important dimensions that encapsulate distinct aspects of reporter performance and characteristics.

Conclusion

Despite the differing outcomes reflected within the literature suggested here, the wealth of research exploring biased perceptions of female reporters offers one silver lining—these biases have been broadly recognized by scholars who are committed to continued investigation using an array of tools, methods, or approaches. Fervent interest in this topic remains, and the hope is that this review illuminate some of the possible reasons for these differing study outcomes and encourages greater consistency and systematic accumulation of knowledge to help identify, explain, and overcome these biased perceptions. In closing, the following recommendations are offered:

• Fully and exhaustively consult the relevant literature in order to draw parallels with past studies. Review of the articles cited herein reveals that some fail to consult all the scholarship in this area, overlooking some contributions. Future work should fully connect to the literature in this area in order to compare and contrast findings. Consistencies within cited literature should be explicitly noted, and discrepancies must be carefully and thoroughly accounted for in future work.

• Researchers should design research scenarios and employ study stimuli that embrace the salient attributes that may have causal impact (e.g., visual prominence of the reporter). As noted above, differences in study stimuli can variably minimize or emphasize the role of the reporter, their visual and aural prominence, and the type of information presented. All these should be carefully considered and explicitly accounted for in research designs and again, connected back to relevant literature to illuminate consistent and discrepant findings.

• Carefully and, when possible, directly measure theoretical processes and causal variables (e.g., identity salience; endorsement of traditional gender roles). Explicitly examining these can help better test theoretically derived predictions, explain study findings, and rule out unsupported theoretical approaches.

• Draw upon established measures appropriate to the study context, balancing the unique attributes of a given research design while also connecting with the literature broadly. Furthermore, use these measures in a manner consistent with their development and with past literature. Again, consistency in measurement is a relatively modest means of aiding the systematic advancement of the literature.

Lastly, it bears repeating that the present study has focused solely on experimental designs with categorical manipulation of independent variables, and a host of additional literature can offer valuable insights. Furthermore, recent studies have embraced more holistic predictive/causal models that can integrate a wider constellation of variables and characteristics, and such approaches hold great potential for illuminating important variables in addition to and in conjunction with reporter sex. Thus, greater use of such approaches can help further advance this decades-old line of inquiry.

Footnotes

ORCID iD

R. Glenn Cummins

Ethical Considerations

This study does not involve collection of data from human subjects but instead reflects the analysis of existing information/documents. No ethical approval or informed consent was sought for the conduct of this research.

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Declaration of Conflicting Interests

The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: The author whose name is listed immediately above certify that he has NO affiliations with or involvement in any organization or entity with any financial interest, or non-financial interest in the subject matter or materials discussed in this manuscript.

Data Availability Statement

Data used in this analysis is summarized within of this manuscript. No additional data was used.*

Notes

References

Angelini

J. R.

Arth

Z. W.

(2022). A U.S. medal agenda? Clock-time and salience analyses of biological sex representation in the 2020 and 2022 NBC Olympic telecasts. Communication & Sport, 11(6), 1042–1057. https://doi.org/10.1177/21674795221132830

Bachman

(2024). Caitlin Clark is already the GOAT of TV ratings. Wall Street Journal. Retrieved from. https://www.wsj.com/sports/basketball/caitlin-clark-tv-audiences-b15d193e

Bagozzi

R. P.

Lee

K. H.

Van Loo

M. F.

(2001). Decisions to donate bone marrow: The role of attitudes and subjective norms across cultures. Psychology and Health, 16, 29–56. https://doi.org/10.1080/08870440108405488

Baiocchi-Wagner

E. A.

Behm-Morawitz

(2010). Audience perceptions of female sports reporters: A social-identity approach. International Journal of Sport Communication, 3(3), 261–274. https://doi.org/10.1123/ijsc.3.3.261

Bell

T. R.

Sadri

S. R.

Billings

A. C.

(2022). The dichotomy of male sports and female announcing: Examining the credibility of gendered pairs for NFL announcing teams. Journalism & Mass Communication Quarterly, 101(4), 1026–1048. https://doi.org/10.1177/10776990221117778

Bem

S. L.

(1981). Gender schema theory: A cognitive account of sex typing. Psychological Review, 88(4), 354–364. https://doi.org/10.1037/0033-295X.88.4.354

Benson

P. L.

Vincent

(1980). Development and validation of the Sexist Attitudes Toward Women Scale (SATWS). Psychology of Women Quarterly, 5(2), 276–291. https://doi.org/10.1111/j.1471-6402.1980.tb00962.x

Berlo

D. K.

Lembert

J. B.

Mertz

R. J.

(1969). Dimensions for evaluating the acceptability of message sources. Public Opinion Quarterly, 33(4), 563–576. https://doi.org/10.1086/267745

Boczek

Dogruel

Schallhorn

(2022). Gender byline bias in sports reporting: Examining the visibility and audience perception of male and female journalists in sports coverage. Journalism, 24(7), 1462–1481. https://doi.org/10.1177/14648849211063312

10.

Bracken

C. C.

(2006). Perceived source credibility of local television news: The impact of television form and presence. Journal of Broadcasting & Electronic Media, 50(4), 723–741. https://doi.org/10.1207/s15506878jobem5004_9

11.

Brisbane

G. J.

Ferrucci

Tandoc

(2021). Side-by-side sports reporters: A between-subjects experiment of the effect of gender in reporting on the NFL. Communication & Sport, 11(1), 115–134. https://doi.org/10.1177/2167479521995462

12.

Brown

M. J.

Gladstone

(2012). Development of a short version of the gender role beliefs scale. International Journal of Psychology and Behavioral Sciences, 2(5), 154–158. https://doi.org/10.5923/j.ijpbs.20120205.05

13.

Butler

(1990). Gender trouble: Feminism and the subversion of identity. Routledge.

14.

Campo

Mackie

D. M.

Sanchez

(2019). Emotions in group sports: A narrative review from a social identity perspective. Frontiers in Psychology, 10, 666. https://doi.org/10.3389/fpsyg.2019.00666

15.

Charitat

Cianfrone

B. A.

(2023). An examination of the effects of source gender on perceived credibility of esports shoutcasters. Journal of Electronic Gaming and Esports, 1(1), 1–10. https://doi.org/10.1123/jege.2022-0036

16.

Cialdini

R. B.

Borden

R. J.

Thorne

Walker

M. R.

Freeman

Sloan

L. R.

(1976). Basking in reflected glory: Three (football) field studies. Journal of Personality and Social Psychology, 34(3), 366–375. https://doi.org/10.1037/0022-3514.34.3.366

17.

Cooky

Council

L. D.

Mears

M. A.

Messner

M. A.

(2021). One and done: The long eclipse of women’s televised sports, 1989–2019. Communication & Sport, 9(3), 347–371. https://doi.org/10.1177/21674795211003524

18.

Cooky

Messner

M. A.

Hextrum

R. H.

(2013). Women play sport, but not on TV: A longitudinal study of televised news media. Communication & Sport, 1(3), 203–230. https://doi.org/10.1177/2167479513476947

19.

Cooky

Messner

M. A.

Musto

(2015). “It’s Dude Time!”: A quarter century of excluding women’s sports in televised news and highlight shows. Communication & Sport, 3(3), 261–287. https://doi.org/10.1177/2167479515588761

20.

Cramer

J. A.

(1994). Conversations with women sports journalists. In Creedon

P. J.

(Ed.), Conversations with women sports journalists (pp. 159–180). Sage Publications, Inc. https://doi.org/10.4135/9781483326764.n6

21.

Cummins

R. G.

Ortiz

Rankine

(2018). “Elevator eyes” in sports broadcasting: Visual objectification of male and female sports reporters. Communication & Sport, 7(6), 789–810. https://doi.org/10.1177/2167479518806168

22.

Dalai

Naraine

M. L.

(2024). Contextualizing fans’ divergent experiences of sport activism through a social identity threat lens. Sport Marketing Quarterly, 33(3), 242–257. https://doi.org/10.32731/smq.333.092024.04

23.

Davis

D. C.

Krawczyk

(2010). Female sportscaster credibility: Has appearance taken precedence? Journal of Sports Media, 5(2), 1–34. https://doi.org/10.1353/jsm.2010.0004

24.

Dean

Somaini

(2024). Olympian women’s representation in U.S. newspapers has improved according to a content analysis of three local dailies. Newspaper Research Journal, 45(4), 449–471. https://doi.org/10.1177/07395329241267031

25.

Demir

Ayhan

(2022). Being a female sports journalist on Twitter: Online harassment, sexualization, and hegemony. International Journal of Sport Communication, 15(3), 207–217. https://doi.org/10.1123/ijsc.2022-0044

26.

Dirks

Sadri

S. R.

Bell

T. R.

Jackson

J. R.

Billings

A. C.

(2023). Psychophysiological responses to gendered sports announcing: Effects of announcer gender on audience arousal and emotion. Journal of Broadcasting & Electronic Media, 67(4), 487–506. https://doi.org/10.1080/08838151.2023.2245935

27.

Eastman

S. T.

Billings

A. C.

(2000). Sportscasting and sports reporting: The power of gender bias. Journal of Sport & Social Issues, 24(2), 192–213. https://doi.org/10.1177/0193723500242006

28.

Etling

Young

(2007). Sexism and the authoritativeness of female sportscasters. Communication Research Reports, 24(2), 121–130. https://doi.org/10.1080/08824090701304816

29.

Etling

L. W.

Young

R. W.

Faux

W. V.

Mitchell

J. C.

(2011). Just like one of the guys? Perceptions of male and female sportscasters' voices. Journal of Sports Media, 6(2), 1–21. https://doi.org/10.1353/jsm.2011.0010

30.

Genovese

(2015). Sports television reporters and the negotiation of fragmented professional identities. Communication, Culture and Critique, 8(1), 55–72. https://doi.org/10.1111/cccr.12069

31.

Greer

J. D.

Jones

A. H.

(2012). A level playing field? Audience perceptions of male and female sports analysts. International Journal of Interdisciplinary Social Sciences, 6(8), 67–79. https://doi.org/10.18848/1833-1882/CGP/v06i08/52137

32.

Hahn

D. A.

Cummins

R. G.

(2014). Effects of attractiveness, gender, and athlete–reporter congruence on perceived credibility of sport reporters. International Journal of Sport Communication, 7(1), 34–47. https://doi.org/10.1123/IJSC.2013-0113

33.

Hardin

Shain

(2005). Strength in numbers? The experiences and attitudes of women in sports media careers. Journalism & Mass Communication Quarterly, 82(4), 804–819. https://doi.org/10.1177/107769900508200404

34.

Hardin

Shain

(2006). “Feeling much smaller than you know you are”: The fragmented professional identity of female sports journalists. Critical Studies in Media Communication, 23(4), 322–338. https://doi.org/10.1080/07393180600933147

35.

Hardin

Whiteside

(2009). Token responses to gendered newsrooms: Factors in the career-related decisions of female newspaper sports journalists: Factors in the career-related decisions of female newspaper sports journalists. Journalism, 10(5), 627–646. https://doi.org/10.1177/14648849090100050501

36.

Harris

Bowes

(2025). Still the outsiders? Women in sport journalism. International Journal of Sport Communication, 18(2), 213–223. https://doi.org/10.1123/ijsc.2025-0031

37.

Harris

(2013). Gender stereotypes, gender segregation, and credibility: Crossing the lines in sports media. The International Journal of Sport and Society, 3(2), 137–159. https://doi.org/10.18848/2152-7857/CGP/v03i02/53912

38.

Jackson

Jacobs

(1983). Generalizing about messages: Suggestions for design and analysis of experiments. Human Communication Research, 9(2), 169–191. https://doi.org/10.1111/j.1468-2958.1983.tb00691.x

39.

Jackson

O’Keefe

D. J.

Jacobs

Brashers

D. E.

(1989). Messages as replications: Toward a message‐centered design strategy. Communication Monographs, 56(4), 364–384. https://doi.org/10.1080/03637758909390270

40.

Jensen

T. I.

Kelly

Pedersen

L. H.

(2023). Is there a replication crisis in finance? The Journal of Finance, 78(5), 2465–2518. https://doi.org/10.1111/jofi.13249

41.

Johnson

R. G.

Al-khateeb

Forbes

Cupido

(2023). Targeted social media harassment: A comparative analysis of toxicity directed at men and women sports reporters. Communication & Sport, 12(3), 443–465. https://doi.org/10.1177/21674795231213330

42.

Kane

M. J.

LaVoi

N. M.

Fink

J. S.

(2013). Exploring elite female athletes’ interpretations of sport media images: A window into the construction of social identity and “selling sex” in women’s sports. Communication & Sport, 1(3), 269–298. https://doi.org/10.1177/2167479512473585

43.

Kemp

D. G.

(2007). Source credibility and public information campaigns: The effect of audience evaluations of organizational sponsors on message acceptance. [Master’s thesis. University of South Florida]. Digital Commons at the University of South Florida. http://scholarcommons.usf.edu/etd/2241.

44.

Kiousis

(2006). Exploring the impact of modality on perceptions of credibility for online news stories. Journalism Studies, 7(2), 348–359. https://doi.org/10.1080/14616700500533668

45.

Kourouthanassis

P. E.

Mikalef

Pappas

I. O.

Kostagiolas

(2017). Explaining travelers online information satisfaction: A complexity theory approach on information needs, barriers, sources and personal characteristics. Information & Management, 54(6), 814–824. https://doi.org/10.1016/j.im.2017.03.004

46.

Kuhn

T. S.

(1996). The structure of scientific revolutions (3rd ed.). University of Chicago Press.

47.

Laucella

P. C.

Hardin

Bien-Aimé

Antunovic

(2016). Diversifying the sports department and covering women’s sports: A survey of sports editors. Journalism & Mass Communication Quarterly, 94(3), 772–792. https://doi.org/10.1177/1077699016654443

48.

Lee

R. S. H.

(1978). Credibility of newspaper and TV news. Journalism Quarterly, 55(2), 282–287. https://doi.org/10.1177/107769907805500209

49.

Lenskyj

H. J.

(2012). Reflections on communication and sport: On heteronormativity and gender identities: On heteronormativity and gender identities. Communication & Sport, 1(1-2), 138–150. https://doi.org/10.1177/2167479512467327

50.

Leustek

(2017). Demand characteristics. In Allen

(Ed.), The SAGE encyclopedia of communication research methods (pp. 371–373). Sage.

51.

Levant

R. F.

Smalley

K. B.

Aupont

House

A. T.

Richmond

Noronha

(2007). Initial validation of the male role norms inventory-revised (MRNI-R). The Journal of Men’s Studies, 15(1), 83–100. https://doi.org/10.3149/jms.1501.83

52.

Loken

Gelman

(2017). Measurement error and the replication crisis. Science, 355(6325), 584–585. https://doi.org/10.1126/science.aal3618

53.

Luisi

Adams

K. L.

Kilgore

(2020). Roughing the caster! Sexism and perceived female sports broadcasters’ credibility. Atlantic Journal of Communication, 29(4), 262–274. https://doi.org/10.1080/15456870.2020.1754822

54.

Mastro

Seate

A. A.

Blecha

Gallegos

(2012). The wide world of sports reporting: The influence of gender- and race-based expectations on evaluations of sports reporters. Journalism & Mass Communication Quarterly, 89(3), 458–474. https://doi.org/10.1177/1077699012447922

55.

Matthes

Kohring

(2003). Operationalisierung von Vertrauen in Journalismus. Medien & Kommunikationswissenschaft, 51(1), 5–23. https://doi.org/10.5771/1615-634x-2003-1-5

56.

Maxwell

S. E.

Lau

M. Y.

Howard

G. S.

(2015). Is psychology suffering from a replication crisis? What does “failure to replicate” really mean? American Psychologist, 70(6), 487–498. https://doi.org/10.1037/a0039400

57.

McCroskey

J. C.

(1966). Scales for the measurement of ethos. Speech Monographs, 33(1), 65–72. https://doi.org/10.1080/03637756609375482

58.

McCroskey

J. C.

Teven

J. J.

(1999). Goodwill: A reexamination of the construct and its measurement. Communication Monographs, 66(1), 90–103. https://doi.org/10.1080/03637759909376464

59.

McGuire

(2025). Celebrating progress: Women’s representation in NCAA sports, leadership roles. NCAA. Retrieved from. https://www.ncaa.org/news/2025/3/1/media-center-celebrating-progress-womens-representation-in-ncaa-sports-leadership-roles.aspx

60.

Metzger

M. J.

Flanagin

A. J.

Eyal

Lemus

D. R.

Mccann

R. M.

(2003). Credibility for the 21st century: Integrating perspectives on source, message, and media credibility in the contemporary media environment. Annals of the International Communication Association, 27(1), 293–335. https://doi.org/10.1080/23808985.2003.11679029

61.

Mudrick

Burton

Lin

C. A.

(2016). Pervasively offside: An examination of sexism, stereotypes, and sportscaster credibility: An examination of sexism, stereotypes, and sportscaster credibility. Communication & Sport, 5(6), 669–688. https://doi.org/10.1177/2167479516670642

62.

Mudrick

Lin

C. A.

(2017). Looking on from the sideline: Perceived role congruity of women sports journalists. Journal of Sports Media, 12(2), 79–101. https://doi.org/10.1353/jsm.2017.0011

63.

Newhagen

Nass

(1989). Differential criteria for evaluating credibility of newspapers and TV news. Journalism Quarterly, 66(2), 277–284. https://doi.org/10.1177/107769908906600202

64.

Nielsen . (2023). Women’s sports viewership on the rise. Retrieved from. https://www.nielsen.com/insights/2023/womens-sports-viewership-on-the-rise/

65.

Ohanian

(1990). Construction and validation of a scale to measure celebrity endorsers’ perceived expertise, trustworthiness, and attractiveness. Journal of Advertising, 19(3), 39–52. https://doi.org/10.1080/00913367.1990.10673191

66.

Olgemöller

Schupfner

(2025). Perceptions of female soccer announcers – Is the gender gap closing? European Journal for Sport and Society, 1–19. https://doi.org/10.1080/16138171.2025.2556503

67.

Ordman

V. L.

Zillmann

(1994). Women sports reporters: Have they caught up? Journal of Sport & Social Issues, 18(1), 66–75. https://doi.org/10.1177/019372394018001005

68.

Pentecost

Spence

(2009, November 27-28). Fanship: A measure of hedonic intensity and its mediating effect on consumer behaviour in sports. Paper presented at the annual meeting of the Sport Management Association of Australia and New Zealand. Retrieved from http://hdl.handle.net/10072/31954

69.

Poole

(2025). Women’s sports growth is a win for investors, brands and the planet. Forbes. Retrieved from. https://www.forbes.com/sites/clairepoolesp/2025/03/08/womens-sports-growth-is-a-win-for-investors-brands-and-the-planet/

70.

Pratt

A. N.

Tadlock

M. E.

Watts

L. L.

Wilson

T. C.

Denham

B. E.

(2018). Perceptions of credibility and likeability in broadcast commentators of women's sports. Journal of Sports Media, 13(1), 75–97. https://doi.org/10.1353/jsm.2018.0003

71.

Rattazzi

A. M. M.

Bobbio

Canova

(2007). A short version of the Right-Wing Authoritarianism (RWA) scale. Personality and Individual Differences, 43(5), 1223–1234. https://doi.org/10.1016/j.paid.2007.03.013

72.

Reysen

(2005). Construction of a new scale: The Reysen likeability scale. Social Behavior and Personality, 33(2), 201–208. https://doi.org/10.2224/sbp.2005.33.2.201

73.

Rogers

(2020). Boys in the Booth: The impact of announcer gender on audience demand. Journal of Sports Economics, 21(6), 610–627. https://doi.org/10.1177/1527002520921231

74.

Sartore-Baldwin

M. L.

Walker

(2011). The process of organizational identity: What are the roles of social responsiveness, organizational image, and identification? Journal of Sport Management, 25(5), 489–505. https://doi.org/10.1123/jsm.25.5.489

75.

Schoch

(2020). The gender of sports news: Horizontal segregation and marginalization of female journalists in the Swiss press. Communication & Sport, 10(4), 746–766. https://doi.org/10.1177/2167479520951162

76.

Shrivastava

Gregory

(2009). Exploring the antecedents of perceived diversity. Journal of Management & Organization, 15(4), 526–542. https://doi.org/10.5172/jmo.15.4.526

77.

Snyder

C. R.

Lassegard

Ford

C. E.

(1986). Distancing after group success and failure: Basking in reflected glory and cutting off reflected failure. Journal of Personality and Social Psychology, 51(2), 382–388. https://doi.org/10.1037/0022-3514.51.2.382

78.

Swim

J. K.

Aikin

K. J.

Hall

W. S.

Hunter

B. A.

(1995). Sexism and racism: Old-fashioned and modern prejudices. Journal of Personality and Social Psychology, 68(2), 199–214. https://doi.org/10.1037/0022-3514.68.2.199

79.

Tajfel

Turner

J. C.

(1979). An integrative theory of intergroup conflict. In Austin

W. G.

Worchel

(Eds.), The social psychology of intergroup relations (pp. 33–48). Brooks/Cole.

80.

Till

B. D.

Busler

(2000). The match- up hypothesis: Physical attractiveness, expertise, and the role of fit on brand attitude, purchase intent and brand beliefs. Journal of Advertising, 29(3), 1–13. https://doi.org/10.1080/00913367.2000.10673613

81.

Trail

G. T.

James

(2001). An analysis of the sport fan motivation scale. Journal of Sport Behavior, 24(1), 108–127.

82.

Wann

D. L.

Hamlet

M. A.

Wilson

T. M.

Hodges

J. A.

(1995). Basking in reflected glory, cutting off reflected failure, and cutting off future failure: The importance of group identification. Social Behavior and Personality, 23(4), 377–388. https://doi.org/10.2224/sbp.1995.23.4.377

83.

Yang

Atkin

D. J.

Mudrick

Qin

(2022). Auditory cuteness in sports podcasting: A new lookism? Communication & Sport, 11(5), 929–948. https://doi.org/10.1177/21674795221117783

Sports Reporting,Reporter Sex,and Perceived Credibility: After 30 Years,What Do We Know?

Abstract

Keywords

Literature Review

Normal Science

Method

Sample

Article Characteristics

Results

Bias Observed

Bias Against Female Reporters

Bias in Favor of Female Reporters

No Bias Observed

Contingent Bias Observed

Differing Stimuli

Medium Effects

Visual Depiction of Reporter

Sport Examined and Nature of Information

Differing Theories

Differing Samples

Differing Measures

Discussion

Accumulation of Knowledge during Times of Normal Science

Stimuli, Sport, and Replication

Theory and Replication

Measurement (and Inclusion) of Key Concepts

Conclusion

Footnotes

ORCID iD

Ethical Considerations

Funding

Declaration of Conflicting Interests

Data Availability Statement

Notes

References