Sage Journals: Discover world-class research

Abstract

The persistent underrepresentation of girls among top performers in STEM has long been a concern in talent development research. Recent studies, however, suggest that this gender gap may be narrowing. This study investigates whether gender and academic achievement shape how students perceive STEM classroom situations, using the DIAMONDS framework, a taxonomy of psychological meaningful situational characteristics (e.g., Duty, Intellect). Data were analyzed from 1,024 German eighth-grade students. In contrast to historical trends, our sample showed equal representation of boys and girls among the top 10% of STEM achievers. While no interaction effects were found between gender and achievement, consistent main effects emerged: girls reported higher levels of Duty and Intellect, but also greater Negativity and Deception; boys perceived STEM lessons more positively overall. High-achieving students, regardless of gender, experienced STEM situations more positively than their lower-achieving peers. These findings in our context suggest that gender disparities in top STEM performance may diminish, but that gender differences in classroom perception persist.

Keywords

high achievement DIAMONDS gender differences situational perception STEM

Introduction

A persistent and troubling pattern in talent development research is the underrepresentation of girls and women among the highest achievers in STEM fields (Charlesworth & Banaji, 2019; Wang & Degol, 2017). While gender gaps in average performance have narrowed considerably in recent decades, disparities at the extreme right tail of the ability distribution remain pronounced. Among students performing at the highest levels—often those identified as gifted or showing precocious talent—boys continue to outnumber girls in math-intensive domains (Bahar, 2021). This pattern was first documented in the seminal longitudinal work of Julian Stanley and Camilla Benbow (Benbow & Stanley, 1980, 1983), whose Study of Mathematically Precocious Youth (SMPY) revealed substantial sex differences in mathematical reasoning ability among intellectually talented adolescents. Based on SAT-Math scores at age 12, they found that boys were significantly overrepresented at the highest percentiles, with ratios as high as 13.5:1 in some early cohorts. Crucially, these differences emerged before students’ educational experiences diverged meaningfully, leading the authors to suggest biological explanations, including the controversial “greater male variability hypothesis.”

Subsequent research has consistently confirmed male overrepresentation among top performers in math-related STEM fields (Baye & Monseur, 2016; Makel et al., 2016; Nunez et al., 2023; Wai et al., 2010, 2018), particularly among those exhibiting a pronounced “mathematical ability tilt.” However, the gender gap has narrowed markedly over time—from 13.5:1 in the 1980s, to 3.8:1 in the 1990s, and now closer to 2:1 (Charlesworth & Banaji, 2019; see also Bahar, 2021)—casting doubt on purely biological explanations. Furthermore, findings are increasingly inconsistent across age groups, grade levels, and educational stages. For example, O’Dea et al. (2018, p. 1) concluded that

[…] the gender differences in both mean and variance of grades are smaller in STEM than non-STEM subjects, suggesting that greater variability is insufficient to explain male over-representation in STEM. Simulations of these differences suggest the top 10% of a class contains equal numbers of girls and boys in STEM, but more girls in non-STEM subjects. (See also Miller & Wai, 2015; Oakley et al., 2024)

It remains unclear whether male overrepresentation at the highest performance levels in math-related STEM fields is a persistent reality or a diminishing artifact of outdated structures.

What is increasingly evident, however, is that environmental and sociocultural factors play a far greater role than initially assumed (Ceci & Williams, 2011; Charlesworth & Banaji, 2019; Cheryan et al., 2017; Keller et al., 2022). From an equity perspective, this is deeply concerning: talent development systems must ensure that girls have the same opportunities as boys to reach their full potential. This implies a need for structural reforms in education and policy (Ceci & Williams, 2011). Without such changes, math-related STEM fields may follow the same trajectory as the creative domains, where meta-analyses show that women perform just as well—or better—on creativity tests (Abdulla Alabbasi et al., 2025), but still lag behind men in creative achievements (Hora et al., 2022). Such findings suggest that women possess at least as much creative potential as men but are either less able or less enabled to translate that potential into recognized accomplishments.

A critical setting for the educational and policy interventions called for by Ceci and Williams (2011) is the classroom. While the causes of girls’ underrepresentation among top STEM performers are undoubtedly multifaceted, ranging from early socialization and stereotype threat to opportunity structures and personal interests, one relatively underexplored factor is how students perceive the learning situations they encounter in school, particularly in STEM contexts (Ketscher et al., 2025). The situational perception approach—the subjective appraisal of a learning environment as demanding, threatening, intellectually challenging, or pleasing—has been made measurable in particular through various taxonomies (e.g., Parrigon et al., 2017; Rauthmann et al., 2014) and correlates with behavior in a multitude of ways. This also encompasses the commitment and motivation of students in STEM contexts. Since both personal factors and contextual cues shape such perceptions, they may differ systematically across gender (e.g., Cheryan et al., 2009, 2011, 2015) and performance level. For instance, Leiner et al. (2018) showed that test situations are perceived differently by boys and girls, which may influence affective and cognitive responses. To investigate these differences in the classroom context, the present study draws on the DIAMONDS framework (Rauthmann et al., 2014) to explore how high-achieving boys and girls perceive STEM lessons and whether these perceptions might help explain persistent gender disparities in advanced STEM achievement. Compared to other situation taxonomies, such as CAPTION (Parrigon et al., 2017) or more domain-specific classroom climate models, the DIAMONDS framework offers a broader and more psychologically grounded set of dimensions that encompass both affective and cognitive aspects of situational appraisal. This makes it particularly suited for analyzing how students experience STEM instruction, where emotional engagement, perceived demands, and social dynamics often interact in complex ways.

Prior studies have successfully applied the DIAMONDS framework in educational and developmental contexts (Ketscher et al., 2025; Konaszewski et al., 2025; Zager Kocjan & Avsec, 2017), including secondary school classrooms, suggesting its suitability for capturing meaningful situational differences across student groups. These studies document variation in dimensions such as Duty and Intellect in STEM instruction, and show that adolescents’ situational perceptions are systematically shaped by personal characteristics, including gender and achievement.

Theoretical Framework

Situational Perception and Proximal Influences in STEM Education

Traditional models in educational psychology have often emphasized distal factors—such as socioeconomic status, cultural norms, and long-term aspirations—in explaining student outcomes. However, recent research underscores the significance of proximal, immediate experiences in shaping students’ engagement and achievement, particularly in STEM education. Situational perception, the subjective interpretation of immediate learning environments, influences students’ motivation, behavior, and academic self-concept (Rauthmann et al., 2014). While situational perception includes affective interpretations of a setting (e.g., perceiving it as pleasant or stressful), it is conceptually distinct from the emotional responses that may follow. In this framework, perception refers to how students cognitively and affectively appraise the features of a situation, not necessarily how they feel during it.

In STEM classrooms, students’ perceptions of tasks as intellectually stimulating, threatening, or socially engaging can directly impact their participation and persistence. These perceptions are not uniform; they vary based on individual differences and contextual factors. For instance, stereotype threat can lead to heightened anxiety and reduced performance among female students in math-intensive settings (Maries et al., 2018). Similarly, the Big-Fish–Little-Pond Effect suggests that students’ academic self-concept is influenced by the achievement levels of their peers, with high-achieving students in high-performing environments potentially experiencing diminished self-confidence (Marsh & Parker, 1984).

Understanding these proximal influences is essential for developing interventions that foster equitable and supportive learning environments in STEM education.

The DIAMONDS Framework

To systematically assess situational perceptions, Rauthmann et al. (2014) introduced the DIAMONDS framework, which categorizes situations based on eight dimensions:

D uty: Situations involving obligations or responsibilities.

I ntellect: Contexts requiring cognitive and intellectual demands.

A dversity: Scenarios characterized by conflict or threats.

M ating: Situations related to romantic or sexual relationships or interests.

pOsitivity: Experiences perceived as enjoyable or pleasant.

N egativity: Contexts associated with negative emotions or outcomes.

D eception: Situations involving mistrust or dishonesty.

S ociality: Environments emphasizing social interaction and relationships.

This taxonomy allows researchers to quantify and compare individuals’ perceptions of various situations, facilitating a deeper understanding of how these perceptions influence behavior and outcomes. At this juncture, the DIAMONDS framework has been employed to analyze various educational contexts (Abrahams et al., 2021, 2025; Konaszewski et al., 2025; Witte et al., 2024), thereby unveiling the interplay between distinct situational characteristics and their respective outcomes.

Gender, Achievement, and Variations in Situational Perception

Growing evidence suggests that girls’ and boys’ situational experiences in STEM learning contexts are shaped by emotional engagement, perceived classroom climate, and sense of belonging (Cheryan et al., 2017; Fairhurst et al., 2023; Good et al., 2003). For instance, girls are more likely to interpret classroom settings as less supportive or more evaluative, particularly in male-typed domains—perceptions that can contribute to differential motivational outcomes and identity formation (Meece et al., 2006; Wang & Degol, 2017). In addition, gender and achievement levels significantly influence how students perceive and respond to classroom situations. Female students often report lower self-efficacy in STEM subjects, which has been linked to persistent societal stereotypes and underrepresentation in these fields (Chan, 2022; Sebastián-Tirado et al., 2023).

These perceptions can lead to decreased participation and interest in STEM disciplines. Moreover, high-achieving students may experience classroom environments differently than their peers. For example, they might perceive a lack of intellectual challenge or insufficient support, leading to disengagement. Conversely, lower-achieving students might find specific tasks overwhelming, perceiving them as high in Adversity or Negativity (Bouton et al., 2025; Li & Xue, 2023).

These differential perceptions may offer a promising explanatory lens for longstanding gender disparities in advanced STEM achievement. As outlined in the introduction, the historical overrepresentation of males among top-performing STEM students is well-documented, but recent trends show that this gap is narrowing. One plausible contributing factor may be shifts in how girls and boys perceive and engage with STEM learning environments. If female students increasingly experience STEM classrooms as more intellectually engaging, socially supportive, or personally meaningful—dimensions captured by the DIAMONDS framework—this could help explain the growing presence of girls among high achievers. Conversely, if differences in situational perception persist or emerge in new forms, they may continue to shape achievement patterns in subtle but significant ways. By applying the DIAMONDS model to investigate these dynamics, the present study aims to shed light on how gender and achievement intersect in students’ perceptions of STEM lessons—and how such perceptions may contribute to the evolving distribution of talent in STEM education.

Aims of the Study

As outlined in the preceding sections, situational perception is increasingly recognized as a meaningful proximal factor in shaping student engagement and achievement. However, little is known about how high-achieving boys and girls, in particular, interpret STEM learning environments. The findings concerning the distribution of situation perception based on personal factors, such as gender and academic achievement in particular, will enable further trends and predictions to be made regarding relevant variations among students in the STEM field.

To investigate this, we apply the DIAMONDS framework (Rauthmann et al., 2014), a taxonomy of psychologically relevant situation characteristics, to systematically assess how students interpret the classroom environment. Preliminary findings suggest that Duty and Intellect are particularly pronounced in the context of education (Ketscher et al., 2025; Konaszewski et al., 2025; Zager Kocjan & Avsec, 2017) and may vary by both gender and achievement. As a consequence, the following five research questions are formulated:

RQ1: How do students perceive STEM lessons with regard to the DIAMONDS dimensions?

RQ1 is descriptive in nature and seeks to establish a baseline profile of situational perception in STEM classrooms. This research question seems instrumental in establishing the foundation for subsequent research endeavors. In accordance with extant research, it is anticipated that students will report elevated levels of Duty and Intellect, a reflection of the structured and cognitively demanding nature of STEM instruction (Ketscher et al., 2025; Konaszewski et al., 2025; Zager Kocjan & Avsec, 2017).

RQ2: Does academic achievement affect the situational perception of STEM lessons?

This research question addresses whether high-, average-, and low-achieving students differ in how they perceive the same classroom environments. Drawing on prior research, we hypothesize that achievement level is associated with distinct situational profiles. For instance, high-achieving students may perceive greater intellectual stimulation (Intellect) and more classroom support (pOsitivity), while lower-achieving students may perceive greater Adversity or Negativity. It seems imperative to clarify this research question, as it serves as the foundation for future studies in the domain.

RQ3: Does gender affect the situational perception of STEM lessons?

This research question examines gender-based differences in how students interpret STEM classrooms and thereby seems instrumental in establishing the foundation for subsequent gender-based research endeavors in the long term. Based on previous findings of gendered differences in academic self-concept and STEM engagement, we expect male students to report more positive perceptions, such as higher pOsitivity and lower Adversity, Negativity, or Deception.

RQ4: Are girls and boys equally represented among high-achieving STEM students?

Previous research has documented a persistent overrepresentation of boys among top STEM achievers (e.g., Benbow & Stanley, 1980, 1983; Wai et al., 2010, 2018). Such an interaction may arise if high-achieving girls experience greater pressure to prove their competence in male-typed domains (cf. stereotype threat; Steele, 1997) or if their situational perception is more sensitive to subtle cues of exclusion or lack of belonging (Cheryan et al., 2009; Good et al., 2003). Conversely, high-achieving boys may benefit from stereotype lift (Walton & Cohen, 2003), reinforcing confidence and positive perceptions in STEM contexts. However, more recent findings suggest that this gender gap may be narrowing (Bahar, 2021). RQ4 investigates whether this pattern is still evident in the current sample by examining whether gender distribution differs significantly across academic achievement levels.

RQ5: Do gender and achievement interact in the situational perception of STEM lessons?

This research question builds on the findings of previous research questions and enables an in-depth analysis of the knowledge already gained. It investigates whether gender differences in situational perception vary depending on students’ academic achievement level. Based on previous findings about the persistent underrepresentation of girls among the highest-performing STEM students, we hypothesize that gender differences may be particularly pronounced among high achievers. For example, high-achieving girls may perceive STEM lessons as less supportive, more demanding, or less socially inclusive compared to high-achieving boys, which could contribute to their underrepresentation at the top performance levels.

Method

Procedure

The study was conducted via an online survey distributed to secondary schools across Germany with a strong emphasis on STEM education. Such schools offer their students the opportunity to engage in STEM activities, that go beyond the regular curriculum, such as institutionalized visits to natural science museums and student science laboratories. These activities are supported by active collaboration with external STEM initiatives, which provide the required infrastructure. The selection of schools was based on two criteria. First, schools were included in the sample if they were members of nationwide STEM associations or if there was explicit mention of STEM initiatives.

After obtaining the necessary permissions from relevant educational authorities in the federal states, participating schools were asked to administer the survey in their computer labs to ensure standardized testing conditions. The survey included informed consent procedures, and data were only retained when explicit parental or legal guardian consent could be verified. After a thorough data cleaning process, which included the removal of mostly incomplete or non-consenting cases, the final sample comprised 1,024 students from 26 schools.

Sample

Participants were 1,024 eighth-grade students enrolled in the lower secondary school track of the German secondary school system. The average age was 13.72 years (SD = 0.44). The sample included 442 boys (M = 13.75, SD = 0.42) and 582 girls (M = 13.70, SD = 0.45).

Measures

Demographics

Students self-reported their gender, their age, and their socio-economic status.

Situational Perception

Students’ perceptions of their STEM lessons were measured using an adapted version of the S8-1 DIAMONDS scale developed by Rauthmann & Sherman (2018). The original eight items were tailored to reflect the classroom context of STEM education, e.g., by adding the phrase “in STEM lesson.” For example, the item assessing Negativity was adapted to: “I have negative feelings (e.g., stress, anxiety, guilt) in STEM lessons.” Items were rated on a 7-point Likert scale ranging from 1 (not at all) to 7 (totally). The utilization and adaption of the scales were guided by the theoretical and empirical work of Rauthmann et al. (2014) and Rauthmann & Sherman (2016a, 2016b). Recent validation by Ketscher et al. (2025) confirmed the scale's applicability for STEM settings, demonstrating convergent, criterion-related, and explanatory validity.

Academic Achievement

Students self-reported their most recent grades in six STEM subjects commonly taught in German secondary schools: mathematics, computer science, biology, physics, chemistry, and technology. Grades range from 1 (very good) to 6 (insufficient). Participation in individual STEM classes varies among federal states in Germany, leading to the calculation of the average STEM grade. Students were categorized into three performance groups based on their average STEM grade: high-performing students (top 10% of the sample), average-performing students (middle 80%), and below-average-performing students (bottom 10%). This approach is primarily aimed at ensuring that students with above-average or below-average performance can be systematically compared. A random number was utilized to assign students to academic achievement groups at relevant cut-off values (grades: 1.33 and 3.33).

Data Analysis

Data were analyzed using IBM SPSS Statistics (Version 29.0.2.0) (IBM Corp, 2023) and R Version 4.3.0 (R Core Team, 2023). The five research questions that guided the analytical approach were as follows. The objective is twofold: Initially, a well-founded foundation is laid by means of a comparative analysis of extant research findings (RQ1, RQ2, and RQ3). Second, additional significant insights are obtained regarding high-achieving students and the long-term promotion of talent (RQ4 and RQ5).

RQ1 (descriptive): Mean scores and standard deviations of the DIAMONDS dimensions were reported, and pairwise t-tests were used to compare dimensions.

RQ2 (achievement differences): One-way analysis of variance (ANOVA) with post hoc comparisons was conducted to test for differences across achievement groups (low-achiever, average-achiever, high-achiever) in order to gain a preliminary understanding of the DIAMONDS approach for students and their achievement levels. With a focus on the ANOVA calculations performed, no corrections were made in order to avoid reducing potential effects. However, in order to obtain the greatest possible significance and to gain insight into distinct situational profiles, particularly when examining the research question in greater depth and carrying out the pairwise comparison, the decision was made to use Games-Howell post-hoc.

RQ3 (gender differences): Due to unequal group sizes and variance heterogeneity, Welch's t-tests were used to compare male and female students. The aim was to gain new insights and compare results with previous research.

RQ4 (equal representation among high-achievers): A chi-square test was used to assess whether boys and girls were equally represented in the high-achieving group. The initiation of this process was driven by two primary objectives. Initially, the objective was to facilitate a comparison with the results of previous research. Second, the research question was intended to ensure long-term talent development.

RQ5 (gender × achievement interaction): Two-way ANOVAs were conducted to examine interaction effects between gender and achievement level. The present research question is predicated on the findings of a preceding research question, with the objective of facilitating the acquisition of further insights for potential future research.

Although parametric tests were employed, data did not meet strict normality assumptions according to the Kolmogorov–Smirnov and Shapiro–Wilk tests (e.g., Adversity, Mating). However, the decision to proceed with parametric methods was supported by previous research indicating the robustness of these methods under conditions of non-normality (e.g., de Winter & Dodou, 2010; Kubinger et al., 2009; Schmider et al., 2010).

Figure 1.

Means and standard deviations of DIAMONDS. Note. Standard deviation ±1.

Results

Descriptive statistics for all eight DIAMONDS dimensions are presented in Table 1. Students reported the highest mean scores for Intellect (M = 4.89, SD = 1.49), followed by Duty (M = 4.77, SD = 1.53) and Sociality (M = 4.49, SD = 1.61). The lowest ratings were observed for Adversity (M = 2.76, SD = 1.71) and Mating (M = 2.64, SD = 1.88). Notably, five of the eight dimensions had mean values exceeding the midpoint of the 7-point Likert scale (i.e., >3.5). (Figure 1)

Table 1.

Means and Standard Deviations of DIAMONDS Dimensions by Gender.

Dimension	Total mean (SD)	Gender		t	df	p	d
Dimension	Total mean (SD)	Male mean (SD)	Female mean (SD)	t	df	p	d
Duty	4.77 (1.53)	4.53 (1.56)	4.94 (1.49)	−4.23	901.35	<.001	−.27
Intellect	4.89 (1.49)	4.75 (1.53)	4.99 (1.45)	−2.52	899.24	.012	−.16
Adversity^a	2.76 (1.71)	2.68 (1.74)	2.83 (1.69)	−1.39	911.80	.166	−.09
Mating^a	2.64 (1.88)	2.57 (1.81)	2.68 (1.92)	−0.97	952.55	.332	−.06
pOsitivity	4.36 (1.61)	4.60 (1.57)	4.18 (1.62)	4.14	938.64	<.001	.26
Negativity	3.20 (1.76)	2.87 (1.69)	3.45 (1.77)	−5.26	946.19	<.001	−.33
Deception	3.60 (1.74)	3.31 (1.71)	3.81 (1.74)	−4.51	934.55	<.001	−.29
Sociality	4.49 (1.61)	4.50 (1.63)	4.48 (1.60)	0.19	913.65	.851	.01

Note. Bold numbers indicate significant gender differences (Welch t-test; two-tailed). Mean values and standard deviations (SD) are reported. t = t-statistic df = degrees of freedom, p = p-value, d = Cohen's d.

Two DIAMONDS-Dimensions do not differ significantly (pairwise t-test; two-tailed).

Standard deviations were consistently high across all dimensions (most exceeding 1.5), indicating substantial variation in students’ situational perceptions. Pairwise t-tests revealed significant differences between most dimensions, except between Adversity and Mating.

To examine the effect of academic achievement on situational perception (RQ2), a series of one-way ANOVAs were conducted (see Table 2). Significant differences emerged for the three dimensions of pOsitivity (η² = .043), Negativity (η² = .031), and Deception (η² = .025). Their effects are characterized by smaller effect sizes. Post hoc comparisons (Games-Howell tests) showed that high-achieving students reported significantly higher pOsitivity and lower Negativity and Deception than both average- and low-achieving students (see Appendix Table A1). These findings indicate the presence of a distinct achievement-based situational perception profile in STEM education.

Table 2.

Means and Standard Deviations of DIAMONDS Dimensions by Academic Achievement Group and ANOVA Results for Group Differences in DIAMONDS Dimensions.

Dimension	Academic achievement			F	df	p	η²
Dimension	High-achievermean (SD)	Average-achievermean (SD)	Low-achievermean (SD)	F	df	p	η²
Duty	4.83 (1.59)	4.78 (1.52)	4.54 (1.53)	1.19	2, 1003	.304	.002
Intellect	5.00 (1.46)	4.87 (1.50)	4.98 (1.41)	0.57	2, 1003	.563	.001
Adversity	2.78 (1.78)	2.74 (1.70)	2.88 (1.75)	0.31	2, 1003	.737	.001
Mating	2.96 (2.11)	2.58 (1.85)	2.74 (1.85)	2.00	2, 1002	.137	.004
pOsitivity	5.24 (1.42)^ab	4.32 (1.60)^ac	3.79 (1.51)^bc	22.56	2, 1002	<.001	.043
Negativity	2.34 (1.43)^de	3.26 (1.75)^d	3.62 (1.88)^e	15.95	2, 1002	<.001	.031
Deception	2.89 (1.59)^fg	3.62 (1.75)^fh	4.09 (1.64)^gh	12.75	2, 1002	<.001	.025
Sociality	4.62 (1.64)	4.47 (1.61)	4.47 (1.60)	0.37	2, 1002	.691	.001

Note. Identical superscripts (a–h) indicate that academic achievement groups differ significantly (Games–Howell test). Mean values and standard deviations (SD) are reported. F = F-statistic, df = degrees of freedom, p = p-value, η² = eta-squared.

RQ3 was examined using Welch's t-tests (see Table 1). They reveal small, but significant gender differences in five of the eight DIAMONDS dimensions. Female students reported higher levels of Duty (d = −.27), Intellect (d = −.16), Negativity (d = −.33), and Deception (d = −.29), while male students reported higher pOsitivity (d = .26). No significant differences were found for the social dimensions of Adversity (d = −.09), Mating (d = −.06), or Sociality (d = .01). These results mirror earlier findings on gender-based differences in classroom experience and emotional responses in STEM contexts (Ketscher et al., 2025).

A chi-square test was conducted to examine RQ4 (see Table 3), whether boys and girls were equally represented across academic achievement levels in STEM. The test revealed no significant association between gender and achievement group membership, χ²(2) = 0.811, p = .667, Cramér's V = .028. In the high-achieving group, 48 students (47.1%) were male and 54 (52.9%) were female. Similar gender distributions were observed in the average-achieving group (boys: 42.9%; girls: 57.1%) and the low-achieving group (boys: 41.2%; girls: 58.8%). These findings indicate that girls in our sample were not underrepresented among the top STEM performers, in contrast to historical trends (see Table 4).

Table 3.

Pearson Chi-Square Testing.

		Gender		Total	χ²	df	p	Cramér's V
		Male	Female	Total	χ²	df	p	Cramér's V
Low-Achiever	Observed	42	60	102	.811	2	.667	.028
	Expected	44.0	58.0	102
	% within Academic Achievement	41.2%	58.8%	100.0%
	% within Gender	9.5%	10.4%	10.0%
	% of Total	4.1%	5.9%	10.0%
Average-Achiever	Observed	350	465	815
	Expected	351.9	463.1	815.0
	% within Academic Achievement	42.9%	57.1%	100.0%
	% within Gender	79.5%	80.3%	80.0%
	% of Total	34.3%	45.6%	80.0%
High-Achiever	Observed	48	54	102
	Expected	44.0	58.0	102.0
	% within Academic Achievement	47.1%	52.9%	100.0%
	% within Gender	10.9%	9.3%	10.0%
	% of Total	4.7%	5.3%	10.0%
Total	Observed	440	579	1019
	Expected	440.0	579.0	1019.0
	% within Academic Achievement	43.2%	56.8%	100.0%
	% within Gender	100.0%	100.0%	100.0%
	% of Total	43.2%	56.8%	100.0%

Note. Asymptotic significance (two-tailed). χ² = chi-square test, df = degrees of freedom, p = p-value.

Table 4.

Descriptive Data: Gender, Academic Achievement, and Their Interaction on DIAMONDS Dimensions.

Dimension	Gender	Academic achievement	Mean (SD)
Duty	Male	Low-Achiever	4.45 (1.69)
		Average-Achiever	4.53 (1.54)
		High-Achiever	4.63 (1.63)
	Female	Low-Achiever	4.61 (1.43)
		Average-Achiever	4.97 (1.48)
		High-Achiever	5.02 (1.54)
Intellect	Male	Low-Achiever	4.74 (1.61)
		Average-Achiever	4.71 (1.52)
		High-Achiever	5.08 (1.46)
	Female	Low-Achiever	5.15 (1.23)
		Average-Achiever	4.98 (1.47)
		High-Achiever	4.93 (1.47)
Adversity	Male	Low-Achiever	2.74 (1.95)
		Average-Achiever	2.67 (1.70)
		High-Achiever	2.67 (1.85)
	Female	Low-Achiever	2.98 (1.60)
		Average-Achiever	2.80 (1.70)
		High-Achiever	2.89 (1.73)
Mating	Male	Low-Achiever	2.33 (1.57)
		Average-Achiever	2.57 (1.83)
		High-Achiever	2.79 (1.89)
	Female	Low-Achiever	3.03 (1.98)
		Average-Achiever	2.60 (1.86)
		High-Achiever	3.11 (2.29)
pOsitivity	Male	Low-Achiever	3.88 (1.52)
		Average-Achiever	4.59 (1.54)
		High-Achiever	5.31 (1.57)
	Female	Low-Achiever	3.73 (1.51)
		Average-Achiever	4.12 (1.62)
		High-Achiever	5.17 (1.29)
Negativity	Male	Low-Achiever	3.21 (1.89)
		Average-Achiever	2.94 (1.68)
		High-Achiever	2.08 (1.37)
	Female	Low-Achiever	3.92 (1.82)
		Average-Achiever	3.50 (1.77)
		High-Achiever	2.57 (1.46)
Deception	Male	Low-Achiever	3.81 (1.78)
		Average-Achiever	3.36 (1.70)
		High-Achiever	2.54 (1.44)
	Female	Low-Achiever	4.29 (1.52)
		Average-Achiever	3.81 (1.76)
		High-Achiever	3.20 (1.65)
Sociality	Male	Low-Achiever	4.48 (1.64)
		Average-Achiever	4.47 (1.63)
		High-Achiever	4.73 (1.66)
	Female	Low-Achiever	4.46 (1.59)
		Average-Achiever	4.48 (1.60)
		High-Achiever	4.52 (1.63)

Note. Mean values and standard deviations (SD) are reported.

In order to examine RQ5, two-way ANOVAs were conducted to identify potential interaction effects between gender and academic achievement (see Table 5). No significant gender × achievement interactions were found for any DIAMONDS dimension. While gender and achievement level showed main effects in several dimensions, their interaction did not reach statistical significance (ps > .05). Given the moderate sample size within each subgroup (e.g., high-achieving boys vs. girls), it remains possible that subtle effects went undetected. Future studies with larger or stratified samples may clarify whether interaction effects exist under specific conditions.

Table 5.

Two-Way ANOVA: Gender, Academic Achievement, and Their Interaction on DIAMONDS Dimensions.

Dimension	Groups	F	p	η²_p
Duty	Gender	5.00	.026	.005
	Academic Achievement	1.08	.341	.002
	Gender × Academic Achievement	0.38	.686	.001
Intellect	Gender	1.49	.223	.001
	Academic Achievement	0.66	.519	.001
	Gender × Academic Achievement	1.13	.325	.002
Adversity	Gender	1.41	.235	.001
	Academic Achievement	0.26	.773	.001
	Gender × Academic Achievement	0.08	.926	.000
Mating	Gender	3.69	.055	.004
	Academic Achievement	1.79	.167	.004
	Gender × Academic Achievement	1.55	.213	.003
pOsitivity	Gender	2.83	.093	.003
	Academic Achievement	22.05	<.001	.042
	Gender × Academic Achievement	0.84	.430	.002
Negativity	Gender	12.19	<.001	.012
	Academic Achievement	15.14	<.001	.029
	Gender × Academic Achievement	0.10	.902	.000
Deception	Gender	10.18	.001	.010
	Academic Achievement	12.35	<.001	.024
	Gender × Academic Achievement	0.17	.841	.000
Sociality	Gender	0.21	.644	.000
	Academic Achievement	0.40	.670	.001
	Gender × Academic Achievement	0.21	.810	.000

Note. F = F-statistic, df = degrees of freedom, p = p-value, η²_p = partial eta-squared.

Discussion

This study investigates how students’ perceptions of STEM classroom situations—measured through the DIAMONDS framework—relate to gender and academic achievement. The overarching motivation was to explore potential psychological and situational mechanisms contributing to the persistent underrepresentation of girls among top STEM achievers, as reported in earlier research (e.g., Benbow & Stanley, 1980, 1983; Wai et al., 2010, 2018). Notably, the present study challenges this narrative: In our German secondary school students’ sample, girls (54 students; 52.9%) were not underrepresented among the top 10% (102 students) of STEM achievers. The distribution was nearly equal. This aligns with recent findings by O’Dea et al. (2018), who showed that while boys and girls are equally represented in the top decile of performance in STEM subjects, girls tend to outperform boys in non-STEM subjects. These findings indicate that gender disparities at the top of the STEM achievement distribution are diminishing—at least within younger cohorts—raising important questions about how current educational systems support or fail to support high-potential students across domains.

Nevertheless, consistent gender and achievement main effects were observed in how students perceived STEM classroom situations. Boys reported significantly higher levels of pOsitivity (d = .26) and lower levels of Negativity (d = −.33) and Deception (d = −.29) than girls. High-achieving students, regardless of gender, perceived lessons more positively and less negatively than their average- or low-achieving peers (see Appendix Table A1). These patterns reflect differentiated emotional and cognitive classroom experiences and align with prior research indicating that classroom perceptions influence students’ sense of belonging and motivation in STEM (e.g., Cheryan et al., 2009, 2011, 2015). Current findings give rise to the question of the extent to which personal factors of students and specific situational cues in STEM lessons influence the perception of lessons as positive or negative, and thus also influence motivational and engagement processes. However, it should be noted that the present effect sizes can essentially be classified as small (d < .50) (Cohen, 2009), and in this context, attention should also be directed toward the elevated standard deviations of individual DIAMONDS dimensions.

The DIAMONDS framework appeared to be a promising initial approach for the purpose of capturing students’, in lower secondary school tracks, situational experiences in STEM classrooms. As anticipated, the dimensions Duty and Intellect emerged as most salient. This finding also aligns with previous observations (Ketscher et al., 2025; Konaszewski et al., 2025; Rauthmann et al., 2014; Zager Kocjan & Avsec, 2017). Aforementioned dimensions may mirror the disciplinary expectations of STEM learning environments and were perceived differently across genders, with female students reporting stronger experiences of these cognitively oriented aspects (Duty: d = −.27; Intellect: d = −.16). In contrast, no significant differences in Duty and Intellect were found with regard to academic achievement. Group differences were more pronounced in the affective dimensions—particularly pOsitivity (η² = .043), Negativity (η² = .031), and Deception (η² = .025)—which appear more closely tied to emotional engagement and perceptions of social-emotional support within the classroom context. These affective differences may help explain gendered patterns of long-term engagement in STEM. Prior research has shown that emotional responses to classroom environments—particularly feelings of belonging or alienation—play a central role in shaping students’ motivation and persistence (Good et al., 2003; Walton & Cohen, 2003). If STEM lessons are consistently perceived as less supportive or more negative by certain groups, this may contribute to later attrition despite equal academic performance.

Importantly, the lack of an interaction effect between gender and academic achievement indicates that gendered perceptions of STEM education are consistent across performance levels. This challenges earlier assumptions that high-achieving girls might experience the STEM classroom differently than high-achieving boys in a way that would deter persistence or success. Instead, our findings suggest that although boys and girls differ in how positively they perceive the learning environment, these differences are not exacerbated (or reduced) by academic achievement status. This has two implications. First, the absence of a gender gap at the top level in this sample may reflect the success of equity interventions, improved representation, or changing cultural norms among younger cohorts. Second, it suggests that gender disparities in STEM pathways may now emerge not primarily from achievement differences, but from differential emotional or motivational experiences in the classroom—an area where targeted interventions could be particularly effective. At this point, however, it should be noted that further analysis of the data set from the perspective of generalizing the results may lead to divergent conclusions.

Limitations

While the study offers novel insights, several limitations must be acknowledged. First, the sample consisted of eighth-grade students from German secondary schools, especially in the lower secondary school track (K-12), which may limit the accuracy and generalizability of the findings to further educational systems or age groups. Cultural and curricular variations can influence achievement patterns and situational perceptions (e.g., Brown & Rauthmann, 2016; Noftle & Gust, 2019; Zager Kocjan et al., 2025), and replication across contexts is needed.

Second, the measurement of academic achievement was based on self-reported grades across several STEM subjects. While this allowed a broad assessment of STEM performance, it may introduce inaccuracies (Kuncel et al., 2005), and students’ perceptions of grades may be shaped by prior situational experiences. Additionally, the composition of STEM subjects (e.g., the inclusion or exclusion of technology) varied across federal states, potentially affecting comparability. Future studies may consider teacher-reported or standardized performance data to enhance objectivity.

Third, while the DIAMONDS framework captures a wide range of situational dimensions, some, such as Adversity and Mating, were less relevant or more ambiguous in the school context. These findings are consistent with prior critiques (Parrigon et al., 2017) and suggest the need to refine or combine situational taxonomies when applying them to educational research. The extant evidence would thus appear to suggest that a revalidation of the adapted scale (S8-1; see Rauthmann & Sherman, 2018) in other age groups or the adaptation of comparable scales (e.g., S8*; see Rauthmann & Sherman, 2016a) in the STEM area is necessary.

Fourth, the possibility of self-selection bias must be considered. Students with higher socio-economic status, greater interest, or confidence in STEM domains may have been more inclined to participate in the survey, potentially influencing the observed perception patterns.

Finally, the choice to use parametric statistics despite non-normal distributions may be questioned. However, this decision was supported by a robust sample size and prior literature demonstrating the robustness of parametric methods under such conditions (e.g., de Winter & Dodou, 2010; Kubinger et al., 2009; Schmider et al., 2010).

Conclusion

The present study provides significant initial descriptive and research question-based results regarding the importance of the perception of STEM instruction in the lower-secondary school track. While no gender gap was found at the top of the performance distribution, a systematic difference in the emotional perceptions of STEM lessons was reported by both boys and girls. High-achieving students—regardless of gender—experienced lessons more positively than their lower-achieving peers. These findings suggest that classroom perception, particularly in relation to affective phenomena such as pOsitivity and Negativity, may play a pivotal role in shaping long-term motivation and enrollment in the STEM domain. The DIAMONDS framework has demonstrated its efficacy in capturing situational nuances. Nevertheless, further refinement is necessary to optimize its accuracy in forthcoming research. As STEM talent education continues to grapple with equity and talent development (Ziegler & Stoeger, 2023), future work should focus not only on performance metrics but for instance, also on how students in various national contexts feel in the learning environment, especially when designing interventions to support high-potential STEM students of all genders. Furthermore, it is important to note that teachers’ perceptions of STEM education, and especially their instructional behavior, can significantly influence students’ perceptions of STEM education. Consequently, this renders it an intriguing field of research with a promising follow-up outlook.

Classroom situations are not only contexts for learning—they are perceived experiences that, when viewed in a positive light, may powerfully shape talented students’ decisions to persist, achieve, and thrive in STEM.

Supplemental Material

sj-docx-1-joa-10.1177_1932202X251396238 - Supplemental material for Classrooms Through the Eyes of High Achievers: Gender Differences in Situational Experiences Based on the DIAMONDS Framework

Supplemental material, sj-docx-1-joa-10.1177_1932202X251396238 for Classrooms Through the Eyes of High Achievers: Gender Differences in Situational Experiences Based on the DIAMONDS Framework by Lukas Ketscher, Heidrun Stoeger, and Albert Ziegler in Journal of Advanced Academics

Footnotes

Acknowledgments

This publication resulted from the joint project “FösaMINT—Förderung schulisch-außerschulischer MINT-Kooperation mit Genderschwerpunkt.” The project is funded by the Federal Ministry of Education and Research (BMBF) under the project grant number 16MF1091. The following institutions are practice partners in the project: CyberMentor, Deutsche Telekom Stiftung, Körber-Stiftung, matrix GmbH, MINT-EC [Nationales Excellence-Netzwerk für Schulen], MINTvernetzt. The responsibility for the content of this publication lies with the authors.

Ethical Approval and Consent to Participate

Our concept for protecting participants’ data was based on national standards. It was approved by the participating institutions (e.g., the ministries of participating German states and the principals of participating schools). The participating students and their legal guardians provided written informed consent to participate in this study. As part of the informed consent, participants were informed that only anonymized data would be published.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This publication resulted from the joint project “FösaMINT—Förderung schulisch-außerschulischer MINT-Kooperation mit Genderschwerpunkt.” The project is funded by the Federal Ministry of Education and Research under the project grant number 16MF1091.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data Availability Statement

The datasets presented in this article are not readily available, as the project will run until 2027. The data is expected to be made available in anonymized form after the funded project has been finalized.

ORCID iDs

Lukas Ketscher

Heidrun Stoeger

Albert Ziegler

Supplemental Material

Supplemental material for this article is available online.

Author Biographies

Lukas Ketscher (Educational Science M.A.) is currently a research fellow and doctoral student in psychology at the Chair of Educational Psychology and Research on Excellence (Chair Owner: Prof. Drs. Albert Ziegler) at the University of Erlangen-Nuremberg, Germany. His research focuses on the analysis of differences in perceptions of educational pathways, by especially focusing on giftedness and gender. The relevant data for his research is obtained through participation in the Federal Ministry of Education and Research-funded (BMBF) research project FoesaMINT.

Heidrun Stoeger is a full professor at the University of Regensburg. She also has an honorary professorship at the Pontificia Universidad Católica del Perú (Lima). Her research interests include mentoring, talent development in STEM, and teacher training to improve students’ learning strategies. She directs several major grant-funded research projects focused on, for example, developing diagnostic tools to optimally support learners in different talent domains, the role of fine motor skills in cognitive and academic development, and increasing access to curricular and extracurricular STEM education. Dr. Stoeger, has over 300 publications, and advises various governmental organizations and foundations in different countries about implementing effective research-based educational services.

Albert Ziegler is Chair Professor of Educational Psychology and Research on Excellence at the University of Erlangen-Nuremberg, Germany. He is the founding director of the nationwide Counselling and Research Centre for the Gifted. He has published around 450 books, chapters, and articles in the fields of educational psychology. He developed the Actiotope Model of Giftedness, which promotes a systemic view of giftedness. His research interests include learning resources and effective learning environments, self-regulated learning, mentoring, and gifted identification. He is the immediate past Vice-President of the European Council for High Ability (ECHA) and the founding Chair of the European Talent Support Network (ETSN). Since 2005, he has been Director of the Statewide Research and Counseling Center at the University of Erlangen-Nuremberg.

Appendix

Table A1.

Post Hoc Comparisons (Games-Howell) Between Academic Achievement Groups.

Dimension	Groups	Mean difference (I-J)	95% CI	p	\|d\|
Duty	Low-Achiever–Average-Achiever	−0.24	[−0.62, 0.15]	.313	\|.16\|
	Low-Achiever–High-Achiever	−0.29	[−0.81, 0.23]	.386	\|.19\|
	Average-Achiever–High-Achiever	−0.05	[−0.45, 0.34]	.946	\|.03\|
Intellect	Low-Achiever–Average-Achiever	0.12	[−0.24, 0.47]	.724	\|.07\|
	Low-Achiever–High-Achiever	−0.02	[−0.49, 0.45]	.995	\|.01\|
	Average-Achiever–High-Achiever	−0.13	[−0.50, 0.23]	.656	\|.09\|
Adversity	Low-Achiever–Average-Achiever	0.14	[−0.30, 0.58]	.732	\|.08\|
	Low-Achiever–High-Achiever	0.10	[−0.49, 0.68]	.919	\|.06\|
	Average-Achiever–High-Achiever	−0.04	[−0.48, 0.40]	.972	\|.02\|
Mating	Low-Achiever–Average-Achiever	0.16	[−0.30, 0.62]	.698	\|.09\|
	Low-Achiever–High-Achiever	−0.22	[−0.87, 0.44]	.713	\|.11\|
	Average-Achiever–High-Achiever	−0.38	[−0.89, 0.14]	.201	\|.20\|
pOsitivity	Low-Achiever–Average-Achiever	−0.53	[−0.90, −0.14]	.004	\|.33\|
	Low-Achiever–High-Achiever	−1.44	[−1.93, −0.96]	<.001	\|.99\|
	Average-Achiever–High-Achiever	−0.92	[−1.28, −0.56]	<.001	\|.58\|
Negativity	Low-Achiever–Average-Achiever	0.36	[−0.10, 0.83]	.161	\|.20\|
	Low-Achiever–High-Achiever	1.28	[0.73, 1.83]	<.001	\|.77\|
	Average-Achiever–High-Achiever	0.92	[0.55, 1.29]	<.001	\|.54\|
Deception	Low-Achiever–Average-Achiever	0.47	[0.05, 0.88]	.023	\|.27\|
	Low-Achiever–High-Achiever	1.20	[0.66, 1.73]	<.001	\|.74\|
	Average-Achiever–High-Achiever	0.73	[0.33, 1.13]	<.001	\|.42\|
Sociality	Low-Achiever–Average-Achiever	−0.01	[−0.41, 0.39]	.999	\|.00\|
	Low-Achiever–High-Achiever	−0.15	[−0.69, 0.39]	.782	\|.09\|
	Average-Achiever–High-Achiever	−0.14	[−0.55, 0.26]	.682	\|.09\|

Note. Games-Howell (post-hoc test) was calculated. Mean differences and 95% confidence intervals are reported. p = p-value; |d| = Cohen's d as absolute values.

References

Abdulla Alabbasi

A. M.

Thompson

T. L.

Runco

M. A.

Alansari

L. A.

Ayoub

A. E. A.

(2025). Gender differences in creative potential: A meta-analysis of mean differences and variability. Psychology of Aesthetics, Creativity, and the Arts, 19(1), 87–100. https://doi.org/10.1037/aca0000506

Abrahams

Rauthmann

J. F.

De Fruyt

(2021). Person-situation dynamics in educational contexts: A self- and other-rated experience sampling study of teachers’ states, traits, and situations. European Journal of Personality, 35(4), 598–622. https://doi.org/10.1177/08902070211005621

Abrahams

Rauthmann

J. F.

De Fruyt

(2025). Understanding person-situation dynamics at work: Effects of traits, states, and situation characteristics on teaching performance. Social Psychological and Personality Science, 16(5), 572–584. https://doi.org/10.1177/19485506241236812

Bahar

A. K.

(2021). Will we ever close the gender gap among top mathematics achievers? Analysis of recent trends by race in advanced placement (AP) exams. Journal for the Education of the Gifted, 44(4), 331–365. https://doi.org/10.1177/01623532211044540

Baye

Monseur

(2016). Gender differences in variability and extreme scores in an international context. Large-Scale Assessments in Education, 4(1), 1–16. https://doi.org/10.1186/s40536-015-0015-x

Benbow

C. P.

Stanley

J. C.

(1980). Sex differences in mathematical ability: Fact or artifact? Science, 210(4475), 1262–1264. https://doi.org/10.1126/science.7434028

Benbow

C. P.

Stanley

J. C.

(1983). Sex differences in mathematical reasoning ability: More facts. Science, 222(4627), 1029–1031. https://doi.org/10.1126/science.6648516

Bouton

Yosef

Asterhan

C. S. C.

(2025). Differences between low and high achievers in whole-classroom dialogue participation quality. Learning and Instruction, 96, 102088. https://doi.org/10.1016/j.learninstruc.2025.102088

Brown

N. A.

Rauthmann

J. F.

(2016). Situation characteristics are age graded: Mean-level patterns of the situational eight DIAMONDS across the life span. Social Psychological and Personality Science, 7(7), 667–679. https://doi.org/10.1177/1948550616652207

10.

Ceci

S. J.

Williams

W. M.

(2011). Understanding current causes of women’s underrepresentation in science. Proceedings of the National Academy of Sciences, 108(8), 3157–3162. https://doi.org/10.1073/pnas.1014871108

11.

Chan

R. C. H.

(2022). A social cognitive perspective on gender disparities in self-efficacy, interest, and aspirations in science, technology, engineering, and mathematics (STEM): The influence of cultural and gender norms. International Journal of STEM Education, 9(1), 37. https://doi.org/10.1186/s40594-022-00352-0

12.

Charlesworth

T. E. S.

Banaji

M. R.

(2019). Gender in science, technology, engineering, and mathematics: Issues, causes, solutions. The Journal of Neuroscience, 39(37), 7228–7243. https://doi.org/10.1523/JNEUROSCI.0475-18.2019

13.

Cheryan

Master

Meltzoff

A. N.

(2015). Cultural stereotypes as gatekeepers: Increasing girls’ interest in computer science and engineering by diversifying stereotypes. Frontiers in Psychology, 6, 49. https://doi.org/10.3389/fpsyg.2015.00049

14.

Cheryan

Meltzoff

A. N.

Kim

(2011). Classrooms matter: The design of virtual classrooms influences gender disparities in computer science classes. Computers & Education, 57(2), 1825–1835. https://doi.org/10.1016/j.compedu.2011.02.004

15.

Cheryan

Plaut

V. C.

Davies

P. G.

Steele

C. M.

(2009). Ambient belonging: How stereotypical cues impact gender participation in computer science. Journal of Personality and Social Psychology, 97(6), 1045–1060. https://doi.org/10.1037/a0016239

16.

Cheryan

Ziegler

S. A.

Montoya

A. K.

Jiang

(2017). Why are some STEM fields more gender balanced than others? Psychological Bulletin, 143(1), 1–35. https://doi.org/10.1037/bul0000052

17.

Cohen

(2009). Statistical power analysis for the behavioral sciences (2. ed., reprint). Psychology Press.

18.

De Winter

J. F. C.

Dodou

(2010). Five-point Likert items: T test versus Mann-Whitney-Wilcoxon (addendum added October 2012). Practical Assessment, Research, and Evaluation, 15(1), 1–16. https://doi.org/10.7275/BJ1P-TS64

19.

Fairhurst

Koul

Sheffield

(2023). Students’ perceptions of their STEM learning environment. Learning Environments Research, 26(3), 977–998. https://doi.org/10.1007/s10984-023-09463-z

20.

Good

Aronson

Inzlicht

(2003). Improving adolescents’ standardized test performance: An intervention to reduce the effects of stereotype threat. Journal of Applied Developmental Psychology, 24(6), 645–662. https://doi.org/10.1016/j.appdev.2003.09.002

21.

Hora

Badura

K. L.

Lemoine

G. J.

Grijalva

(2022). A meta-analytic examination of the gender difference in creative performance. Journal of Applied Psychology, 107(11), 1926–1950. https://doi.org/10.1037/apl0000999

22.

IBM Corp. (2023). IBM SPSS statistics for windows (version 29.0.2.0) [Computer software]. IBM Corp. https://www.ibm.com/products/spss-statistics

23.

Keller

Preckel

Eccles

J. S.

Brunner

(2022). Top-performing math students in 82 countries: An integrative data analysis of gender differences in achievement, achievement profiles, and achievement motivation. Journal of Educational Psychology, 114(5), 966–991. https://doi.org/10.1037/edu0000685

24.

Ketscher

Stoeger

Vialle

Ziegler

(2025). Same classroom, different reality: Secondary school students’ perceptions of STEM lessons—A pioneering study. Education Sciences, 15(4), 467. https://doi.org/10.3390/educsci15040467

25.

Konaszewski

Fajkowska

Rogoza

Karwowski

(2025). Personality types and educational situation perception in juveniles from youth and probation centers. Personality and Individual Differences, 236, 113005. https://doi.org/10.1016/j.paid.2024.113005

26.

Kubinger

K. D.

Rasch

Moder

(2009). Zur Legende der Voraussetzungen des t-tests für unabhängige Stichproben [On the myth of requirements for t-tests for independent samples]. Psychologische Rundschau, 60(1), 26–27. https://doi.org/10.1026/0033-3042.60.1.26

27.

Kuncel

N. R.

Credé

Thomas

L. L.

(2005). The validity of self-reported grade point averages, class ranks, and test scores: A meta-analysis and review of the literature. Review of Educational Research, 75(1), 63–82. https://doi.org/10.3102/00346543075001063

28.

Leiner

J. E. M.

Scherndl

Ortner

T. M.

(2018). How do men and women perceive a high-stakes test situation? Frontiers in Psychology, 9, 2216. https://doi.org/10.3389/fpsyg.2018.02216

29.

Xue

(2023). Dynamic interaction between student learning behaviour and learning environment: Meta-analysis of student engagement and its influencing factors. Behavioral Sciences, 13(1), 59. https://doi.org/10.3390/bs13010059

30.

Makel

M. C.

Wai

Peairs

Putallaz

(2016). Sex differences in the right tail of cognitive abilities: An update and cross cultural extension. Intelligence, 59, 8–15. https://doi.org/10.1016/j.intell.2016.09.003

31.

Maries

Karim

N. I.

Singh

(2018). Is agreeing with a gender stereotype correlated with the performance of female students in introductory physics? Physical Review Physics Education Research, 14(2), 020119. https://doi.org/10.1103/PhysRevPhysEducRes.14.020119

32.

Marsh

H. W.

Parker

J. W.

(1984). Determinants of student self-concept: Is it better to be a relatively large fish in a small pond even if you don’t learn to swim as well? Journal of Personality and Social Psychology, 47(1), 213–231. https://doi.org/10.1037/0022-3514.47.1.213

33.

Meece

J. L.

Glienke

B. B.

Burg

(2006). Gender and motivation. Journal of School Psychology, 44(5), 351–373. https://doi.org/10.1016/j.jsp.2006.04.004

34.

Miller

D. I.

Wai

(2015). The bachelor’s to Ph.D. STEM pipeline no longer leaks more women than men: A 30-year analysis. Frontiers in Psychology, 6, 37. https://doi.org/10.3389/fpsyg.2015.00037

35.

Noftle

E. E.

Gust

C. J.

(2019). Age differences across adulthood in interpretations of situations and situation–behaviour contingencies for Big Five states. European Journal of Personality, 33(3), 279–297. https://doi.org/10.1002/per.2203

36.

Nunez

H.-P.

Ziegler

(2023). Can eminence in STEAM produce more female role models? Recent trends in prizes known as the Nobel or the highest honors of a field. Contemporary Educational Research Quarterly, 31(3), 3–31. https://doi.org/10.6151/CERQ.202309_31(3).0001

37.

Oakley

C. M.

Pekrun

Stoet

(2024). Sex differences of school grades in childhood and adolescence: A longitudinal analysis. Intelligence, 107, 101857. https://doi.org/10.1016/j.intell.2024.101857

38.

O’Dea

R. E.

Lagisz

Jennions

M. D.

Nakagawa

(2018). Gender differences in individual variation in academic grades fail to fit expected patterns for STEM. Nature Communications, 9(1), 3777. https://doi.org/10.1038/s41467-018-06292-0

39.

Parrigon

Woo

S. E.

Tay

Wang

(2017). CAPTION-ing the situation: A lexically-derived taxonomy of psychological situation characteristics. Journal of Personality and Social Psychology, 112(4), 642–681. https://doi.org/10.1037/pspp0000111

40.

Rauthmann

J. F.

Gallardo-Pujol

Guillaume

E. M.

Todd

Nave

C. S.

Sherman

R. A.

Ziegler

Jones

A. B.

Funder

D. C.

(2014). The situational eight DIAMONDS: A taxonomy of major dimensions of situation characteristics. Journal of Personality and Social Psychology, 107(4), 677–718. https://doi.org/10.1037/a0037250

41.

Rauthmann

J. F.

Sherman

R. A.

(2016a). Measuring the situational eight DIAMONDS characteristics of situations: An optimization of the RSQ-8 to the S8*. European Journal of Psychological Assessment, 32(2), 155–164. https://doi.org/10.1027/1015-5759/a000246

42.

Rauthmann

J. F.

Sherman

R. A.

(2016b). Ultra-brief measures for the situational eight DIAMONDS domains. European Journal of Psychological Assessment, 32(2), 165–174. https://doi.org/10.1027/1015-5759/a000245

43.

Rauthmann

J. F.

Sherman

R. A.

(2018). S8-I situational eight DIAMONDS-I—deutsche Fassung [Verfahrensdokumentation und Fragebogen] [S8-I situational eight DIAMONDS-I—German version [Procedure documentation and questionnaire]]. In Leibniz-Institut für Psychologie (ZPID) (Ed.), Open Test Archive. ZPID. https://doi.org/10.23668/psycharchives.6568 https://www.testarchiv.eu/de/test/9007481

44.

R Core Team. (2023). R: A Language and Environment for Statistical Computing [Computer software]. R Foundation for Statistical Computing. https://www.R-project.org/

45.

Schmider

Ziegler

Danay

Beyer

Bühner

(2010). Is it really robust?: Reinvestigating the robustness of ANOVA against violations of the normal distribution assumption. Methodology, 6(4), 147–151. https://doi.org/10.1027/1614-2241/a000016

46.

Sebastián-Tirado

Félix-Esbrí

Forn

Sanchis-Segura

(2023). Are gender-science stereotypes barriers for women in science, technology, engineering, and mathematics? Exploring when, how, and to whom in an experimentally-controlled setting. Frontiers in Psychology, 14, 1219012. https://doi.org/10.3389/fpsyg.2023.1219012

47.

Steele

C. M.

(1997). A threat in the air: How stereotypes shape intellectual identity and performance. American Psychologist, 52(6), 613–629. https://doi.org/10.1037/0003-066X.52.6.613

48.

Wai

Cacchio

Putallaz

Makel

M. C.

(2010). Sex differences in the right tail of cognitive abilities: A 30 year examination. Intelligence, 38(4), 412–423. https://doi.org/10.1016/j.intell.2010.04.006

49.

Wai

Hodges

Makel

M. C.

(2018). Sex differences in ability tilt in the right tail of cognitive abilities: A 35-year examination. Intelligence, 67, 76–83. https://doi.org/10.1016/j.intell.2018.02.003

50.

Walton

G. M.

Cohen

G. L.

(2003). Stereotype lift. Journal of Experimental Social Psychology, 39(5), 456–467. https://doi.org/10.1016/S0022-1031(03)00019-2

51.

Wang

M.-T.

Degol

J. L.

(2017). Gender gap in science, technology, engineering, and mathematics (STEM): Current knowledge, implications for practice, policy, and future directions. Educational Psychology Review, 29(1), 119–140. https://doi.org/10.1007/s10648-015-9355-x

52.

Witte

Spinath

Ziegler

(2024). Dissecting achievement motivation: Exploring the link between states, situation perception, and trait-state dynamics. Learning and Individual Differences, 112, 102439. https://doi.org/10.1016/j.lindif.2024.102439

53.

Zager Kocjan

Avsec

(2017). Bringing the psychology of situations into flow research: Personality and situation characteristics as predictors of flow. Psychological Topics, 26(1), 195–210. https://doi.org/10.31820/pt.26.1.9

54.

Zager Kocjan

Avsec

Buško

Sočan

(2025). State conscientiousness and perceptions of duties and intellectual demands in daily life: A continuous-time modeling approach. Personality and Individual Differences, 236, 113030. https://doi.org/10.1016/j.paid.2024.113030

55.

Ziegler

Stoeger

(2023). Talent denied: Equity and excellence gaps in STEMM. Annals of the New York Academy of Sciences, 1530(1), 32–45. https://doi.org/10.1111/nyas.15083

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.04 MB