Abstract
In this study, we examined the measurement invariance of the Everyday Discrimination Scale (EDS) across Black older men and women (ages 66–90). Participants were 141 Black men and 287 Black women drawn from the Kaiser Health Aging and Diverse Life Experiences study. An exploratory factor analysis of EDS items resulted in a unidimensional scale, and results were corroborated with each group using confirmatory factor analysis (CFA). Measurement invariance across groups was tested using multiple group CFA. Data fit was tested for configural, metric, and scalar invariance. There was measurement noninvariance at the metric level, suggesting nonequivalent factor loadings. The extent of factor score misestimation between the unadjusted and final model was assessed. Among Black women who reported more frequent experiences of discrimination, the unmodified EDS overestimated their total factor score. Administering the EDS without refinement may lead to incorrect measurement particularly among older Black women. Accounting for the gendered nature of interpersonal discrimination may improve estimates of discrimination prevalence and more accurately estimate the relationship between discrimination and health among Black older adults.
Interpersonal discrimination describes the unjust and prejudicial action of one person towards another on the basis of difference (e.g., race, religion, sexual orientation). A number of studies have linked discrimination on the basis of race to poor mental health (Berger & Sarnyai, 2015; Jones et al., 2022; Loyd et al., 2024; Madubata et al., 2018; Pascoe & Smart Richman, 2009; Seblova et al., 2022; Williams, 2018; Wycoff et al., 2024) and physical health outcomes (Barnes et al., 2012; Gamba et al., 2024; Hill-Jarrett & Jones, 2022; Lewis et al., 2015; Lewis & Van Dyke, 2018; Majoka & Schimming, 2021; Robinson-Lane et al., 2022; Williams, 2012; Zahodne et al., 2023), leading to its recognition as a major public health crisis (Causadias & Korous, 2019; Vandiver, 2020). Converging evidence has also indicated that Black Americans report more experiences of discrimination and unjust treatment compared to any other racial or ethnic group (Bleich et al., 2019; Everson-Rose et al., 2015; Kessler et al., 1999; Lanier et al., 2016; Lewis et al., 2013).
A primary limitation of many studies that have examined the impact of discrimination on health is the application of a single-axis framework of investigation, for example, examining race without considering intersecting identities such as gender. Considering only one aspect of identity flattens researchers’ ability to understand the impact of interlocking systems of discrimination and oppression on those who exist with multiply marginalized identities. Further, viewing Black individuals as a monolithic group introduces bias as it limits understanding of within-group heterogeneity and substantial socio-cultural differences across Black inhabitants of the United States (Lewis & Van Dyke, 2018; Onwong’a et al., 2021).
According to intersectionality theory, which has origins rooted in Black feminist thought, systems of oppression, such as racism and sexism, are cross-cutting and mutually reinforcing and should therefore be analyzed and addressed simultaneously (Bowleg, 2008; Cole, 2009; Combahee River Collective, 1995; Crenshaw, 1990). Thus, measures that focus exclusively on racial discrimination may not adequately capture the experiences of Black women as they are frequently erased in studies speaking to the experience of racial discrimination or gender discrimination, which most often align with the experiences of Black men or White women, respectively (Bowleg, 2008; Crenshaw, 1990). Per intersectionality theory, people’s social identities (e.g., race and gender) are not experienced in isolation, and they navigate and engage in society as gendered-racialized individuals (Monnat, 2010). As discussed by Harnois and Ifatunji (2011), the experience of racial discrimination is a “gendered phenomenon” and the way it is enacted against Black men and women may differ.
A burgeoning body of evidence supports both quantitative and qualitative differences in the experience of racial discrimination between Black men and women (Brownlow et al., 2019; Hoggard et al., 2019; Kwate & Goodman, 2015; Lewis & Van Dyke, 2018) so much so that the term gendered racism has been used to describe the simultaneous experience of racism and sexism experienced by Black women (Essed, 1991; Jones et al., 2022; Lewis & Neville, 2015; Spates et al., 2020). Some studies have shown that Black women report interpersonal incivilities at higher rates compared to Black men (Kwate & Goodman, 2015), whereas others have documented higher rates reported by Black men (Broman, 1996; Ifatunji & Harnois, 2016), or no differences in the frequency of everyday discrimination (Sellers et al., 2013). These findings may be mixed due to a limited number of questions on existing measures of discrimination that fully account for the nuanced and simultaneous experience of both race and gender, but this hypothesis remains to be fully explored particularly among older Black adults.
Studies have also suggested that there are differences in the type of discrimination experienced by Black men and women. For instance, Black men are more likely to experience fear/suspicion-based discrimination (Evans, 2011). Qualitatively, Black men have described experiences of major discrimination such as police violence and racial profiling, whereas Black women described instances of microaggressions, invasive and inappropriate questions, and being treated as if they are invisible (Kwate & Goodman, 2015; Wong et al., 2025). The stereotyped imagery and faulty beliefs society maintains about Black men versus Black women also differ, which may influence the type of discrimination they experience. Black men are often presented as violent/criminal, lazy, athletes, and lacking intellect (Chavous et al., 2004; Evans, 2011; Taylor et al., 2019; Timberlake & Estes, 2007), whereas Black women have traditionally been portrayed, especially in the media, as a Mammy (i.e., homely, self-sacrificing, subordinate, meant to serve), Sapphire (i.e., loud, overbearing, domineering, unfeminine), or Jezebel (i.e., hypersexualized, temptress, lascivious; Collins, 2000; Gordon, 2015; West, 2008).
Lastly, interpretation of discrimination may differ across Black men and women. Gender differences in the perception of discrimination may shape self-reports of discrimination. For instance, one study found parental differences in racial socialization (i.e., the process through which parents provide explicit and implicit messages to a child about their race and racial discrimination) for Black boys versus girls (Brown et al., 2017; Holman, 2012). Other studies have found that Black girls generally received more parental socialization messages around racial pride and cultural identity (McHale et al., 2006; Smith-Bynum et al., 2016), and Black boys received more messages preparing discriminatory types of encounters and racial barriers (Bowman & Howard, 1985; Cooper et al., 2015; Smith-Bynum et al., 2016). If there are gender differences in the type of racial socialization messaging received in childhood, this may lead to differential awareness of discrimination enacted in adulthood. Additionally, there is evidence that a parent’s racial socialization messages to a child may be largely determined by the parent’s experience of racial discrimination (Holloway & Varner, 2021; McNeil Smith et al., 2016; Saleem et al., 2016). Thus, early childhood socialization practices may also contribute to gender differences in the perception of discrimination in adulthood depending on whether socialization was predominantly from a male parent, a female parent, or any combination of two parents. Taken collectively, this body of evidence has suggested the need for more nuanced approaches to examining the gendered and racialized experiences of discrimination faced by Black Americans.
Everyday Discrimination Scale
In considering the measurement of discrimination, one of the most widely used research measures of day-to-day discrimination is the Everyday Discrimination Scale (EDS; Williams et al., 1997), which was originally comprised of nine items and developed to examine the association between discrimination and self-reported physical and mental health. The EDS has been used for almost three decades with a variety of populations (Goldman, 2022; Gonzales et al., 2016; Keys et al., 2019; Pearl et al., 2018), across countries (Caqueo-Urizar et al., 2021; Dambrun, 2007; Faerstein et al., 2014; Siddiqi et al., 2017; Williams et al., 2017) and to examine complex relationships between discrimination and health outcomes (e.g., see Lawrence et al., 2022). Given the pervasive use of the EDS across varying demographic groups, and the importance of understanding how experiences of discrimination shape health and wellness, identifying the best approach to measure the construct of discrimination is critical, and ensuring measurement is adequate for populations of interest is equally important.
Psychometric evaluation of the original nine item EDS with Black American respondents has shown good internal consistency with Cronbach’s alphas ranging from .80-.87 (Clark et al., 2004; Taylor et al., 2004). Convergent validity has also been established as evidenced by significant correlational relationships of the scale with perceived stress (.39), depression (.35), negative affect (.37), and social conflict (.30) (Taylor et al., 2004).
Existing research examining the EDS dimensionality and factor structure has varied. Some have identified a unidimensional scale (Clark et al., 2004; Kessler et al., 1999; Kim et al., 2014; Krieger et al., 2005; Williams et al., 1997), or a two-factor scale (Barnes et al., 2004; Guyll et al., 2001). Others have indicated a four-factor scale and noted issues of local dependence potentially resulting from item order or wording (Stucky et al., 2011). As a result of local dependence, Stucky and colleagues (2011) reduced the scale to five items that consistently reflect the one-factor model. These inconsistent findings are further compounded by studies that suggest scale dimensionality is primarily driven by the first two items on the scale (“treated with less courtesy” and “treated with less respect”) and therefore use a combined two-item adapted EDS measure (Benner et al., 2024; Harris et al., 2019).
Ensuring measurement invariance is a psychometric prerequisite before use of scales like the EDS; however, invariance is rarely tested. Measurement invariance is achieved when a scale or psychological instrument measures the same latent construct equivalently across groups or time points. Two critical assumptions when using the EDS in studies are: (a) the latent, unobserved construct (discrimination) is the actual construct measured by the set of questions, and (b) this construct is measured with equivalent validity across all subpopulations the instrument is used with (Bastos & Harnois, 2020). Meeting these measurement assumptions is particularly critical for cross-group (e.g., Black male vs. Black female) comparisons if differences on the latent construct are interpreted. Thus, when assumptions of measurement invariance are violated, biased estimation may impact the calculation of mean differences in discrimination in Black men and women, or it may impact the calculated relationship between each group’s experience of discrimination and a health outcome (e.g., see Chen, 2008; Guenole & Brown, 2014; Meuleman et al., 2022).
Several studies have shown EDS measurement invariance across different racial/ethnic groups. Kim and colleagues (2014) determined the underlying construct of discrimination was equivalent across four racial/ethnic groups (non-Hispanic White, Black, Hispanic/Latino, and Asian), but found that the intercept for item 7 (“People act as if they’re better than you”) was lower for Hispanic/Latino and Asian groups. In a modified version of the EDS, measurement was invariant across racial/ethnic groups (Black/African Americans, non-Hispanic White, Asian American Native Hawaiian or Pacific Islander, American Indian/Alaska Native, Multiracial), but racial/ethnic differences were observed in the likelihood of endorsing a specific item (e.g., Black participants were more likely to endorse “they are afraid of you”; Shariff-Marco et al., 2011). Despite these findings, EDS measurement invariance across racial groups has not been consistently observed (Bastos & Harnois, 2020; Harnois et al., 2019). Some have called the EDS measurement properties into question particularly when comparing groups stratified by multiple aspects of identity including age, gender, and education level (Bastos & Harnois, 2020; Harnois et al., 2019). For example, in a sample of adults in Texas, Harnois and colleagues (2019) found EDS nonequivalence across race/ethnicity, gender, and educational groupings when the reason for discrimination was not asked as a follow-up. Additionally, a qualitative study examining participant patterns of interpretation of EDS questions revealed that the respondents of varied sociodemographic groups interpreted the survey and the purpose of the questions in substantively different ways: racial/ethnic minority women interpreted the EDS questions as being generally about social inequality, and racial/ethnic minority men described the survey as asking about experiences of racism (Harnois, 2022).
Results comparing the EDS measurement properties across Black men and women have not been uniform as findings have differed across samples. For example, using data from the 2001-2003 National Survey of American Life (NSAL; Jackson et al., 2004) and the 1995 Detroit Area Study (DAS; Jackson & Williams, 2002), Ifatunji and Harnois (2016) found EDS non invariance at the metric level for Black men and women across both samples using the two-factor model (i.e., separating the EDS into implicit and explicit discrimination). After allowing factor loadings to vary, they observed improvement in the model fit for the NSAL sample, but not the DAS sample. They then tested whether the observed noninvariance resulted in group mean differences in discrimination across Black men and women. They also examined whether accounting for this difference explains the “gender gap” in the rates of everyday discrimination (namely, higher rates of interpersonal discrimination). Findings were mixed; in the local sample, DAS, they found that adjusting for measurement error across Black men and women in the sample fully accounted for the gender differences in the factor mean scores for the implicit discrimination factor. Adjusting for the measurement error accounted for 25% of the explicit discrimination factor mean difference between Black men and women. However, these findings were not upheld with the national sample, NSAL. Additionally, older adults (i.e., individuals aged 65 and older) were underrepresented across both studies. In the NSAL, the age of participants ranged from 18 to 64 years, and in the DAS, the maximum age was 67, with only 3.78% of the sample reflecting individuals from age 65–67. To our knowledge, no additional psychometric studies of the EDS include information about distribution of the sample across axes of race, gender, and age therefore limiting our ability to make meaningful psychometric evaluations across older Black men and women.
Given the inconsistent prior results and limited information on older adults in existing psychometric studies of the EDS, we wanted to address an important gap in the literature by testing for measurement invariance of the EDS across Black men and women elders age 65 or older. Examination of the psychometric properties of the EDS among subgroups of older Black adults is particularly important as many older adults existed during the Jim Crow era within the United States. Under Jim Crow laws, de jure racial segregation was enforced in the United States South, and Black Americans of this time experienced blatant forms of racial discrimination that is likely distinct from the racial discrimination most prevalent today (Adkins-Jackson et al., 2022). This is supported by the fact that the implementation of racial discrimination shifts over time and space in subtle ways to uphold White supremacy and maintain social hierarchy (Desmond & Emirbayer, 2009; Hardeman et al., 2022; Seamster & Ray, 2018). Additionally, “Gender neutral” questionnaires developed to assess racial discrimination and experiences may not adequately capture the nuanced differences in stressors experienced by older Black men and women. While the consequences of misestimation could be costly, the degree of measurement bias remains unknown. It is thus important to evaluate the EDS psychometric properties in older Black adults. As a step towards incorporating intersectional methods into our understanding of the measurement and conceptualization of interpersonal discrimination, this study aims to (a) examine the measurement invariance of the EDS among community-dwelling older Black adults, and (b) measure the extent to which nonequivalence would impact estimates of discrimination and potentially comparisons of group means.
Method
Participants
Participant Descriptive Characteristics
Procedures
This dataset was archival. Participants were recruited into the KHANDLE study if they were 65 years of age or older on January 1, 2017. They were excluded if they had a medical diagnosis of dementia or another medical condition listed in their medical record that may interfere with study participation. KHANDLE was approved by the institutional review boards of Kaiser Permanente and the University of California, Davis. All participants provided written informed consent, which granted permission to use and publish data in future analyses.
Measures
Everyday Discrimination Scale
The Everyday Discrimination Scale (EDS; Williams et al., 1997) is a 9-item self-report instrument designed to measure perceived experiences of interpersonal discrimination. Respondents are presented with the following: “In your day-to-day life, how often do any of the following things happen to you?” Then, nine general scenarios are presented on how others have perceived or treated the respondent, such as being treated with less courtesy or respect, receiving poorer service at restaurants/stores, or being called names or insulted. The specific list of items is documented elsewhere (Williams, nd). Using a 6-point Likert scale, respondents rate the frequency of occurrence in their day-to-day life (6 = Almost everyday, 5 = At least once a week, 4 = A few times a month, 3 = A few times a year, 2 = Less than once a year, 1 = Never). For those who respond to a question indicating that the specific form of discrimination happens a few times a year or more often, the respondent is prompted to select what they think the main reason is for discrimination. They are provided a list of options to select from (e.g., race, gender, ancestry or national origins, height, other) and have the option to provide a free response. The specific reason(s) selected were not part of these data analyses; therefore, the study focused on discrimination without attribution. Total EDS score is summed across the nine ratings and ranges from 9 to 54, with higher scores reflecting more frequent instances of perceived discrimination.
Demographic Information
The demographic information used from the original data was gender, race/ethnicity, age, educational attainment, employment status, marital status, and total income (participant income + spouse/significant other income). Gender information was collected during baseline interview and is limited by a lack of query about transgender and gender nonconforming identities. Participants were asked, Do you consider yourself to be male or female and this information was coded as male, female, other, or refused.
Analytic Procedures
All factor analyses were completed using MPlus Version 8 (Muthén & Muthén, 1998-2017). We were first interested in identifying the best-fitting factor structure given inconsistencies in the literature. An exploratory factor analysis was therefore conducted using maximum likelihood estimation for extraction as has been done with other EDS data (Kim et al., 2014). This approach allows for non-normality (Shi et al., 2021). Geomin rotation was used to allow for correlations among factors. Upon identifying a viable factor solution, we then completed a confirmatory factor analysis (CFA) separately in each group to confirm adequate fit. We allowed for minor dissimilarities (i.e., differences in what residuals were correlated) in the final best-fitting baseline model for each group as has been recommended (Byrne, 2013) and conducted in other work (Harnois et al., 2019).
We next used multiple group CFA to test for measurement invariance of the EDS across gender for older Black men and women. Configural, metric, and scalar invariance tests were conducted by sequentially constraining parameters across groups, starting with the least restrictive (configural) model. With each step, additional parameters were constrained to equivalency across groups. Briefly, the baseline configural invariance of the EDS was first tested to analyze the factor loading pattern and the number of dimensions. Constraints consisted of fixing the first item loading and first item intercept to equivalence across groups. All remaining parameters, including means and variances, were allowed to vary freely across groups. Then, metric invariance was evaluated to test the strength of association between items and the latent construct. In this model, the factor loadings were constrained to equivalence across groups. Lastly, scalar invariance was examined to test for the equivalence of item intercepts and whether the comparison of mean values of discrimination across groups is appropriate. This step included constraining intercepts to equivalence across the two groups and testing measurement invariance (Putnick & Bornstein, 2016; Vandenberg & Lance, 2000; Widaman & Reise, 1997).
At each step of invariance testing, modification indices were examined and used to determine if freeing some item parameters from constraint would improve model fit with theoretical relevance in mind. Specifically, we examined what made the most sense based on the change in chi-square modification index. This index reflects the potential change in the chi-square statistics if an association between parameters is added to the model and if the association is allowed to differ across groups (Whittaker, 2012). Given that modification indices are completely data-driven, we only freed parameters or allowed for correlation of parameters where reasonable and if the adjustment fit within the broader theoretical context (Whittaker, 2012).
Per Hu and Benter’s (1999) recommendation to examine at minimum three goodness-of-fit indices, we considered: Comparative Fit Index (CFI), Tucker-Lewis Index (TLI), Root Mean Square Error of Approximation (RMSEA), Standardized Root Mean Square Residual (SRMSR), and chi-square fit index with corresponding degrees of freedom. While there are no universally accepted criteria to denote adequate fit, we use the recommended fit criteria: CFI and TLI ≥.95; RMSEA ≤.05; and SRMR ≤.06. Differences in nested models were examined to determine whether imposing additional equality constraints across the groups reduced fit, which would indicate a difference in measurement between the groups. While we reported chi-square change (∆χ2), it is sensitive to large sample size (Cheung et al., 2002; La Du & Tanaka, 1990). We therefore also included change in the CFI (∆ CFI). Specifically, a CFI reduction equal to or less than .01 was used to indicate measurement invariance (Cheung & Rensvold, 2002). R
Additional Analyses
Descriptive data were analyzed using IBM Statistical Package for Social Sciences (SPSS) version 28 (IBM Corp, 2021). Bias is the systematic error related to a scale and was calculated as the average difference between baseline (configural model) and adjusted (final model) factor scores across all participants. Upper and lower limits of agreement were calculated as Bias +/− (1.96 x standard deviation Factor difference ) to reflect the upper and lower limits of agreement, respectively. It is recommended that 95% of data fall within the limits of agreement (Giavarina, 2015). Because a measure that over-estimates at one end of the distribution but under-estimates at another may appear unbiased on average, a Bland-Altman plot (Altman & Bland, 1983; Bland & Altman, 1986) was used to examine the degree of agreement between the factor score estimates for the baseline configural model and the factor scores of the final adjusted model across the range of the measures. The Bland-Altman plot is a scatter plot with the y-axis representing the difference between factor scores estimated by different models (baseline factor score - adjusted factor score) and the x-axis reflecting the average of the factor score values ((baseline factor score + adjusted factor score)/2).
Results
Preliminary Analyses
As part of preliminary analyses, data were cleaned and checked for missing values. Of the 443 Black American participants in the dataset, 12 were missing gender information and were eliminated from subsequent analyses. Analysis of the patterns of missing the EDS data indicated that of the 431 remaining participants, three had completely missing data, and these cases were removed. Across the entire dataset, 1.11% of the data were missing. Most participants (96.0%) had no missing data. The total missingness within each variable ranged from 0.90% to 1.20%. Little’s MCAR test was significant, indicating the data were not missing completely at random, χ2 = 146.23 (76), p < .001. The data were not systematically missing by gender, χ2 = 2.76 (5), p = .737 or education level χ2 = 26.1 (25), p = .404. As a result, we used maximum likelihood with robust standard error (MLR) parameter estimates in our analyses as this has been identified as the optimal choice for latent models with missing data due to its yield of more accurate estimates (Shi et al., 2021). Participants who were 90 years of age or older were recoded as age 90 in the study as part of the study protocol (Black men: 2.84% were recoded; Black women: 5.23% were recoded).
Skewness and kurtosis of each item of the EDS was evaluated using cutoffs of skewness <2 and kurtosis <7 (Curran et al., 1996). Minor violations were observed for item 6 (skewness = 2.71, kurtosis = 7.49), item 8 (skewness = 2.41, kurtosis = 6.76), and item 9 (skewness = 3.46, kurtosis = 14.5). To address this, we elected to use MLR for parameter estimates as they are robust to non-normality and non-independence of observations and does not require data transformation (Shi et al., 2021).
Descriptive Statistics
Item-Level Means, Standard Deviations, and Intercorrelations of Everyday Discrimination Scale by Gender (Black Men n = 141; Black Women n = 287)
Note. Frequency scores from 6-point Likert scale ranging from 1 = never to 6 = almost everyday.
Exploratory Factor Analysis
The Kaiser-Meyer-Olkin (KMO) measure was 0.878, which indicated the data were suitable for factor analysis. This was further supported by Bartlett’s test of sphericity which was significant, (χ2 (36) = 1,614.50, p < .001) and indicated there are sufficient correlations between variables to proceed with factor analysis. Results from the factor analysis indicated three eigenvalues of: 4.54, .971, and .875. Visual inspection of the scree plot of eigenvalues showed a clear decline after the first factor, further supporting a one-factor solution/unidimensionality. Based on these data as well the Kaiser criterion (1960) indicating an eigenvalue greater than one is needed to be considered a tenable factor, we proceeded with a one-factor solution.
Confirmatory Factor Analysis: One-Factor Baseline Model
CFA Model Fit Statistics for One-Factor Model (Black Men n = 141; Black Women n = 287)
Note. Model A: baseline; Model B: correlated residuals for items 1 and 2; Model C: correlated residuals for items 8 and 9.
Measurement Invariance
Stepwise Measurement Invariance Tests Across Gender Groups (Black Men n = 141; Black Women n = 287)

Final Scalar EDS Model With Standardized Results for Black Men (n = 141)

Final Scalar EDS Model With Standardized Results for Black Women (n = 287)
Factor Coefficients of the Everyday Discrimination Scale Across Measurement Invariance Test Models for Black Men (n = 141) and Black Women (n = 287)
Note. Item four was noninvariant across groups and factor loadings were free to vary for partial metric and scalar models.
aResidual correlation for items 1 and 2.
bResidual correlation for items 8 and 9.
Measurement Bias
Given the adjustments made to the original baseline EDS model and the final scalar model to achieve measurement invariance, our final step of analysis was to examine the extent to which measurement bias had an impact on factor score estimates. We were also interested in whether the degree of measurement bias differed across groups and might be greater for Black women. We examined the degree of measurement bias with the Bland-Altman plot shown in Figure 3. The plot provides a visual indication of agreement between the factor scores resulting from the baseline unadjusted model and the final scalar model. Factor score differences on the y-axis were plotted against the average of the factor score values on the x-axis. Bland-Altman Plot Comparing Everyday Discrimination Scale Bias Between Black Men (n = 141) and Women (n = 287). Note. Solid Black Line Represents Bias. Dashed Gray Lines Represent the Upper and Lower Limits of Agreement, Respectively
Upon initial examination, the average discrepancy in factor score estimates between the two models, or the bias (Figure 3 solid black line), is close to zero. This suggests that, when collapsed across the two groups, there does not appear to be a meaningful difference in the estimates of the latent factors between adjusted and unadjusted models. However, in comparison across groups, the magnitude of difference in factor score estimates tends to increase for Black women only and exceeds the limits of agreement for Black women with higher factor scores. Figure 3 suggests that for Black women who report more frequent experiences of discrimination and have higher scores on the EDS, there may be an overestimation of the total score when using the scale as originally intended. Additionally, while still within the broad limits of agreement, Black women who reported less frequent experiences of discrimination had unadjusted total EDS scores that underestimated exposure; the lack of significance of this finding may represent a floor effect.
Discussion
The current study examined the measurement invariance of the EDS in a sample of 141 older Black men and 287 older Black women. Based on intersectionality theory, race and gender are co-constructed and reinforced in ways that guide perceptions and the enactment of prejudicial behavior, and this may contribute to differences in how discrimination is experienced across Black men and women. Whereas Black men are typically viewed as the target of racial discrimination (Goff & Kahn, 2013), this belief may lead to narrow constructions of racial discrimination that are reflected in the questions included in measures like the EDS.
In our initial analyses of the EDS structure, we found a one-factor model was best-fitting across the entire sample, which is consistent with some of the existing psychometric literature (Clark et al., 2004; Kessler et al., 1999; Kim et al., 2014; Krieger et al., 2005; Williams et al., 1997). We then sought to confirm this unidimensional factor structure using CFA. Post-hoc adjustments, namely correlating the residual error from item 1 (You are treated with less courtesy than other people are) with item 2 (You are treated with less respect than other people are) resulted in improved fit for Black men, which is consistent with other psychometric investigations of the EDS with Black Americans (Kim et al., 2014; Reeve et al., 2011). For Black women, correlating the residual error from item 8 (You are called names or insulted) and item 9 (You are threated or harassed) was additionally required for improved model fit and has been a modification made in other samples (Kim et al., 2014; Reeve et al., 2011; Stucky et al., 2011). The need to adjust residual error terms suggests there is shared variability across these items that is not captured by the latent discrimination construct. This shared variability is thought to reflect local dependence. The local dependence of EDS items 8 and 9 has been discussed by others (Harnois et al., 2019; Kim et al., 2014; Stucky et al., 2011). In a sample of Black Americans, Stucky and colleagues (2011) found that local dependence was due to the adjacent order of the EDS items on the scale as well as the overlap in the content captured by the items that extend beyond the underlying construct of discrimination (Steinberg, 1994; Stucky et al., 2011; Tourangeau & Rasinski, 1988). These findings provide further evidence that adjustments to the scale may be needed before use with older Black adults.
Next, we tested the adjusted baseline models to determine whether measurement using the EDS was invariant by gender. Findings were notable for invariance at the configural level. This suggests that the latent construct of discrimination has the same pattern of free and fixed loadings across gender groups, and the patterns of EDS item loadings on the latent discrimination factor is equivalent across gender (Putnick & Bornstein, 2016).
In the subsequent step of analyses, we found gender noninvariance at the metric level. Further examination of what contributed to metric noninvariance revealed item 4 (People act as if they think you are not smart), differentially related to the construct of discrimination across older Black men and women. This pattern of results has been confirmed in other Black American samples (Stucky et al., 2011). Our findings indicate that item 4 more strongly relates to the discrimination construct for older Black women than Black men. While this is contrary to existing literature on the racial gender stereotypes of intellectual ability and academic performance (Evans, 2011; Jaxon et al., 2019), these findings may be contextualized by the demographic background of our KHANDLE cohort. Most of the Black women in our sample were born in the U.S. South, and all were born in 1952 or earlier, therefore making them primary school aged prior to the passing of Brown v. Board. Thus, not only did the Black women in the sample possibly experience racially segregated education, but they were educated during a tumultuous period of education integration and may have been subjected to more overt stereotypes and prejudices based on race and gender (Hutson, 2022; Jones, 2019; Keels et al., 2017). Additionally, most Black women in the sample attended college and worked during a period where employment in “nontraditional” high paying roles outside of domestic service or low wage jobs was uncommon. Thus, given their high educational attainment, Black women in our sample may have been one of few (or one of one) Black women in their work setting. This may have placed them in environments where insidious forms of discrimination, such as having their intellect questioned on the job, was common. Further investigation is needed to examine how the intersections of race, gender, and age impact perceptions and reports of interpersonal discrimination.
There were differences in how the EDS operates across Black men and women resulting in measurement bias. First, we found that the total variance in discrimination explained by the EDS items was greater for Black men than Black women, which suggests the original scale may more accurately capture discrimination experienced by Black men. Additionally, there was evidence of biased measurement in that the unadjusted EDS poorly estimated scores for Black women at the extremes ends of the scale. That is, exposure was significantly overestimated for Black women who reported more frequent encounters of discrimination and trended toward underestimating scores for Black women who reported less frequent experiences of discrimination. With the scale adjustments, invariance was achieved across gender; however, use of the EDS as originally developed may result in poor estimates of interpersonal discrimination, particularly for Black women. Our findings indicated measurement bias would impact the comparison of mean EDS scores across gender groups, leading to faulty conclusions about group differences. This measurement bias could also potentially impact our understanding of the relationship between discrimination and other outcomes, such as health, specifically for Black women. Further scale refinement would benefit from the inclusion of discrimination experiences that more closely aligned with those of Black women. Some experiences of discrimination that are relevant to Black women may include silencing, invisibility, the projection of specific stereotypes and tropes (e.g., Angry Black woman), comments about appearance, and tone policing (Lewis et al., 2016; Lewis & Neville, 2015; Williams, 2023).
Overall, our finding that there are differences in the types of racial discrimination experienced between older Black men and women fits with an intersectional understanding of how our perception of race is shaped by gender. Further, our findings support one of the central tenants of intersectionality theory—that identity-based discrimination and its associated social inequalities are interdependent and mutually constitutive (Bowleg, 2008). We show how considering race and gender as mutually exclusive constructs within a racial discrimination instrument development, like the EDS, can lead to faulty measurement. Additionally, the use of such faulty logic reinforces existing social hierarchies by prioritizing the experience of Black men and contributing to the intersectional invisibility of Black women (Purdie-Vaughns & Eibach, 2008). Greater emphasis in research and clinical contexts should be given to the distinct forms of interpersonal discrimination experienced by Black men and Black women as opposed to discrimination due to race alone. We caution against androcentrism, whereby society centers men as a “gender neutral” standard, and all things health are measured in relation to men (Bailey et al., 2019, 2024; Gilman, 1911; Hibbs, 2014). Fortunately, instruments such as the Gender Racial Microaggressions Scale (Lewis & Neville, 2015) and the African American Men’s Gendered Racism Stress Inventory (Schwing et al., 2013) have been developed to capture the intersectional discrimination experienced by Black women and men, respectively. We encourage continued development measures that capture the distinct, nuanced experiences of racial gender groups.
Overall, the strengths of this paper and its findings should be considered with some limitations in mind. First, our data were limited to self-reported experiences of discrimination which relies on the individual’s perception of the incident and accurate self-report. Further, there is no timeframe indicated in the EDS to cue participants to a specific stage of life, so it is unclear whether participants responded with their entire life history in mind or only more recent encounters of discrimination. Additionally, there may be qualitative differences in individuals who are able to recall instances of discrimination more accurately compared those who do not, which likely impacts the approach to self-report. While the EDS offers the option to select the main reason for discrimination, we included all data in analyses irrespective of the main reason selected for discrimination. Perhaps selecting only responses that indicate discrimination was because of race would have led to a different pattern or results; however, there is also evidence that individuals with multiple marginalized identities may be unable to identify the sole reason for the act of discrimination, and this may impact the validity of scales that contain this line of questioning (Bowleg, 2008; Harnois et al., 2022).
The KHANDLE cohort has a relatively high socioeconomic status and is drawn from a single region in Northern California, although many older Black participants immigrated to California from the U.S. South. Additionally, our sample generally had a high level of education, which may influence generalizability as well as exposure to discrimination. Results may differ in other regions of the country and across social classes and education levels. However, the robust evidence that interpersonal discrimination is manifested differently towards Black men and Black women suggests our findings are likely to be broadly relevant.
Implications for Clinical Practice and Research
This study has implications for clinical practice. Awareness of intersectional experiences of discrimination and practicing cultural responsivity is paramount within a therapy and assessment setting. Not only is having the language to discuss and affirm lived experiences important to maintaining a therapeutic alliance (Haas et al., 2024) and providing adequate care (Mendoza et al., 2020), but having measures that accurately measure and reflect lived experiences is critical for identifying areas for potential intervention. For instance, assessing both the form and frequency of discrimination may aid a psychologist in the development of mitigation and stress coping strategies within a therapeutic setting (e.g., see Lewis et al., 2013). Additionally, understanding how discrimination is enacted across race and gender can aid in the assessment and conceptualization of an older adult presenting for cognitive changes.
With regards to research, there is currently no objective criterion for adequate measurement of the EDS. The implications of mismeasurement, therefore, depend on the specific use of the measure. For instance, in studies that attempt to understand how experiences of interpersonal discrimination shape health and contribute to health disparities, measurement nonequivalence may result in misestimation and limit our ability to make inferences about the nature of this relationship. Thus, not only are Black men’s and women’s experiences of discrimination different, but how discrimination relates to health outcomes may also differ and warrants future investigation. Rigorous measures of discrimination introduced by Williams et al. (1997) were a major step forward in research on racial health disparities, but it is essential that we take as much care for valid and reliable measures of discrimination as for any other domain. Our findings are thus an important step towards ensuring measurement of interpersonal discrimination is valid for both Black men and Black women.
Footnotes
Ethical Considerations
KHANDLE was approved by the institutional review boards of Kaiser Permanente and the University of California, Davis. All participants provided written informed consent, which granted permission to use and publish data in future analyses.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by National Institute on Aging K23AG084871 (PI: Hill-Jarrett); National Institute on Aging R13AG030995-15 (PI: Mungas); National Institute on Aging R01AG052132 (PI: Whitmer, Glymour, Mayeda, Gilsanz); The Alzheimer’s Association/The Judy Fund [2019-AARGD-644788 (PI: Gilsanz)].
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
