Sex in the medical machine: How algorithms can entrench bioessentialism in precision medicine

Abstract

Machine learning offers new possibilities for developing more precise diagnostics and treatments, but the increasing use of sex stratification in precision medicine algorithms raises concerns. Using Alzheimer's disease (AD) research as an example in which machine learning approaches are applied to a heterogenous, socially patterned disease, this paper examines how the move toward sex-specific “pink” and “blue” algorithms reinforces biological sex essentialist assumptions and their attendant harms. We analyze three examples of sex-stratified algorithmic approaches in AD research, and identify three interacting processes-effacing contested knowledge, obscuring social factors, and ossifying binary sex categories-that can occur when binary sex variables are incorporated into predictive models. These case studies demonstrate that even in models intended to be causally agnostic, sex categories are likely to be interpreted as decontextualized, self-evident health determinants in a manner that can imply causality of biological sex. We call for establishing ethical norms and empirical standards for including gender/sex variables in precision medicine algorithms to avoid perpetuating crude ontologies of sex and gender that undermine both scientific validity and health justice.

Keywords

Sex gender precision medicine Alzheimer's AI ethics critical algorithm studies

Introduction

Machine learning offers new possibilities for medicine: data-driven tools that promise to tailor care to each patient's particular needs and circumstances. Those who develop such tools are, with increasing frequency, stratifying their models by sex or including sex as a predictor. In light of concerns about comparable accuracy and fairness of medical algorithms for men and women (Celeste et al., 2023; McCradden et al., 2020), the appeal of sex stratification is obvious; its dangers less so.

The embrace of sex stratification stands in contrast to discussions of “race corrections” in biomedical algorithms. Recent scholarship in medicine and science & technology studies argues that race-based corrections reinforce stereotypes about biological differences between groups; systematically mischaracterize risk for non-white groups; and suggest that race is a cause of health outcomes when racism is what is causally relevant (Braun et al., 2021; Delgado et al., 2022; Denny and Collins, 2021; Eneanya et al., 2022). As a result, medical researchers are reassessing the widespread inclusion of race categories in clinical algorithms, once believed to be necessary for accurate risk prediction and racially inclusive medicine. Reflecting this reversal, a 2020 perspective in the New England Journal of Medicine argued for “reconsidering race correction in order to ensure that our clinical practices do not perpetuate the very inequities we aim to repair” (Vyas et al., 2020: 881).

As we will argue, sex-stratification calls for similar scrutiny. One place where sex stratification has taken particular hold is in research on Alzheimer's Disease and Related Dementias (ADRD or AD), a well-funded, politically powerful, and socially salient field of biomedicine (see e.g., Alzheimer's Association, (n.d. a, n.d. b, n.d. c)) with a history of contentious debate over research ethics and the role of biological versus social factors in disease risk and prediction (Buckley et al. 2019; Morris et al., 2014; Piller, 2025). In this paper, we use AD research to characterize visions of binary sex-stratified solutions to multifactorial, costly, socially-patterned health conditions, and anticipate how these visions may ossify binary sex categories across precision medicine platforms.

In medical research, sex is conventionally defined as “the different biological and physiological characteristics of males and females, such as reproductive organs, chromosomes, hormones, etc.,” and gender as “the socially constructed characteristics of women and men—such as norms, roles and relationships of and between groups of women and men” (WHO, 2021). In this paper, we also use the term “gender/sex” for cases in which “gender and sex cannot be easily or at all disentangled” (van Anders, 2015: 1181). The term “biological sex essentialism” refers to concepts and practices where

sex is defined by a set of binary, fixed variables that are facts of biology, found in nature across species and ecologies, uncontroversially ‘scientific,’ and omnipresent throughout the body so that every tissue at every level of biological organization can be characterized as male or female. (Richardson, 2022: 14)

Machine learning algorithms in precision medicine use large datasets to identify patterns and features relevant for predicting health outcomes or disease categories. In this paper, we use “sex-stratified algorithms,” sometimes also called “sex-specific algorithms” and “sex-sensitive algorithms,” as an umbrella term to refer to machine learning approaches that incorporate sex categories as a source of predictive information; build a separate computational model for each sex; and/or include different numerical cutoffs or relevant predictive features for each sex.

Technically, these methods are often intended to be causally agnostic. That is, an algorithm can offer predictions—for example, whether an individual is more likely to develop AD or will benefit more from an early intervention—without positing a causal relationship between predictors and outcomes, and without elaborating mechanisms that explain why certain individuals are more at risk or why a given intervention works. Our analysis of sex-stratified algorithms will show that the application of such algorithms, even if intended to be causally agnostic, can reinforce biological sex essentialist assumptions and their attendant harms. Notably, this identifies a novel way in which sex essentialism can become embedded in scientific research, as compared with the explicit biological sex determinism of more traditional life and biomedical sciences that feminist scholars have previously analyzed (Fausto-Sterling, 2000; Jordan-Young, 2011; Richardson, 2013).

The clinical and direct-to-consumer technologies built using sex-stratified algorithms may harm individuals and communities and impede the advancement of scientific knowledge, both in the short and long term. In AD science, they may, for example, produce disproportionately inaccurate AD predictions for people with nonbinary or gender expansive identities, or people with cisgender identities who lie on the ends of statistical distributions for men and women. They may also uphold the notion of a woman's brain and a man's brain as innately different, deepening stereotypes about differences in intelligence, cognition, career preferences or motivations, and other psychometrics that, due to their link to the brain and cognition, carry stigma and status and have historically contributed to the subjugation of women (Fine, 2010; Jordan-Young, 2011; Lock, 2013).

We argue that the unreflective use of binary sex categories in machine learning for precision medicine research accelerates approaches to studying gender/sex disparities in health that focus exclusively on biological factors and entrench biological essentialist understandings of gender/sex categories. We show that this occurs through three interconnected processes: effacing contested knowledge; obscuring the social; and ossifying binary sex categories. We describe these processes through close analysis of major research initiatives advancing the study of sex in AD precision medicine, such as the Women's Brain Project (WBP), and through three worked examples of research programs attempting to execute sex-stratified predictive algorithms aligned with the vision of the WBP. We call for precision medicine scientists, ethicists, and critical data studies scholars to establish ethical norms and empirical standards for including gender/sex-related variables in precision medicine algorithms.

Sex as the “gateway to precision medicine”

Many women's health and sex-based biology advocates posit that sex represents a “gateway” into a precision medicine future (Clayton, 2016; Ferretti et al., 2018; Hampel et al., 2018; Miller et al., 2015; Stachenfeld and Mazure, 2022). Precision medicine is a vision of medical care that is tailored to each individual patient, offering customized diagnoses, treatment plans, and predictions of health risks based on an individual's lifestyle, environment, and, most prominently, genetic make-up (Erikainen and Chan, 2019; Ferryman and Pitcan, 2018; Joel et al., 2015; National Institutes of Health (NIH), 2020 ; Prainsack et al., 2018). Proponents of precision medicine claim that this future can be achieved by harnessing the potential of large digital datasets and machine learning to reveal differences in disease risk, biology, and progression amongst subpopulations (Behl et al., 2022; Denny and Collins, 2021; Schaefer et al., 2019). To reach this aim, advocates call for increasing investment in AI-powered models trained on larger and more granular datasets for use in medical research.

ADRD are “debilitating conditions that impair memory, thought processes, and functioning, primarily among older adults” (U.S. Dept. of Health and Human Services, n.d.), and are responsible for significant human suffering, health care costs, and caregiving labor. AD is a highly heterogeneous disease category, that is, one with large variability in disease presentation, progression, and patterns of pathology. Despite billions invested annually into research, existing hypotheses about underlying mechanisms remain contested (e.g., the amyloid hypothesis (Morris et al., 2014)), and effective treatments have proven challenging to develop (Moutinho, 2022). Funders and researchers thus consider AD an ideal target for precision medicine (Arafah et al., 2023; Hampel et al., 2018; NIA-AA Symposium Enabling Precision Medicine for Alzheimer's Disease Through Open Science, 2021; Yang et al., 2021). That is, machine learning looks especially well-suited to tackling AD, since etiological mechanisms are multiple, complex, and interlocking—all factors that make it difficult for human researchers to craft fruitful hypotheses or identify useful patterns.

More women live with AD than men, and women are more likely to be caretakers of individuals with AD. These sharp disparities in the burden of the disease have made AD a focus of both women's health advocacy and research on sex differences. Leading voices behind this focus include the Society for Women's Health Research (Society for Women's Health Research, n.d.), the Office of Research on Women's Health, the Alzheimer's Association (Alzheimer's Association, n.d. a, n.d. b, n.d. c), the UK's Alzheimer's Society (Why is dementia different for women?, 2024), Maria Shriver's Women's Alzheimer's Movement (The Women's Alzheimer's Movement, n.d.), and the WBP (Ferretti et al., 2021)—the last of which is an influential non-profit consortium advocating for the importance of sex-based analysis in precision medicine for AD. In 2020 the Alzheimer's Association created a formal professional interest area, “Sex and Gender Differences in Alzheimer's Disease” (ISTAART Community, n.d.), including an award program to support scientific research on “understanding the contributions of biological sex and gender…to address the gaps in our understanding of the role of sex assigned at birth and related genetic, biological, lifestyle and societal factors may play in increasing vulnerability to AD” (Alzheimer's Association, n.d. a, n.d. b, n.d. c), which also established scientific research cohorts to examine sex differences in AD risk factors.

A 2023 flow chart (Figure 1), published by the WBP as part of their efforts to launch a “Sex and Gender Precision Medicine Institute,” serves as an ideograph of these developments (Castro-Aldrete et al., 2023). In this figure, research begins in the laboratory with comparisons of male and female cells in petri dishes or rodent models. It then proceeds in iterative dialogue, as depicted by the two-headed arrows, to “sex-based disease modeling,” which might involve methods such as “stratification of algorithms by sex” and “sex-sensitive deep learning algorithms.” Multi-modal population data and patient data are then processed through these algorithms to develop “sex-sensitive clinical diagnosis and treatment.” Only at the last stage might features such as “socio-cultural determinants” of health be included, alongside “access to digital health tools” that presumably will make such sex-based algorithms a ubiquitous and everyday part of peoples’ lives.

Figure 1.

A flow chart from the women's brain project illustrating a “sex-sensitive Alzheimer’s disease (AD) approach” to inform “precision medicine agendas” (Castro-Aldrete et al., 2023: 7).

The WBP asserts that “sex is an essential component in the phenotypic heterogeneity of AD” (Castro-Aldrete et al., 2023: 3, emphasis added). Citing evidence that women have a higher lifetime risk of AD and more women than men presently have AD, the WBP calls for “a new, AI-powered, biomarker-based clinical framework” that is “sex-sensitive” (Castro-Aldrete et al., 2023: 4) and promises to reveal differences in the predictive value of biomarkers for AD. The project envisions a future of “tailored preventative campaigns for men and women” and “stratified algorithms by sex” in digital technologies used in diagnosis (Castro-Aldrete et al., 2023: 4–5).

The push for sex stratification in biomedicine is not unique to AD research. The WBP's approach emerges within a powerful global movement towards stratifying by sex at all levels of biomedical research (Pape, 2021). Exemplifying this work is the mandate, introduced in 2016 by the National Institutes of Health (NIH), requiring all NIH-funded preclinical research to consider sex as a biological variable (SABV). Champions of this movement see precision medicine initiatives as naturally aligned. As Janine Clayton, Director of the NIH's Office for Research on Women's Health puts it: attending to sex differences is “one step toward the more individualized approach to human health that is the trajectory of medical practice and the aim of the Precision Medicine Initiative” (Clayton, 2016). In close alignment with SABV policy, the WBP proposes implementing a sex-stratified approach at every stage of AD research, modeling, assessment, diagnosis, and treatment, “taking into account sex and gender differences to make a precise diagnosis and recommend a tailored and more effective treatment for each individual” (Cirillo et al., 2020: 81).

The WBP, along with similar initiatives rooted in sex-based biology and promoting gender-specific medicine, presents sex-stratified approaches to AD as not only scientifically important but also ethically essential for addressing a history of androcentrism in biomedicine. Appealing to a gender equity imperative to include sex and gender (Cirillo et al., 2020), the WBP envisions a program of research and clinical risk prediction that is centrally oriented around the category of sex, organizing data and pursuing analysis within a framework of binary sex difference.

Contested science

Sex-stratified machine learning approaches to AD risk prediction and diagnosis introduce binary sex categories into research designs in a space of underdetermination, in which knowledge about the existence of sex differences in the epidemiology of AD and the role of social variables such as education, socioeconomic class, and occupation in producing any disparities is contested. As in many areas of sex disparities science (Danielsen et al., 2022; Einstein, 2017; Lee et al., 2023; Rushovich et al., 2023), in AD research, both the existence of sex disparities in AD incidence and the relative contribution of biological as opposed to gendered and other social variables to such sex disparities represent contested science. Contested science is science that is not only empirically underdetermined by currently available evidence, but also involves persistent and unresolved disputes over the fundamental categories of knowledge, relevance of different forms of evidence, and authority of particular forms of expertise (Gallie, 1955; Richardson, 2021).

Due in large part to demographics of aging, with women living longer than men (Arias et al., 2019) and increases in AD incidence with age (Hebert et al., 2013; Mayeda et al., 2016), a greater number of women live with AD than men. Evidence on sex/gender differences in AD beyond longevity remains equivocal and contested (Mayeda, 2019). For example, claims of excess AD mortality and morbidity among women (e.g., Buckley et al., 2019) face methodological challenges resulting from survival bias, competing risks of death from other causes, and measurement challenges (Mayeda, 2019). Apart from survival differences, reported disparities in AD risk may also be shaped by the gender/sex patterning of known, modifiable risk factors, as suggested by Geraets and Leist (2023) who, for example, found no sex difference in the risk of dementia but rather “differences in the prevalence of modifiable risk factors for dementia,” such as childhood deprivation and low wealth. Anstey et al. (2021), as another example, demonstrated that gendered differences in midlife cardiovascular conditions, such as physical activity and hypertension, largely explain observed sex differences in cognitive decline. Likewise, a growing literature also supports the theory that known, modifiable risk factors of AD are patterned by gender/sex and gendered experiences (Dekhtyar et al., 2015; Sindi et al., 2021; Wolters et al., 2020), including gender identity (Brady et al., 2024). Moreover, any sex differences noted in AD incidence are “slight” compared to disparities observed across socioeconomic and racial/ethnic groups (Lim et al., 2022).

While it is likely that AD emerges from a complex of biological (e.g., genetics, cardiovascular disease, protein plaque accumulation, protein tangles) and lifestyle and environmental factors (e.g., education, employment, caregiving burden, socioeconomic status, lifetime stress, traumatic brain injury exposure), there is little consensus about the relative importance of and interaction between biological and social variables in gender/sex disparities in AD. Research indicating an important role for gender-related social factors—such as education level and occupation (e.g., Garibotto et al., 2008; Vemuri et al., 2014), experiences of violence and trauma (e.g., Severs et al., 2023), and social isolation (e.g., Shen et al., 2022)—in AD has greatly expanded over the past decade, yet considerable research effort and funding has been and continues to be predominantly directed toward biological mechanisms. For instance, the Alzheimer's Association distributed over half of its annual research funding to areas of “Molecular Pathogenesis and Physiology” and “Diagnosis, Assessment and Disease Monitoring” in 2022 and 2023 (Alzheimer's Association, n.d. a, n.d. b, n.d. c). Starting in 2018, the NIH's National Institute on Aging convened an effort to develop a “new biological research framework for Alzheimer's” that focuses on biomarker data (Silverberg et al., 2018). Biomedical researchers who advocate a “sex-specific or gender-specific focus in AD research” (Mielke et al., 2014) operate in this milieu and have set a research agenda at the highest levels of government, foundation, and pharmaceutical agencies focused on the discovery of biologically-driven, sex-specific differences in the prodromal (stage between initial symptoms and full disease onset), diagnostic, treatment, or late stage of the disease that must be taken into account in the development of therapies, screenings, and risk prediction tools (Nebel et al., 2018; Pike, 2017).

The persistence of biological explanations of observed sex disparities may reflect entrenched but outdated theories in Alzheimer's research, which have centered on hormonal explanations of AD since the 1990s. More specifically, growing interest in hormone therapy for postmenopausal women in the 1980s raised the hypothesis of estradiol decline playing a role in sex differences in AD. Indeed, the prospect of reducing AD risk became central to the explosion of hype around hormone therapy (HT) during the 1990s, with proponents of HT suggesting that the therapy reduced risk of cognitive decline in women (Henderson et al., 1994, 1996; Schmidt et al., 1996). Estradiol was not only touted as a potential cure or preventive measure to AD; HT proponents also gestured towards a sex-specific etiological theory of AD, helping to cement, in the eyes of both researchers (Fillit et al., 1986) and the general public (Fillit, 1986), the idea of AD as a female-biased medical condition, linked to female sex-related biology.

By the 2000s, however, the tide shifted sharply away from estradiol theories as new studies revisited the efficacy of HT and called into question the safety of long-term estradiol use. In 1992, the NIH started the Women's Health Initiative (WHI), a study intended to evaluate the efficacy and safety of HT as a preventative measure of heart disease and other health events. By 2002, the WHI elected to halt all study arms that involved estrogens and progestin treatments, citing increased risks for breast cancer, strokes, heart attacks, and other conditions (Writing Group for the Women's Health Initiative Investigators and Rossouw, 2002; for more recent developments see Manson et al., 2024). By 2004, the WHI canceled all programs involving any form of estrogen treatment. In addition to raising concerns about the risk of HT, the preliminary findings from the WHI research program on memory (Women's Health Initiative Memory Study) suggested that estrogen treatment could, in fact, increase dementia risk in women aged 65 years or older (Shumaker et al., 2003, 2004). Despite this, research on the role of biological sex-related factors in AD remains a top priority of major AD research funders, who are interested in molecular biological mechanisms and pathways for understanding AD pathology, including the interaction between apolipoprotein E ε4 (APOE; a risk factor for late life AD) genotype and chromosomal sex (Riedel et al., 2016) and enduring interest in the role of the postmenopausal reproductive transition (Scheyer et al., 2018).

In sum, today, sex and gender research on AD is characterized by extensive debate and uneven evidence about the contributions of sex and gender factors to AD. As the examples in the following section demonstrate, the application of machine learning to this contested field of science can further obfuscate the role of gender/sex in disease.

Sex-stratified predictive algorithms in Alzheimer's precision medicine science: Examples

To characterize and understand models, claims, and assumptions within the emergent use of gender/sex categories in predictive precision medicine approaches, we analyzed three papers applying sex-stratified predictive algorithms in AD precision medicine science. To identify these papers, we surveyed biomedical literature that uses sex variables in predictive models for estimating AD risk. Using a keyword-driven snowball search in Google Scholar (keywords: Alzheimer's, dementia, prediction, predictive model, risk, sex, gender, algorithm, and sex-stratified), we identified 25 articles for close analysis (see Supplementary 1). From this pool, the authorship team of scholars of gender and sexuality studies, history and philosophy of science, public health, and social studies of science, selected three articles that represent emergent strategies for incorporating sex/gender categories in machine learning precision medicine, aligned with the vision of the WBP. We emphasize that these examples do not represent a systematic review of the field and that our focus is not on critiquing individual researchers, but on illuminating assumptions and approaches emerging in this particular area.

Across the 25 papers reviewed, studies use a variety of tools such as neural networks, decision trees, or classifier models. These models compute an AD risk score or assign an AD disease status on the basis of individual patient demographics, biomarkers, and/or behavioral data. In cases such as the WBP, sex categories are a central analytic because sex disparities and sexed disease pathways are an explicit focus of the research program. Elsewhere, sex categories are routinely incorporated in study designs for other reasons, including: the assumption that sex is always an important moderating factor in AD; institutional mandates by funders and publishers that gender/sex categories be included in biomedical research (e.g., “NIH Policy on Sex as a Biological Variable,” see Arnegard et al., 2020); a desire to make tools for clinical settings, where the sex category is a ubiquitous demographic variable accessible to physicians; or an everything-but-the-kitchen-sink approach that inputs all available variables and allows an algorithm to select the most predictive ones.

Below, we describe the three studies and characterize their methods. In Section 5, we reference these examples as we identify three interconnected processes—effacing contested knowledge, obscuring the social, and ossifying of the sex binary—that occur in the application of sex-stratified algorithms. These processes can fix attention on binary sex in the explanation of disparities in disease burden and lead to a biological sex essentialist inference that gender/sex differences are driven by some property essential to maleness or femaleness. Though we emphasize that the three processes are interconnected and multiply manifested in publications within this field, for reasons of clarity, we illustrate each process using one example. We use the first example, Qiu et al. (2020), to demonstrate how contested knowledge is effaced when sex is incorporated as a risk predictor without acknowledgement of ongoing debates about sex differences in AD risk. Through the second example, Ang et al. (2019), we show how the social is obscured as observed differences in AD tests are assumed to reflect biological rather than social differences. Finally, Harms et al. (2022) illuminate how the sex binary is ossified through a machine learning classifier that groups participants into two sex categories.

Calculating individualized AD risk

Our first example, published in 2020, comes from the journal BRAIN. The study, conducted by Boston University computational biologist Shangran Qiu and colleagues, aimed to develop an algorithm for calculating individualized AD risk that can help neurologists in clinical contexts, especially those in under-resourced hospitals with limited, often incomplete patient data (Qiu et al., 2020). To do this, the team trained a neural network to use an MRI scan and generate a “disease probability map”: “a precise, intuitive visualization of individual Alzheimer's disease risk” (2020: 1920) across a person's brain anatomy. As illustrated in Figure 2, they then trained a second neural network to use this probability map to predict whether the individual has AD, testing its performance on three external datasets. The study found that adding non-neuroimaging data, specifically Mini Mental State Examination score (MMSE, a common questionnaire for assessing cognitive impairment), age, and “gender,” as inputs in the second neural network increased predictive performance by up to 15% in some datasets. The team's neural networks performed more accurately than a team of 11 neurologists who examined the MRI scan, age, gender, and MMSE score, prompting the news headline “Algorithm Beats Experts in Alzheimer's Diagnosis” (Jahnke, 2020).

Figure 2.

Part of a figure from Qiu et al. (2020), illustrating a neural network that combines neuroimaging and non-neuroimaging inputs, including “gender,” to predict disease status for the patient (AD or “normal cognition,” NC).

This study operationalizes “gender” as “male” or “female.” The authors thereby depart from conventional definitions of gender and offer no details as to how their “male” and “female” categories are defined, assessed or reported. In this study, “gender” is included because it is easily available and usable in clinical practice. The researchers describe gender and other non-imaging variables as “known Alzheimer's disease risk factors … easily obtained by non-Alzheimer's disease specialists” (Qiu et al., 2020: 1925). In contrast to other model inputs, the inclusion of gender is not elaborated. For example, the researchers further contextualize the inclusion of MMSE scores as a current standard of diagnosis. Likewise, they justify the inclusion of age as a means of controlling for “the natural progression of cerebral morphological changes over the lifespan,” citing literature showing a clear “proportionality between age and global cerebral atrophy” (Qiu et al., 2020: 1928).

Although “gender” is included here without a hypothesis about its causal relationship to AD, the authors report that “when age, gender and MMSE information were added to the model, then the performance increased significantly” (Qiu et al., 2020: 1928). Together with the statement that gender is a “known Alzheimer's disease risk factor,” the research team's findings imply that machine learning tools have identified binary sex categories as offering valuable predictive information about AD risk.

Sex-specific algorithms for AD diagnosis

Our second example is a 2019 paper published in the journal Alzheimer's and Dementia by a Boston-based team of researchers in computational biomedicine, neurology, and epidemiology. Using data from the Framingham Heart Study, the authors aimed to reduce subjectivity and variability in neurologists’ clinical assessments by providing a more “objective” (Ang et al., 2019: 264), computationally driven diagnostic process, and to develop screening algorithms that better capture the full heterogeneity of AD. They use an algorithm to combine and assess results from multiple neuropsychiatric tests, stratified by “different demographic and AD risk factors” (Ang et al., 2019: 265). Their primary goal was to build and compare different decision tree algorithms capable of characterizing and classifying participants into Alzheimer's Disease, non-Alzheimer's dementia, and no dementia using results from a wide variety of neuropsychological tests.

To do this, the authors stratified the dataset by sex, creating two sex-specific branches—one for males and one for females—in the decision tree algorithm. They then applied feature selection algorithms to the male and female subpopulations, which selected the most informative neuropsychological tests for predicting cognitive outcomes. The results showed that the set of top five most predictive tests for AD status in males differed from the set of top five most predictive tests in females, generating two different decision tree computations for males and females. Although the authors also performed stratifications by education level (receiving high school education and above, or not) and APOE e4 status and found that optimal neuropsychological test profiles also differed in these stratifications, they elaborate only the results for sex stratification in the paper's discussion section.

The research team interprets these different optimal neuropsychological test profiles as evidence of a need for sex-specific algorithms and decision rules in AD screening. They also hypothesize that the sex-specific decision trees likely differ for males and females due to sex-related “cognitive heterogeneity” (Ang et al., 2019: 268) in AD phenotype, which might include sex differences in performance on cognitive tests and/or differential impacts of disease on cognitive domains in men as compared to women. The researchers conclude in support of the view that machine learning offers a promising approach for developing tailored diagnostics and “sex-specific decision rules” for AD (Ang et al., 2019: 268). Several co-authors of this paper have since collaborated with WBP researchers on a deeper probe to develop “sex-specific predictive models” based on male and female neuropsychological profiles (Ferretti et al., 2024). The result of this sex-specific machine learning approach to AD is to generate separate diagnostic or screening algorithms for men and women—creating pink and blue algorithms.

Predicting patient sex from AD data

Our third example is a paper from the WBP's research program published in 2022 in the European Association for Predictive, Preventive and Personalized Medicine Journal (Harms et al., 2022). It was co-authored by researchers from WBP in partnership with the company Altoida, Inc, whose mission is “to unleash the power of digital biomarkers to ignite an era of precision neurology” (Altoida, 2022).

In this study, the authors demonstrated that a machine learning classifier (a type of model that predicts the category of a given input) could use digital biomarker data to predict the self-reported sex of healthy patients. Using 793 biomarkers capturing features of cognitive processing and physiology, which were collected from participants’ completion of two motor tasks and two augmented reality tests, the sex classifier demonstrated that it could successfully distinguish the sex of participants with good predictive performance (0.75 AUC). This finding is, in turn, interpreted as validating the importance of sex classifiers in “precision neurology” (Harms et al., 2022: 310). The WBP-Altoida research team interprets the classifier's performance in predicting the sex of healthy subjects as evidence of sex differences in baseline healthy neurocognitive performance. As the authors write, “sex differences [are] expressed by the capacity of results to inform a sex classifier” (Harms et al., 2022: 302).

The researchers further found that the classifier performed more poorly on a dataset of patients with mild cognitive impairment or AD as compared to healthy patients. They interpret this difference in the classifier's predictive performance between healthy and AD subjects as evidence that sex-based neurocognitive profiles change across the course of AD disease progression. In other words, they interpret the differential performance of classifier algorithms as evidence of both a sex difference in healthy patients and a sex difference in how AD progresses.

In this case, the perceived successful use of neurocognitive data to predict sex is interpreted as evidence of a sex difference in disease progression, inferring etiological significance from a dataset's performance in categorizing sex. As such, Harms et al. illustrate how algorithms can be mobilized not only to generate pink and blue disease-predictive models, but also in reverse, to predict people's sex category. The researchers conclude by calling for an “integrated framework for sex-stratified prediction, monitoring, and personalized treatment” (Harms et al., 2022: 310), which they argue has clinical significance for early disease detection and tailoring preventative treatment for those at-risk or in early stages of dementia, and can guide the development of digital diagnostic and preventative tools.

From sex stratification to biological sex essentialism

Scholars across information science, science and technology studies, public health, and gender studies have warned of the complexities and risks of uncritically embracing big data and machine learning. These include the risks of perpetuating discrimination in areas such as policing, employment, healthcare, and biometrics (Benjamin, 2019; Chun, 2021; Hu and Kohler-Hausmann, 2020; Pierson, 2024; Scheuerman et al., 2021; Selbst and Barocas, 2018; Wang et al., 2023), as well as the dangerous bioessentialist trade-offs of “inclusion and difference” approaches to correcting histories of sexism in medicine and advancing women's health (Epstein, 2007; Keyes et al., 2020; Richardson et al., 2015). These literatures motivate a concern that algorithmic practices may uncritically bake essentialized gender/sex markers of human difference into medicine.

Limited previous work that has looked specifically at the inclusion of gender/sex variables in medical algorithms has flagged the exclusion and omission of women and gender minorities from research and the potential for precision data platforms to inadequately capture social dimensions of sex and gender oppression (Pot et al., 2019). Writing about the exclusion of nonbinary individuals in algorithms for calculating body composition, Albert and Delano (2022a) highlight how the use of binary sex categories perpetuates “category-based erasure, the idea that although a particular group or subgroup of people may be present in a dataset, categories have been constructed in such a way that their presence cannot be determined one way or the other” (Albert and Delano, 2022a: 4). For example, nonbinary or intersex persons may be included in a dataset but lumped together within male/men or female/women categories, invisibilizing them within the structure of research categories. Consequently, gender/sex data in electronic health records—which are frequently used for machine learning because of their scale and ubiquity—exhibit, among other things, “slippage” between sex and gender variables; ambiguity of what a sex category refers to (genitals, sex assigned at birth, chromosomes, etc.); and fixation on sex assigned at birth as ground truth (Albert and Delano, 2022b).

Sex-stratified precision medicine algorithms clearly raise concerns about sex/gender slippage and category-based erasure. But they also raise additional, distinct issues, demonstrated by our analysis of their application to AD research. Here, in the face of contested science, purportedly causally agnostic machine learning approaches carry the potential to introduce mutually reinforcing, looping processes that sediment essentialist, binary, and biological approaches to explaining disparities in human health.

In the three examples above, sex is incorporated into models without an explicit hypothesis about the causal pathways between sex and AD outcomes, that is, in a manner agnostic to the reasons for the predictive value of sex variables. This is a common feature of precision medicine machine learning approaches, which are often solely or primarily interested in predictive power, in contrast to approaches aimed at identifying the causal mechanisms. Predictive machine learning-based research need not posit a causal role for sex in AD etiology (the causal pathway of a disease). Likewise, it need not resolve whether sex variables are a measure of biological factors and/or a proxy for other variables, such as gender or gendered exposures. Such causal agnosticism is often touted as an advantage of predictive machine learning (Anderson, 2008; Mayer-Schönberger and Cukier, 2013).

However, examining sex-stratified algorithms in AD precision medicine research makes clear how this causal agnosticism can work to efface contested knowledge about the contributors to observed sex differences in health status. For instance, in Example 1, Qiu et al. (2020), female sex is referenced as a “known Alzheimer's disease risk factor” and incorporated into the neural network without acknowledgement of the ongoing debates about the underlying determinants of female risk. Causally agnostic machine learning strategies enter a space of speculation, hype, uncertainty, and contestation about the importance of sex differences in understanding the distribution of health and disease. Predictive machine learning models are purportedly agnostic to complicated questions of causality, described above in Section 3, that have historically divided the field of research on gender/sex disparities in AD. Nevertheless, these models operate in a world in which binary sex is ubiquitously available in datasets, and biomedical research programs are higher-resourced and hold greater epistemological status than research programs on social variables. Qiu et al. incorporate sex as a predictive variable without engaging the contestations over what that variable captures socially or biologically.

When sex-stratification in algorithms also works to obscure the role of social factors in health outcomes, the utilization of sex in predictive models without hypothesis or mechanism is likely to endorse and sediment essentialist, binary, and biological approaches to explaining health disparities. This can be seen in Example 2, the Ang et al. (2019) study. Observed sex differences in optimal neuropsychological test profiles could reflect differences in gendered educational and/or occupational experiences—which impact testing comfort and performance in these birth cohorts—rather than inherent differences between groups in aspects of sex-linked biology. However, the question of whether these findings are due to gender/sex differences in testing or measurement validity, rather than underlying AD heterogeneity, goes unasked. Here, the social is obscured.

The risks of effacing contested knowledge and obscuring the social are amplified by the potential for looping, self-confirmatory processes to ossify categorical machine learning approaches in biomedical and population health research. When sex makes a difference to the algorithm's accuracy or decision nodes, and contested knowledge about the complex role of gender/sex related variables is sidestepped, some researchers use this to call for sex-based risk assessment in healthcare. We see this, for instance, in Example 3, Harms et al. (2022), in which the authors conclude that “more data on sex differences could guide future clinical practice, informing choices for ad hoc prevention (knowing sex-specific risk profiles), diagnosis (adjusting diagnostic cut-offs by sex), and treatment options (if sex specific efficacy and safety profiles will be found)” (2022: 310). Here, the sex binary is ossified through the use of a machine learning classifier that groups participants into two sex categories, supporting the idea that quantitative analysis of digital biomarkers reveals and validates maleness and femaleness as discrete, binary categories, informative for clinical practice. The pursuit of sex-stratified algorithms also loops back to the laboratory to substantiate the search for sex differences in biomedical research and validate the legitimacy of binaries in data collection practices, which ultimately provides more data suitable for sex-stratified algorithms. For example, Ferretti et al. (2024), a follow up to Ang et al. (2019), call for future work to “identify the biological underpinning of such sex-related differences in [neuropsychological test] performance and strategy” (2024: 1113). In this way, the propagation of sex-stratified algorithms could become internally sustaining. If such algorithms are ultimately applied in the clinic, this portends the routinization and naturalization of binary sex categories in predictive tools as part of future medical infrastructure.

Sex-stratified predictive algorithms in brain-related conditions use binarized data that fuses diverse demographic, biomedical, and social measures, capturing signals from a range of contextually situated sexed and gendered variables that systematically vary across gender/sexed bodies, but which cannot always be causally attributed to sex-related biology. Because the distribution of disease as well as lifestyle factors and social and environmental exposures are known to vary considerably across genders/sexes, we can expect that such approaches will likely return models of AD risk that, by some quantitative measure, “work” better for women or for men, and that appear to confirm the hypothesis that women and men carry different vulnerabilities for dementias. For instance, if sex functions as a proxy for occupational hazards that were gender-specific in the context of the cohorts used for testing and validation, adding sex as a variable or stratifying by sex will improve the model's predictive performance for that cohort, even though sex is not mechanistically related to AD diagnosis.

The outcome is a binary sexed model of AD, that is, a model of separate “male” and “female” Alzheimer's diseases, each with different etiologies, courses, and outcomes. Indeed, the vision of a future of precision medicine guided by sex-specific algorithms, exemplified by the WBP, is a drive toward separate testing protocols, diagnosis metrics, and preventative and treatment regimes for males and females, enabled by binary sex stratification in data collection and machine learning analyses. Such protocols and treatments will likely prove less effective not only for many nonbinary, intersex, transgender, and gender expansive individuals, but also for cisgender men and women whose biomarker profiles deviate from the mean of their sex's distribution. In the form of bioessentialist “pink algorithm/blue algorithm” claims, sex stratified precision medicine exits the laboratory into a world of powerful misogynist beliefs about sex differences in intellectual ability and cognitive strengths, which contribute to stigmatizing stereotypes in a range of arenas including education, career, and economic potential (Fine, 2010; Jordan-Young, 2011).

Everyday encounters with the technologies developed using these algorithms, many of which are intended for use in the clinic as well as at home or in direct-to-consumer devices, may directly help to construct gendered subjectivities in which people understand themselves to be part of a category with particular cognitive-behavioral risks, potentialities, and baselines. This data may be tracked and used in other areas, such as health insurance premiums (Sadowski, 2024), in an unequal way that further disadvantages those with higher predicted risk for AD. Companies that have commercialized digital biomarker data and cognitive performance metrics may profit from targeting women and heightening their anxieties about female risk for AD. When used in interaction with physicians, educational institutions, and other social institutions, these technologies risk contributing to the othering, exclusion, and stigma of nonbinary, trans, and intersex people in medical knowledge and at the clinical interface, for whom binary logics are particularly incoherent. Moreover, binarized algorithms can be co-opted by non-scientific actors, such as law and policymakers, to lend scientific legitimacy to trans-exclusionary laws and regulations (Sudai et al., 2022). This is all in addition to the potential harm to biomedical knowledge, where sex-binary tools, once deployed and standardized widely in the clinic, are difficult to revise or remove, creating intellectual and infrastructural barriers to pursuing scientific questions outside of this binary (Pape et al., 2024; Richardson, 2022; Richardson et al., 2015).

Conclusion and recommendations

AD leads to significant suffering and its prevalence will likely increase in the future as populations age. While there is no cure for AD, researchers hope that risk prediction algorithms may convince people to make lifestyle changes that might delay symptoms. It is understandable that researchers and policymakers want to both predict risk of AD onset, understand the causal mechanisms behind it, quantify and track the burden of the disease, and develop tools that will help support those with the condition and their caretakers. Equally, it is commendable that researchers wish to correct for and avoid repeating histories of sexism in medicine and biomedical research. It is also vital to attend to sex-related biases in datasets and their implications for inaccuracies and error rates for people tagged with a specific sex. But risk prediction involving socially salient categories such as sex and gender can also bring harm.

Sex-stratified approaches to risk prediction in precision medicine are rapidly advancing with minimal social, ethical, conceptual, and methodological dialogue around these practices. The specific case of sex-stratified algorithms in AD research illustrates how bold programs that build binary sex categories into algorithmic approaches to disease risk prediction are moving forward despite significant ongoing uncertainty among health researchers regarding any causal relationship between biological sex and AD—and even over whether gender/sex disparities exist in the disease at all, and if they do, the magnitude of these effects relative to other drivers of AD vulnerability, such as education and occupational history.

AD is but one example of a broader move toward sex-specific metrics, cutoffs, and diagnostics across biomedicine. Sex-stratified algorithms are currently under development across a range of domains and diseases, such as predicting opioid use (Bright et al., 2021) and liver disease (Straw and Wu, 2022), diagnosing cardiac disease (Bermúdez-López et al., 2022), and assessing ACL knee injury risk (Beynnon et al., 2015). Further analysis of this landscape is needed to understand the prevalence of sex variables in precision medicine research designs using machine learning, characterize how sex constructs are operationalized, and examine the specific assumptions that inform sex-stratification using binary sex categories.

This is challenging, especially given that existing data on sexed and gendered social factors is often limited and of poor or biased quality (e.g., Pot et al., 2019). There are, nonetheless, ways to engage in more ethical and precise research when attending to gender/sex variables. For example, D'Ignazio and Klein (2020) have proposed data collection, interpretation, and visualization practices to align data science with principles of gender equity and diversity. Scheuerman and Brubaker (2018) suggest a model of participatory design workshops to develop more trans-inclusionary algorithms. Theoretical developments, such as sex contextualism (Richardson, 2022), offer frameworks for systematically engaging in research on sex-related variation outside of a binary model, acknowledging contextual and pragmatic limitations on interpreting sex-tagged data, and testing plural hypotheses about the structure of sex and gender-related variation in relation to an outcome of interest. In some instances a solution may involve eliminating sex and gender variables, while in others it may mean a more precise, contextualized use of those variables. As discussions on race-related variables in clinical algorithms demonstrate, it is possible for the research community to productively come together around standards for the ethical use of a contested social category in algorithmic research in a way that advances the rigor and precision of the use of these categories in biomedicine.

In summary, we conclude that it is reasonable to anticipate that the current pursuit of sex-stratified precision medicine platforms using AI-informed tools will contribute to the acceleration of unjustified biological sex essentialist assumptions and frameworks in medicine, from data collection, to the laboratory and computational analysis, to the clinic, to health policy and economics, health-adjacent wellness discourses and consumer devices, and ultimately legal and folk understandings of sex as a biological category. In these ways, the naturalization of sex as a predictor of risk in machine learning and related precision medicine approaches is poised to repeat the mistakes of other uncritical uses of social categories, perpetuating crude ontologies of sex and gender that are ill-suited to both precision and health equity.

Supplemental Material

sj-docx-1-bds-10.1177_20539517251381674 - Supplemental material for Sex in the medical machine: How algorithms can entrench bioessentialism in precision medicine

Supplemental material, sj-docx-1-bds-10.1177_20539517251381674 for Sex in the medical machine: How algorithms can entrench bioessentialism in precision medicine by Kelsey Ichikawa, Marion Boulicault, Alex Thinius, Marina DiMarco, Audrey R Murchland, Ben Maldonado, Abigail S Higgins and Sarah S Richardson in Big Data & Society

Footnotes

Acknowledgements

The authors would like to thank the members of the Harvard GenderSci Lab, particularly Rory Brinkmann and Kai Jillson, for their invaluable contributions to this research project. Kendra Albert, Solon Barocas, Seetha Davis, Gillian Einstein, Kadija Ferryman, Nancy Krieger, David Jones, and Brian Liu offered detailed comments on earlier drafts of the paper. We are also grateful to the many Alzheimer’s researchers and computational scientists who shared their time and expertise.

ORCID iDs

Kelsey Ichikawa

Marion Boulicault

Alex Thinius

Marina DiMarco

Ben Maldonado

Abigail S Higgins

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Robert Wood Johnson Foundation and the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (grant number: 79892, 019.221SG.009).

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Supplemental material

Supplemental material for this article is available online.

Notes

References

Albert

Delano

(2022a) Algorithmic exclusion. SSRN scholarly paper. Available at: https://papers.ssrn.com/abstract=4122529 (accessed 15 May 2024).

Albert

Delano

(2022b) Sex trouble: Sex/gender slippage, sex confusion, and sex obsession in machine learning using electronic health records. Patterns 3(8): Article 100534.

Altoida . (2022) About Altoida. Available at: https://altoida.com/company/ (accessed 21 August 2024).

Alzheimer’s Association . (n.d. a) Portfolio summaries. Available at: https://alz.org/research/for_researchers/grants/portfolio_summaries (accessed 21 August 2024a).

Alzheimer’s Association . (n.d. b) Women and Alzheimer’s. Available at: https://alz.org/alzheimers-dementia/what-is-alzheimers/women-and-alzheimer-s (accessed 15 May 2024b).

Alzheimer’s Association . (n.d. c) Sex and gender in Alzheimer’s (SAGA 23) award program. Available at: https://alz.org/research/for_researchers/grants/types-of-grants/sex_and_gender_award_program (accessed 15 January 2023c).

Anderson

(2008) The end of theory: The data deluge makes the scientific method obsolete. Wired, 23 June. Available at: https://www.wired.com/2008/06/pb-theory/ (accessed 15 May 2024).

Ang

TFA

Ding

, et al. (2019) Using data science to diagnose and characterize heterogeneity of Alzheimer’s disease. Alzheimer’s & Dementia : Translational Research & Clinical Interventions 5: 264–271.

Anstey

Peters

Mortby

, et al. (2021) Association of sex differences in dementia risk factors with sex differences in memory decline in a population-based cohort spanning 20–76 years. Scientific Reports 11(1): 7710.

10.

Arafah

Khatoon

Rasool

, et al. (2023) The future of precision medicine in the cure of Alzheimer’s disease. Biomedicines 11(2): 335.

11.

Arias

Kochanek

(2019) United States Life tables, 2016. Epub ahead of print 7 May 2019.

12.

Arnegard

Whitten

Hunter

, et al. (2020) Sex as a biological variable: A 5-year progress report and call to action. Journal of Women’s Health 29(6): 858–864.

13.

Behl

Kaur

Sehgal

, et al. (2022) The road to precision medicine: Eliminating the “one size fits all” approach in Alzheimer’s disease. Biomedicine & Pharmacotherapy 153: Article 113337.

14.

Benjamin

(2019) Race after Technology: Abolitionist Tools for the New Jim Code. Cambridge, UK; Medford, MA: Polity.

15.

Bermúdez-López

Martí-Antonio

Castro-Boqué

, et al. (2022) Development and validation of a personalized, sex-specific prediction algorithm of severe atheromatosis in middle-aged asymptomatic individuals: The ILERVAS study. Frontiers in Cardiovascular Medicine 9: 895917.

16.

Beynnon

Sturnick

Argentieri

, et al. (2015) A sex-stratified multivariate risk factor model for anterior cruciate ligament injury. Journal of Athletic Training 50(10): 1094–1096.

17.

Brady

Zheng

Kootar

, et al. (2024) Sex and gender differences in risk scores for dementia and Alzheimer’s disease among cisgender, transgender, and non-binary adults. Alzheimer’s & Dementia 20(1): 5–15.

18.

Braun

Wentz

Baker

, et al. (2021) Racialized algorithms for kidney function: Erasing social experience. Social Science & Medicine 268: Article 113548.

19.

Bright

Langerveld

DeVuyst-Miller

, et al. (2021) Identification of a sex-stratified genetic algorithm for opioid addiction risk. The Pharmacogenomics Journal 21(3): 326–335.

20.

Buckley

Waller

Masters

, et al. (2019) To what extent does age at death account for sex differences in rates of mortality from Alzheimer disease? American Journal of Epidemiology 188(7): 1213–1223.

21.

Castro-Aldrete

Moser

Putignano

, et al. (2023) Sex and gender considerations in Alzheimer’s disease: The women’s brain project contribution. Frontiers in Aging Neuroscience 15: Article 1105620.

22.

Celeste

Ming

Broce

, et al. (2023) Ethnic disparity in diagnosing asymptomatic bacterial vaginosis using machine learning. npj Digital Medicine 6(1): 1–10.

23.

Chun

WHK

(2021) Discriminating Data: Correlation, Neighborhoods, and the New Politics of Recognition. Cambridge, Massachusetts: The MIT Press.

24.

Cirillo

Catuara-Solarz

Morey

, et al. (2020) Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare. npj Digital Medicine 3: 1–11.

25.

Clayton

(2016) Minority Health: A Milestone on the Road to Precision Medicine. Available at: https://orwh.od.nih.gov/about/director/messages/milestone-precision-medicine (accessed 2 November 2021).

26.

Danielsen

Lee

Boulicault

, et al. (2022) Sex disparities in COVID-19 outcomes in the United States: Quantifying and contextualizing variation. Social Science & Medicine 294: Article 114716.

27.

Dekhtyar

Wang

H-X

Scott

, et al. (2015) A life-course study of cognitive reserve in dementia—from childhood to old age. The American Journal of Geriatric Psychiatry 23(9): 885–896.

28.

Delgado

Baweja

Crews

, et al. (2022) A unifying approach for GFR estimation: Recommendations of the NKF-ASN task force on reassessing the inclusion of race in diagnosing kidney disease. American Journal of Kidney Diseases 79(2): 268–288.e1.

29.

Denny

Collins

(2021) Precision medicine in 2030—Seven ways to transform healthcare. Cell 184(6): 1415–1419.

30.

D’Ignazio

Klein

(2020) Data Feminism. Cambridge, MA: The MIT Press. Available at: https://direct.mit.edu/books/book/4660/Data-Feminism (accessed 26 August 2024).

31.

Einstein

(2017) Sex and gender in health: The world writes on the body. In: Legato

Glezerman

(eds) The International Society for Gender Medicine. London: Academic Press, 45–55. Available at: https://www.sciencedirect.com/science/article/pii/B9780128118504000065 (accessed 15 May 2024).

32.

Eneanya

Boulware

Tsai

, et al. (2022) Health inequities and the inappropriate use of race in nephrology. Nature Reviews Nephrology 18(2): 84–94.

33.

Epstein

(2007) Inclusion: The Politics of Difference in Medical Research. Chicago: University of Chicago Press.

34.

Erikainen

Chan

(2019) Contested futures: Envisioning “personalized,” “stratified,” and “precision” medicine. New Genetics and Society 38(3): 308–330.

35.

Fausto-Sterling

(2000) Sexing the Body: Gender Politics and the Construction of Sexuality. New York: Basic Books.

36.

Ferretti

Dimech

Chadha

(2021) Sex and Gender Differences in Alzheimer’s Disease: The Women’s Brain Project. London, England: Academic Press.

37.

Ferretti

Ding

, et al. (2024) Maximizing utility of neuropsychological measures in sex-specific predictive models of incident Alzheimer’s disease in the Framingham Heart Study. Alzheimer’s & Dementia 20(2): 1112–1122.

38.

Ferretti

Iulita

Cavedo

, et al. (2018) Sex differences in Alzheimer disease—The gateway to precision medicine. Nature Reviews Neurology 14(8): 457–469.

39.

Ferryman

Pitcan

(2018) What is precision medicine? 26 February. Data & Society.

40.

Fillit

(1986) Might estrogen prevent memory loss? - Free Online Library. Saturday Evening Post, 1 December. Available at: https://www.thefreelibrary.com/Might+estrogen+prevent+memory+loss%3F-a04530963 (accessed 21 August 2024).

41.

Fillit

Weinreb

Cholst

, et al. (1986) Observations in a preliminary open trial of estradiol therapy for senile dementia-Alzheimer’s type. Psychoneuroendocrinology 11(3): 337–345.

42.

Fine

(2010) Delusions of Gender: How Our Minds, Society, and Neurosexism Create Difference, 1st ed. New York: W. W. Norton.

43.

Gallie

(1955) Essentially contested concepts. Proceedings of the Aristotelian Society 56: 167–198.

44.

Garibotto

Borroni

Kalbe

, et al. (2008) Education and occupation as proxies for reserve in aMCI converters and AD. Neurology 71(17): 1342–1349.

45.

Geraets

AFJ

Leist

(2023) Sex/gender and socioeconomic differences in modifiable risk factors for dementia. Scientific Reports 13: 80.

46.

Hampel

Vergallo

Giorgi

, et al. (2018) Precision medicine and drug development in Alzheimer’s disease: The importance of sexual dimorphism and patient stratification. Frontiers in Neuroendocrinology 50: 31–51.

47.

Harms

Ferrari

Meier

, et al. (2022) Digital biomarkers and sex impacts in Alzheimer’s disease management — potential utility for innovative 3P medicine approach. EPMA Journal 13(2): 299–313.

48.

Hebert

Weuve

Scherr

, et al. (2013) Alzheimer disease in the United States (2010–2050) estimated using the 2010 census. Neurology 80(19): 1778–1783.

49.

Henderson

Paganini-Hill

Emanuel

, et al. (1994) Estrogen replacement therapy in older women: Comparisons between Alzheimer’s disease cases and nondemented control subjects. Archives of Neurology 51(9): 896–900.

50.

Henderson

Watt

Galen Buckwalter

(1996) Cognitive skills associated with estrogen replacement in women with Alzheimer’s disease. Psychoneuroendocrinology 21(4): 421–430.

51.

Kohler-Hausmann

(2020) What’s Sex Got to Do With Fair Machine Learning? In: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (FAT* ‘20), 2020.

52.

ISTAART Community . (n.d.) Group Home. Available at: https://istaart.alz.org/groups/home/68 (accessed 22 August 2024).

53.

Jahnke

(2020) Algorithm beats experts in Alzheimer’s diagnosis. In: Futurity. Available at: https://www.futurity.org/algorithm-alzheimers-disease-diagnosis-2358332/ (accessed 15 May 2024).

54.

Joel

Kaiser

Richardson

, et al. (2015) A discussion on experiments and experimentation: NIH to balance sex in cell and animal studies. Catalyst: Feminism, Theory, Technoscience 1(1): 1–13.

55.

Jordan-Young

(2011) Brain Storm: The Flaws in the Science of Sex Differences. Cambridge, MA: Harvard University Press.

56.

Keyes

Peil

Williams

, et al. (2020) Reimagining (women’s) health: HCI, gender and essentialised embodiment. ACM Transactions on Computer-Human Interaction 27(4): 1–42.

57.

Lee

KMN

Rushovich

Gompers

, et al. (2023) A gender hypothesis of sex disparities in adverse drug events. Social Science & Medicine 339: Article 116385.

58.

Lim

Wang

Park

S-Y

, et al. (2022) Risk of Alzheimer’s disease and related dementia by sex and race/ethnicity: The multiethnic cohort study. Alzheimer’s & Dementia 18(9): 1625–1634.

59.

Lock

(2013) The Alzheimer Conundrum: Entanglements of Dementia and Aging. Princeton: Princeton University Press.

60.

Manson

Crandall

Rossouw

, et al. (2024) The women’s health initiative randomized trials and clinical practice: A review. JAMA 331(20): 1748–1760.

61.

Mayeda

(2019) Invited commentary: Examining sex/gender differences in risk of Alzheimer disease and related dementias—Challenges and future directions. American Journal of Epidemiology 188(7): 1224–1227.

62.

Mayeda

Glymour

Quesenberry

, et al. (2016) Inequalities in dementia incidence between six racial and ethnic groups over 14 years. Alzheimer’s & Dementia : The Journal of the Alzheimer’s Association 12(3): 216–224.

63.

Mayer-Schönberger

Cukier

(2013) Big Data: A Revolution That Will Transform How We Live, Work, and Think. Boston: Houghton Mifflin Harcourt.

64.

McCradden

Joshi

Mazwi

, et al. (2020) Ethical limitations of algorithmic fairness solutions in health care machine learning. The Lancet Digital Health 2(5): e221–e223.

65.

Mielke

Vemuri

Rocca

(2014) Clinical epidemiology of Alzheimer’s disease: Assessing sex and gender differences. Clinical Epidemiology 6: 37–48.

66.

Miller

Rocca

Faubion

(2015) Sex differences research, precision medicine, and the future of women’s health. Journal of Women’s Health 24(12): 969–971.

67.

Morris

Clark

Vissel

(2014) Inconsistencies and controversies surrounding the amyloid hypothesis of Alzheimer’s disease. Acta Neuropathologica Communications 2: 135.

68.

Moutinho

(2022) The long road to a cure for Alzheimer’s disease is paved with failures. Nature Medicine 28(11): 2228–2231.

69.

National Institutes of Health (NIH) . (2020) The Promise of Precision Medicine. Available at: https://www.nih.gov/about-nih/what-we-do/nih-turning-discovery-into-health/promise-precision-medicine (accessed 21 August 2024).

70.

Nebel

Aggarwal

Barnes

, et al. (2018) Understanding the impact of sex and gender in Alzheimer’s disease: A call to action. Alzheimer’s & Dementia 14(9): 1171–1183.

71.

NIA-AA Symposium Enabling Precision Medicine for Alzheimer’s Disease Through Open Science . (2021) Virtual conference. Available at: https://alz.org/alzheimers-precision-medicine/overview.asp (accessed 21 August 2024).

72.

Pape

(2021) Co-production, multiplied: Enactments of sex as a biological variable in US biomedicine. Social Studies of Science 51(3): 339–363.

73.

Pape

Miyagi

Ritz

, et al. (2024) Sex contextualism in laboratory research: Enhancing rigor and precision in the study of sex-related variables. Cell 187(6): 1316–1326.

74.

Pierson

(2024) Accuracy and equity in clinical risk prediction. The New England Journal of Medicine 390(2): 100–102.

75.

Pike

(2017) Sex and the development of Alzheimer’s disease. Journal of Neuroscience Research 95(1–2): 671–680.

76.

Piller

(2025) Doctored: Fraud, Arrogance and Tragedy in the Quest to Cure Alzheimer’s. New York: One Signal Publishers.

77.

Pot

Spahl

Prainsack

(2019) The gender of biomedical data: Challenges for personalised and precision medicine. Somatechnics 9(2–3): 170–187.

78.

Prainsack

, et al. (2018) Personalised and precision medicine: What kind of society does it take? In: Meloni

Cromby

Fitzgerald

(eds) The Palgrave Handbook of Biology and Society. London: Palgrave Macmillan UK, 683–701. Available at: http://link.springer.com/10.1057/978-1-137-52879-7_29 (accessed 17 November 2023).

79.

Qiu

Joshi

Miller

, et al. (2020) Development and validation of an interpretable deep learning framework for Alzheimer’s disease classification. Brain 143(6): 1920–1933.

80.

Richardson

(2013) Sex Itself: The Search for Male and Female in the Human Genome. Chicago: The University of Chicago Press.

81.

Richardson

(2021) The Maternal Imprint: The Contested Science of Maternal-Fetal Effects. New York: University of Chicago Press.

82.

Richardson

(2022) Sex contextualism. Philosophy, Theory, and Practice in Biology 14(2). https://journals.publishing.umich.edu/ptpbio/article/id/2096/.

83.

Richardson

Reiches

Shattuck-Heidorn

, et al. (2015) Focus on preclinical sex differences will not address women’s and men’s health disparities. Proceedings of the National Academy of Sciences 112(44): 13419–13420.

84.

Riedel

Thompson

Brinton

(2016) Age, APOE and sex: Triad of risk of Alzheimer’s disease. The Journal of Steroid Biochemistry and Molecular Biology 160: 134–147.

85.

Rushovich

Gompers

Lockhart

, et al. (2023) Adverse drug events by sex after adjusting for baseline rates of drug use. JAMA Network Open 6(8): Article e2329074.

86.

Sadowski

(2024) Total life insurance: Logics of anticipatory control and actuarial governance in insurance technology. Social Studies of Science 54(2): 231–256.

87.

Schaefer

Tai

Sun

(2019) Precision medicine and big data: The application of an ethics framework for big data in health and research. Asian Bioethics Review 11(3): 275.

88.

Scheuerman

Brubaker

(2018) Gender is not a Boolean: Towards Designing Algorithms to Understand Complex Human Identities. In: Participation + Algorithms Workshop at CSCW 2018, 2018.

89.

Scheuerman

Pape

Hanna

(2021) Auto-essentialization: Gender in automated facial analysis as extended colonial project. Big Data & Society 8(2): Article 20539517211053712.

90.

Scheyer

Rahman

Hristov

, et al. (2018) Female sex and Alzheimer’s risk: The menopause connection. The Journal of Prevention of Alzheimer’s Disease 5(4): 225–230.

91.

Schmidt

Fazekas

Reinhart

, et al. (1996) Estrogen replacement therapy in older women: A neuropsychological and brain MRI study. Journal of the American Geriatrics Society 44(11): 1307–1313.

92.

Selbst

Barocas

(2018) The intuitive appeal of explainable machines. Fordham Law Review 87(3): 1085–1139.

93.

Severs

James

Letrondo

, et al. (2023) Traumatic life events and risk for dementia: A systematic review and meta-analysis. BMC Geriatrics 23(1): 587.

94.

Shen

Rolls

Cheng

, et al. (2022) Associations of social isolation and loneliness with later dementia. Neurology 99(2): e164–e175.

95.

Shumaker

Legault

Kuller

, et al. (2004) Conjugated equine estrogens and incidence of probable dementia and mild cognitive impairment in postmenopausal women: Women’s health initiative memory study. JAMA 291(24): 2947–2958.

96.

Shumaker

Legault

Rapp

, et al. (2003) Estrogen plus progestin and the incidence of dementia and mild cognitive impairment in postmenopausal women: The women’s health initiative memory study: A randomized controlled trial. JAMA 289(20): 2651–2662.

97.

Silverberg

Elliott

Ryan

, et al. (2018) NIA Commentary on the NIA-AA research framework: Towards a biological definition of Alzheimer’s disease. Alzheimer’s & Dementia 14(4): 576–578.

98.

Sindi

Kåreholt

Ngandu

, et al. (2021) Sex differences in dementia and response to a lifestyle intervention: Evidence from Nordic population-based studies and a prevention trial. Alzheimer’s & Dementia 17(7): 1166–1178.

99.

Society for Women’s Health Research . (n.d.) Women’s health dashboard. Available at: https://swhr.org/programs/womens-health-dashboard/ (accessed 15 May 2024).

100.

Stachenfeld

Mazure

(2022) Precision medicine requires understanding how both sex and gender influence health. Cell 185(10): 1619–1622.

101.

Straw

(2022) Investigating for bias in healthcare algorithms: A sex-stratified analysis of supervised machine learning models in liver disease prediction. BMJ Health & Care Informatics 29(1): Article e100457.

102.

Sudai

Borsa

Ichikawa

, et al. (2022) Law, policy, biology, and sex: Critical issues for researchers. Science 376(6595): American Association for the Advancement of Science: 802–804.

103.

The Women’s Alzheimer’s Movement . (n.d.) The Women’s Alzheimer’s Movement. Available at: https://thewomensalzheimersmovement.org/ (accessed 22 August 2024).

104.

U.S. Dept of Health and Human Services . (n.d.) What is Alzheimer’s Disease and Related Dementias. Available at: https://aspe.hhs.gov/collaborations-committees-advisory-groups/napa/what-ad-adrd (accessed 15 May 2024).

105.

van Anders

(2015) Beyond sexual orientation: Integrating gender/sex and diverse sexualities via sexual configurations theory. Archives of Sexual Behavior 44(5): 1177–1213.

106.

Vemuri

Lesnick

Przybelski

, et al. (2014) Association of lifetime intellectual enrichment with cognitive decline in the older population. JAMA neurology 71(8): 1017–1024.

107.

Vyas

Eisenstein

Jones

(2020) Hidden in plain sight—Reconsidering the use of race correction in clinical algorithms. New England Journal of Medicine 383(9): 874–882.

108.

Wang

Ahmed

Leufer

(2023) Bodily harms: Mapping the risks of emerging biometric tech. AccessNow.

109.

WHO . (2021) Gender and health. Available at: https://www.who.int/news-room/questions-and-answers/item/gender-and-health (accessed 14 August 2024).

110.

Why is dementia different for women? . (2024) Available at: https://www.alzheimers.org.uk/blog/why-dementia-different-women (accessed 15 May 2024).

111.

Wolters

Chibnik

Waziry

, et al. (2020) Twenty-seven-year time trends in dementia incidence in Europe and the United States. Neurology 95(5): e519–e531.

112.

Writing Group for the Women’s Health Initiative Investigators and Rossouw . (2002) Risks and benefits of estrogen plus progestin in healthy postmenopausal women: Principal results from the women’s health initiative randomized controlled trial. JAMA 288(3): 321–333.

113.

Yang

Kantor

Chiba-Falek

(2021) APOE: The new frontier in the development of a therapeutic target towards precision medicine in late-onset Alzheimer’s. International Journal of Molecular Sciences 22(3): 1244.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.54 MB