Abstract
Introduction:
Estimating the effects of comorbidities on risk of all-cause dementia (ACD) could potentially better inform prevention strategies and identify novel risk factors compared to more common post-hoc analyses from predictive modeling.
Methods:
In a retrospective cohort study of patients with mild cognitive impairment (MCI) from US Veterans Affairs Medical Centers between 2009 and 2021, we used machine learning techniques from the treatment effect estimation literature to estimate individualized effects of 25 comorbidities (e.g., hypertension) on ACD risk within 10 years of MCI diagnosis. Age and healthcare utilization were adjusted for using exact matching.
Results:
After matching, of 19,797 MCI patients, 6,767 (34.18%) experienced ACD onset. Dyslipidemia (percentage point increase of ACD risk range across different treatment effect estimation techniques = 0.009–0.044), hypertension (range = 0.007–0.043), and diabetes (range = 0.007–0.191) consistently had non-zero average effects.
Discussion:
Our findings support known associations between dyslipidemia, hypertension, and diabetes that increase the risk of ACD in MCI patients, demonstrating the potential for these approaches to identify novel risk factors.
Introduction
All-cause dementia (ACD) is a leading cause of death among individuals 65 years and older, and understanding what contributes to ACD onset in patients with mild cognitive impairment (MCI) could inform treatment and prevention (Alzheimer’s Association, 2024). Past work has shown that intervening on modifiable lifestyle factors, such as diet and exercise, may slow cognitive decline (Rosenberg et al., 2018). Identifying novel risk factors related to ACD using machine learning (ML) can further stimulate hypothesis generation on the disease mechanism, which can aid in designing interventions to reduce ACD risk. However, current work using ML to identify these factors does so by performing post-hoc analyses on models trained to predict ACD onset (Irwin et al., 2024; Jo et al., 2019; Tang et al., 2024; Tjandra et al., 2020). For example, past studies have used datasets such as the Alzheimer’s Disease Neuroimaging Initiative (Devarakonda et al., 2019; Jo et al., 2019) and institutional electronic health records (EHRs; Irwin et al., 2024; Tang et al., 2024; Tjandra et al., 2020) to predict onset of Alzheimer’s disease and then identified risk factors as the features whose contribution to predictive performance was significant. In contrast, we aim to directly estimate the individual effects of common comorbidities on the risk of ACD rather than measuring post-hoc feature importance. If the effect is strong, then treating the comorbidity could reduce ACD risk.
Ideally, we aim to identify comorbidities that cause ACD onset. However, verifying whether the relationship between a comorbidity (e.g., hypertension) and ACD onset is causal requires a randomized controlled trial (RCT), and for many potential risk factors, an RCT is infeasible. In light of this, we investigate the applicability of ML approaches to an observational EHR cohort to estimate the effects of comorbidities identified by the literature as risk factors for ACD onset (e.g., hypertension and hearing loss). The approaches we use generally aim to quantify the effect of a feature (e.g., hypertension) on an outcome (e.g., ACD onset) from observational data. Under a set of standard assumptions, outlined below, the estimated effects can be interpreted as the amount by which the feature changes the outcome or risk of the outcome. Such knowledge can provide a focused set of hypotheses for future work in the clinical space to test. In past work, these approaches have been used in assessing treatment effects (Xu, Ignatiadis et al., 2023). In the context of ACD onset, we aim to quantify how much known risk factors change the risk of ACD onset over a 10-year horizon. Here, we measure the change in risk of ACD onset (i.e., the estimated effect) as the percentage point increase of having the risk factor compared to not having the risk factor. In theory, measuring the estimated effect allows us to come closer to a causal investigation than post-hoc analyses from predictive modeling, which aim to identify factors that are correlated with ACD onset.
Cardiovascular diseases, including hypertension and cerebrovascular disease, are among the most commonly studied risk factors for ACD onset (Alzheimer’s Association, 2024). Since adequate heart health is required to deliver oxygen to the brain, researchers hypothesize that comorbidities adversely affecting the heart also adversely affect the brain (Kuźma et al., 2018; Mergenthaler et al., 2013). Similarly, factors like smoking can have a negative effect on heart health (Wells, 1994), thus affecting ACD progression in similar ways. Though we cannot test these hypotheses directly, we can check whether our findings are consistent with the clinical literature across different approaches from the treatment effect estimation literature.
In this paper, we identified a cohort of patients with MCI and used ML to estimate the effects of recognized risk factors on the risk of conversion from MCI to ACD at a 10-year horizon as a proof of concept (Beydoun et al., 2008; Choi et al., 2018; Dunietz et al., 2021; Hulse et al., 2005; Newman et al., 2005; Stefanidis et al., 2018; Tamura & Yaffe, 2011; Thomson et al., 2017). Based on our results, we suggest potential mechanisms for how these factors could contribute to ACD conversion. Going forward, these approaches can potentially be used by researchers in dementia to guide clinical research by investigating the effects of novel risk factors on dementia risk in observational data, leading to novel hypotheses that can be used to inform future work and interventions.
Methods
Study Cohort
We included patients with MCI, as defined by the VINCI (VA Informatics and Computing Infrastructure) CIPHER (Centralized Interactive Phenomics Resource) criteria (Honerlaw et al., 2023), from the Veterans Affairs’ (VA) Cerner EHR instance (Cerner Corporation, North Kansas City, MO; VA Informatics and Computing Infrastructure [VINCI], 2008) who had an encounter with any of the 172 VA facilities in the United States between January 1, 2009 and December 31, 2021. Patient timelines were aligned at the first diagnosis of MCI (i.e., MCI onset). We excluded patients with an MCI or ACD diagnosis before 50 years of age, patients who converted to ACD less than 6 months after their MCI diagnosis, and patients with less than 1 year of historical data prior to MCI diagnosis. ACD diagnoses were also defined as described by the VINCI CIPHER criteria (Honerlaw et al., 2023; more detail in Appendix A1). For both conditions, the VINCI CIPHER criteria identified diagnoses based whether the patient was billed for specific diagnostic codes in the EHR. To control for the effects of age at MCI diagnosis and healthcare utilization on the risk of ACD conversion, we matched patients across each time to conversion (e.g., conversion after 1 year) and time of censoring (e.g., lost to follow-up after 1 year) by age of MCI diagnosis and number of BMI measurements within the 5 years leading up to MCI diagnosis (more detail in Appendix A2). Here, the number of BMI measurements acted as a surrogate for healthcare utilization since we assumed that patients generally have their BMI measured during routine clinical encounters. We controlled for these so that our predictions would not be dominated by these factors (e.g., the model mainly uses age at MCI onset to predict ACD risk). This study was carried out between August 2023 and August 2024 and was approved by the Institutional Review Board of the Phoenix Veterans Affairs Health Care Center (protocol Migrino 1593816).
Risk Factors Considered
We considered comorbidities that were diagnosed before MCI onset. Since cardiovascular comorbidities have been identified by the literature as risk factors for ACD, we considered comorbidities like hypertension and cerebrovascular disease (Alzheimer’s Association, 2024). Similarly, since smoking can adversely effect heart health (Wells, 1994), we also considered it as a risk factor. Outside of cardiovascular comorbidities, TBI (traumatic brain injury) is often studied in the context of ACD onset (Logue et al., 2023; Vincent et al., 2014), where TBI has been shown to be associated with increased risk (Alzheimer’s Association, 2024). Additionally, comorbidities affecting mental health (e.g., anxiety and depression) and psychological trauma (e.g., post-traumatic stress disorder [PTSD]) (Logue et al., 2023; Prieto et al., 2023; Yaffe et al., 2009) have been suggested to be associated with increased dementia risk (Byers & Yaffe, 2011; Desmarais et al., 2020; Gardner et al., 2014; Kwak et al., 2017), and thus, were included in our study.
We limited our focus to mid- to late-life modifiable risk factors that can be identified in the EHR to highlight comorbidities that could guide future research for designing risk-reducing interventions. Thus, we did not consider factors like genetics, demographics, education, and socioeconomic status (SES) since they are either (1) not intervenable or (2) not observable in the EHR. However, since they are potential confounders to our study, we considered them as features during model training to account for their effects on the risk of ACD onset. Only demographics were observable in the EHR, so we relied on downstream variables to capture the effects of features like SES as described below.
We assumed that these risk factors are related to ACD onset as shown in Figure 1, where the risk factors we considered are highlighted in blue. Figure 1 (described more in Appendix A3) was constructed based on our literature search of ACD risk factors and aims to explicitly state which risk factors we are considering and what we assume the potential confounders are. Since we considered comorbidities identified with diagnostic billing codes, they are likely confounded by variables like genetics and SES. These variables, in turn, affect related vital sign measurements and laboratory test results. Thus, we included them as confounders. For unobservable variables like education, genetics, and SES, we indirectly accounted for them by relying on downstream variables to capture their effects, such as healthcare utilization, ZIP codes, vital sign measurements, and laboratory test results. If we assume that these relationships hold, then we can apply the approaches outlined below.

Assumed feature relations. We assume that the conditions we examine are related to each other and ACD onset as shown in the figure in patients with MCI. Arrows show the assumption that the source feature (or feature group) has a causal contribution to the destination feature (or feature group). Blue boxes: comorbidities assumed to contribute to ACD onset. Green boxes with dashed borders: observed confounders. Yellow boxes with dotted borders: potential unobserved confounders whose downstream effects were assumed to be captured by other variables. Dark gray boxes: feature groups. Arrows pointing at a dark gray box point to all features in the box. Arrows coming from a dark gray box are such that there is an arrow coming from each feature in the box.
Data Preprocessing
We extracted 114 covariates relating to the comorbidities mentioned above as well as potential confounders such as demographics, medications, vital signs, laboratory tests, and healthcare utilization from up to 5 years before MCI onset (more detail in Table 1 and Appendix A4).
Cohort characteristics and feature breakdown. We show the characteristics at alignment (i.e., MCI onset). Some patients did not have a race or ethnicity recorded. We report race and ethnicity categories as they are recorded.
N = number; IQR = interquartile range.
Comorbidity Effect Estimation
Given the comorbidities and patient covariates, we used ML techniques from the treatment effect estimation literature to estimate the effect of each comorbidity on the risk of ACD onset. We considered the probability of ACD conversion within 10 years of MCI onset as our outcome, and we trained ML models to predict time to ACD onset at the time of MCI onset given patient covariates (Wang et al., 2019). To estimate effects, we estimated the difference in probability of ACD onset within 10 years of MCI onset in the presence and absence of each comorbidity within each patient and then averaged over all patients (more detail in Appendix A5; Rubin, 2005). In summary, the average effect can be interpreted as the percentage point change in the probability of ACD onset within 10 years of MCI onset resulting from the comorbidity.
Model Training
For each comorbidity, we estimated the effects using common approaches from the treatment effect estimation literature, such as the X, R, and DR metalearners (see Appendix A6 for more detail; Funk et al., 2011; Xu, Ignatiadis et al., 2023). To use these approaches, we make the following three assumptions, as is standard in the treatment effect estimation literature (VanderWeele, 2009; Xu, Ignatiadis et al., 2023). The first is overlap: for a comorbidity of interest, the probability of any patient in the dataset having that comorbidity is non-zero. The second is unconfoundedness: the expected outcome given the presence or absence of the comorbidity (e.g., the expected probability of ACD onset in the presence of hypertension) is independent of whether the comorbidity is present, conditioned on patient covariates (i.e., all confounders are included in the covariate set). The third is stable treatment units and consistency: a patient’s observed outcome does not depend on another patient’s outcome, and for all patients, the observed outcome is the outcome directly resulting from the given set of features and whether the comorbidity is present. In making these assumptions, we can use treatment effect estimation approaches to train ML models using observational data and still recover the average effects. This is because the assumptions ensure that what the models learn from patients without the comorbidity, generalize to patients with the comorbidity and vice versa. We test for overlap as described in the preliminary analysis. Unconfoundedness holds if the assumed relationships outlined in Figure 1 hold and there are no additional confounders not shown in Figure 1. We cannot explicitly test for consistency or stable treatment units, but it remains a reasonable assumption given our current understanding of the disease process.
Model Evaluation
We first conducted a preliminary analysis to investigate the overlap assumption and whether the approaches perform as expected in a controlled environment. Then, we conducted the main analysis, where we compared the identified predictors of ACD onset from applying post-hoc analyses to a predictive model to the identified predictors from the treatment effect estimation approaches described earlier.
The preliminary analysis is described in Appendix A7. Our main analysis consisted of two parts. In the first, we trained a standard predictive model, reporting its discriminative performance using the time-dependent AUROC (area under the receiver operating characteristic curve; Lambert & Chevret, 2016). Potential predictors were identified with permutation importance (Breiman, 2001) on the standard model using the held-out test set. In the second part, we used the treatment effect estimation approaches to measure the effects of the comorbidities on ACD onset. To evaluate the approaches, we began by measuring the discriminative performance of all intermediate models trained for each approach and each comorbidity using the time-dependent AUROC. Comorbidities whose 95% CI overlapped with 0.5 (i.e., random performance) for at least one model on at least one approach were excluded from further analysis. For each comorbidity that remained, we measured the average effect of each comorbidity on ACD onset using each approach by estimating the effect for all individuals in the test set and then taking the average. Here, we do not know what the ground truth effect of each comorbidity is so we can only evaluate whether the results among approaches are consistent (i.e., all of the approaches indicate that the comorbidity increases the risk of ACD onset). Inconsistencies among approaches, such as the effects having different signs (i.e., some approaches indicate that the risk factor increases ACD risk while others indicate a decrease in ACD risk), would indicate that the predicted average effects are more likely to be false discoveries resulting from methodological differences among approaches (Xu, Bechler et al., 2023). For example, the X and R learners may be more sensitive to the accuracy of the estimated propensity scores, an intermediate model output describing the probabilty of having the comorbidity that is used for downstream effect estimation. As such, for each comorbidity, we plotted the average effect for each approach and highlighted which comorbidities had consistent predictions, with error bars representing 95% CIs from 1,000 bootstrapped samples. Note that the features identified by consistent average effects and permutation importance are not guaranteed to be the same (see Appendix A8 for more detail).
Results
Cohort Characteristics
After applying our inclusion/exclusion criteria and matching, our cohort contained 19,797 MCI patients (Figure 2). About 6,767 (34.18%) experienced ACD onset within 10 years, 1,320 (6.67%) did not experience ACD onset within 10 years, and the remaining were right censored (i.e., lost to follow-up less than 10 years after MCI onset without meeting the ACD criteria). The median age of MCI onset was 70 years [IQR (interquartile range) 65–78], 774 (3.91%) were female, 15,307 (77.32%) were White, and the median number of outpatient encounters prior to MCI onset was 4 [IQR 2–6]. The most common comorbidity was hypertension, covering 57.72% of the cohort (11,427 patients). More details are in Table 1.

Inclusion/exclusion criteria. We begin with all patients with an encounter at any VHA facility between January 1 2009 and December 2021. Numbers in each box correspond to the number of patients included or excluded.
Preliminary Analysis
We investigated the overlap assumption in Appendix A9 by plotting the distributions of propensity scores for each comorbidity. Although the means of the distributions differed among patients who did and did not have the comorbidity (i.e., positives and negatives), the range of scores between positive and negative patients had a considerable amount of overlap for all comorbidities, and the majority of scores were within the range [0.1, 0.9]. We investigated the approaches in a controlled environment through a global null analysis in Appendix A10. The environment was controlled such that a synthetic comorbidity was introduced to the dataset whose average effect was 0. In the results, the estimated effects were close to zero, showing that the approaches performed as expected.
Main Analysis
Part 1
Model performance, as measured by the time-dependent AUROC, was 0.61 (95% CI [0.59, 0.64]). The results from running permutation importance on a standard model using the held-out test set are shown in Appendix A11, where the only comorbidity identified as having a significant effect on performance was anxiety and related disorders, which resulted in a drop of 0.003 (95% CI [0.0003, 0.007]).
Part 2
The discriminative performance of the intermediate models trained for each comorbidity and approach is shown in Appendix A12. For each comorbidity, the performances of the models trained on the negative patients were all significantly better than random (time-dependent AUROC range across comorbidities = 0.61–0.68). For the models trained on the positive patients for each comorbidity, some were not significantly better than random (time-dependent AUROC range = 0.41–0.61). In Figure 3, we show the estimated average effects for the comorbidities whose performance on all models trained for each approach were better than random. Dyslipidemia, hypertension, and diabetes were consistently identified as risk factors by all approaches.

Average estimated effects. We show the average of the estimated effects of each condition on ACD onset. Error bars represent bootstrapped 95% confidence intervals.
Discussion
In our study, we identified hypertension, dyslipidemia, and diabetes as risk factors for ACD using approaches from the treatment effect estimation literature. While EHR data have been used by previous work to identify ACD risk factors, many focus on post-hoc analyses from predictive modeling instead of directly estimating the individual effect of each comorbidity (Irwin et al., 2024; Jo et al., 2019; Tang et al., 2024; Tjandra et al., 2020). For example, these studies identify potential risk factors by measuring feature importance post-hoc using approaches like permutation importance. While these approaches may indicate which features the model uses to make its predictions (i.e., which features may have a higher correlation with the outcome), they do not necessarily indicate which features could inform prevention. From our analysis using permutation importance, we found that the features identified differed from those identified by consistent average effects across approaches. There are many reasons why this could occur. For example, the effects may not be large enough to significantly affect discriminative performance. This may be why hypertension, dyslipidemia, and diabetes were identified by the treatment effect estimation approaches and not by permutation importance. It is also possible that the effects among individuals within a comorbidity cancel each other out at the population level but can significantly change discriminative performance.
Identifying features that inform prevention requires identifying causal relationships. Verifying causal relationships requires an RCT. However, RCTs cannot be used to investigate the effect of comorbidities on the onset of ACD. We address this gap, in part, through retrospective analyses on observational data using ML techniques. With these techniques, we directly estimate how the presence of a comorbidity could change a patient’s probability of ACD onset within some prediction horizon. While these analyses cannot replace RCTs, we have shown that they can identify risk factors that are consistent with the literature, and thus, have the potential to guide clinical research by suggesting avenues for future research and intervention. By observing similar trends across multiple approaches, we can strengthen our claim on whether the directions of our estimates (i.e., risk or protective) hold.
Approaches from the treatment effect estimation literature require a stricter set of assumptions, but these assumptions allow us to have greater confidence in the accuracy of the predictions of the outcomes in the presence and absence of the comorbidity. This is because these assumptions mean that the model can accurately learn the relationship between the comorbidity and outcome (due to consistency) while accounting for confounding (due to unconfoundedness), and that the learned relationship among patients with the comorbidity will generalize to patients without the comorbidity and vice versa (due to overlap). As a result, we can estimate effects by taking the difference between the predictions of the outcome with and without the comorbidity.
Our finding that dyslipidemia, hypertension, and diabetes are risk factors of ACD onset aligns with the literature (Biessels et al., 2006; Walker et al., 2017; Wee et al., 2023). Dyslipidemia increases the chance of cholesterol buildup in the arteries (Kopin & Lowenstein, 2017), which could limit blood flow to the brain. Hypertension increases the chance of heart disease and stroke (Wajngarten & Silva, 2019), both of which can affect the heart’s ability to supply oxygen to the brain. Insulin resistance from type 2 diabetes has been shown to lower insulin levels in the brain, which may contribute to cognitive decline (Gasparini et al., 2001). Further studies into the mechanism through which these comorbidities contribute to ACD progression could shed light on results from current work (Rosenberg et al., 2018) showing that medication or intervening on modifiable risk factors, such as diet and exercise, may slow the rate of cognitive decline. Notably, it has been proposed that drugs for diabetes and hypertension, two risk factors identified in our analyses, could potentially lower the risk of Alzheimer’s disease, the most common form of dementia (Michailidis et al., 2022; Yasar et al., 2013).
In contrast to hypertension, dyslipidemia, and diabetes, some factors from the literature were not consistently identified by the metalearners. For example, smoking and cerebrovascular disease are associated with ACD onset in similar ways to hypertension, but the approaches did not consistently indicate that they increased the probability of ACD onset over 10 years. This may be because some of the comorbidities could have been affected by unobserved confounding. For example, comorbidities like smoking have been shown to be associated with SES (Hitchman et al., 2014). While we included ZIP codes in our feature set, they only serve as a proxy and may not fully capture SES. Despite hypertension, dyslipidemia, and diabetes also being associated with SES (Blok et al., 2022; Espírito Santo et al., 2022), we hypothesize that including direct measures relating to vital signs and laboratory tests (e.g., blood pressure) could more effectively capture the downstream effects of SES.
Our study has several limitations. First, we relied on EHR-based phenotyping tools to identify MCI, ACD, and the comorbidities we included, which may not always be accurate (Tjandra et al., 2020). Second, since the ground truth average effects are unknown, we could not quantitatively assess whether our estimated effects were correct. Third, we were unable to verify whether all of the assumptions required for the approaches held, such as unconfoundedness. Due to the complicated dynamics of ACD progression, it is likely that there are additional confounders that Figure 1 did not include. Despite our inability to check the correctness of the main analysis, our results support well accepted, plausible hypotheses on what contributes to ACD onset. Fourth, our study only considered one comorbidity at a time and not how comorbidities act in combination. Additionally, the findings from our VA cohort should be validated in the general population. Past work has shown that veterans have a higher prevalence of mental health conditions, thus potentially putting them at higher risk for dementia (Veitch et al., 2013). In addition, even among veterans, these conditions may affect male and female patients differently (Yaffe et al., 2019). In our cohort, the prevalence of PTSD and depression were 17% and 31% respectively, and patients were mostly male (96.09%). Despite these differences in our cohort compared to the general population, a previous study (Tjandra et al., 2022) showed that the performance of using machine learning to predict Alzheimer’s disease onset using blood pressure trajectories trained using VA EHR data was similar when applied to EHR data from another institution, even though the demographic compositions were different. However, it is important to note that generalizability to other demographics should be empirically established. Finally, when controlling for age of MCI onset and healthcare utilization, we used matching, which excluded many patients from our final cohort.
Conclusion
We demonstrated the potential of approaches for estimating treatment effects from observational data in the context of directly estimating the effects of comorbidities on the risk of ACD onset. Results from analyses like ours can be used to inform future work in clinical research on identifying novel risk factors in settings where RCTs are infeasible.
Supplemental Material
sj-docx-1-ggm-10.1177_30495334251347053 – Supplemental material for Estimated Effects of Comorbidities on Risk of All-cause Dementia in Patients with Mild Cognitive Impairment
Supplemental material, sj-docx-1-ggm-10.1177_30495334251347053 for Estimated Effects of Comorbidities on Risk of All-cause Dementia in Patients with Mild Cognitive Impairment by Donna Tjandra, Chase Irwin, Raymond Q. Migrino, Bruno Giordani and Jenna Wiens in Sage Open Aging
Footnotes
Consent Statement
The IRB at the Phoenix Veterans Affairs approved a waiver of informed consent. Patient consent was waived because, in our retrospective analysis of electronic health record data, there were too many patients to feasibly obtain consent given the size of the study team.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research program is supported by the NIH/NIA funded Michigan Alzheimer’s Disease Center (5P30AG053760), the National Science Foundation (IIS 2124127), and the Phoenix Veterans Affairs Office of Research. The views and conclusions in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the NSF, the NIH, the VA, or the United States Government.
Data Availability Statement
The data underlying this article for the Veterans Affairs cannot be shared due to current policies.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
