Abstract
Screening for adult Attention-Deficit/Hyperactivity Disorder (ADHD) and differentiating ADHD from comorbid mental health disorders remains to be clinically challenging. A screening tool for ADHD and comorbid mental health disorders is essential, as most adult ADHD is comorbid with several mental health disorders. The current pilot study enrolled 955 consecutive patients attending a tertiary mental health center in Canada and who completed EarlyDetect assessment, with 45.2% of patients diagnosed with ADHD. The best ADHD classification model using composite scoring achieved a balanced accuracy of 0.788, showing a 2.1% increase compared to standalone ADHD screening, detecting four more patients with ADHD per 100 patients. The classification model including ADHD with comorbidity was also successful (balanced accuracy = 0.712). The results suggest the novel screening method can improve ADHD detection accuracy and inform the risk of ADHD with comorbidity, and may further inform specific comorbidity including MDD and BD.
Attention-Deficit/Hyperactivity Disorder (ADHD) is a neuro-developmental disorder characterized by inattentive or hyperactive/impulsive behavior with a variety of functionally impairing symptoms (American Psychiatric Association, 2013). ADHD is one of the most common disorders in children with a worldwide prevalence of 5% to 12% (Polanczyk et al., 2007), and remains highly prevalent in adults at 4.4% (Kessler et al., 2006), costing $74 billion in the United States, and $6 to $11 billion annually in Canada (Centre for ADHD Awareness Canada, 2017).
Although a significant portion of childhood ADHD persists into adulthood (Simon et al., 2009; Thomas et al., 2015), clinicians do not routinely screen for adult ADHD. Furthermore, complicating the diagnosis of adult ADHD, are symptom overlap with many other mental disorders, and the common phenotype of adult ADHD and comorbid mental health disorders (Adler et al., 2017; Klassen et al., 2010), such as major depressive disorder (MDD), bipolar disorder (BD), personality disorder, and substance abuse (Watters et al., 2018; Wilens & Biederman, 2006). Lack of appropriate screening for adult ADHD leads to missed, inappropriate, and delayed diagnosis of ADHD and comorbid mental health disorders (De Graaf et al., 2013; Ginsberg et al., 2014; Hankey et al., 2020; Hoyle et al., 2015). Since ADHD with comorbidity also confers poor functional outcomes (Hankey et al., 2020), it is crucial to confirm ADHD and comorbid disorders as soon as possible to choose the appropriate treatments to improve functional outcomes. In addition, adult ADHD symptoms may present differently from childhood ADHD symptoms (Adler et al., 2017; Agnew-Blais et al., 2016; Moffitt et al., 2015; Thomas et al., 2015) further contributing to missed and delayed diagnoses in adult patients. The detrimental outcomes are similar between adult and childhood ADHD (Ginsberg et al., 2014), including negative physical health consequences such as obesity, smoking, asthma, migraines, emergency room visits, and injury-related insurance claims and accidental deaths (Agnew-Blais et al., 2016; Catalá-López et al., 2022; Dalsgaard et al., 2015; Lundervold et al., 2019; Sadeghi-Bazargani et al., 2019); and poor socio-economical outcomes such as higher chance to drop out of school (Fleming & McMahon, 2012), financial problems from ADHD-associated impulsivity at work (Asherson et al., 2012; Pinna et al., 2019) and increased chance to involve in criminal activities or live in detention centers (Baggio et al., 2018). Despite the diagnostic challenges, accurate early diagnoses and appropriate treatment of ADHD could be effective to curb the negative outcomes (Oliva et al., 2021).
Currently, ADHD screening involves multiple aspects such as the DSM-5 criteria, self-report questionnaires, neurocognitive computerized tests, and clinical histories (Gualtieri & Johnson, 2005), where no single test is completely indicative of ADHD, and physicians’ judgment remains the gold standard. The World Health Organization Adult ADHD Self-Report Scale (ASRS) (Kessler et al., 2005) is a short self-report screener with six questions derived from the DSM-IV ADHD to screen for adult ADHD. ASRS has been widely used by physicians and validated in a variety of health care settings (Gray et al., 2014; Silverstein et al., 2018). However, the interpretation of ASRS results offers little insight on the presence of comorbid diagnoses. Thus, we developed EarlyDetect, a digital screening system incorporating the ASRS as a primary screening tool for ADHD, while simultaneously screening for the more common comorbid mental health disorders including Major Depression Disorder (MDD), Bipolar Disorder (BD), Generalized Anxiety Disorder (GAD), and Alcohol Use Disorder (AUD) (Y. Liu et al., 2021; Y. S. Liu et al., 2021). The EarlyDetect screening tool has been shown to improve screening of MDD and BD when compared to existing screening instruments such as PHQ-9 and MDQ respectively (Y. Liu et al., 2021; Y. S. Liu et al., 2021). Compared with a single-disease-focused screening approach, the EarlyDetect has been shown to reap benefits from machine-learning algorithms, including enhanced screening performance (Y. Liu et al., 2021; Y. S. Liu et al., 2021). and insights on reducing screening times (Y. S. Liu et al., 2021).
In this study, we aim to develop a machine learning model to screen and identify ADHD based on EarlyDetect, by utilizing individual questions from the ASRS in conjunction with questions from other clinical questionnaires. We also aim to examine whether EarlyDetect could offer additional insight on determining comorbid diagnoses of ADHD, specifically MDD and BD.
Methods
Sample and Study Population
This single-center, retrospective, naturalistic study, approved by the University of Alberta Health Research Ethics Board, included 955 participants (56.4% female, 1% other; mean age = 35.35, SD = 12.83); 45.2% subsequently diagnosed with ADHD, see Table 1 for subsample summary). Between June 2016 and October 2018, patients who agreed to participate provided written consent upon arrival at their first visit to an interdisciplinary, referral-based tertiary mental health center in western Canada. Participants were not pre-selected based on inclusion or exclusion criteria. Participation was voluntary, with no monetary incentives. Participants were asked to complete the EarlyDetect questionnaire, after which they were seen by one of the center’s certified psychiatrists, who was blinded to the results of the screening questionnaires, for face-to-face assessment, diagnosis, and treatment as usual. Physician diagnosis of ADHD was based on DSM-5 criteria (American Psychiatric Association, 2013).
Study Sample Summary.
Note. Due to some patients having multiple comorbidities, the total proportion will not equal 100%. The p values were based on paired sample t-test on age, and two by two χ2 tests for all frequency-based variables.
EarlyDetect Questionnaires
The EarlyDetect is a set of self-report clinical questionnaires distilled to approximate the clinical interview process. The current version of EarlyDetect (Y. Liu et al., 2021; Y. S. Liu et al., 2021) consists of a proprietary Life History Questionnaire (LHQ) probing a patient’s mental health history, two proprietary ADHD screening questions for initial ADHD screening, and ASRS-v1.1—Part A for ADHD-specific symptom screening, and the Sheehan Disability Scale (SDS) (Sheehan et al., 1996) as a proxy indicator of self-reported functional impairments. In addition to ADHD-specific screening, EarlyDetect also screens for MDD, BD, GAD, and AUD (Y. Liu et al., 2021; Y. S. Liu et al., 2021).
Statistical Analysis
For evaluation and comparison of screening performance, we adapted a machine learning approach from earlier studies (Y. Liu et al., 2021; Y. S. Liu et al., 2021). Machine learning analyses were conducted using Python 3.6 with Scikit-Learn 0.22.1. The individual questions for each clinical questionnaire, including yes/no answers to MINI Screening questions, were used as model features. Missing values in continuous variables were imputed using the mean of the non-missing cases. Missing values in categorical variables were coded as a new category. Questions using Likert-scales or representing numerical values (e.g., age) were treated as continuous variables and rescaled using the StandardScaler function, questions with a binary yes/no answer or representing categories were dummy encoded, resulting in a total of 134 features. For all analyses, selected features were used to predict the diagnosis of ADHD using Elastic Net (Zou & Hastie, 2005) with a five-fold internal cross-validation for hyperparameter tuning and a fixed L1 ratio of 0.5. The models were optimized for balanced accuracy (Brodersen et al., 2010), defined as an average of sensitivity and specificity. We used a leave-one-out cross-validation procedure (LOOCV) for external cross-validation, where data from each patient was reserved to test a learned model’s performance trained with the remaining sample, iterating through all patients. Thus, classification performances of the LOOCV were literally a collection of individual-level model predictions. The use of cross-validation for hyperparameter tuning combined with external cross-validation to evaluate model performances is also known as nested-cross-validation, a technique widely used in machine-learning studies to evaluate a model’s generalizability (Kassraian-Fard et al., 2016; Wong, 2015). We present the cross-validation results by evaluating the model predicted label against the actual diagnosis label of ADHD. As baseline comparisons, we fitted two additional models using the same LOOCV procedure: Baseline A is using only the original ASRS scores (Kessler et al., 2005). Baseline B is using all six individual questions from the ASRS as features. Because cross-validation results are representing the model’s performance on data withheld from the computer during model building, it represents the model’s ability to generalize to new data, where the numerical difference in cross-validation performance alone is commonly used to select the best model. To further evaluate the numerical difference statistically, McNamar’s test (Dietterich, 1998) was conducted to evaluate the LOOCV performance of the best model versus other models.
Predictive features were selected if the feature’s coefficients were consistently non-zero across the LOOCV iterations. Then the selected features were ranked based on the magnitude of the averaged coefficient value of the Elastic Net models across all LOOCV iterations.
To explore model performance on differentiating ADHD and ADHD with comorbid conditions. We further conducted a 3-way classification, dividing the ADHD label into a group with only ADHD diagnosis (ADHD_pure, n = 163) and a group with ADHD and one or more comorbid diagnoses of MDD, BP, GAD, or AUD into separate classes (ADHD_comorbid, n = 269), in addition to the class of patients without an ADHD diagnosis (no ADHD, n = 523).
Results
Classification Performance
The LOOCV performance of the Elastic Net model using answers to individual questions as features reached a balanced accuracy of 0.788 for classifying ADHD, with a sensitivity of 0.822 and a specificity of 0.753, with positive predictive value and negative predictive value for ADHD at 0.733 and 0.837 respectively (see Figure 1). Thus, 82.2% of patients with ADHD can be identified by the model, whereas 75.3% of patients with no ADHD can be identified by the model. If a patient is screened positive, there is a 73.3% chance the patient has ADHD, and if a patient is screened negative, there is an 83.7% chance the patient does not have ADHD. All baseline models had lower balanced accuracy than the model based on all individual questions, 0.767 for model A, 0.780 for model B (Table 2) with the model using all individual questions outperforming baseline models when considering both sensitivity and specificity, by 2.1% to the model based on ASRS original score (χ2 = 12.4, p < .001), and 0.8% to the model based on individual questions of ASRS (χ2 = 4.4, p < .05).

Model performance on ADHD classification.
Model Comparisons.
Note. PPV denotes positive predictive value, and NPV denotes negative predictive value. The model with the highest balanced accuracy is in bold.
Predictive Features
Our machine-learning algorithm selected 30 predictive features. Among the top 10 ranked predictive features, the most predictive features are the first and second stem ADHD screening questions probing ADHD symptoms of childhood and whether symptoms continue as an adult, where experiencing ADHD symptoms and continued experience of the symptoms in adulthood are positively associated with an ADHD diagnosis. Other top predictive features where an increase in score is associated with an increased chance of ADHD diagnoses are Mini International Neuropsychiatric Interview-(Hypo)Manic Episode (MINI-HME) (Sheehan et al., 1998), question 6 on ASRS probing for hyperactivity (Kessler et al., 2005). The higher age of the patient, higher age of first feeling unwell, higher SDS score on experiencing disability of social life and leisure activities (Sheehan et al., 1996), children not having diagnosed mental health issues, being female, experienced other (not listed) stressful life events are associated with lower risk of ADHD diagnoses (Figure 2).

Top 10 features on ADHD classification.
Comorbidity
In the 3-way classification task, the model based only on ASRS questions failed to differentiate between comorbid ADHD and pure ADHD patients, the cell prediction is performing close to the chance level (Figure 3a). The model based on individual questions from EarlyDetect performed adequately on the 3-way classification to differentiate the pure ADHD, comorbid ADHD, and no ADHD cases (Figure 3b). Overall, the kappa coefficient (Landis & Koch, 1977), representing a chance adjusted agreement between predicted and actual label, is higher for the individual-question-based model than the ASRS-only model, with a fair agreement (κ = .336) and moderate agreement (κ = .484) respectively. For identifying pure ADHD in the 3-way classification model, the classification performance is comparable to the binary classification models, at a balanced accuracy of 77.3%, with a sensitivity of 69.9% and a specificity of 84.7%. The classification of comorbid ADHD cases is at a balanced accuracy of 71.2%, with a sensitivity of 61.3% and a specificity of 81.0%.

LOOCV results on the classification of ADHD with comorbidity table.
Discussions
Adult ADHD is most often associated with comorbid mental health disorders, such as major depressive disorder and bipolar disorder. Missed diagnosis of comorbid mental health disorders associated with ADHD confers significant functional impairment. Thus, it is of paramount importance to screen for ADHD and comorbid disorders from the onset of a clinical evaluation.
This study presents preliminary evidence that a machine learning approach integrating multiple sources of mental health assessment scales may offer superior performance in screening for adult ADHD and comorbid disorders such as MDD and BD, in line with similar approaches in screening for MDD (Y. Liu et al., 2021) and BD (Y. S. Liu et al., 2021). To the best of our knowledge, this machine learning approach using EarlyDetect is the first screening instrument that has a high predictive value screening for ADHD and comorbid MDD and BD. A set of top predictive features underly the machine-learning model’s performance were identified, consistent with known ADHD risk factors. The study also demonstrates that harvesting multiple sources of patients’ self-report data can offer additional benefits in identifying ADHD patients with comorbid conditions, which may facilitate diagnostic clarification. This current study demonstrates the successful identification of ADHD and ADHD with comorbidity. Although the current model was not built to facilitate screening of specific comorbidity, it should prompt clinicians to probe for comorbidity.
Reassuringly, EarlyDetect has been validated on screening for MDD and BD, and thus provides confidence to screen for these disorders in patients with comorbid ADHD (Y. Liu et al., 2021; Y. S. Liu et al., 2021). When an ADHD patient was flagged for potential comorbidity, clinicians can use the MDD and BD screening module to rule in and rule out MDD and BD, the most common and functional impairing comorbid disorders of ADHD.
The best-performing model using all questions collected from EarlyDetect as features achieved a 2.1% increase in balanced accuracy compared to the threshold-cut-off-based scoring of ASRS. Albeit a smaller numerical lead in balanced accuracy at 0.8%, the best-fitting model performs significantly better than a model based on individual questions from ASRS. The improved performance may be explained by the model’s access to more features from other screening tests. Because balanced accuracy is an average of sensitivity and specificity, a 2.1% increase in balanced accuracy may translate to a sensitivity increase of 4.2% if specificity can be held as a constant, detecting four more patients with ADHD per 100 patients.
The top 10 features underly ADHD classification echo known factors associated with ADHD. The ADHD screening questions probing for patients’ ADHD symptoms from childhood (question 1) to adulthood (question 2) were the top two contributing features respectively for ADHD classification. Other risk factors associated with an increased chance of ADHD are ASRS’s question 6 probing for hyperactivity symptoms, and the MINI-HME screening for mania symptoms with can also be present in ADHD (Klassen et al., 2010). Demographics information including higher age, higher age when first felt mentally unwell, and being female are utilized by the model contributing to reduced risk of ADHD, this may be attributed to ADHD’s earlier onset than other common mental health disorders screened by EarlyDetect and its disproportionally higher diagnosed rates in male than female (Hankey et al., 2020; Kessler et al., 2006). The best-fitting model also associates lower social life disability compared to other common mental health disorders with ADHD patients, as well as reporting no mental health conditions for children and experiencing non-typical stressful life events. Those results need to be interpreted with caution. Lower social life disability in ADHD patients needs to be interpreted in the context that other psychiatric diagnoses such as MDD and BD may be associated with more severe disabilities, this does not suggest ADHD patients have no impairment in social function; further, ADHD is a developmental disorder, thus maybe less associated with stressful life events, yet we cannot interpret the results with confidence as the “other” option of stressful life events were not logged. Also due to the chronic and developmental nature of ADHD, unlike MDD and BD that are episodic in nature where patients are more aware of the contrasts between well and unwell periods, ADHD patients may not realize their social functioning has been affected because they have no experience of a normal or well social function as a comparison reference. Moreover, the lack of report on children’s mental health history may be arbitrarily associated with the younger age of ADHD patients in our sample, they may either not have children or has a smaller time window to identify children’s mental health history.
Comorbidity
Up to 70% of adult ADHD is associated with comorbidities (Kessler et al., 2006). Our sample echoes Kessler et al.’s (2006) results, finding that 63.3% (N = 269) of patients diagnosed with ADHD had at least one comorbidity amongst MDD, GAD, BD, and AUD. When each comorbid diagnosis was considered individually, we found that 22.0% of ADHD patients have comorbid MDD, 31.5% have comorbid GAD, and 24.8% have comorbid BD (Table 1). The function to detect risk of comorbid mental health conditions with ADHD is an important property for a screening tool and may have a significant impact on treatment options selection and positively impact treatment outcomes. Our analysis demonstrates that a single-purpose screening tool such as the ASRS lacks the ability to detect comorbidity risks, where comorbidity risks could be evaluated by using multiple screening tools. ADHD patients with comorbid psychiatric diagnoses are generally suffering from more functional disabilities than ADHD patients without comorbidities (Hankey et al., 2020). Thus, early identification of the comorbidity risk of ADHD may enable appropriate intervention to prevent severe disability. For example, for ADHD patients with comorbid bipolar disorder, if the bipolar disorder is undiagnosed, prescribing psychostimulant medication may trigger manic episodes and worsen the patient’s mental health condition (Viktorin et al., 2017). For ADHD patients with a comorbid MDD, treatment options may include medications that may benefit both MDD and ADHD symptoms such as Bupropion (Bond et al., 2012).
Limitations
The study results need to be interpreted with caution, as data were collected under a naturalistic setting, where the diagnoses and self-report assessments both contain an element of bias (Y. Liu et al., 2021). The study also did not look at identifying specific comorbidities with ADHD, for example, ADHD comorbid with GAD. Another limitation is the use of a single-blinded rater. However, this is a pilot project needing replication of the EarlyDetect screening tool with large a sample size, multiple raters, and centers.
Conclusion
The high comorbidity rates of adult ADHD drive the demand for an ADHD screening tool to inform comorbidity risks to facilitate more accurate clinical diagnosis and may lead to better downstream treatment outcomes. The current pilot study implemented a machine-learning-based screening of ADHD, and ADHD with comorbidity by utilizing multiple clinical questionnaires in addition to ASRS. The results showed preliminary evidence this novel screening method can improve ADHD detection accuracy and inform the risk of ADHD with comorbidity at a tertiary mental health center, warranting future validation studies.
Footnotes
Acknowledgements
We acknowledge the Chokka center for Integrative Health and MITACS Accelerate Scholarship for funding support, and The Chokka Center for Integrative Health for research site provision and onsite support. We thank Kevin Zhan for his contribution on background literature search.
Author Contributions
Yang S. Liu designed the study, processed, quality-controlled and analyzed the data, drafted the manuscript, and critically edited the manuscript. Bo Cao and Pratap Chokka designed the study, supervised the data processing, provided instrumental support, and critically edited the manuscript draft.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: P. Chokka reports grants from Pfizer, Janssen, Lundbeck, Purdue, Shire, and Allergan. These are all unrelated to the submitted work.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by a MITACS Accelerate Industrial Postdoc Scholarship (IT13648). MITACS had no role in study design, data collection, analysis, or interpretation, report writing, or the decision to submit the article for publication. MITACS provided salary support for Y. S. Liu.
