Sage Journals: Discover world-class research

Abstract

Eating disorders (EDs) are complex and debilitating conditions. Prior efforts to predict outcomes (onset, prognosis, treatment response) have yielded inconsistent findings. Machine-learning (ML) techniques have shown promise to improve outcome prediction, but a systematic literature synthesis is missing. We conducted a systematic scoping review to summarize extant literature on ML applications in ED-outcome-prediction research, identifying 75 studies. ML has mostly been used to predict ED diagnostic status (k = 45); other studies have predicted escalation of ED risk and symptoms (k = 13), treatment outcomes (k = 12), and ED onset (k = 6). Decision trees, random forest, and support-vector machines were the most common models used. Although many studies reported moderate to high predictive performance, the benefits of ML over traditional statistical techniques remains unclear in light of inconsistent findings. We make several recommendations for future research (i.e., integrating multiple data types, external validation) to encourage continued progress in this developing field.

Keywords

scoping review eating disorder machine learning outcome prediction

Eating disorders (EDs) are complex, chronic conditions associated with a number of adverse outcomes, including high comorbidity and mortality, impairment in psychological and social functioning, and poor quality of life (Klump et al., 2009; van Hoeken & Hoek, 2020). A host of barriers (i.e., cost, limited therapist availability, stigma) contribute to low uptake of treatment; approximately three quarters of people with EDs are not accessing appropriate intervention (Ali et al., 2017) or are reporting significant delays (5 years on average) between symptom onset and treatment seeking (Hamilton et al., 2021). This is concerning given that recovery without treatment is uncommon and that the likelihood of full recovery is reduced the longer that symptoms persist (Fernández-Aranda et al., 2021; Wonderlich et al., 2012). Despite the development of numerous evidence-based treatments, treatment outcomes also remain suboptimal and variable; fewer than 50% achieve full symptom remission (Linardon & Wade, 2018). Many studies have sought to improve ED outcomes through exploring factors associated with ED risk, onset, and treatment outcomes, yet accurate prediction of outcomes has proven challenging; studies have often yielded unreliable predictors that lack clinical utility (Linardon et al., 2017; McClure et al., 2024).

Increasingly, machine-learning (ML) techniques are being applied to optimize diagnosis, prognosis, and treatment-outcome predictions in the field of psychiatry, and promising results have emerged (Dwyer et al., 2018). Although the term “machine learning” was coined by Arthur Samuel in the 1950s, the use of ML in psychiatric research has been steadily growing over the past 2 decades as data availability and computational resources have increased (Z. S. Chen et al., 2022). ML is a collection of data-driven computational techniques that learn patterns in data, enabling underlying data structures to be identified and accurate predictions of future outcomes to be made (Jordan & Mitchell, 2015). Unlike traditional statistical techniques (i.e., linear regression, logistic regression), which perform most optimally when only a small number of linearly related variables are used, ML techniques can model more complex (nonlinear, interactions), high-dimensional (i.e., large number of variables) data while enabling several different data types (i.e., survey data, imaging data, social media data) to be considered in a single model. These capabilities provide greater opportunity to capture the diversity and complexity inherent in psychiatric phenomena, potentially leading to predictions that are more accurate, parsimonious, and generalizable. ML techniques are often more suited to modelling relationships that are generalizable to single cases, potentially allowing more accurate predictions at the individual patient level (Bzdok & Meyer-Lindenberg, 2018; Dwyer et al., 2018). This stands in contrast with traditional techniques that often examine group-level effects, producing findings that can have limited validity when applied to individuals (Dwyer et al., 2018). Consequently, researchers have touted the potential for ML to usher in an era of precision psychiatry, in which prevention, diagnosis, and treatment of mental-health conditions are based on the unique biological, psychological, and social characteristics of each individual (Bzdok & Meyer-Lindenberg, 2018).

Although ML offers significant potential for improving outcome prediction, there are several challenges associated with ML applications that are worth noting. These include the need for large data sets to adequately train and validate models, difficulties with interpretation of complex models involving many predictors, and the computational resources and expertise required to perform ML analyses (Bzdok & Meyer-Lindenberg, 2018; Dwyer et al., 2018). These complexities can not only make the development of robust models challenging but can also cause difficulties with translation of ML models into clinical practice (Steyerberg et al., 2013). Thus, it is crucial that researchers consider the potential limitations of applying ML techniques alongside its anticipated benefits and the clinical utility of models (i.e., data availability, generalizability, clinical interpretability). To help facilitate this, adherence to best practice guidelines on ML model development and reporting is recommended (i.e., TRIPOD statement, Collins et al., 2024; PROGRESS, Steyerberg et al., 2013).

Broadly, ML techniques can be grouped into two main categories: supervised learning and unsupervised learning. Supervised-learning techniques are applied to data sets in which the input data (i.e., predictors, features) and output data (i.e., outcomes, response variable) are labeled (Wang, 2021). These techniques learn patterns and relationships between variables to accurately classify or predict a prespecified outcome (i.e., predicting ED onset). In contrast, unsupervised-learning techniques identify patterns and relationships in unlabeled data (i.e., input data are provided without corresponding output data). They are commonly used to identify meaningful clusters or groups in data in which the underlying structure may be previously unknown (i.e., identifying clusters of ED subtypes in data; Wang, 2021).

ML techniques have been applied across various mental-health conditions, demonstrating promise in areas including detection and diagnosis, treatment and support, and clinical and research administration (Shatte et al., 2019). For example, supervised-ML techniques have been used with data derived from thousands of electronic health records (EHRs) to predict future suicide attempts with high accuracy (Walsh et al., 2017). The use of neuroimaging data with ML has shown promise in enhancing early detection and diagnosis of schizophrenia (Koutsouleris et al., 2021; Skåtun et al., 2017), and wearable-sensor data have been used to accurately predict the presence and severity of depression (Tazawa et al., 2020). Unsupervised techniques have also been used to improve understanding of diagnostic heterogeneity—that is, variability in symptoms and treatment responses in a specific diagnosis—by uncovering clinical subtypes of diagnoses that may show similar etiologies, prognoses, and treatment responses (i.e., Amoretti et al., 2021; Chekroud et al., 2017; Kung et al., 2022; Pelin et al., 2021). Several studies have also compared ML and traditional techniques; ML has demonstrated enhanced performance for predicting treatment outcomes in depression (Kessler et al., 2016), obsessive compulsive disorder (Lenhard et al., 2018), and psychosis (Koutsouleris et al., 2016). Together, these studies provide preliminary evidence of the potential utility of using ML to enhance psychiatric research.

Although ML techniques have been applied more widely in the fields of depression and schizophrenia, there has been a growing number of studies testing ML in ED research that have yielded encouraging results (Fardouly et al., 2022). For example, ML has shown promise in predicting ED diagnoses using data from surveys (i.e., Linardon et al., 2020), social media posts (i.e., Abuhassan et al., 2023), and neuroimaging scans (i.e., Cerasa et al., 2015). Preliminary evidence has also been found for predicting ED risk and onset (Krug et al., 2021; Mitchison et al., 2023), illness course (Haynos et al., 2021), and treatment response (Espel-Huynh et al., 2021), and ML models have achieved high predictive performance. Several studies have also investigated whether ML outperforms traditional techniques by comparing approaches; some have found that ML enhanced predictive performance (Haynos et al., 2021), and others have found comparable results (Espel-Huynh et al., 2021; Krug et al., 2021). Thus, enhancing understanding of the conditions under which ML performs optimally in this context is critical for maximizing its potential benefits and, ultimately, improving understanding of the complexities of EDs.

Furthermore, the clinical benefits of constructing accurate and robust ML models are substantial. By integrating ML into clinical workflows, health-care providers could offer more precise, timely, and individualized care, ultimately improving patient outcomes and optimizing resource allocation (Lee et al., 2021). For example, ML models processing large amounts of patient intake data may aid in generating differential diagnoses, enabling clinicians to promptly identify conditions aligned with a patient’s symptoms and history. Likewise, ML may be used to predict treatment responsiveness, guiding the selection of appropriate therapies. However, before ML models can be integrated into clinical practice, they must undergo rigorous testing and external validation to ensure they reliably contribute to improved patient outcomes (Steyerberg et al., 2013). Ongoing monitoring and regular updates to these models are also essential to ensure ongoing adherence to current clinical standards and adaptation to emerging data trends (Steyerberg et al., 2013).

Considering the exciting potential ML offers for enhancing diagnosis, prognosis, and treatment-outcome prediction and the increasing number of studies using ML in this field, a systematic scoping review mapping this emerging body of literature is timely. Although previous narrative reviews have provided a preliminary overview of ML applications in the ED field (Fardouly et al., 2022; Ghosh et al., 2024), narrative reviews do not follow predefined, systematic methods and may be subject to selective reporting, making their findings less comprehensive and more prone to bias (Sarkar & Bhatia, 2021). In contrast, a systematic scoping review follows more rigorous, transparent, and replicable methods to comprehensively identify, map, and synthesize the available evidence, ensuring minimized bias and enhanced reproducibility (Peterson et al., 2017). This approach allows for a more structured synthesis of trends, helping to identify common methodologies, gaps in the literature, and directions for future research, which is essential for advancing the field.

Although a systematic review has been conducted on ML and natural language processing for detecting EDs (Merhbene et al., 2024), its focus was limited to a specific application of ML in the field (text-based detection of EDs) rather than the broader landscape of ML applications for ED-outcome prediction. In contrast, our review provides a more comprehensive examination of how ML has been used to predict ED outcomes, encompassing a wider range of methodologies, data types, and clinical applications. This broader scope allows for a more complete understanding of ML’s potential and limitations in ED research, offering insights that are critical to guiding future work in the field.

Thus, we conducted a systematic scoping review aiming to locate, examine, and summarize the existing literature on the application of ML in ED-outcome-prediction research. Specifically, we aimed to understand (a) which ML techniques have been used and in what context, (b) whether there is evidence of ML demonstrating superior performance over traditional statistical techniques, and (c) whether models have been externally validated in new samples.

Method

A systematic-scoping-review methodology was selected given the purpose was to explore, locate, and summarize how ML has been applied in ED-outcome-prediction research. Scoping reviews aim to map the available literature on a given topic, providing an overview of the research conducted to date while allowing key concepts and gaps in knowledge to be identified (Arksey & O’Malley, 2005). A scoping review was also selected given anticipated heterogeneity across study designs, samples, assessment instruments, and so on. The Arksey and O’Malley (2005) methodological framework for scoping reviews and the PRISMA Extension for Scoping Reviews (Tricco et al., 2018) were used to guide the methodology of this review.

Information sources and search strategy

Six online databases (Medline, PsycINFO, Web of Science, EMBASE, Cochrane Database, and ProQuest Dissertations and Theses Global) were searched in February 2023 (updated in April 2024). Terms related to EDs and ML were combined and searched for in the title and abstract (for search terms, see the Supplemental Material available online). Terms related to ML were adapted from prior reviews conducted in the broader health field (Christodoulou et al., 2019; Morgenstern et al., 2020). The reference list of a relevant narrative review (Fardouly et al., 2022) was hand searched to identify any additional studies.

Selection of sources of evidence

Studies were included if they (a) reported on a method or application of an ML technique; (b) evaluated the ML technique’s performance for predicting ED risk, diagnosis, and/or prognosis either naturally or in the context of an intervention; (c) made an original contribution to the literature (i.e., reviews were excluded); and (d) was available in English language. Studies that applied only logistic regression were excluded because this was not regarded as a ML technique (see Christodoulou et al., 2019). Given that we were interested in the prediction of ED risk, onset, and diagnosis, no restrictions were placed on sample type (i.e., nonclinical and clinical samples were included). All study designs were considered for inclusion. Titles and abstracts were screened for relevance by Z. McClure. Full texts were sourced for potentially relevant studies and were reviewed to determine whether full inclusion criteria were met (performed by Z. McClure, J. Linardon, M. Fuller-Tyszkiewicz).

Data extraction

Data were extracted using a template developed for this study. Data extracted included authors; design; study aim; outcome; sample characteristics; data type; predictor variables/features; name of ML techniques used, including key details to help ascertain the predictive performance of the techniques, such as comparison with more traditional techniques (i.e., logistic regression, linear regression); number of outcome events in training set; validation method used; and key findings. Data were extracted by Z. McClure in consultation with J. Linardon.

Risk of bias

For studies comparing ML models with a traditional technique, we assessed for risk of bias using a tool developed by Christodoulou et al. (2019). Each criteria assessed methodological issues of model development and aspects that may compromise comparisons between approaches. Five signaling items are used to indicate potential bias: (a) unclear or biased validation of model performance, (b) difference in whether data-driven variables selection was performed (yes/no) before applying traditional and ML algorithms, (c) difference in handling of continuous variables before applying traditional and ML algorithms, (d) different predictors considered for traditional and ML algorithms, and (e) whether corrections for imbalanced outcomes were used only for traditional algorithms or only for ML algorithms. Each item was scored as “no” (not present), “unclear,” or “yes” (present). Risk of bias was considered high if at least one item was scored unclear or yes. For further detail on items, see Table S1 in the Supplemental Material.

Synthesis of results

Studies were synthesized according to four broad themes that were developed through discussion among authors following the data-extraction process. The themes are (a) “predicting ED onset,” studies that predicted onset of an ED in previously asymptomatic or at-risk samples; (b) “predicting ED risk and symptoms,” studies that predicted ED-related behaviors and cognitions (i.e., binge-eating episodes, weight/shape concerns, severity) in clinical and nonclinical samples (in the absence of predicting diagnostic categories); (c) “predicting ED diagnosis,” studies that aimed to predict the presence versus absence of an ED or to classify different ED subtypes (i.e., anorexia nervosa [AN] vs. bulimia nervosa [BN]); and (d) “prediction of treatment outcome,” studies that aimed to predict responsiveness to a particular intervention program.

Results

Search results

Seventy-five studies were included (for PRISMA flowchart, see Fig. 1; for studies, see Tables S2–S5 in the Supplemental Material). Publication year ranged from 1998 to 2024; 84% were published from 2015 onward (see Fig. S1 in the Supplemental Material). A synthesis of findings is presented for each theme below.

Fig. 1.

PRISMA flowchart.

Predicting ED onset

Study characteristics

Six studies used ML models to predict ED onset (see Table 1). Sample size ranged from 206 to 1,297 (Mdn = 920). Follow-up time ranged from 10 months to 8 years. All but one (Mitchison et al., 2023) used a classification and regression tree (CART) to detect interactions between variables and identify important predictors. None reported conducting external validation of models.

Table 1.

Aggregated Study Characteristics of Studies Predicting ED Onset (k = 6)

	N studies
Diagnosis
AN	3
BN	4
BED	5
Purging disorder	4
Diagnosis unclear	1
Study design
Prospective	4
Pooled data (intervention trials)	2
Follow-up time, range	10 months to 8 years
Data type
Self-reported	6
Interview	3
ML model
CART	5
Random forest	2
Elastic net	1
Traditional technique
Logistic regression	2
No comparison	4
Validation technique
10-fold cross-validation	2
None reported	4
N predictors
1–10	3
11–20	2
21–30	1

Note: ED = eating disorder; AN = anorexia nervosa; BN = bulimia nervosa; BED = binge eating disorder; CART = classification and regression tree; ML = machine learning.

Study findings

Four studies reported classification accuracy (76%–98.2%; Mehl et al., 2019; Mitchison et al., 2023; Stice & Desjardins, 2018; Stice et al., 2002), and one reported the aggregated out-of-bag error rate (Allen et al., 2016; 0.4% error rate for classifying onset in boys, 13.8% error in girls). Two implemented a traditional approach alongside an ML technique, enabling comparison (for studies comparing approaches, see Table S6 in the Supplemental Material). Allen et al. (2016) tested CART and logistic regression. Using six predictors, CART found sex was the most important predictor of ED onset such that adolescent girls displayed higher risk than boys (n = 1,297). A nonlinear interaction was also found between weight and eating concerns and onset for boys and girls, and an interaction between moderate weight and eating concerns and externalizing problems was found for girls. The logistic-regression model failed to identify any two-way interactions. In contrast, Mitchison et al. (2023) found comparable performance between an ML technique and traditional technique (n = 687). Using six predictors, elastic net and logistic regression performed similarly at predicting ED onset over 1 year (both models obtained an area under the curve [AUC] of .75), demonstrating moderate predictive performance.

Predicting ED risk and symptoms

Study characteristics

Thirteen studies used ML models to predict ED risk and symptoms (see Table 2). Studies aimed to predict the occurrence of specific ED behaviors, cognitions, or general ED risk. Sample size ranged from 13 to 11,620 (Mdn = 371). Most recruited nonclinical samples and used self-reported data as input. Several ML models were tested; CART and random forest were the most common. Only one reported conducting external validation (X. Chen et al., 2023). For ease of readability, findings were synthesized according to sample type and by data type for nonclinical samples.

Table 2.

Aggregated Study Characteristics of Studies Predicting ED Risk and Symptoms (k = 13)

	N studies
Study design
Prospective	6
Cross-sectional	7
Sample type
Nonclinical	9
Clinical	4
Data type
Self-reported	10
Self-reported + cognitive tasks	1
Brain imaging	2
Physiological	1
ML model
CART	3
Random forest	4
Elastic net	2
Deep neural network	1
LASSO	1
SVM	1
Shallow neural network	1
BISCUIT	1
Hybrid (neural network + SEM)	1
Bayesian networks	1
Traditional technique
Linear regression	1
Logistic regression	3
SEM	1
No comparison	8
Validation technique
10-fold cross-validation	7
Leave 1 out cross-validation	1
None reported	5
N predictors
1–10	6
11–20	2
21–30
31–40	2
60–70	1
Unclear	2

Note: Clinical samples confirmed an ED diagnosis via diagnostic interview. ED = eating disorder; ML = machine learning; CART = classification and regression tree; LASSO = least absolute shrinkage and selection operator; SVM = support-vector machine; BISCUIT = best item scales that are cross-validated, unit weighted, informative and transparent; SEM = structural equation modeling.

Clinical samples

Four studies (all prospective) predicted symptoms in a clinical sample. Three used self-reported ecological momentary assessments (EMAs) as input obtained via smartphone (Arend et al., 2023; Levinson et al., 2022) or a blood-glucose-monitoring device (Presseller et al., 2024). Presseller et al. (2024) found a random-forest model could distinguish eating and noneating episodes in binge eating disorder (BED) and BN with high accuracy (82%), and Arend et al. (2023) reported good model performance for predicting binge eating in women with BN and BED (AUC = .80, best items scale that is cross-validated, unit-weighted, informative, and transparent algorithm; for explanation of algorithm, see Table S3 in the Supplemental Material). Only Levinson et al. (2022) employed ML models (CART, random forest, support-vector machine [SVM], shallow neural network) alongside a traditional technique (logistic regression; n = 60). A shallow-neural-network model performed best, predicting behavioral symptoms with high accuracy (> 75%) across several time points. However, performance metrics were reported only for the shallow-neural-network model, so the degree to which it outperformed logistic regression is unclear.

Haynos et al. (2021) demonstrated superior performance of ML models (elastic net and random forest) compared with logistic regression for predicting symptom trajectories over 2 years in transdiagnostic women (n = 415). When researchers used more than 30 baseline predictors (self-report and interview), ML models achieved higher mean AUCs across outcomes, including ED diagnostic status, presence of binge eating, compensatory behaviors, and underweight body mass index (elastic net: range = 0.61–0.93; random forest: range = 0.62–0.92; logistic regression: range = 0.47–0.83).

Nonclinical sample

Self-reported data

Seven studies recruiting a nonclinical sample used self-reported data as input to predict symptoms. Least absolute shrinkage and selection operator (LASSO) or Bayesian networks were used to identify ED risk factors (Bercht & Costa, 2023; Han & Zhang, 2021). Two used CART to predict orthorexia nervosa risk (Dell’Osso et al., 2018) or general ED risk (Ren et al., 2022), correctly classifying 67% to 88% of high-risk participants. Three other studies predicting ED symptoms compared ML with a traditional technique (Kheirollahpour et al., 2020; Liang et al., 2022; Mitchison et al., 2023). Kheirollahpour et al. (2020) found a hybrid ML model (R² = .552) outperformed structural equation modeling (SEM; R² increased by 27%) for predicting eating-behavior patterns using three predictors (n = 340). In contrast, Liang et al. (2022) found no performance benefit between random forest, deep neural network, and linear regression for predicting body-image disturbances (n = 11,620) using four predictors. Mitchison et al. (2023) also reported similar performance between elastic net (AUC = .64) and logistic regression (AUC = .62) for predicting symptom persistence using three predictors (n = 276).

Brain-imaging data

Two studies used ML models to explore brain structure and function as predictors of symptom variation. X. Chen et al. (2023) sampled Chinese students and used connectome-based predictive modeling (n = 660). Although models failed to predict binge/purge behavior, connectivity between networks involved in cognitive control, reward sensitivity, and visual perception predicted increased body-image concerns (significant correlations between actual and predicted scores). Findings were replicated in an independent sample (n = 821). X. Chen et al. (2022) also reported good model performance of a linear SVM for predicting binge eating using brain structure and function indexes in primary school children (n = 76).

Predicting ED diagnosis

Study characteristics

Forty-five articles applied ML to classify individuals with an ED or differentiate ED subtypes (see Table 3). Many studies focused on classifying AN versus healthy control subjects (k = 18). Sample size ranged from 30 to 1,165,000 (Mdn = 423). Several data types were used as input; text from social media profiles were the most common. Most studies undertaking text classification implemented more than one ML technique, proposing custom models combining natural-language-processing techniques (i.e., feature extraction) with supervised-learning models (i.e., SVM, random forest) or deep-learning models (i.e., neural networks, transformer). Studies using other data types mostly implemented one ML technique; SVM was the most common. External validation was reported in one study (Burstein et al., 2023). Findings are synthesized below according to the data type used as model input, with emphasis on noteworthy studies (i.e., large sample size) and/or those that implemented ML alongside traditional techniques.

Table 3.

Aggregated Study Characteristics of Studies Predicting ED Diagnostic Categories (k = 45)

	N studies
Outcome
ED cases vs. healthy control subjects	38
Distinguish ED subtypes	10
Distinguish ED from other mental-health conditions	2
Study design
Text analysis	16
Cross-sectional	22
Retrospective	4
Prospective + case control	1
Experimental	1
Case study	1
Data type
Text	16
Self-report	11
Brain imaging	8
Physiological	8
Internet browsing	1
Electronic health record	1
Cognitive tasks	1
ML model
Text classification
Natural language processing	11
Deep learning	9
Custom	5
Naive bayes	4
CART	3
Random forest	5
Multilayer perceptron	4
SVM	7
AdaBoost	2
K nearest neighbor	1
XGBoost	1
Other data types
SVM	11
CART	6
Neural network	4
LASSO	4
Random forest	4
K-means clustering	1
Multilayered perceptron	2
Naive Bayes	3
Prediction rule ensemble	1
Linear relevance vector machine	1
Partial least squares	1
Custom	3
Elastic net	1
Traditional technique
Logistic regression	10
Latent profile analysis	1
No comparison	33
Validation technique
5-fold cross-validation	8
10-fold cross-validation	12
20-fold cross-validation	1
Leave 1 out cross-validation	8
Silhouette method	1
v-fold cross validation^a	1
Not reported	15
Number of predictors
1–10	10
11–20	4
21–30	1
31–40	1
41–50	3
51–60	2
61–70	1
71–80
81–90
90–100	3
100+	2
Unclear	18

Note: ED = eating disorder; ML = machine learning; CART = classification and regression tree; LASSO = least absolute shrinkage and selection operator; SVM = support-vector machine.

The value of v was not specified.

Textual data

Sixteen studies used textual data as input. Most used data from posts on social media platforms Twitter and Reddit (k = 14) and aimed to classify ED cases versus healthy control subjects. Most reported F1 scores, ranging from poor to excellent performance across studies (F1s = 0.43–0.98). One noteworthy study is Abuhassan et al. (2023), who collected a large sample of Twitter biographies from 1,165,000 users and classified them into five groups: individuals with an ED, health-care professionals, communicators, health-care professional communicators, and other. They implemented a deep-learning model based on bidirectional encoder representations (BERT) and long short-term memory (LSTM), achieving a high classification accuracy (98.37%). Four studies compared the performance of ML models with logistic regression (Fano et al., 2019; Noguero et al., 2023; Ramirez-Cifuentes et al., 2018; Uban et al., 2021; ns = 177–1,288). All but one (Noguero et al., 2023) found that an ML model performed better. The best performing models were a multilayer perceptron with GloVe vectors (F1 = 0.78; logistic regression: F1 = 0.56), SVM (F1 = 0.85; logistic regression: F1 = 0.76), and LSTM hierarchical attention network (F1 = 0.61; logistic regression: F1 = 0.49).

Self-reported data

Studies using self-reported data as input were cross-sectional (k = 8) or retrospective (k = 2) or used data from a longitudinal and case-control study design (k = 1). ML models generally achieved high accuracy (> 70%) for classifying ED cases or differentiating subtypes. Three studies compared ML models with a traditional approach (Krug et al., 2021; Orru et al., 2021; Sandoval-Araujo et al., 2024). Orru et al. (2021) used four predictors, and the others used 43 to 51 predictors. One reported comparable performance between ML models and logistic regression (Krug et al., 2021), although they noted that ML achieved similar performance using fewer predictors, leading to a more parsimonious model (AUCs = .69–.82). In contrast, Sandoval-Araujo et al. (2024) found ML models (CART: AUC = .81; random forest: AUC = .79) outperformed logistic regression (AUC = .62) for classifying AN versus atypical AN, whereas Orru et al. (2021) found ML models (naive Bayes, SVM, random forest; AUCs = .81–.90) were superior to logistic regression (AUC = .77) for classifying ED cases versus healthy control subjects. Overall, ML models tended to perform better at distinguishing between ED cases and healthy control subjects compared with distinguishing between ED subtypes (i.e., AN vs BN).

Brain-imaging data

Seven studies investigated using ML (SVM: k = 5; LASSO: k = 1; linear relevance vector: k = 1) with brain-imaging data to identify biomarkers of EDs that may distinguish EDs from healthy control subjects (ns = 30–658; Arold et al., 2023; Cerasa et al., 2015; Cyr et al., 2018; Lavagnino et al., 2015, 2018; Weygandt et al., 2012; Zheng et al., 2023). Studies were primarily interested in exploring whether patterns in brain structure or function could be used to classify people at the individual patient level, and most reported high accuracy (> 70%). One study that examined brain-activation patterns involved in food-cue processing was also able to distinguish BN from BED cases with high accuracy (84%; Weygandt et al., 2012). Another found that features extracted from diffusion tensor imaging classified AN versus BN cases with good performance (AUC = .79; Zheng et al., 2023).

A case study (Strigo et al., 2017) used brain-activation patterns during anticipation and experience of high pain conditions to recommend a diagnosis of either AN, gastrointestinal problems (GIP), or depression. An SVM classifier trained on samples with recovered AN, GIP, or depression was implemented with a 15-year-old female with overlapping ED, depressive, and gastrointestinal symptoms. The model (accuracy of 56% when trained on diagnostic samples) classified the subject into the gastrointestinal group. These findings were corroborated by a second model using participant self-reported behavioral measures (84% when trained on diagnostic samples).

Physiological data

Studies using physiological data as input categorized individuals with AN versus healthy control subjects using eye-gazing tracking in response to a body-image-related visual-scanning task (Liu et al., 2021), concentrations of several trace elements (Zhao et al., 2004), and genotypic and phenotypic data (Guo et al., 2016). One classified AN versus BN cases with moderate performance (AUC = .72) using several metabolic indices (Dönmez et al., 2023). Two studies used electroencephalography data to accurately classify people with BED versus without BED (Raab et al., 2020; accuracy = 81.25%) and AN cases versus healthy control subjects (Karavia et al., 2024; accuracy = 75%–85%). Guo et al. (2016) was the only study to compare ML with a traditional approach, finding no benefit of SVM (AUC = .69) over logistic regression (AUC = .69) for classifying AN cases (n = 4,402).

Predicting treatment outcome

Study characteristics

Twelve studies predicted response to ED treatment (see Table 4). Most were predicting treatment outcomes to inpatient treatment and used self-reported data as input. A range of ML models were implemented; random forest was the most common. Sample size ranged from 36 to 826 (Mdn = 262). No study reported conducting external validation.

Table 4.

Aggregated Study Characteristics of Studies Predicting ED Treatment Outcome (k = 12)

	N studies
Study design
Prospective	8
RCT (pooled data)	3
Treatment approach
Inpatient	6
Digital intervention	2
Pharmacological	1
Outpatient	1
Behavioral weight loss and stepped care	1
Physical activity	1
Data type
Self-report	10
Physiological data	1
Physical activity count data	1
ML model
SVM	4
Elastic net	2
Random forest	6
CART	3
Naive bayes	3
K nearest neighbor	3
LASSO	2
Bayesian quadratic discriminant analysis	1
AdaBoost	1
XGBoost	1
Traditional technique
Logistic regression	4
Linear regression	2
No comparison	7
Validation technique
10-fold cross-validation	6
5-fold cross-validation	2
50-fold cross-validation	2
Leave 1 out cross-validation	1
None reported	1
N predictors
1–10	2
11–20	3
21–30	1
31–40	1
41–50	1
80	2
Unclear	2

Note: ED = eating disorder; ML = machine learning; RCT = randomized controlled trial; CART = classification and regression tree; LASSO = least absolute shrinkage and selection operator; SVM = support-vector machine.

Inpatient treatment

Studies show promise for using ML models to predict inpatient treatment response. CART and random forest were useful for identifying important predictors of good treatment outcome versus poor treatment outcome in BN (Hannöver et al., 2002; n = 630, model misclassification rate = 18%) and treatment dropout in a transdiagnostic sample (Todisco et al., 2023; n = 420). Four studies used supervised-learning techniques to classify people into different treatment-response categories; studies reported good performance (accuracy > 75%; Espel-Huynh et al., 2021; Espel-Huynh & Lowe, 2019; Ioannidis et al., 2020). Only one compared ML (SVM, k nearest neighbor) with a traditional approach (Espel-Huynh et al., 2021; n = 333), classifying participants into three different response trajectories (rapid, gradual, low symptom static). All models were trained using either 80 predictors or three predictors. SVM (radial) with three predictors was the best performing ML model (AUC = .94) but did not significantly outperform logistic regression (AUC = .93).

Digital intervention

Two studies applied ML models to predict response to ED digital interventions. von Brachel et al. (2014) used LASSO to identify important predictors of dropout to an online motivational program for women with an ED. Linardon et al. (2022) compared predictive performance of several ML models with linear regression for predicting engagement and symptom-level change (n = 826). ML models did not significantly outperform linear regression using 36 self-reported baseline predictors (across models for engagement outcomes: AUCs = .48–.52; for symptom change: R²s = .15–.40). However, predictive performance improved considerably across models (except SVM radial) for dropout (AUCs = .92–.99) and adherence outcomes (AUCs = .62–.93) when intervention-usage-pattern variables were added as input (although ML had comparable performance with traditional regression).

Other treatment types

Other treatment types included general outpatient treatment for EDs (Svendsen et al., 2023), pharmacological treatment for BED (Goyal et al., 2022), and behavioral weight loss and stepped care treatment for BED (Forrest et al., 2021). Although Svendsen et al. (2023) did not compare ML performance with a traditional approach, their model performed better than chance at predicting treatment nonresponse, demonstrating high precision (positive predictive value = 70%–71%) and sensitivity (78%–95%). Note that a synthetic data set was used to train the model. Two other studies predicting likelihood of placebo response (Goyal et al., 2022; n = 189) and reduction in binge eating, ED psychopathology, and weight loss (Forrest et al., 2021; n = 191) in adults with BED compared ML techniques with traditional techniques. Only Goyal et al. (2022) found better performance of an ML model (Gaussian naive Bayes; accuracy = 72%, specificity = 88%, sensitivity = 63%) compared with logistic regression (accuracy = 66%, specificity = 53%, sensitivity = 73%).

Risk of bias assessment

Of 19 studies that implemented a traditional technique alongside an ML model, seven were assessed as having a high risk of bias (see Table S7 in the Supplemental Material). Five of these reported greater predictive performance of ML models. The most frequently unmet criterion was unclear or biased validation performance (Criterion 1; 7/19 studies scored yes or unclear), often because of insufficient reporting on whether all model-building steps, such as feature selection and hyperparameter tuning, were repeated in each validation fold.

Discussion

This scoping review located, examined, and summarized existing literature on the application of ML in ED-outcome-prediction research. Specifically, we aimed to investigate (a) which ML techniques have been used and the contexts in which they have been applied, (b) whether there is evidence of ML outperforming traditional techniques, and (c) the extent to which models have been externally validated. We included 75 studies. Below, we summarize several key findings and trends in the literature pertaining to the three key aims of the review.

Summary of key findings

Synthesis of studies identified four broad domains in which ML techniques have been applied. These included predicting (a) onset of an ED (k = 6), (b) presence or severity of symptoms and risk factors (k = 13), (c) ED diagnostic categories (i.e., ED vs. healthy control subjects, AN vs. BN; k = 45), and (d) outcomes during treatment/intervention (k = 12).

We found that there were several different types of data used as input to predict outcomes, yet studies mostly relied on one type of data to generate prediction models. The most common data type used was self-reported data; most studies using this data type reported moderate to high predictive performance across the four domains. Other types of data used for predictive models—particularly when the goal was to predict diagnostic categories—were textual data, brain-imaging data, and physiological data. For example, studies implemented natural-language-processing techniques with ML models to analyze and classify text; many demonstrated the potential to detect ED cases through content posted on an individual’s social media profile (e.g., Aragón et al., 2020; Benitez-Andrades et al., 2023; Fano et al., 2019). Several studies also showed preliminary evidence for using brain-imaging data with ML models to identify biomarkers that can accurately classify ED cases or predict symptom variation (e.g., Cerasa et al., 2015; X. Chen et al., 2023; Lavagnino et al., 2015, 2018).

Although a number of ML techniques were tested across studies, the most common were CART, random forest, and SVM. Several studies implementing CART suggested that interpretability may be a key advantage of this model; studies highlighted its utility for identifying cut points associated with ED-risk/diagnostic categories and interactions between predictors (e.g., Allen et al., 2016; Hannöver et al., 2002; Linardon et al., 2020; Ren et al., 2022; Stice et al., 2011). Although authors were overall optimistic about the performance of ML models, external validation of findings was reported in only two studies (Burstein et al., 2023; X. Chen et al., 2023), which limits understanding of model robustness and out-of-sample generalizability.

There is emerging evidence supporting the potential for ML to outperform traditional techniques, although findings are mixed. In particular, 12 of 19 (63%) studies comparing approaches reported higher predictive performance of ML models compared with traditional techniques. However, five of these studies were assessed as being high risk of bias, indicating that findings should be interpreted with caution given that some reported advantages may be influenced by methodological limitations. Although most of these indicated enhanced performance through observing greater predictive accuracy or classification ability, one noted superior performance because of ML producing a more parsimonious model (but similar AUC to logistic regression; Krug et al., 2021). Likewise, another found that a ML model (CART) was more useful for detecting significant, nonlinear interactions compared with logistic regression (Allen et al., 2016). It appears that ML may improve text-classification tasks given that three studies found ML performed better at classifying ED cases based on social media content (Fano et al., 2019; Ramirez-Cifuentes et al., 2018; Uban et al., 2021). However, note that the traditional approach still performed relatively well at text-classification tasks, indicating that it may be worth considering whether the incremental benefits of ML outweigh the potential complexities that may come with implementing ML models in this context. This consideration is particularly relevant for text-classification tasks, which often rely on more complex ensemble or deep-learning methods (i.e., LSTM). These models can introduce additional challenges related to interpretability, transparency, computational demand, and clinical implementation, which must be weighed against their potential performance benefits (Miotto et al., 2018).

In contrast, studies finding comparable performance suggest there are circumstances when simpler traditional techniques perform just as well. This may be the case in studies with smaller samples, limited number and type of predictors, and predictors that are either weakly or linearly related to the outcome (Chekroud et al., 2021). Note, however, that some studies that had a large number of predictors (i.e., 80 predictors; Espel-Huynh et al., 2021) or a large sample size (Liang et al., 2022) found minimal benefits, suggesting that multiple factors may determine the relative benefits of ML. However, given that studies comparing approaches were limited and heterogenous, it is difficult to know for certain which specific contexts ML techniques show superior performance in this field. Further studies comparing approaches in similar contexts (i.e., similar samples, number and type of predictors, and outcomes) and under conditions hypothesized to optimize ML performance (i.e., diverse data types, larger samples) are needed to provide a more robust understanding of when ML techniques should be used over traditional approaches.

Literature gaps and future directions

Although the application of ML in predicting ED outcomes is growing and has been met with great enthusiasm, this synthesis identified a number of important literature gaps and directions for future research. We focus on four key gaps.

First, given that research has primarily applied ML to predicting ED diagnosis, knowledge of how ML techniques may be used to facilitate more personalized support, predict the evolving nature and complexities of EDs over time, and forecast responsiveness to treatment is limited. This is an unfortunate oversight and may explain why even the field’s best treatments (i.e., cognitive-behavior therapy) produce modest outcomes (Linardon & Wade, 2018). Future research should seek to explore how ML techniques can be leveraged to deliver more personalized treatments plans that are tailored to an individual’s unique symptom profile and are administered at critical moments. For example, future studies may investigate how ML can be used to deliver such support through just-in-time adaptive interventions (JITAIs). JITAIs offer timely support through technological means (i.e., smartphones) using real-time analysis of passive smartphone or sensor data (i.e., usage patterns, social media interaction, location, heart rate, movement) or low-burden self-reporting (i.e., EMA; Juarascio et al., 2018; Nahum-Shani et al., 2018). ML may enhance JITAIs by predicting the optimal timing and type of intervention to deliver to individuals in moments of need (i.e., high risk for displaying ED behaviors). ML-enhanced JITAIs are emerging across the health field, demonstrating promise in areas including weight-loss interventions (Forman et al., 2019) and alcohol-consumption reduction (Bae et al., 2018). Although some studies used ML models with EMA (Arend et al., 2023; Levinson et al., 2022) and sensor data (Lekkas et al., 2023) to predict ED symptoms, it may be beneficial to also explore how predictive models can be used with these data to deliver JITAIs, providing timely support to people during high-risk moments.

Second, although across studies there were several different types of data used as input (i.e., self-report, neuroimaging, text, physiological), studies rarely implemented more than one data type in a single model. The ability to integrate and process complex and varied data sources is a key advantage of ML and may improve prediction by enabling novel interactions and associations to be discovered between a diverse set of input features (Iniesta et al., 2016). An example of this was seen in Linardon et al. (2022) when ML model performance significantly improved after including usage-pattern variables alongside an initial set of self-reported baseline predictors for predicting responsiveness to ED digital interventions. In clinical settings, it may also be useful to explore whether combining routinely collected clinical data with health-service-use data improves prediction of therapeutic outcomes given that this information is readily accessible in clinical contexts. However, it is important to consider how increasing model complexity may also limit interpretability. Future research should prioritize investigating how diverse types of data sources may be integrated in models to enhance prediction while also evaluating the feasibility of applying these models in clinical settings.

Third, many studies had relatively small sample sizes in the context of ML (n < 1,000) and did not conduct external validation of models on new, independent samples. ML models generally require larger samples to optimize model performance and mitigate the risk of overfitting (Yarkoni & Westfall, 2017). Overfitting occurs when the model captures noise in the training data rather than the true underlying patterns, causing the model to perform well on the training data but fail to generalize to new data (Yarkoni & Westfall, 2017). Prior recommendations suggest avoiding using ML models with fewer than several hundred observations (Poldrack et al., 2020). Furthermore, it is also critical that future studies conduct external validation of their models on new, independent samples because only two studies reported doing so (Burstein et al., 2023; X. Chen et al., 2023). External validation is crucial for mitigating the risk of overfitting and determining whether a model is generalizable to new data that span different clinical settings, populations, and subgroups (Riley et al., 2016). Without this, it remains unclear whether results would replicate in conditions outside the samples in which the models were developed. This is particularly pertinent given that many studies developed models on samples consisting of predominantly White, educated female participants. The potential for racial, ethnic, and gender biases in ML models is a well-recognized concern in the development, validation, and implementation of ML models (Hooker, 2021). Training and testing of models in populations that are traditionally underrepresented in ED research is critical to establish more robust and generalizable models.

To facilitate the collection of large, heterogeneous (e.g., self-report, neuroimaging, social media) data sets, we recommend cross-institutional and international collaborations among researchers in the ED field. An example of cross-country collaboration can be seen by Krug et al. (2021), who collaborated to collect a large clinical sample (n = 1,402) across six different centers in Europe. Data collection may also be facilitated by establishing clinical registries. For example, in Australia, the TrEAT registry—a clinical registry for EDs—includes data from more than 10 clinics and provides a valuable resource for researchers (University of Technology Sydney, n.d.). Such registries may enable the application of ML techniques to address complex questions and understand clinical variability in real-world settings. In addition, EHRs and passive data collected via smartphones (i.e., social media interactions, communication patterns, usage patterns) are increasingly being used to develop risk signatures across other psychiatric disorders (Chekroud et al., 2021). Such data also provide the opportunity to collect large amounts of information across a variety of domains and time points while being low effort for participants.

Fourth, once models have undergone external validation to establish model generalizability, the clinical application of models should be assessed. Specifically, it is important to understand the feasibility of implementing ML models into real-world settings and their actual clinical value. This includes evaluating the usability of the model by clinicians, the incremental utility of the model compared with current clinical practice (i.e., whether the accuracy of ML models for identifying ED cases improved on clinical diagnostic accuracy), and the cost-effectiveness of implementing the model into clinical workflows (Cearns et al., 2019; Steyerberg et al., 2013). To achieve this, external validation should be followed by clinical-impact studies that evaluate the real-world effectiveness of ML-based tools for improving patient outcomes and optimizing management of care (Steyerberg et al., 2013). It is also crucial to conduct research aimed at understanding clinicians’ perspectives, concerns, and potential barriers to using ML in practice. Ongoing collaboration between researchers and clinicians is required to facilitate acceptability of ML-based systems and ensure compatibility with clinical practice and guidelines (Dwyer et al., 2018).

Limitations of this review

A limitation of this review is the lack of preregistration. Because of the broad and exploratory nature of this scoping review, it was difficult to predetermine rigid parameters for the research questions, methodology, and data synthesis. Although this approach allowed for greater flexibility, enabling our methods to evolve with data, preregistration may have enhanced transparency and reduced the risk of bias. However, we sought to mitigate this by adhering to established scoping-review frameworks and ensuring a systematic and replicable approach to study selection and data extraction.

Conclusion

In conclusion, research applying ML to ED-outcome prediction is rapidly emerging, with 75 studies identified in this scoping review. Although the utility of ML is promising in this field, the available research largely contains proof-of-concept studies that simply demonstrate feasibility of these advanced computational approaches. It is evident from this review that larger-scale validation studies are required to establish more robust and generalizable findings.

Supplemental Material

sj-docx-1-cpx-10.1177_21677026251340348 – Supplemental material for Machine-Learning Applications in Eating-Disorder-Outcome Prediction: A Systematic Scoping Review

Supplemental material, sj-docx-1-cpx-10.1177_21677026251340348 for Machine-Learning Applications in Eating-Disorder-Outcome Prediction: A Systematic Scoping Review by Zoe McClure, Matthew Fuller-Tyszkiewicz, Mariel Messer and Jake Linardon in Clinical Psychological Science

Footnotes

Acknowledgements

Z. McClure wishes to acknowledge the support of the Australian government provided through an Australian Government Research Training Program Scholarship.

Transparency

Action Editor: Kelsie T. Forbush

Editor: Jennifer L. Tackett

Author Contributions

Zoe McClure: Conceptualization; Formal analysis; Investigation; Methodology; Project administration; Visualization; Writing – original draft; Writing – review & editing.

Matthew Fuller-Tyszkiewicz: Conceptualization; Supervision; Writing – original draft; Writing – review & editing.

Mariel Messer: Conceptualization; Supervision; Writing – review & editing.

Jake Linardon: Conceptualization; Formal analysis; Methodology; Supervision; Visualization; Writing – original draft; Writing – review & editing.

ORCID iD

Zoe McClure

Supplemental Material

Additional supporting information can be found at

References

Abuhassan

Anwar

Fuller-Tyszkiewicz

Jarman

H. K.

Shatte

Liu

C. F.

Sukunesan

(2023). Classification of Twitter users with eating disorder engagement: Learning from the biographies. Computers in Human Behaviour, 140, Article 107519. https://doi.org/10.1016/j.chb.2022.107519

Ali

Farrer

Fassnacht

D. B.

Gulliver

Bauer

Griffiths

K. M.

(2017). Perceived barriers and facilitators towards help-seeking for eating disorders: A systematic review. International Journal of Eating Disorders, 50(1), 9–21. https://doi.org/10.1002/eat.22598

Allen

K. L.

Byrne

S. M.

Crosby

R. D.

Stice

(2016). Testing for interactive and non-linear effects of risk factors for binge eating and purging eating disorders. Behaviour Research and Therapy, 87, 40–47. https://doi.org/10.1016/j.brat.2016.08.019

Amoretti

Verdolini

Mezquida

Rabelo-da-Ponte

F. D.

Cuesta

M. J.

Pina-Camacho

Gomez-Ramiro

De-la-Camara

Gonzalez-Pinto

Díaz-Caneja

C. M.

(2021). Identifying clinical clusters with distinct trajectories in first-episode psychosis through an unsupervised machine learning technique. European Neuropsychopharmacology, 47, 112–129. https://doi.org/10.1016/j.euroneuro.2021.01.095

Aragón

M. E.

López-Monroy

A. P.

González

L. C.

Montes-y-Gómez

(2020, September 8–11). Attention to emotions: Detecting mental disorders in social media [Conference session]. Text, Speech, and Dialogue: 23rd International Conference, TSD 2020, Brno, Czech Republic.

Arend

A.-K.

Kaiser

Pannicke

Reichenberger

Naab

Voderholzer

Blechert

(2023). Toward individualized prediction of binge-eating episodes based on ecological momentary assessment data: Item development and pilot study in patients with bulimia nervosa and binge-eating disorder. JMIR Medical Informatics, 11, Article e41513. https://doi.org/10.2196/41513

Arksey

O’Malley

(2005). Scoping studies: Towards a methodological framework. International Journal of Social Research Methodology, 8(1), 19–32. https://doi.org/10.1080/1364557032000119616

Arold

Bernardoni

Geisler

Doose

Uen

Boehm

Roessner

King

J. A.

Ehrlich

(2023). Predicting long-term outcome in anorexia nervosa: A machine learning analysis of brain structure at different stages of weight recovery. Psychological Medicine, 53(16), 7827–7836. https://doi.org/10.1017/S0033291723001861

Bae

Chung

Ferreira

Dey

A. K.

Suffoletto

(2018). Mobile phone sensors and supervised machine learning to identify alcohol use events in young adults: Implications for just-in-time adaptive interventions. Addictive Behaviors, 83, 42–47. https://doi.org/10.1016/j.addbeh.2017.11.039

10.

Benitez-Andrades

J. A.

Garcia-Ordas

M. T.

Russo

Sakor

Rotger

L. D. F.

Vidal

M. E.

(2023). Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts. Semantic Web, 14(5), 873–892. https://doi.org/10.3233/SW-223269

11.

Bercht

A. M.

Costa

A. B.

(2023). Objectification theory: Applicability in a sample of Rio Grande do Sul/Brazil. Psicologia: Teoria e Pesquisa, 39. https://doi.org/10.1590/0102.3772e39412.en

12.

Burstein

Griffen

T. C.

Therrien

Bendl

Venkatesh

Dong

Modabbernia

Zeng

Mathur

Hoffman

Sysko

Hildebrandt

Voloudakis

Roussos

(2023). Genome-wide analysis of a model-derived binge eating disorder phenotype identifies risk loci and implicates iron metabolism. Nature Genetics, 55(9), 1462–1470. https://doi.org/10.1038/s41588-023-01464-1

13.

Bzdok

Meyer-Lindenberg

(2018). Machine learning for precision psychiatry: Opportunities and challenges. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 3(3), 223–230. https://doi.org/10.1016/j.bpsc.2017.11.007

14.

Cearns

Hahn

Baune

B. T.

(2019). Recommendations and future directions for supervised machine learning in psychiatry. Translational Psychiatry, 9(1), Article 271. https://doi.org/10.1038/s41398-019-0607-2

15.

Cerasa

Castiglioni

Salvatore

Funaro

Martino

Alfano

Donzuso

Perrotta

Gioia

M. C.

Gilardi

M. C.

Quattrone

(2015). Biomarkers of eating disorders using support vector machine analysis of structural neuroimaging data: Preliminary results. Behavioural Neurology, 2015, Article 924814. https://doi.org/10.1155/2015/924814

16.

Chekroud

A. M.

Bondar

Delgadillo

Doherty

Wasil

Fokkema

Cohen

Belgrave

DeRubeis

Iniesta

Dwyer

Choi

(2021). The promise of machine learning in predicting treatment outcomes in psychiatry. World Psychiatry, 20(2), 154–170. https://doi.org/10.1002/wps.20882

17.

Chekroud

A. M.

Gueorguieva

Krumholz

H. M.

Trivedi

M. H.

Krystal

J. H.

McCarthy

(2017). Reevaluating the efficacy and predictability of antidepressant treatments: A symptom clustering approach. JAMA Psychiatry, 74(4), 370–378. https://doi.org/10.1001/jamapsychiatry.2017.0025

18.

Chen

Dong

Zhou

Gao

Liu

Wang

Qin

Tian

Xiao

Qiu

Feng

Lei

Chen

(2023). Connectome-based prediction of eating disorder-associated symptomatology. Psychological Medicine, 53(12), 5786–5799. https://doi.org/10.1017/S0033291722003026

19.

Chen

Qin

Gao

Liu

Song

Huang

Chen

(2022). Gray matter volume and functional connectivity underlying binge eating in healthy children. Eating and Weight Disorders, 27(8), 3469–3478. https://doi.org/10.1007/s40519-022-01483-7

20.

Chen

Z. S.

Kulkarni

Galatzer-Levy

I. R.

Bigio

Nasca

Zhang

(2022). Modern views of machine learning for precision psychiatry. Patterns, 3(11), Article 100602. https://doi.org/10.1016/j.patter.2022.100602

21.

Christodoulou

Collins

G. S.

Steyerberg

E. W.

Verbakel

J. Y.

Van Calster

(2019). A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of Clinical Epidemiology, 110, 12–22. https://doi.org/10.1016/j.jclinepi.2019.02.004

22.

Collins

G. S.

Moons

K. G. M.

Dhiman

Riley

R. D.

Beam

A. L.

Ben Van

Ghassemi

Liu

Reitsma

J. B.

Maarten van

Boulesteix

A.-L.

Camaradou

J. C.

Celi

L. A.

Denaxas

Denniston

A. K.

Glocker

Golub

R. M.

Harvey

Heinze

. . . Logullo

(2024). TRIPOD+AI statement: Updated guidance for reporting clinical prediction models that use regression or machine learning methods. The BMJ, 385, Article q902. https://doi.org/10.1136/bmj-2023-078378

23.

Cyr

Yang

Horga

Marsh

(2018). Abnormal fronto-striatal activation as a marker of threshold and subthreshold bulimia nervosa. Human Brain Mapping, 39(4), 1796–1804. https://doi.org/10.1002/hbm.23955

24.

Dell’Osso

Carpita

Muti

Cremone

I. M.

Massimetti

Diadema

Gesi

Carmassi

(2018). Prevalence and characteristics of orthorexia nervosa in a sample of university students in Italy. Eating and Weight Disorders, 23(1), 55–65. https://doi.org/10.1007/s40519-017-0460-3

25.

Dönmez

R. B.

Demirel

T. N.

Bilgin

Tarhan

Örkçü

Ö.

Ceylan

Guleken

(2023). Comparative and predictive analysis of clinical and metabolic features of anorexia nervosa and bulimia nervosa. Addiction & Health, 15(4), 230–239. https://doi.org/10.34172/ahj.2023.1466

26.

Dwyer

D. B.

Falkai

Koutsouleris

(2018). Machine learning approaches for clinical psychology and psychiatry. Annual Review of Clinical Psychology, 14, 98–118. https://doi.org/10.1146/annurev-clinpsy-032816-045037

27.

Espel-Huynh

Zhang

Thomas

J. G.

Boswell

J. F.

Thompson-Brenner

Juarascio

A. S.

Lowe

M. R.

(2021). Prediction of eating disorder treatment response trajectories via machine learning does not improve performance versus a simpler regression approach. International Journal of Eating Disorders, 54(7), 1250–1259. https://doi.org/10.1002/eat.23510

28.

Espel-Huynh

H. M.

Lowe

M. R.

(2019). Measurement and prediction of treatment outcome among residential eating disorder patients using a novel progress monitoring measure (Publication No. 13896746) [Doctoral dissertation, Drexel University]. ProQuest Dissertations & Theses Global.

29.

Fano

Nivre

Karlgren

(2019). A comparative study of word embedding methods for early risk prediction on the Internet (Publication No. 27812338) [Master’s thesis, Uppsala Universitet]. ProQuest Dissertations & Theses Global.

30.

Fardouly

Crosby

R. D.

Sukunesan

(2022). Potential benefits and limitations of machine learning in the field of eating disorders: Current research and future directions. Journal of Eating Disorders, 10(1), Article 66. https://doi.org/10.1186/s40337-022-00581-2

31.

Fernández-Aranda

Treasure

Paslakis

Agüera

Giménez

Granero

Sánchez

Serrano-Troncoso

Gorwood

Herpertz-Dahlmann

(2021). The impact of duration of illness on treatment nonresponse and drop-out: Exploring the relevance of enduring eating disorder concept. European Eating Disorders Review, 29(3), 499–513. https://doi.org/10.1002/erv.2822

32.

Forman

E. M.

Goldstein

S. P.

Crochiere

R. J.

Butryn

M. L.

Juarascio

A. S.

Zhang

Foster

G. D.

(2019). Randomized controlled trial of OnTrack, a just-in-time adaptive intervention designed to enhance weight loss. Translational Behavioral Medicine, 9(6), 989–1001. https://doi.org/10.1093/tbm/ibz137

33.

Forrest

L. N.

Ivezaj

Grilo

C. M.

(2021). Machine learning v. traditional regression models predicting treatment outcomes for binge-eating disorder from a randomized controlled trial. Psychological Medicine, 53(7), 2777–2788. https://doi.org/10.1017/S0033291721004748

34.

Ghosh

Burger

Simeunovic-Ostojic

Maas

Petković

(2024). Review of machine learning solutions for eating disorders. International Journal of Medical Informatics, 189, Article 105526. https://doi.org/10.1016/j.ijmedinf.2024.105526

35.

Goyal

R. K.

Kalaria

S. N.

McElroy

S. L.

Gopalakrishnan

(2022). An exploratory machine learning approach to identify placebo responders in pharmacological binge eating disorder trials. Clinical and Translational Science, 15(12), 2878–2887. https://doi.org/10.1111/cts.13406

36.

Guo

Wei

Keating

B. J.

Hakonarson

(2016). Machine learning derived risk prediction of anorexia nervosa. BMC Medical Genomics, 9, Article 4. https://doi.org/10.1186/s12920-016-0165-x

37.

Hamilton

Mitchison

Basten

Byrne

Goldstein

Hay

Heruc

Thornton

Touyz

(2021). Understanding treatment delay: Perceived barriers preventing treatment-seeking for eating disorders. Australian & New Zealand Journal of Psychiatry, 56(3), 248–259. https://doi.org/10.1177/00048674211020102

38.

Han

Zhang

(2021). Three essays in health economics (Publication No. 28492034) [Doctoral dissertation, Clark University]. ProQuest Dissertations & Theses Global.

39.

Hannöver

Richard

Hansen

N. B.

Martinovich

Kordy

(2002). A classification tree model for decision-making in clinical practice: An application based on the data of the German multicenter study on eating disorders, project TR-EAT. Psychotherapy Research, 12(4), 445–461. https://doi.org/10.1093/ptr/12.4.445

40.

Haynos

A. F.

Wang

S. B.

Lipson

Peterson

C. B.

Mitchell

J. E.

Halmi

K. A.

Agras

W. S.

Crow

S. J.

(2021). Machine learning enhances prediction of illness course: A longitudinal study in eating disorders. Psychological Medicine, 51(8), 1392–1402. https://doi.org/10.1017/S0033291720000227

41.

Hooker

(2021). Moving beyond “algorithmic bias is a data problem”. Patterns, 2(4), Article 100241. https://doi.org/10.1016/j.patter.2021.100241

42.

Iniesta

Stahl

McGuffin

(2016). Machine learning, statistical learning and the future of biological research in psychiatry. Psychological Medicine, 46(12), 2455–2465. https://doi.org/10.1017/S0033291716001367

43.

Ioannidis

Serfontein

Deakin

Bruneau

Ciobanca

Holt

Snelson

Stochl

(2020). Early warning systems in inpatient anorexia nervosa: A validation of the MARSIPAN-based modified early warning system. European Eating Disorders Review, 28(5), 551–558. https://doi.org/10.1002/erv.2753

44.

Jordan

M. I.

Mitchell

T. M.

(2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255–260. https://doi.org/10.1126/science.aaa8415

45.

Juarascio

A. S.

Parker

M. N.

Lagacey

M. A.

Godfrey

K. M.

(2018). Just-in-time adaptive interventions: A novel approach for enhancing skill utilization and acquisition in cognitive behavioral therapy for eating disorders. International Journal of Eating Disorders, 51(8), 826–830. https://doi.org/10.1002/eat.22924

46.

Karavia

Papaioannou

Michopoulos

Papageorgiou

P. C.

Papaioannou

Gonidakis

Papageorgiou

C. C.

(2024). Using electroencephalogram-extracted nonlinear complexity and wavelet-extracted power rhythm features during the performance of demanding cognitive tasks (Aristotle’s syllogisms) in optimally classifying patients with anorexia nervosa. Brain Sciences, 14(3), Article 251. https://doi.org/10.3390/brainsci14030251

47.

Kessler

R. C.

van Loo

H. M.

Wardenaar

K. J.

Bossarte

R. M.

Brenner

L. A.

Cai

Ebert

D. D.

Hwang

de Jonge

(2016). Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Molecular Psychiatry, 21(10), 1366–1371. https://doi.org/10.1038/mp.2015.198

48.

Kheirollahpour

M. M.

Danaee

M. M.

Merican

A. F. A. F.

Shariff

A. A. A. A

. (2020). Prediction of the influential factors on eating behaviors: A hybrid model of structural equation modelling-artificial neural networks. The Scientific World Journal, 2020, Article 4194293. https://doi.org/10.1155/2020/4194293

49.

Klump

K. L.

Bulik

C. M.

Kaye

W. H.

Treasure

Tyson

(2009). Academy for eating disorders position paper: Eating disorders are serious mental illnesses. International Journal of Eating Disorders, 42(2), 97–103. https://doi.org/10.1002/eat.20589

50.

Koutsouleris

Dwyer

D. B.

Degenhardt

Maj

Urquijo-Castro

M. F.

Sanfelici

Popovic

Oeztuerk

Haas

S. S.

Weiske

(2021). Multimodal machine learning workflows for prediction of psychosis in patients with clinical high-risk syndromes and recent-onset depression. JAMA Psychiatry, 78(2), 195–209. https://doi.org/10.1001/jamapsychiatry.2020.3604

51.

Koutsouleris

Kahn

R. S.

Chekroud

A. M.

Leucht

Falkai

Wobrock

Derks

E. M.

Fleischhacker

W. W.

Hasan

(2016). Multisite prediction of 4-week and 52-week treatment outcomes in patients with first-episode psychosis: A machine learning approach. The Lancet Psychiatry, 3(10), 935–946. https://doi.org/10.1016/S2215-0366(16)30171-7

52.

Krug

Linardon

Greenwood

Youssef

Treasure

Fernandez-Aranda

Karwautz

Wagner

Collier

Anderluh

Tchanturia

Ricca

Sorbi

Nacmias

Bellodi

Fuller-Tyszkiewicz

(2021). A proof-of-concept study applying machine learning methods to putative risk factors for eating disorders: Results from the multi-centre European project on healthy eating. Psychological Medicine, 53(7), 2913–2922. https://doi.org/10.1017/S003329172100489X

53.

Kung

Chiang

Perera

Pritchard

Stewart

(2022). Unsupervised machine learning to identify depressive subtypes. Healthcare Informatics Research, 28(3), 256–266. https://doi.org/10.4258/hir.2022.28.3.256

54.

Lavagnino

Amianto

Mwangi

D’Agata

Spalatro

Zunta-Soares

G. B.

Abbate Daga

Mortara

Fassino

Soares

J. C.

(2015). Identifying neuroanatomical signatures of anorexia nervosa: A multivariate machine learning approach. Psychological Medicine, 45(13), 2805–2812. https://doi.org/10.1017/S0033291715000768

55.

Lavagnino

Mwangi

Cao

Shott

M. E.

Soares

J. C.

Frank

G. K. W.

(2018). Cortical thickness patterns as state biomarker of anorexia nervosa. International Journal of Eating Disorders, 51(3), 241–249. https://doi.org/10.1002/eat.22828

56.

Lee

E. E.

Torous

De Choudhury

Depp

C. A.

Graham

S. A.

Kim

H. C.

Paulus

M. P.

Krystal

J. H.

Jeste

D. V.

(2021). Artificial intelligence for mental health care: Clinical applications, barriers, facilitators, and artificial wisdom. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 6(9), 856–864. https://doi.org/10.1016/j.bpsc.2021.02.001

57.

Lekkas

Gyorda

J. A.

Jacobson

N. C.

(2023). A machine learning investigation into the temporal dynamics of physical activity-mediated emotional regulation in adolescents with anorexia nervosa and healthy controls. European Eating Disorders Review, 31(1), 147–165. https://doi.org/10.1002/erv.2949

58.

Lenhard

Sauer

Andersson

Månsson

K. N.

Mataix-Cols

Rück

Serlachius

(2018). Prediction of outcome in internet-delivered cognitive behaviour therapy for paediatric obsessive-compulsive disorder: A machine learning approach. International Journal of Methods in Psychiatric Research, 27(1), Article e1576. https://doi.org/10.1002/mpr.1576

59.

Levinson

C. A.

Trombley

C. M.

Brosof

L. C.

Williams

B. M.

Hunt

R. A.

(2022). Binge eating, purging, and restriction symptoms: Increasing accuracy of prediction using machine learning. Behavior Therapy, 54(2), 247–259. https://doi.org/10.1016/j.beth.2022.08.006

60.

Liang

Frederick

D. A.

Lledo

E. E.

Rosenfield

Berardi

Linstead

Maoz

(2022). Examining the utility of nonlinear machine learning approaches versus linear regression for predicting body image outcomes: The U.S. Body Project I. Body Image, 41, 32–45. https://doi.org/10.1016/j.bodyim.2022.01.013

61.

Linardon

de la Piedad Garcia

Brennan

(2017). Predictors, moderators, and mediators of treatment outcome following manualised cognitive-behavioural therapy for eating disorders: A systematic review. European Eating Disorders Review, 25(1), 3–12. https://doi.org/10.1002/erv.2492

62.

Linardon

Fuller-Tyszkiewicz

Shatte

Greenwood

C. J.

(2022). An exploratory application of machine learning methods to optimize prediction of responsiveness to digital interventions for eating disorder symptoms. International Journal of Eating Disorders, 55(6), 845–850. https://doi.org/10.1002/eat.23733

63.

Linardon

Messer

Helms

E. R.

McLean

Incerti

Fuller-Tyszkiewicz

(2020). Interactions between different eating patterns on recurrent binge-eating behavior: A machine learning approach. International Journal of Eating Disorders, 53(4), 533–540. https://doi.org/10.1002/eat.23232

64.

Linardon

Wade

T. D.

(2018). How many individuals achieve symptom abstinence following psychological treatments for bulimia nervosa? A meta-analytic review. International Journal of Eating Disorders, 51(4), 287–294. https://doi.org/10.1002/eat.22838

65.

Liu

H. Y. S.

Chung

Eizenman

Ieee Comp

S. O. C.

(2021). A general end-to-end method for characterizing neuropsychiatric disorders using free-viewing visual scanning tasks. https://ieeexplore.ieee.org/document/9412857/

66.

McClure

Fuller-Tyszkiewicz

Messer

Linardon

(2024). Predictors, mediators, and moderators of response to digital interventions for eating disorders: A systematic review. International Journal of Eating Disorders, 57(5), 1034–1048. https://doi.org/10.1002/eat.24078

67.

Mehl

Rohde

Gau

J. M.

Stice

(2019). Disaggregating the predictive effects of impaired psychosocial functioning on future DSM-5 eating disorder onset in high-risk female adolescents. International Journal of Eating Disorders, 52(7), 817–824. https://doi.org/10.1002/eat.23082

68.

Merhbene

Puttick

Kurpicz-Briki

(2024). Investigating machine learning and natural language processing techniques applied for detecting eating disorders: A systematic literature review. Frontiers in Psychiatry, 15, Article 1319522. https://doi.org/10.3389/fpsyt.2024.1319522

69.

Miotto

Wang

Jiang

Dudley

J. T.

(2018). Deep learning for healthcare: Review, opportunities and challenges. Briefings in Bioinformatics, 19(6), 1236–1246. https://doi.org/10.1093/bib/bbx044

70.

Mitchison

Wang

S. B.

Wade

Haynos

A. F.

Bussey

Trompeter

Lonergan

Tame

Hay

(2023). Development of transdiagnostic clinical risk prediction models for 12-month onset and course of eating disorders among adolescents in the community. International Journal of Eating Disorders, 56(7), 1406–1416.

71.

Morgenstern

J. D.

Buajitti

O’Neill

Piggott

Goel

Fridman

Kornas

Rosella

L. C.

(2020). Predicting population health with machine learning: A scoping review. BMJ Open, 10(10), Article e037860. https://doi.org/10.1136/bmjopen-2020-037860

72.

Nahum-Shani

Smith

S. N.

Spring

B. J.

Collins

L. M.

Witkiewitz

Tewari

Murphy

S. A.

(2018). Just-in-time adaptive interventions (JITAIs) in mobile health: Key components and design principles for ongoing health behavior support. Annals of Behavioral Medicine, 52(6), 446–462. https://doi.org/10.1007/s12160-016-9830-8

73.

Noguero

D. S.

Ramírez-Cifuentes

Ríssola

E. A.

Freire

(2023). Gender bias when using artificial intelligence to assess anorexia nervosa on social media: Data-driven study. Journal of Medical Internet Research, 25, Article e45184. https://doi.org/10.2196/45184

74.

Orru

Miniati

Conversano

Ciacchini

Palagini

Mauri

Gemignani

(2021). A machine learning analysis of psychopathological features of eating disorders: A retrospective study. Mediterranean Journal of Clinical Psychology, 9(1). https://doi.org/10.6092/2282-1619/mjcp-2670

75.

Pelin

Ising

Stein

Meinert

Meller

Brosch

Winter

N. R.

Krug

Leenings

Lemke

(2021). Identification of transdiagnostic psychiatric disorder subtypes using unsupervised learning. Neuropsychopharmacology, 46(11), 1895–1905. https://doi.org/10.1038/s41386-021-01051-0

76.

Peterson

Pearce

P. F.

Ferguson

L. A.

Langford

C. A.

(2017). Understanding scoping reviews: Definition, purpose, and process. Journal of the American Association of Nurse Practitioners, 29(1), 12–16. https://doi.org/10.1002/2327-6924.12380

77.

Poldrack

R. A.

Huckins

Varoquaux

(2020). Establishment of best practices for evidence for prediction: A review. JAMA Psychiatry, 77(5), 534–540. https://doi.org/doi:10.1001/jamapsychiatry.2019.3671

78.

Presseller

E. K.

Parker

M. N.

Zhang

Manasse

Juarascio

A. S.

(2024). Continuous glucose monitoring as an objective measure of meal consumption in individuals with binge-spectrum eating disorders: A proof-of-concept study. European Eating Disorders Review, 32(4), 828–837. https://doi.org/10.1002/erv.3094

79.

Raab

Baumgartl

Buettner

(2020). Machine learning based diagnosis of binge eating disorder using EEG recordings. PACIS 2020 Proceedings, 97. https://aisel.aisnet.org/pacis2020/97

80.

Ramirez-Cifuentes

Mayans

Freire

(2018). Early risk detection of anorexia on social media. In Bodrunova

(Ed.), Internet Science. INSCI 2018. Lecture Notes in Computer Science (Vol. 11193, pp. 3–14). Springer. https://doi.org/10.1007/978-3-030-01437-7_1

81.

Ren

Yang

Barnhart

W. R.

Zhou

(2022). Using machine learning to explore core risk factors associated with the risk of eating disorders among non-clinical young women in China: A decision-tree classification analysis. Journal of Eating Disorders, 10, Article 19. https://doi.org/10.1186/s40337-022-00545-6

82.

Riley

R. D.

Ensor

Snell

K. I.

Debray

T. P.

Altman

D. G.

Moons

K. G.

Collins

G. S.

(2016). External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: Opportunities and challenges. The BMJ, 353, Article i3140. https://doi.org/10.1136/bmj.i3140

83.

Sandoval-Araujo

L. E.

Cusack

C. E.

Ralph-Nearman

Glatt

Han

Bryan

Hooper

M. A.

Karem

Levinson

C. A.

(2024). Differentiation between atypical anorexia nervosa and anorexia nervosa using machine learning. International Journal of Eating Disorders, 57(4), 937–950. https://doi.org/10.1002/eat.24160

84.

Sarkar

Bhatia

(2021). Writing and appraising narrative reviews. Journal of Clinical and Scientific Research, 10(3), 169–172. https://doi.org/10.4103/jcsr.jcsr_1_21

85.

Shatte

A. B. R.

Hutchinson

D. M.

Teague

S. J.

(2019). Machine learning in mental health: A scoping review of methods and applications. Psychological Medicine, 49(9), 1426–1448. https://doi.org/10.1017/S0033291719000151

86.

Skåtun

K. C.

Kaufmann

Doan

N. T.

Alnæs

Córdova-Palomera

Jönsson

E. G.

Fatouros-Bergman

Flyckt

KaSP Melle

(2017). Consistent functional connectivity alterations in schizophrenia spectrum disorder: A multisite study. Schizophrenia Bulletin, 43(4), 914–924. https://doi.org/10.1093/schbul/sbw145

87.

Steyerberg

E. W.

Moons

K. G. M.

van der Windt

Hayden

J. A.

Perel

Schroter

Riley

R. D.

Hemingway

Altman

D. G.

, & for the PROGRESS Group. (2013). Prognosis Research Strategy (PROGRESS) 3: Prognostic model research. PLOS Medicine, 10(2), Article e1001381. https://doi.org/10.1371/journal.pmed.1001381

88.

Stice

Desjardins

C. D.

(2018). Interactions between risk factors in the prediction of onset of eating disorders: Exploratory hypothesis generating analyses. Behaviour Research and Therapy, 105, 52–62. https://doi.org/10.1016/j.brat.2018.03.005

89.

Stice

Marti

C. N.

Durant

(2011). Risk factors for onset of eating disorders: evidence of multiple risk pathways from an 8-year prospective study. Behaviour Research and Therapy, 49(10), 622–627. https://doi.org/10.1016/j.brat.2011.06.009

90.

Stice

Presnell

Spangler

(2002). Risk factors for binge eating onset in adolescent girls: A 2-year prospective investigation. Health Psychology, 21(2), 131–138. https://doi.org/10.1037/0278-6133.21.2.131

91.

Strigo

I. A.

Murray

S. B.

Simmons

A. N.

Bernard

R. S.

Huang

J. S.

Kaye

W. H.

(2017). The clinical application of fMRI data in a single-patient diagnostic conundrum: Classifying brain response to experimental pain to distinguish between gastrointestinal, depressive and eating disorder symptoms. Journal of Clinical Neuroscience, 45, 149–153. https://doi.org/10.1016/j.jocn.2017.07.023

92.

Svendsen

V. G.

Wijnen

B. F. M.

De Vos

J. A.

Veenstra

Evers

S. M. A. A.

Lokkerbol

(2023). A roadmap for applying machine learning when working with privacy-sensitive data: Predicting non-response to treatment for eating disorders. Expert Review of Pharmacoeconomics & Outcomes Research, 23(8), 933–949. https://doi.org/10.1080/14737167.2023.2230368

93.

Tazawa

Liang

K.-c.

Yoshimura

Kitazawa

Kaise

Takamiya

Kishi

Horigome

Mitsukura

Mimura

(2020). Evaluating depression with multimodal wristband-type wearable device: Screening and assessing patient severity utilizing machine-learning. Heliyon, 6(2), Article e03274. https://doi.org/10.1016/j.heliyon.2020.e03274

94.

Todisco

Meneguzzo

Garolla

Diomidous

Antoniades

Vogazianos

Tozzi

(2023). Understanding dropout and non-participation in follow-up evaluation for the benefit of patients and research: Evidence from a longitudinal observational study on patients with eating disorders. Eating Disorders 31(4), 337–352. https://doi.org/10.1080/10640266.2022.2135738

95.

Tricco

A. C.

Lillie

Zarin

O’Brien

K. K.

Colquhoun

Levac

Moher

Peters

M. D.

Horsley

Weeks

(2018). PRISMA extension for scoping reviews (PRISMA-ScR): Checklist and explanation. Annals of Internal Medicine, 169(7), 467–473. https://doi.org/10.7326/M18-0850

96.

Uban

A. S.

Chulvi

Rosso

(2021). An emotion and cognitive based analysis of mental health disorders from social media data. Future Generation Computer Systems- The International Journal of eScience, 124, 480–494. https://doi.org/10.1016/j.future.2021.05.032

97.

University of Technology Sydney. (n.d). The TrEAT Registry. https://www.uts.edu.au/about/faculty-health/faculty-health-research/treat-registry

98.

van Hoeken

Hoek

H. W

. (2020). Review of the burden of eating disorders: Mortality, disability, costs, quality of life, and family burden. Current Opinion in Psychiatry, 33(6), 521–527. https://doi.org/10.1097/YCO.0000000000000641

99.

von Brachel

Hötzel

Hirschfeld

Rieger

Schmidt

Kosfelder

Hechler

Schulte

Vocks

. (2014). Internet-based motivation program for women with eating disorders: Eating disorder pathology and depressive mood predict dropout. Journal of Medical Internet Research, 16(3), Article e92. https://doi.org/10.2196/jmir.3104

100.

Walsh

C. G.

Ribeiro

J. D.

Franklin

J. C.

(2017). Predicting risk of suicide attempts over time through machine learning. Clinical Psychological Science, 5(3), 457–469. https://doi.org/10.1177/2167702617691560

101.

Wang

S. B.

(2021). Machine learning to advance the prediction, prevention and treatment of eating disorders. European Eating Disorders Review, 29(5), 683–691. https://doi.org/10.1002/erv.2850

102.

Weygandt

Schaefer

Schienle

Haynes

J. D.

(2012). Diagnosing different binge-eating disorders based on reward-related brain activation patterns. Human Brain Mapping, 33(9), 2135–2146. https://doi.org/10.1002/hbm.21345

103.

Wonderlich

Mitchell

J. E.

Crosby

R. D.

Myers

T. C.

Kadlec

LaHaise

Swan-Kremeier

Dokken

Lange

Dinkel

(2012). Minimizing and treating chronicity in the eating disorders: A clinical overview. International Journal of Eating Disorders, 45(4), 467–475. https://doi.org/10.1002/eat.20978

104.

Yarkoni

Westfall

(2017). Choosing prediction over explanation in psychology: Lessons from machine learning. Perspectives on Psychological Science, 12(6), 1100–1122. https://doi.org/10.1177/1745691617693393

105.

Zhao

C. Y.

Zhang

R. S.

Liu

H. X.

Xue

C. X.

Zhao

S. G.

Zhou

X. F.

Liu

M. C.

Fan

B. T.

(2004). Diagnosing anorexia based on partial least squares, back propagation neural network, and support vector machines. Journal of Chemical Information and Computer Sciences, 44(6), 2040–2046. https://doi.org/10.1021/ci049877y

106.

Zheng

Wang

Liu

Zhang

(2023). Machine learning research based on diffusion tensor images to distinguish between anorexia nervosa and bulimia nervosa. Frontiers in Psychiatry, 14, Article 1326271. https://doi.org/10.3389/fpsyt.2023.1326271

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.12 MB