Abstract
This scoping review aims to summarize the existing literature on how machine learning can be used to impact systemic sclerosis diagnosis, management, and treatment. Following Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) reporting guidelines, Embase, Web of Science, Medline (PubMed), IEEE Xplore, and ACM Digital Library were searched from inception to 3 March 2024, for primary literature reporting on machine learning models in any capacity regarding scleroderma. Following robust triaging, 11 retrospective studies were included in this scoping review. Three studies focused on the diagnosis of scleroderma to influence preferred management and nine studies on treatment and predicting treatment response to scleroderma. Nine studies used supervision in their machine learning model training; two used supervised and unsupervised training and one used solely unsupervised training. A total of 817 patients were included in the data sets. Seven of the included articles used patients from the United States, one from Belgium, two from Japan, and two from China. Although currently limited to retrospective studies, the results indicate that machine learning modeling may have a role in early diagnosis, management, therapeutic decision-making, and in the development of future therapies for systemic sclerosis. Prospective studies examining the use of machine learning in clinical practice are recommended to confirm the utility of machine learning in patients with systemic sclerosis.
Background
Systemic sclerosis (SSc) is an autoimmune connective tissue condition resulting in the fibrosis of skin and internal organs including the gastrointestinal tract, heart, lungs, and kidneys. 1
The etiology of SSc is not known, with a complex triad of vasculopathy, immune dysregulation, and fibrosis, causing damage in skin and multiple organs. No curative treatments exist. 2 Therapies aim to relieve symptoms and slow disease progression in the affected organs. However, even recent advances such as immunomodulatory therapies are not curative, and treatment effects are limited once fibrosis is established. Hematopoietic stem cell transplantation (HSCT) has been used in severe cases. This treatment seeks to halt disease progression by transfusing undifferentiated stem cells allowing regeneration of the affected fibrotic tissue and improvement in functional outcomes. 3 While this treatment has proven to be effective in treating severe disease, it is not recommended for mild cases due to the invasive nature of the therapy and the risk of serious adverse events. 3
Artificial intelligence (AI) is defined as a machine-based system that aims to make predictions, recommendations, and decisions emblematic of the ability to learn (i.e. service knowledge from data). 4 In the field of medicine, it is expected that AI will work alongside electronic medical records (EMRs) to assist in predictions about health outcomes as the technology develops. AI technology “learns” by training with thorough data sets that contain a vast amount of patient information like vitals, genetics, and environment. 4 Machine learning (ML) is a subtype of AI that “learns” using prediction models and algorithms to analyze complex data patterns. In the healthcare field, this technology is most often seen in precision medicine, in which treatment decisions are based on predictions made by an ML model that has examined a variety of patient attributes and the context behind the treatment. 4
With the complexity of SSc, ML can be a useful approach in understanding the manifestations of the disease and its management. While tools like composite diagnostic criteria such as ACR/EULAR 2012 classification criteria have been used to confirm diagnosis and the modified Rodnan Skin Score (mRSS) and Composite Response Index in Systemic Sclerosis (CRISS) scores used to determine activity and extent of disease have been used historically to determine extent and activity of SSc, advances in ML tools have the potential to improve accuracy in diagnostics, prognostication, and analysis of treatment outcomes.5–8 Furthermore, ML could identify potential new therapies in this disease. The extent to which ML has been used in SSc diagnosis, management and outcomes research is not known. It is also unclear which areas of SSc research would benefit most from ML techniques. An improved understanding of which ML approaches have been used in SSc is needed to improve the understanding and management of this disease. The objective of the current review is to summarize the existing literature on how ML can be used to impact SSc management with applications such as in assessing treatment response, along with SSc diagnosis and severity in influencing preferred management.
Methods
The Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) checklist for scoping reviews was used to inform the reporting of this scoping review (Supplementary File 1). 9
Inclusion and exclusion criteria
English primary peer-reviewed literature of any type that applied ML models to SSc treatment was included. Articles were excluded if they did not involve SSc treatment, or if ML was not included in the study methods. Conference abstracts were excluded due to their brevity.
Information sources
Embase, Web of Science, Medline (PubMed), Scopus, IEEE Xplore, and ACM Digital Library were searched from inception to 3 March 2024. Forward and backward citation analysis of relevant records was performed using Web of Science to identify potentially eligible studies as well.
Search strategy
The search terms “scleroderma,” “systemic sclerosis,” and a variety of ML-related search terms were used. We did not use the search terms, “treatment,” or “therapeutic” as it would unnecessarily narrow our search (Table S1, Supplementary File 1). Duplicates were removed, and all types of primary literature aligning with the above inclusion and exclusion criteria were eligible in initial title and abstract screening. Relevant articles moved on to full-text review, and citation chaining of included articles was performed. Articles were imported into the systematic review software, Covidence. 10
Two reviewers (R.S.G. and K.S.) completed the initial title and abstract screening and full-text screening independently and in duplicate. Any disagreements were settled through discussion with a third author (E.P.M.). Data were also extracted independently and in duplicate. Conflicts were settled through review by E.P.M.
Data charting and result reporting
Data were extracted from the included articles into a data extraction table on an Excel spreadsheet, piloted with the first 10 studies to ensure that the categories matched the goals of extraction. The extraction table was created by E.P.M. and reviewed by R.S.G. The following study-related variables were collected: author(s), year of publication, country, study aim, type of ML model used, specific ML model used, reference test used, population (n), treatment used, conclusions, and reported study outcomes.
Results
The initial electronic search returned 935 articles. After duplicates were removed, 635 articles advanced to title and abstract screening. 621 were excluded, leaving 14 to move on to full-text review. Reasons for exclusion at the full-text stage are reported (Figure 1). Overall, a total of 11 articles fulfilled the search criteria and were included in our scoping review (Figure 1). The most frequently used ML models were GLMnet, random forest (RF), and support vector machine (SVM) (Table 2).

Flow diagram of literature screening according to the Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) guidelines.
ML applications on SSc diagnosis and severity to influence preferred management
Four studies focused on attempting to classify and diagnose SSc (Table 1).5,11–13 Between the studies, the accuracy in comparison to conventional assessment ranged from 85.4% to 87.7% in validation image sets.11,12 In one study, ML was used to assign histological images of SSc to subsets of disease and correlate them to candidate genetic biomarkers to stratify the severity of disease. 5 They showed that CD34 and aSMA may be relevant biomarkers in predicting both disease severity and improvement over time. 5 One of these studies used a similar method with RF and GLMnet to achieve the same goal of identifying SSc interstitial lung disease progression from a set of high-resolution computed tomography (HR-CT) images. 13
Summary of articles reporting on ML applications of scleroderma diagnosis and severity to influence preferred management.
RCT: randomized controlled trial; CNN: convolutional neural network; SSc: systemic sclerosis; N/A: not applicable; mRSS: modified Rodnan skin score; SVM: support vector machine; RF: random forest; SCOT: scleroderma cyclophosphamide or transplantation; HSCT: hematopoietic stem cell transplant; AI: artificial intelligence; CYC: cyclophosphamide; ILD: interstitial lung disease; MMF: mycophenolate mofetil; DML: debiased machine learning; PFT: pulmonary function tests; SSc-PAH: systemic sclerosis–assisted pulmonary arterial hypertension; XGBOOST: extreme gradient boosting algorithm.
ML applications on SSc treatment
Seven studies reported ML methodology pertaining to the treatment of SSc (Table 2).3,14–19 Although the measured outcomes were different for each of the studies, there was significant overlap in the types of treatments that were examined. The main therapies among the included articles were immunosuppressants such as rituximab and mycophenolate, HSCT for more severe cases, and vasodilators (Table 2).
Summary of findings reporting on ML applications on scleroderma treatment and treatment response.
RCT: randomized controlled trial; CNN: convolutional neural network; SSc: systemic sclerosis; N/A: not applicable; mRSS: modified Rodnan skin score; SVM: support vector machine; RF: random forest; SCOT: scleroderma cyclophosphamide or transplantation; HSCT: hematopoietic stem cell transplant; AI: artificial intelligence; CYC: cyclophosphamide; ILD: interstitial lung disease; MMF: mycophenolate mofetil; DML: debiased machine learning; PFT: pulmonary function tests; SSc-PAH: systemic sclerosis–assisted pulmonary arterial hypertension; XGBOOST: extreme gradient boosting algorithm.
A total of five articles directly studied the use of immunosuppressive and immunomodulatory medication in the treatment of SSc. 3 ,12–15 Due to the high level of variability in the performance metrics of the treatment response studied, it is difficult to draw direct comparisons between the performance of the ML models used in each study. Regardless, the results of immunosuppressive therapies in improving SSc demonstrated effectiveness in improving disease outcomes. One study indicated that patients with high B-cell counts respond most positively to rituximab therapy, with those with low baseline B-cell counts not responding. 14 This efficacy of B-cell depletion therapy was replicated in another study of SSc-induced pulmonary arterial hypertension. 19 A significant improvement in 6-min walk distance after 24 weeks of 25.5 ± 8.8 m for the rituximab group and only 0.4 ± 7.4 m for the placebo group was reported. 19
One study examined the use of vasodilators in the treatment of SSc pulmonary hypertension and through the use of a convolutional neural network (CNN) found that calculated abnormal lung volume was superior to forced vital capacity for predicting response to vasodilators (area-under-the-curve (AUC) = 0.74). 16
Two recent studies from 2023, examined SSc treatment, through the lens of genetic analysis.17,18 Examining the compound–gene relationship in subtypes of SSc, eight feature genes along with associated potential treatments (methotrexate, resveratrol, paclitaxel, trichloroethylene, formaldehyde, silicon dioxide, benzene, and tetrachloroethylene) were identified. 17 Another study identified hub genes, and other diagnostic markers. Seven hub genes that may play a role in SSc pathways and development were identified including THY1 and SULF1 as prognostic markers. 18 Ten small-molecule drugs with potential therapeutic effects were also identified, mainly including phosphodiesterase inhibitors (BRL-50481, dipyridamole) and transforming growth factor beta (TGF-β) receptor inhibitors (SB-525334). 18
Accuracy of included ML models
Specific data points surrounding the effectiveness of each ML methodology in the included studies were sparse. One study provided an AUC value for their ML methodologies, finding, and AUCs = 0.74. 16 Extreme gradient boosting algorithm (XGBOOST) was the most reliable methodology compared to the three others (RFs, SVM polynomial, and SVM radial) trained, validated, and tested. 19
Discussion
The results of this review demonstrate the various positive applications of ML in clinical practice relating to SSc. ML tools have the potential to be implemented alongside existing tools such as the composite measures to diagnose patients with SSc. Improving the speed and accuracy of diagnoses may improve patient prognosis. Furthermore, since SSc has variable systemic manifestations, early detection will prompt improved monitoring for critical organ damage and inform earlier treatment decisions. All of the studies in this review were able to utilize ML in promising ways. While most of the models assisted in the measurement of treatment outcomes, others achieved a high level of accuracy in identifying SSc subtypes. ML modeling may have a role in the identification and development of future therapies for SSc and may be an effective tool for clinicians to predict treatment outcomes.
There was significant variation in the chosen performance metrics among the included articles, and for this reason, they could not be analyzed. Due to the limited body of research, more studies are needed to determine which models are best for unique purposes. Only one of the included studies directly compared multiple ML models in carrying out their research. On comparing four different ML models against each other, XGBOOST outperformed the other models by a significant margin. 19
While the results of this review are promising, the review is not without limitations. Including only peer-reviewed, articles written in English, likely led to an overrepresentation of research from English-speaking countries. Moreover, it is likely that many of the studied populations were lacking in ethnic diversity. While most of the included articles opted not to report statistics on the ethnicities of the participants, white individuals are highly overrepresented in those that did. For example, in one study, white individuals comprised 76.7% of the study participants. 3 There is some evidence to suggest that individuals of different ethnic backgrounds experience autoimmune diseases, particularly SSc, in varying degrees of severity. 20 Thus, a study design that examines a more diverse population would likely provide more comprehensive results.
There was a high degree of variability between the objectives and outcomes of each included study. This could be attributed to the novelty of the field, with the majority of research on this topic published in the last few years. As more studies emerge, the heterogeneity will likely decrease.
Finally, none of the reviewed ML models were implemented prospectively in clinical practice. This is a major limitation in assessing the utility of ML in diagnosis, prognosis, and optimal management of SSc. However, the majority of these studies reported their goal of implementing ML models to improve clinical practice in the future.
Conclusion
Research examining the applications of ML in SSc is rapidly developing and has received significant attention in recent years. This scoping review shows that in supervised settings, several ML models have demonstrated a high level of accuracy in identifying SSc at early stages, and in classifying severity and measuring disease progression over time. Response to treatment and identification of molecules that may improve disease outcome have also been shown to be reasonable outcomes from these studies; however, these studies are retrospective in design. As the literature grows, ML can gradually be implemented into clinical practice for the analysis of empirical data to augment clinicians’ ability to diagnose, assess, manage, and predict treatment response. Future research should focus on investigating the utility of ML modeling compared to traditional methods in managing SSc cases. Despite the promising results, the lack of homogeneity among existing studies creates difficulties in drawing informed conclusions about the current applications. Further research with larger data sets comparing a variety of ML models would provide a greater body of evidence to draw from and is recommended for a clearer assessment.
Supplemental Material
sj-pdf-1-jso-10.1177_23971983241253718 – Supplemental material for Can machine learning assist in systemic sclerosis diagnosis and management? A scoping review
Supplemental material, sj-pdf-1-jso-10.1177_23971983241253718 for Can machine learning assist in systemic sclerosis diagnosis and management? A scoping review by Eric P McMullen, Rajan S Grewal, Kyle Storm, Lawrence Mbuagbaw, Maxine R Maretzki and Maggie J Larché in Journal of Scleroderma and Related Disorders
Footnotes
Author contributions
All authors contributed to manuscript writing. E.P.M. was involved in project ideation and writing the manuscript first draft. L.M. and M.J.L. supervised the project.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethics approval and consent to participate
Not applicable.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
