Abstract
Background:
Persons with multiple sclerosis (pwMS) might be particularly well suited to benefit from digital health applications because they are, on average, younger and less severely disabled than patients with many other chronic diseases. Many digital health applications for pwMS have been developed.
Objectives:
Analysis of the evidence of digital health applications to improve health outcomes from a patient perspective.
Methods:
A systematic review was performed on all randomized controlled trials (RCTs) that have studied mobile health interventions for pwMS, that is, which can be applied with a smartphone, tablet, or laptop to improve patient-reported outcomes.
Results:
Of the 1127 articles identified in the literature search, 13 RCTs fit the inclusion criteria. Two trials studied messaging systems, two depression interventions, one addressed MS fatigue, five cognition, and three mobility issues, of which two focused on spasticity management. One trial aimed to enhance physical activity. Most were pilot studies that cannot yield definitive conclusions regarding efficacy. One depression intervention and one fatigue intervention showed significant results across several outcomes.
Conclusion:
Several mobile self-guided digital health applications for pwMS have been tested in RCTs, and two interventions targeting depression and fatigue have demonstrated significant effects. Challenges remain regarding implementation into routine care.
Introduction
Digitalization is among the greatest contemporary societal challenges and has a profound impact on medicine and healthcare delivery. The term eHealth is often invoked in this context, referring to the use of information and communication technologies to support health and healthcare services. The term mHealth (mobile health) refers to a specific delivery mode, such as using short messaging services (SMS) and smartphone applications (apps) to convey health information or interventions to patients. mHealth interventions, delivered via the Internet, might be particularly relevant for persons with multiple sclerosis (pwMS), for several reasons.1–3 Young age, complex presentation, and evolution of MS, and numerous decisions to be made needing decision support tools are major arguments to develop and deliver mHealth tools.
Despite the opportunities of easy access and scalability, mHealth applications also have limitations. For example, older age and low education can function as barriers to the uptake of mHealth applications. A wealth of digital information of uncertain quality can also be experienced as overwhelming, trigger fears, 4 or lead to unrealistic, inflated hopes, incorrect self-diagnoses, and inappropriate therapies. In addition, the attitudes of physicians and other care providers play a key role in the use of mHealth technologies, such that patients may be more likely to engage with mHealth tools if these are endorsed by their clinicians.5,6 The recently established Digital Delivery of Care Act (DVG) in Germany 7 (https://www.bgbl.de) provides criteria to evaluate digital health applications in terms of their efficacy, as shown in methodologically sound trials, and their data security. Once approved and listed in the digital health application directory, these mHealth tools are eligible for reimbursement by statutory health insurance companies in Germany, which opens a pathway for the implementation of digital health applications in the German healthcare system.
Although several reviews have been performed on digital health interventions in MS,3,8 no systematic review has focused specifically on self-guided mHealth interventions in MS. Following the idea that digital interventions that do not require clinician support could be disseminated widely and cost-effectively, this review includes randomized controlled trials (RCTs) of self-guided digital interventions that target any stage of MS and that do not require personal support beyond that which may be offered in the context of usual treatment. Only studies with patient-reported outcomes were included to estimate the extent to which patients experience benefit from the interventions.
Methods
Inclusion criteria
We included all RCTs as pilot studies or fully powered trials in which pwMS were the target population or were investigated as a defined subgroup. The intervention needed to be a program (application, app) to support the self-management of the target group, which can be used on common mobile (location-independent) devices (smartphone, tablet, and laptop). “Self-management” includes all interventions implemented by the patient himself or herself to support coping with the disease and its consequences. Endpoints of interest are patient-relevant outcomes, such as morbidity parameters, quality of life, or adverse effects. Studies only examining performance-based tests, such as cognitive tests, mobility tests, or biomarkers, such as magnetic resonance imaging (MRI) or blood markers, were not considered. At least one patient-based outcome as a secondary or tertiary outcome had to have been reported to qualify for inclusion in this systematic review. Programs that require additional equipment beyond a smartphone or tablet for their implementation were excluded (example: a wheelchair that can be controlled via a smartphone or virtual reality training is not subject of the evaluation). Corresponding to the guidelines for use of the Digital Delivery of Care Act, 9 only interventions that do not require additional support from healthcare providers or caregivers were examined. The mHealth applications were used in addition to standard care. Comparison interventions were (1) usual care (treatment as usual, TAU) or waitlist or (2) use of active controls, that is, electronic or paper-based applications that do not contain the component presumed to be effective and are ideally indistinguishable from the intervention under investigation.
Database search
A systematic literature search for studies (Figure 1), initial search on May 5, 2020 and updated on May 11, 2022, was performed in the MEDLINE, Embase, and the Cochrane Central Register of Controlled Trials databases (see search strategy in Supplemental Appendix 1). In parallel, a search for relevant systematic reviews was conducted in MEDLINE, Embase, Cochrane Database of Systematic Reviews, and HTA Database. The selection of relevant studies was performed independently by two persons (C.H. and D.L.). Discrepancies were resolved by discussion between the two. Data extraction was performed into standardized tables.

Flowchart of included studies.
Risk of bias assessment
We applied Risk of bias (ROB) 2 from the Cochrane Collaboration to assess the qualitative certainty of results. 10 Cross-endpoint and endpoint-specific criteria of bias potential were assessed. The endpoint-specific assessment focused on patient-reported outcomes as a group. The bias potential was classified as “low,” “some concerns,” or “high” in each case.
Estimates of efficacy
If not given in the publications, clinical relevance of reported statistically significant between-group differences was determined by calculating standardized mean differences (SDM, Cohen’s d) to estimate clinical relevance. 11
Funding source
This work is an update of a health technology assessment under the auspices of the Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen (IQWIG) in 2020 and 2021, published in 2022 on the website of the IQWIG. 12
Results
In total, 13 RCTs (see Table 1) could be identified. Two studies addressed messaging systems, two depression interventions, one study was performed to ameliorate MS fatigue, five studies worked on interventions to improve cognition, and three studies addressed mobility issues, of which were two spasticity management interventions and one aimed to enhance physical activity.
mHealth intervention randomised-controlled trials in MS—overview of study design and findings.
key significant outcome with data and effect size estimate.
ROB ratings
Most studies were rated as having two or more areas of concern (Table 2). Outcome assessment (D4) was a key domain because five studies only used Patient-Reported Outcome Measures (PROMS). Furthermore, only four studies13,19,20,22 provided active control interventions which were indistinguishable from the experimental intervention for pwMS. Therefore, participant blinding and blinded outcome assessment were not ensured in most studies (D2). Another area of concern is the lack of published protocols or analysis plans, so that selective reporting cannot be ruled out (D5). In addition, the randomization procedure was often insufficiently described (D1). Some studies also had high dropout rates (D3). In addition, treatment fidelity and adherence reflected in frequency and usage time were seldomly reported in detail (for overview of adherence definitions applied and results, see Supplemental Appendix 2).
ROB ratings of mHealth intervention RCTs focusing PROMs.
Low risk.
Some concerns.
High risk.
PROMs = Patient-reported outcome measures. This ROB assessment focuses study evidence based on PROMs even if they had not been primary outcomes. Therefore, overall ROB is also driven by the quality of PROM assessment.
Messaging systems
Two studies investigated the effectiveness of messaging systems for self-organization.13,14 Miller et al. 13 compared an established messaging concept system with an enhanced version with self-management tools in one of the largest studies (n = 206) running over 12 months. Goodwin et al. 14 performed a small (n = 38) and short-term pilot crossover study with a similar intervention approach comparing the messaging system NeuroPage® with an enhanced version with individualized goals. Primary endpoints in the study by Miller were validated PROMS, such as the perceived sickness impact, functioning in three key disability domains (multiple sclerosis functional composite, MSFC), and self-efficacy. Goodwin et al. chose an everyday memory questionnaire as primary outcome with other PROMS as secondary endpoints. Both studies did not show an effect on their primary endpoints. However, Goodwin reported a significant effect on the General Health Questionnaire with its clinical relevance remaining uncertain (Cohen’s d = 0.84, CI: 0.03 to 1.10). In addition, some evidence can be derived for a relevant greater benefit of the NeuroPage intervention compared to the active control intervention with regard to the endpoint “forgotten target behaviors.”
Depression management
A study with 90 participants 15 by Fischer et al. and a pilot study by Cooper et al. 16 with 24 participants investigated the effectiveness of online programs based on cognitive behavioral therapy (CBT; Fischer: deprexis®, 9 weeks; Cooper: Beating the Blues®, 8 weeks) for the treatment of depressive symptoms in comparison to a waitlist control group. Neither program was specifically designed for patients with MS. While any self-reported depressive symptoms were sufficient for inclusion in the Fischer study, the Cooper study required a Beck Depression Inventory (BDI) score of at least 14 points (at least mild depressive symptoms) for inclusion, patients with BDI scores above 28 were excluded. Results were collected at 9 weeks (post-treatment, primary endpoint) and 6 months after the end of the intervention (follow-up assessment) in the study by Fischer and at 8 weeks (post-treatment) and 21 weeks (follow-up; 13 weeks after the end of the intervention) in the study by Cooper. In the later, the primary goal was to analyze recruitment, and the clinical endpoint was self-reported depression measured by the BDI. Clinical endpoints in this study were measured as a basis for a valid sample size calculation for a larger trial. The study had a dropout rate of 25% (3 out of 12 participants at 8 weeks) in the intervention group and none in the control group. The study by Fischer measured the severity of depression symptoms by BDI-2 as the primary endpoint to evaluate intervention effectiveness. Here, the dropout rate was 22% at 9 weeks in the intervention group and 20% in the control group.
The pilot study by Cooper et al. 16 was able to show the feasibility of the intervention. Regarding the endpoint depression, the study demonstrated numerically favorable results at the end of intervention and at follow-up for the intervention group. Given the small sample size of n = 12 per group, no conclusion on efficacy is possible. Fischer et al. 15 reported a statistically significant effect in favor of the intervention group compared to the waitlist control group in the German study. 15 The effect size (Cohen’s d = 0.53, CI: 0.11 to 0.95) indicates at least a small and, based on the effect size, medium and clinically relevant effect in favor of deprexis. Regarding the other endpoints considered in the study by Fischer et al., statistically significant differences between the study groups could be shown in two secondary outcomes (psychological well-being and motor fatigue). In terms of potential adverse effects, no evidence of new occurrences of suicidal ideation was observed in either group in the German study. In the British study, no results were reported for the protocol-reported endpoints of anxiety and generic quality of life.
Fatigue management
In an RCT with 275 participants, a 12-week digital health application based on CBT techniques (elevida) was tested against a waitlist control group to reduce fatigue. 17 The key inclusion criterion was a total fatigue score of more than 43 points on the Fatigue Scale for Motor and Cognitive Functioning (FSMC). 26 Dropout rates after 12 and 24 weeks were 22% and 30% in the intervention group versus 11% and 15% in the control group, respectively. Significant intervention effects on the primary outcome (Chalder Fatigue Scale) 27 and for the endpoint fatigue, measured with the FSMC scale, were observed at post-intervention (12 weeks) and follow-up (24 weeks). The point estimate for the effect size (Cohen’s d) after 12 weeks (primary endpoint) was 0.53 (CI: 0.22 to 0.84). The confidence interval indicates at least a small clinically relevant effect; in fact, the point estimate suggests that the effect is medium in magnitude. Statistically significant effects in favor of the intervention group were also found with regard to the endpoints reduction of anxiety, increase in activity level, and in three out of six subscales of the applied quality of life scale.
Cognition management
A total of five studies investigated the effects of interventions to support cognitive functioning. All studies, except the Charvet et al.’s 19 pilot study, recruited MS patients with documented cognitive deficits in at least one dimension. All studies excluded patients with pre-existing psychiatric and dementia conditions. An Italian study with 35 participants 18 compared the effects of an 8-week computer-based training program (Dr Kawashima’s Brain Training®) against a waitlist control group. They addressed three cognitive tests, fatigue, and quality of life, as endpoints, without a defined primary endpoint. De Giglio et al. 18 found significant but possibly not relevant differences in favor of the intervention group in one (Stroop test) of the three functional tests for the endpoint cognitive function. Also significant but possibly not certainly, relevant results were found for a subscale of MS-specific quality of life. A pilot study with 20 participants and a confirmatory study with 135 participants, both published by Charvet et al.,19,20 investigated the effects of adaptive cognitive reeducation programs over 12 weeks. In the confirmatory study, the training time per day was doubled to 60 minutes each on 5 days/week. In both studies, the control groups trained for the same amount of time with non-adaptive computer games as an active control group. In the study by Charvet et al., 19 feasibility was addressed as the primary outcome, measured as compliance defined as 50% or more use of the intervention per patient. In the full study, 20 a composite cognitive outcome parameter of six cognitive tests was used as the primary outcome. This battery was also applied in the pilot but has not been validated prior to the studies. In addition, both studies used simple, non-validated three- and two-item self-assessment and peer-assessment scales. Charvet et al.19,20 were able to demonstrate the feasibility of the intervention in their pilot study (primary endpoint), which was confirmed in the confirmatory study. Both studies showed statistically significant changes in the self-developed composite cognitive outcome measure. However, while in the pilot study, the composite cognitive outcome indicates a clinically meaningful change the magnitude of the effect could not be confirmed in the larger confirmatory study.
In another study by Campbell et al. 21 with 38 participants, the intervention group received a 6-week computer-based rehabilitation program (RehaCom®), while the control group watched natural history DVDs for the same amount of time. In this study, outcomes were measured after a follow-up at 6 weeks after completion of the intervention and at 12 weeks. Authors defined BICAMS (Brief International Cognitive Assessment in Multiple Sclerosis, composite score of cognitive function tests), and two quality-of-life measures as primary outcomes in the protocol. In the publication, only the BICAMS is mentioned as a primary outcome. In addition, they assessed perceived cognitive function and self-efficacy. Only in one of the three cognitive tests of the BICAMS, authors report a significant result in favor of the intervention group after 6 weeks. Data for calculating an effect size are not available. The two other cognitive measurement parameters, and all results after 12 weeks, showed no significant differences between the study groups. No significant effects could be obtained for perceived cognitive functioning self-efficacy and patient empowerment.
Bove et al. 22 investigated two different computer games one (intervention group) with combined sensory and motor tasks adapting to the individual performance level for 6 weeks. Primary outcome was change in information processing speed as measured by the symbol digit modalities test but also other BICAMS measures were assessed and patient-perceived deficits, depression, stress, and fatigue. The study did not find a differential effect of the adaptive intervention while both of the gaming groups improved. Effects persisted to some extent to Month 3.
In summary, no conclusive evidence is derived for a benefit of cognitive training programs compared to active control interventions or care as usual (CAU) with regard to cognitive functions. However, four of the five studies were just pilot studies.
Mobility
Three studies investigated the effects of mobility performance support programs. A pilot study by Ehling et al. 23 with 20 participants evaluated the effects of an app-controlled therapy program as a follow-up to inpatient rehabilitation treatment in MS patients with proven moderate spasticity of the lower extremities. In 2022, Ehling et al.23,24 published a full RCT with the same approach but only selected patients had gained benefits during the rehabilitation stay. The intervention included individually composed exercises for mobility, strength, and coordination of the lower extremities and aimed at reducing spasticity. Outcome measurement was obtained 3 months after the start of the intervention in both studies. In the 2017 study, Ehling et al. 23 did not define a primary outcome in their pilot study. In the full RCT, the Neurological Rating Scale for spasticity 21 was used as a primary outcome measure. A couple of other mobility measures and measures of fatigue and depression were applied. In the pilot study, Campbell et al. 21 could not demonstrate an effect of the Neurological-Rating-Scale (NRS) 25 or any other measure. The full RCT, however, showed a significant effect on the NRS, while all other measures did not show significant effects.
In the pilot study by Nasseri et al., 25 the effectiveness of a smartphone app including a self-programmed accelerometry (including feedback) and evidence-based patient information on physical activity was tested against a leaflet with general information on physical activity and health. Patients with primary or secondary progressive MS and an EDSS score < 6.5 were included. The outcome was measured after 3 months. An increase in step count of at least 20% was chosen as the primary outcome. In addition, mobility tests and measures of daily physical activity and activity in general were included. Nasseri et al. 25 could not demonstrate a 20% increase in step count or any change in other measures.
In summary, data indicate a possible effect of a spasticity app in ameliorating spasticity. No effect of an educative digital intervention combined with a smartphone accelerometry to increase step counts in progressive MS could be shown.
Discussion
Based on the high potential of digital health in managing MS, we performed a systematic review on self-guided digital interventions, that is, applications without additional human support studied in RCTs.
Reviews on digital health in MS have already been performed. Nasseri et al. 25 and Lavorgna et al. 8 summarized mHealth in MS, Fricke and Apolinario-Hagen 28 MS apps, and Yeroushalmi et al. 3 telemedicine in MS. These papers address digital apps in MS in a very general way and especially also in the context of rehabilitative interventions with additional interactive parts. The results point to their potential but conclude that most mHealth interventions are not yet implementable due to a lack of data on the benefit-risk profile. In telemedical and especially telerehabilitative studies, digital aids are combined with interactive components, which was not the subject of the current systematic review.
Based on our aim to focus on self-guided digital interventions, we identified 13 RCTs, 8 of which were feasibility or pilot RCTs. Statistically significant and clinically relevant benefits were observed in the area of neuropsychiatric functioning (i.e. depression and fatigue). The study by Fischer et al. 15 on depression examined the deprexis program and reached its primary endpoint with a likely clinically relevant effect size. A recent large multinational study of an MS-adapted version of deprexis, called amiria, funded by the National MS Society of the United States has shown efficacy among n = 279 pwMS with mild to moderate depression, with effect sizes on depression reduction of around d = 0.95. 29 In addition, a recent meta-analysis of 12 RCTs reported robust evidence for the efficacy of deprexis in mild to severe depression, with somatic comorbidity present in 2 of the 12 trials. 30 Another depression intervention, the Beating the Blues program, was examined in a small feasibility study by Cooper et al., 16 also suggesting a potential benefit.
Also, the fatigue trial by Pöttgen et al. 17 with the digital application elevida reached its primary endpoint, with a moderate clinical-relevant effect size. Like deprexis, the elevida intervention has been approved by the German Federal Institute for Drugs and Medical Devices and is, therefore, the first German “DIGA” for MS that is permanently included in the directory of reimbursable digital health applications. Given the largely disappointing efforts to treat MS fatigue with medication, CBT-based interventions and exercise currently seem to be the most effective strategies. Therefore, a further development for elevida with more focus on increasing physical activity and possibly including sensoring technology might be an approach to enhance efficacy. However, while sensoring and multimodal monitoring is strongly advocated in MS, 31 we are not aware that this approach yet has shown to lead to better health outcomes in MS.
Approaches to enhance cognitive functioning show feasibility and some studies show improved cognitive performance tests but the only larger RCT 20 did not reach is primary endpoint. As the active control group received an intervention similar to the key intervention, the barrier to show a differential effect was high in this study. This may apply also for other studies with active control groups as, for example, the studies on messaging systems. Especially the large RCT by Miller et al. 13 might have failed because of the similarity of the interventions. In addition, studies to improve cognitive functioning are hampered by the open question of clinical relevance of changes in cognitive tests, which is not clear for all applied tests. Inclusion of established perceived deficits questionnaire, although showing other limitations, should be more encouraged in these interventions.
Interestingly, little effort has been made to enhance general physical activity through self-guided mHealth interventions in MS. The only study in this area 25 could not show an increase in step counts. However, as the intervention in this study only included evidence-based patient information and did not provide a behavior change program, it might have been too ambitious to change walking habits in progressive pwMS just through providing a smartphone accelerometry.32,33 Much more comprehensive behavior change interventions should be studied here.
Most studies from this review had relevant methodological problems as in very few protocol data or detailed study registration data were available, making conclusions on selective reporting impossible. Randomization was often poorly described without documented concealed allocation. Although this was not the case for all trials examined here, a general recommendation for future research in this area is to improve the methodological rigor, in line with current standards, as delineated in the CONSORT E-Health Statement, among others. 34
Reported adherence rates in the 13 trials examined here differed. However, as some studies were advertised, recruited, and organized purely online without any personal contact, the question should be discussed what could be considered a minimum dosage for different interventions and target conditions. Methodological concepts are needed to account for the possible selection bias of higher dropouts, especially in studies conducted unguided and completely online. However, comparative data of patient characteristics in online studies compared with the general population have shown substantial representativeness. 35
This leads to one of the most critical questions in studying behavioral interventions: blinding. Patients can only be blinded when an active control intervention is very similar to the intervention to be studied. In this case, also the control intervention can have a treatment effect or even a negative impact leading to an under- or overestimation of the benefits of the intervention. Especially in smaller studies, it will be therefore very difficult to show differential effects, which may have happened to some of the RCTs summarized in this review. An NIH expert panel in 2019 presented a concept to handle the methodological problem of controls in behavioral interventions. 36 The recommendation here is to start with a treatment as usual or waitlist control group. Having gathered some effectiveness data, further studies with active control conditions can attempt to disentangle the mechanisms and mediators by which treatment effects unfold over time.
This review is limited as it addresses only self-guided interventions. Guided interventions show a broad range from single email contacts to regular telephone–video–live contacts. However, scalability is substantially much more challenging. Furthermore, ROB assessment focused on self-reported outcomes, which were not always used as primary study endpoints. Therefore, ROB information on the study in the domain PROMS and the overall rating are to some extent selective. However, it remains an open question whether the studies with significant outcomes in performance-based measures but missing or non-significant PROMs have a relevant impact on the activity and participation of pwMS.
Searching for registered but not yet published studies, we found additional RCTs indicating that the currently limited evidence might improve in the near future.
Conclusion
Although the evidence on the effectiveness of self-guided digital interventions in MS is limited, studies especially in the area of neuropsychiatric symptoms (depression and fatigue) have demonstrated the utility of such interventions.
Supplemental Material
sj-docx-1-msj-10.1177_13524585231201089 – Supplemental material for Mobile health interventions in multiple sclerosis: A systematic review
Supplemental material, sj-docx-1-msj-10.1177_13524585231201089 for Mobile health interventions in multiple sclerosis: A systematic review by Christoph Heesen, Thomas Berger, Karin Riemann-Lorenz, Nicole Krause, Tim Friede, Jana Pöttgen, Björn Meyer and Dagmar Lühmann in Multiple Sclerosis Journal
Supplemental Material
sj-docx-2-msj-10.1177_13524585231201089 – Supplemental material for Mobile health interventions in multiple sclerosis: A systematic review
Supplemental material, sj-docx-2-msj-10.1177_13524585231201089 for Mobile health interventions in multiple sclerosis: A systematic review by Christoph Heesen, Thomas Berger, Karin Riemann-Lorenz, Nicole Krause, Tim Friede, Jana Pöttgen, Björn Meyer and Dagmar Lühmann in Multiple Sclerosis Journal
Footnotes
Author Contributions
C.H. provided the concept, was involved in the screening process and data interpretation, and wrote the initial paper draft. D.L. provided analysis methodology and supervised and performed data extraction and interpretation. She commented and revised the paper. K.R.-L. was involved in data extraction and interpretation. J.P. and T.F. were involved in effect size calculations and interpretation. T.B., K.R.-L., N.K., T.F., J.P., and B.M. provided substantial comments to the paper draft.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: B.M. is employed as the Research Director by GAIA, the company that has developed, owns, and operates deprexis, amiria, and elevida. C.H. and J.P. also contributed to the trials of these applications. K.R.-L. and N.K. are involved in another digital health intervention development with GAIA. T.B. and D.L. declare no conflicts of interest. T.F. has no conflicts of interest. C.H. received research support from Bristol Myers Squibb, Novartis, and Merck. He received honoraria from Roche for digital health presentations.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the German Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen based on a call for a systematic review on the value of mHealth solutions for self-management for pwMS.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
