Abstract
Background:
The Multiple Sclerosis Severity Score (MSSS) is obtained by normalising the Expanded Disability Status Scale (EDSS) score for disease duration and has been a valuable tool in cross-sectional studies.
Objective:
To assess whether use of age rather than the inherently ambiguous disease duration was a feasible approach.
Method:
We pooled disability data from three population-based cohorts and developed an Age Related Multiple Sclerosis Severity (ARMSS) score by ranking EDSS scores based on the patient’s age at the time of assessment. We established the power to detect a difference between groups afforded by the ARMSS score and assessed its relative consistency over time.
Results:
The study population included 26058 patients from Sweden (
Conclusion:
Since age is typically unbiased and readily obtained, and the ARMSS and MSSS were comparable, the ARMSS may provide a more versatile tool and could minimise study biases and loss of statistical power caused by inaccurate or missing onset dates.
Introduction
Kurtzke’s Expanded Disability Status Scale (EDSS) score 1 is the most widely used measure of disability in multiple sclerosis (MS). The EDSS is a standard outcome in MS clinical trials and commonly used in natural history studies and often applied in clinical practice to monitor patient’s progression over time.
In 2005, Roxburgh et al. 2 proposed the MS Severity Score (MSSS), a useful method of using and comparing cross-sectional patient-level disability (EDSS) data. An MSSS score was assigned according to how a patient’s EDSS ranked in comparison to all those patients with similar disease duration (±2 years). This offered advantages over other more crude approaches, such as the progression index (PI), which is estimated by dividing an individual’s EDSS by disease duration. The MSSS has been widely used in different settings,3–5 can be used in a variety of models and was shown to have improved statistical power, when compared to other available measures of disease progression, to detect differences in disability between groups of patients. One notable drawback of the approach is its reliance on the date of disease onset, which is generally assigned retrospectively and is necessarily imprecise. More importantly, an onset date is frequently missing or unobtainable in some data sets, resulting in loss of data and subsequently loss of statistical power.
Natural history studies have shown that many aspects of the clinical course of MS, including clinical symptoms, 6 occurrence of relapses 7 and disability progression, are associated with age.6,8–10 In light of these observations, we examined the suitability of using chronological age rather than years since date of onset in compensating for the effects of time on disability. To do this, we adapted the MSSS algorithm and developed an Age Related Multiple Sclerosis Severity (ARMSS) score by ranking EDSS scores based on the patient’s age at the time of assessment. Using age for the calculation of a severity score offers several advantages, not least of which are its availability, ease of measurement and absence of bias. In the work presented here, we have compared the proposed ARMSS score with its forerunner, the MSSS, and with other previously considered methods in terms of power to detect a difference in disability between groups. We also provided an updated version of the global MSSS and a new global ARMSS matrix by accessing three large population-based cohorts of MS patients.
Methods
Patients and data source
We used data from cohorts in Sweden, Canada (from the British Columbia MS database) and the United Kingdom (Cambridge MS cohort). These cohorts have been described before.6,7,11–13 Sex, date of birth, date of disease (first symptom) onset (recorded retrospectively through the neurologist–patient encounter), clinical course at onset (relapsing vs progressive), EDSS score and date of EDSS examination were obtained. From these cohorts, two data sets were constructed: cross-sectional and longitudinal. The cross-sectional data set was used for construction of the global ARMSS and the updated global MSSS matrices and included the most recent EDSS scores for all individuals in the three cohorts to ensure that individuals with varying rates of progression and different numbers of clinic visits contributed equally. The longitudinal data set comprised the Swedish and Canadian cohorts and contained serial EDSS scores recorded at different time points; this was used for testing the stability of the ARMSS.
Statistical analyses
As for derivation of the MSSS, the ARMSS was created using Weibull plotting positions, which is calculated as follows:
We also considered alternate versions of the ARMSS including the following:
Creation of the global ARMSS matrix
A global ARMSS matrix was constructed using the cross-sectional data set. This matrix included the ARMSS scores obtained for EDSS scores recorded between ages of 18 and 75 years.
Update of the global MSSS matrix
We also calculated an updated version of the global MSSS matrix using the same (original) approach as reported previously. 2
Comparison of power to detect a 0.5 EDSS score difference between two groups
The power of a study refers to the probability of correctly rejecting the null hypothesis of no difference between groups where there is a true effect. Comparisons of the power of the global ARMSS, the local ARMSS (i.e. scores calculated using a specific data set), the global and the local MSSS, PI, and the EDSS score were made in a simulation study. Since all of the scenarios proposed by Roxburgh et al.
2
yielded similar results, we replicated only their first simulation scenario on one subset of randomly selected patients (
To test whether the power was influenced by the age distribution, we compared the power of the global scores in three patient samples where the mean ages were 30 (
Stability over time of the ARMSS and MSSS measurements
We measured the correlation between two succeeding scores to assess the long-term stability of the measurement. Spearman’s rank correlation coefficient between assessments at ages of 20, 25, 30, 35, 40, 45 and 50 and assessment at 5, 10 and 15 years later (±1 year) were calculated using the pooled longitudinal data set. Comparisons were made between the correlation coefficients for the global ARMSS score, updated and original MSSS and the double ranked score.
Statistical analyses were performed using Stata (StataCorp, 2009,
Ethical approval
Ethical approval was obtained from the respective regional ethical committees.
Results
Patient population
The final population included 26058 patients with clinically definite MS from Sweden (
Demographic and clinical characteristics of the 26,058 included MS patient at the last recorded EDSS.
EDSS: Expanded Disability Status Scale; MS: multiple sclerosis; CI: confidence Interval; MSSS: Multiple Sclerosis Severity Score.
Proportion of MS patients in each EDSS category by disease duration.
EDSS: Expanded Disability Status Scale; MS: multiple sclerosis.
Percentages from Roxburgh et al. 2
There was a moderate correlation between EDSS and disease duration (
The global ARMSS and updated global MSSS matrices
The global ARMSS (
Software packages were created for Stata and R to calculate the global/local ARMSS and MSSS. An interactive online tool has also been developed to obtain scores for individual patients and data sets (https://aliman.shinyapps.io/ARMSS/). 14
Comparison of power to detect a 0.5 EDSS score difference between two groups
The results of the simulation study are shown in Figure 1. The global ARMSS, the updated global MSSS and the original global MSSS scores showed very similar power to capture EDSS changes which was considerably better than that of the other scales. For example, when half of the patients in the exposed group (

Comparison of power to detect 0.5 EDSS score difference between two patients groups in 500 randomly selected patients from the pooled data set. The power curve for the three Global scores overlap . Local scores are calculated within the same sample.
Stability over time of the ARMSS and MSSS measurement
The stability analysis included 68,240 observations from the Swedish data and 38,977 observations from the Canadian data. All scores showed relative stability after 5, 10 and 15 years and, for any age at first assessment, the 95% CIs of the correlation estimates for the different severity scores all overlapped (Supplementary Table 1). It should be noted that results of the stability analyses are based on average scores of patient groups. Results from the stability analyses demonstrate similar rate of progression by age; however, individual fluctuations are still significantly present. The individual fluctuation specifically at younger ages limits the use of ARMSS as a predictive tool. Ranking EDSS scores based on both disease duration and age (double ranked) did not provide any advantage.
Application as a patient monitoring/comparison tool
The MSSS has been used in several registers as a patient monitoring/comparison tool15,16 giving clinicians the ability to determine where a patient is relative to other patients of similar disease duration. Such an application is also possible for the ARMSS to define relative severity and rate of progression over time in an individual patient. Similar to MSSS, a patient having an ARMSS score of
Figure 2 demonstrates the longitudinal assessments of the global ARMSS scores in two patients with consecutive EDSS measurements over time. While the disease course can be longitudinally assessed using EDSS scores, the global ARMSS score allows cross-sectional and longitudinal comparisons of disease severity in each individual compared to the patients of similar age.

Longitudinal assessments of the global ARMSS scores in two patients with consecutive EDSS measurements.
Discussion
Here, we demonstrate that the ARMSS score obtained by ranking of EDSS scores based on patients’ age offers a powerful method for measuring relative severity of disability in MS. The ARMSS score presents an outcome measure that offers reasonable stability over time and is able to capture an effect contributing to a change in disability scores.
The ARMSS score offers a major advantage over the MSSS by its use of a patient’s age, which is typically readily and accurately available as opposed to the date of disease onset, and has been consistently shown to be associated with disease progression. 17 Date of symptom onset in MS is generally based on a patient’s ability to recall, date and articulate past events and might be influenced by factors such as type and nature of the first symptom(s), sex (or gender) and the initial MS clinical phenotype. The accuracy of the symptom onset date can also depend on the history taking skills of the evaluating physician and the consultation time available, and the record can be biased by factors such as an assessor’s (the patient or clinician) knowledge of the average age at MS onset. 18 Using age not only increases accuracy when comparing between patients and across cohorts, but also increases the sample size by preventing case exclusions due to an unknown or undetermined date of onset. As shown here, as the proportion of the cohort with a missing onset date increased, the power of the MSSS significantly decreased as a result of the reduced sample size. Unlike MSSS, the ARMSS scores would not be influenced by the combined MS phenotypes in a single global cohort since patients with different MS phenotype (relapse- and progressive-onset) reach disability milestones at almost similar ages (but significantly different disease durations).
There are several applications of the ARMSS score. In a clinical setting, the ARMSS score enables practicing clinicians to compare a patient’s disease severity to that of a large global (as well as local) patient population to get an overall picture of the patient’s performance. When recorded at several time points, the ARMSS scores can offer a rather comprehensive overview of a patient’s relative disease severity with the impact of age already taken into account. This might be helpful when patient is being monitored for clinical purposes 19 as a change in EDSS score is better reflected in the ARMSS score. In the context of research, the ARMSS scores offer an outcome measure that can make the best use of sparse clinical data or cross-sectional EDSS scores and detect small differences between groups when even a limited number of patients have experienced a change in the EDSS score. Examples of such research applications would be the genome-wide association studies (GWAS) of disease severity,20,21 studies of association between environmental factors and disease severity22,23 or studies investigating MS clinical course.3,4,24–26
A major strength of our study is that the data used for construction of the global ARMSS and the updated global MSSS were obtained from three large cohorts: two in Europe and one in North America. Although there are some population-based differences, the pooled disability data enabled us to include MS patients with a wide spectrum of age, disease duration and EDSS scores. As a result, compared with the original global MSSS, the updated global MSSS assigns a higher score to the same combination of EDSS and disease duration in the majority of the cases. Nevertheless, if patients with very severe MS had died before any EDSS assessment could be made, or very benign patents were never assessed, these extreme groups would be underrepresented in the data sets such that the global scores might either under or overestimate the actual MS severity. The Swedish, Canadian and UK MS cohorts are predominantly comprised of patients of Northern European descent. It would be of value to assess the ARMSS in other ethnic groups and country-specific data sets.
Our sample included patients with varying lengths of exposure to different DMTs. We did not include treatment data in the calculation of global scores as this was impractical and we did not have comprehensive access to this data across all sites. We believe that the data from our cohorts are representative of the real-world setting which includes patients with varying lengths of exposure to DMTs. Hence, the global scores obtained from these cohorts are generalisable to many contemporary clinical settings. Furthermore, the three global scores showed significantly better performance than the local ARMSS and MSSS, EDSS and PI in our power analysis of patients who have been exposed to a DMTs for less than a year (the group perceived as being benign (not needing/qualifying for treatment) or least influenced by DMTs that was available to us), implying the advantage of a big data set. It should be noted that no impact of DMTs on the MSSS has been reported 24 and that any long-term influence of DMTs on EDSS progression remains hopeful but still uncertain.13,27
Useful applications of the global ARMSS might include baseline or cross-sectional comparisons of disease severity between two or more groups of MS patients, comparisons when disability data are sparse, or for matching groups of patients within or between studies. It should be noted that, regardless of its performance, the ARMSS score has limitations which are mainly due to its reliance on the EDSS score with its well-defined shortcomings such as its bimodal distribution and marked inter- and intra-rater variability. 28 Nevertheless, the EDSS is the most widely used outcome measure in MS, it is of relatively low cost to obtain, neurologists are familiar with it worldwide, and it can reasonably capture disease worsening over the long term.
In MS, age might be a better proxy of the cumulative effect of environmental and related exposures (including comorbidity) than disease duration. The burden of comorbidity in MS may increase with age and may also impact subsequent disease progression.29,30 Alternatively, the comparable performance of the ARMSS score to that of the MSSS in capturing effects on disability and its stability over time may indicate that irreversible disability in MS is just as much a function of chronologic age as it is of disease duration. These findings are in line with the reports on the effect of current age on reaching disability milestones.6,9 All said and as described before, 31 significant heterogeneity in disability scores in individuals within the same age is still present, particularly in younger age groups. One might expect that the precision obtained using age lessens the variability in the EDSS scores. While EDSS scores at older ages were slightly more stable over time (Supplementary Table 1), the overall correlation between EDSS and age was only moderate and the power of global ARMSS score was comparable in samples with different age distributions. Hence, part of the correlation between age and disability levels may have resulted from the analyses of groups which can potentially obscure individual or subgroup heterogeneity. Nevertheless, judging by its effect, age is one of the most important factors in accumulation of disability in MS.
In conclusion, disability in MS as assessed by the EDSS correlates with age at a similar magnitude as with duration of disease. Since a patient’s age is nearly always available, and since the ARMSS and MSSS are comparable even when the onset date is known, the ARMSS offers a more versatile tool for comparing EDSS-based severity in MS between groups.
Footnotes
Acknowledgements
The authors would like to thank all of the neurologists, nurses and MS patients in Sweden, Canada and the United Kingdom for providing data for this study. They gratefully acknowledge the BC MS Clinic neurologists who contributed to the study through patient examination and data collection (current members listed here by primary clinic): UBC MS Clinic: A. Traboulsee, MD, FRCPC (UBC Hospital MS Clinic Director and Head of the UBC MS Programs); A.-L. Sayao, MD, FRCPC; V. Devonshire, MD, FRCPC; S. Hashimoto, MD, FRCPC (UBC and Victoria MS Clinics); J. Hooge, MD, FRCPC (UBC and Prince George MS Clinic); L. Kastrukoff, MD, FRCPC (UBC and Prince George MS Clinic); J. Oger, MD, FRCPC; Kelowna MS Clinic: D. Adams, MD, FRCPC; D. Craig, MD, FRCPC; S. Meckling, MD, FRCPC; Prince George MS Clinic: L. Daly, MD, FRCPC; Victoria MS Clinic: O. Hrebicek, MD, FRCPC; D. Parton, MD, FRCPC; K. Atwell-Pope, MD, FRCPC. The views expressed in this paper do not necessarily reflect the views of each individual acknowledged.
Author contribution
A.M. designed the study, analysed and interpreted data, wrote and revised the manuscript. H.W. analysed and interpreted data, wrote and revised the manuscript. E.K. analysed and interpreted data and revised the manuscript. F.Z. analysed and interpreted data and revised the manuscript. R.R. interpreted data and revised the manuscript. A.G. interpreted data and revised the manuscript. R.C. interpreted data and revised the manuscript. S.S. designed the study, interpreted data and revised the manuscript. M.B. interpreted data and revised the manuscript. H.T. facilitated data interpretation and revised the manuscript. J.H. designed the study, interpreted data and revised the manuscript.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship and/or publication of this article: H.W., R.R., S.S., M.B., E.K. and F.Z. declare no conflict of interests. A.M. reports grants from Neuro Sweden (Neuroförbundet) during the conduct of the study. A.G. is receiving research support from Biogen. J.H. received honoraria for serving on advisory boards for Biogen, Sanofi-Genzyme and Novartis and speaker’s fees from Biogen, Merck-Serono, Bayer-Schering, Teva and Sanofi-Genzyme. He has served as P.I. for projects sponsored by, or received unrestricted research support from, Biogen, Merck-Serono, TEVA, Novartis, Sanofi-Genzyme and Bayer-Schering. H.T. is the Canada Research Chair for Neuroepidemiology and Multiple Sclerosis. She has received research support from the National Multiple Sclerosis Society, the Canadian Institutes of Health Research, the Multiple Sclerosis Society of Canada (Don Paty Career Development Award); the Michael Smith Foundation for Health Research (Scholar Award) and the UK MS Trust; speaker honoraria and/or travel expenses to attend conferences from the Consortium of MS Centres (2013), the National MS Society (2012, 2014), Bayer Pharmaceuticals (2010), Teva Pharmaceuticals (2011), ECTRIMS (2011, 2012, 2013, 2014, 2015), UK MS Trust (2011), the Chesapeake Health Education Program, US Veterans Affairs (2012), Novartis Canada (2012), Biogen (2014), American Academy of Neurology (2013, 2014, 2015). Unless otherwise stated, all speaker honoraria are either donated to an MS charity or to an unrestricted grant for use by her research group.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The lead site (Sweden) was received support for this study from Biogen and Neuro Sweden (Neuroförbundet). The participating sites (UK and Canada) received no direct funding for this study. The UK site was supported by the Cambridge NIHR Biomedical Research Centre. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Biogen provided a courtesy review and feedback on the paper to the authors. The authors had full editorial control of the paper and provided their final approval of all content.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
