Abstract
Introduction
Early identification of patients at high risk of progression could help with a personalised treatment strategy. Magnetic resonance imaging (MRI) measures have been proposed to predict long-term disability in multiple sclerosis (MS), but a reliable predictor that can be easily implemented clinically is still needed.
Aim
Assess MRI measures during the first 5 years of the MS disease course for the ability to predict progression at 10+ years.
Methods
Eighty-two MS patients (53 females), with ≥10 years of clinical follow-up and having two MRI scans, were included. Clinical data were obtained at baseline, follow-up and at ≥10 years. White matter lesion (WML) counts and volumes, and four linear brain sizes were measured on T2/FLAIR ‘Fluid-Attenuated-Inversion-Recovery’ and T1-weighted images.
Results
Baseline and follow-up inter-caudate diameter (ICD) and third ventricular width (TVW) measures correlated positively with Expanded Disability Status Scale, ≥10 or more of WMLs showed a high sensitivity in predicting progression, at ≥10 years. A steeper rate of lesion volume increase was observed in subjects converting to secondary progressive MS. The sensitivity and specificity of both ICD and TVW, to predict disability at ≥10 years were 60% and 64%, respectively.
Conclusion
Despite advances in brain imaging and computerised volumetric analysis, ICD and TVW remain relevant as they are simple, fast and have the potential in predicting long-term disability. However, in this study, despite the statistical significance of these measures, the clinical utility is still not reliable.
Introduction
Multiple sclerosis (MS) is a chronic inflammatory, demyelinating and neurodegenerative disease of the central nervous system (CNS). 1 The early identification of MS high-risk patients with a greater disability is crucial and it would be effective for determining an early personalised treatment strategy. Studies have shown that more than half of MS patients are likely to develop a significant disability after 15–30 years;2–4 however, some cohorts showed that many maintained a mild disease state.
In a largely untreated cohort of people with clinically isolated syndrome (CIS), it has been found that up to 42% of patients remained fully ambulatory (EDSS ≤3.5) 30 years later, while 58% developed disability and may have benefitted from early treatments. 2 It has also been reported that patients who accumulated lesions over the first five years of their disease were more likely to develop secondary progressive multiple sclerosis (SPMS) 20 years later, 2 especially when 10 or more lesions are present at the baseline scan. 4 It is therefore important to examine if there is an easily applicable measurement that could predict long-term disability in an era the disease-modifying therapies (DMTs) have been used. 5 EDSS 4 and EDSS 6 are routinely used as disease progression cut-offs because EDSS 4 is the first point in the scale at which walking is limited/restricted, and EDSS 6 is the first point at which bilateral walking aids are required.
Despite advances in brain imaging and computerised volumetric analysis, manual brain measurements of the lesions and linear brain atrophy remain relevant as they are simple, fast, and can be performed on non-digitised data. In addition, for brain atrophy measures, linear two-dimensional methods do not require extensive training or expensive and time-intensive computer software necessary for complex quantitative and volumetric analyses. Furthermore, imaging protocols do not always include three-dimensional brain imaging (required for voxel-based morphometry) due to time constraints while two-dimensional scans have shorter acquisition times and are easily available. For these reasons, simple linear measurements are applicable to clinical practice, especially in regions with no access to advanced imaging technology.
Different MRI predictors of MS disability which can be easily implemented in clinics have been proposed.6–9 To date, these predictors have not been approved to be implemented in clinical practice. Predictors include white matter lesions (WMLs) accumulation10,11 and linear measurements of brain atrophy. Amongst the many proposed linear measures of brain atrophy, the most widely reported are third ventricular width (TVW), 3 medullary width (MEDW)3,12 corpus callosum index (CCI) 13 and inter-caudate diameter (ICD). 12 These measurements are established and practical techniques that correlate with long-term disability in MS.14,15 However, the reliability and strength to use these measures in clinics are yet to be confirmed in clinical MS cohort.
This study aims to assess the role and reliability of measures, that can feasibly be implemented within a clinical setting in the first 5 years of the disease, for predicting long-term (>10 years) outcomes. This assessment is needed to facilitate and implement reliable and practical MRI metrics in clinics.
Material and methods
Patient selection
This is a retrospective study from the MS clinic database at the Queen’s Medical Centre in Nottingham. In this study, data collected up to January 2021 were included. A total number of 3801 patients with different MS subtypes were registered with the clinic with the date of disease onset ≤2011. Inclusion criteria were (1) baseline brain MRI scan in 2011 or earlier (2) follow-up scan acquired between 4–6 years from the baseline scan (3) both sets of scans had a T2-FLAIR (fluid-attenuated inversion recovery) and T1 sequences of brain MRI (4) Patients with confirmed diagnosis of MS on the McDonald criteria 2010 (5) consented to have their MRI and clinical data used for research purposes (6) Primary progressive MS (PPMS) patients were excluded due to the different nature of the disease.
Patients’ demographic data were captured at baseline and follow-up MRI, and clinical data were extracted from the clinical notes. Data include age, sex, date of MS onset, date of MS diagnosis, MS subtypes, EDSS, date reached EDSS 4 and 6 if applicable and details of DMTs used. The number and volume of T2-FlAIR lesions at baseline and follow-up scans were measured by a researcher blind to their clinical characteristics.
Disability was evaluated according to the EDSS at three times; baseline, follow-up (4–6 years interval), and last visit (≥10 years). Disability milestones were defined as reaching an EDSS score
We divided treatments in high-efficacy treatments (HET): Tysabri (natalizumab), Lemtrada (alemtuzumab, Campath), Ocrevus (ocrelizumab); and non-HET: beta-Interferons, Copaxone (glatiramer acetate), Aubagio (teriflunomide), Tecfidera (dimethyl fumarate), Gilenya (Fingolimod), Mavenclad (Cladribine). 16
MRI protocol
Clinical brain MRI scans were performed as part of the service provided at the MS clinic at the Queen’s Medical Centre using 1.5T and 3T MRI. These scans included sagittal T2-FLAIR and T1-weighted spin-echo. All the sequences were obtained using a contiguous 3–5 mm slice thickness covering the whole brain. Having a normal brain MRI scan was not an exclusion criterion, since patients were included based solely on their clinical diagnosis (i.e., patients with normal baseline brain scans were included).
Image analysis
Lesions and atrophy
All measurements were performed using 3D slicer version 4.13.0 (https://www.slicer.org). Inter-rater and intra-rater reliability statistics were provided in (S.1). Lesions with long axis diameter
Statistical analysis
Pearson’s correlation coefficient was used to assess the relationship between EDSS at > 10 years/change in EDSS and the six brain MRI metrics (lesion count, lesion volume, TVW, CCI, ICD, MEDW) at baseline, follow-up and changes in MRI parameters were computed as an annualized measure (e.g., CCI change/y). To assess the extent to which MRI brain metrics can be used to predict worsening disability over 10 years, we performed a simple linear regression analysis. The EDSS scores at 10 years were set as the dependent variable while MRI predictors (lesion count, lesion volume, TVW, CCI, ICD, MEDW) at baseline or follow-up were set as the independent explanatory variable, including a factor variable with three levels (HET, non-HET and untreated) in the model. Linear regression is particularly interpretable; however, EDSS is an ordered categorical variable with a scale (0–10). Therefore, ordinal regression analysis was also performed to check similar findings. The t-test tests were used to compare EDSS and MRI predictors between baseline, follow-up, and yearly change.
For the binary logistic regression, the EDSS models were EDSS
Results
Patient characteristics
Of the 3801 MS patients that were recorded, 82 RRMS patients were included based on the inclusion criteria (Figure 1). MRI brain scans of the 82 patients were 29 male and 53 female, with a mean age (±SD) of 35.4 (±10.3) years. Flowchart illustrates the process of patient selection for this study.
The mean clinical follow-up was 12.1 (±1.31) years, and the mean time between scans was 5.33 (±1) years. The baseline MRI scan was acquired with a median of 3 months (IQR 6) after disease onset. A wide range of physical disabilities encompassing all levels of the EDSS was seen across the cohort (Figure 2). EDSS worsened from a mean of 1.95 (±1.59) at baseline to 4.48 (±2.17) after 10 years (p <0.001). Out of 82 patients, 70 remained RRMS and 12 progressed to SPMS. During the follow-up, 51 RRMS patients reached EDSS 4 at the last visit while 31 patients reached EDSS 6. Two RRMS patients died and only one was due to MS. Expanding Disability status scale (EDSS) scores after more than 10 years of follow-up. EDSS were obtained from 82 patients at the last visit >10 years. An EDSS of 10 was assigned to those where Multiple sclerosis (MS) was known to contribute to death. *RRMS: Relapsing remitting multiple sclerosis; SPMS: Secondary progressive multiple sclerosis.
Demographic characteristics, clinical classification and data availability at each follow-up time points.
All data presented as medians and ranges, unless otherwise stated, EDSS: Expanded Disease Duration Status Scale, DMTs: disease-modifying treatments.
The numbers of slices and slice thickness were variable, both between patients and between the first and second scans. Of the 164 scans analysed (first and follow-up MRIs), the number of slices ranged from 15 to 60 and slice thickness ranged from 3 to 5 mm. The mean number of slices was 26.7 (±5.94) and the mean slice thickness was 3.91 mm (±0.76).
Baseline brain MRI and new lesions
New lesions and the change rate in lesions on the follow-up MRI scan.
MRI predictors and long-term physical disability
Correlations between MRI predictors and clinical disability
No correlations were found between WML counts and volume with clinical disability at both baseline and follow-up. Alternatively, some linear brain measurements showed a positive correlation with EDSS at the last visit. This included ICD (r = 0.25, r = 0.27, p < 0.01 at baseline and follow-up) and TVW (r = 0.28, r = 0.24, p < 0.01 at baseline and follow-up).
Correlations between MRI predictors and disease severity (EDSS).
BL: Baseline, FU: Follow-up, EDSS: Expanded Disability Status Score, CCI: Corpus callosum index, MEDW: medulla width, TVW: Third ventricular width.
aSignificance at p-value of <0.05.
T2 lesions and brain changes
As expected, the lesion count at the follow-up scan was greater than baseline (25 versus 15 lesions, p < 0.001), with an annual rate of new lesions of 1.11 lesion/year (p < 0.001). Similarly, the lesion volume was greater in the follow-up scan (1.48 versus 1.44 cm3, p < 0.001), with an annual lesion growth rate of 0.16 lesion/year (p < 0.001). Only 3 patients; 2 RRMS, and 1 SPMS did not have any new lesions, they had fewer lesions at the follow-up scan by 1 to 4 lesions.
Patients who remained RRMS or progressed to SPMS at the end of the observational period had similar medians of the baseline T2 lesion volume see Table 4. However, observing the pattern of T2 lesion volume during the follow-up scan showed a steeper rate of lesion volume increase in SPMS compared to RRMS over the first 4-6 years of the disease.
White matter lesions counts and volume in relapsing-remitting and secondary-progressive MS patients.
All data presented as medians and ranges RRMS: Relapsing-Remitting Multiple Sclerosis, SPMS: Secondary Progressive Multiple Sclerosis.
EDSS at last visit (≥10 years)
The results from the linear regression showed that most of the predictors have positive associations, apart from CCI and MEDW which were not statistically significant (Table 5). Two predictors, TVW and ICD at both baseline and follow-up had statistically significant associations with long-term disability. For every unit increase in TVW at baseline, the value of disability score at the last visit was estimated to increase by a value of 0.45 mm and this association was highly statistically significant (p = 0.001). Linear regression may not be ideal for ordinal categorical variables such as EDSS, although it is easy to interpret, therefore we also performed ordinal regression and similar results were found (S.3).
EDSS of 4.0 and 6.0 at 10 years
Linear regression for the predictors and EDSS.
BL: Baseline, FU: Follow-up, EDSS: Expanded disability status score, CC: Corpus callosum, MEDW: Medulla width, ICD: Inter-caudate diameter, TVW: Third ventricular width.
aSignificance at p-value of <0.05.
Baseline MRI lesion number and clinical status at 15 years.
Binary logistic regression for the predictors and EDSS ≥4 or EDSS ≥6.
BL: baseline, FU: follow-up, EDSS: Expanded Disability Status Score, CC: corpus callosum, MEDW: medulla width, TVW: third ventricular width.
aSignificance at p-value of <0.05.
Aggressiveness MS: EDSS of 6.0 at 10 years
As stated, aggressiveness was set as EDSS ≥6. The risk of having aggressive MS in the group of more than 10 lesions at baseline was 18% higher compared to patients with less than 10 lesions and this difference was highly statistically significant (p = 0.004). Furthermore, the median (IQR) number of baseline T2 lesions was 18 (0–58) in the aggressive group compared to 15 (1–38) in the non-aggressive group (Figure 3). In addition, the area under the curve was 0.60 (95% CI 0.48–0.72) which means poor prediction; the best cut point was 4 lesions at baseline with a high sensitivity of 88% but low specificity of 20%. A cut-point of ten lesions had better specificity of 43% but sensitivity was reduced by 74%. Number of T2 lesions at baseline in two MS groups, aggressive MS patients who reached EDSS ≥6 and EDSS <6.
ICD and TVW had similar values. A cut-point of 0.46 mm had the best specificity of ∼64% and sensitivity of ∼60 for both ICD and TVW, accuracy of 62% and AUC was 0.70 (95% CI 0.16–0.94) which is consider as acceptable predictors.
Discussion
This study assesses the role and reliability of different MRI measures during the first 5 years of the disease, which can be easily used in routine clinics and could reliably predict MS disability.
The current study demonstrated that linear brain atrophy metrics related to ICD and TVW have an independent impact in predicting disability after 10 years but WML counts and volume showed no association with disability. However, the presence of 10 or more WML showed a high sensitivity in predicting disability.
More than 37% of our patients developed an aggressive MS (reaching an EDSS of 6 or more at 10 years). It is worth highlighting that in patients with an aggressive phenotype, baseline characteristics such as age and sex did not contribute in identifying patients at risk.
Contrary to previous findings,10,18,19 in our study, the counts and volume of WML were not associated with disability either at baseline or during the follow-up period. This could be due to the small sample size compared to other studies. Having said that, the correlation coefficient (R-value) was smaller compared to other studies and the should not be affected by study size.
It is more likely that our results are to the patients included, which had definite MS,rather than CIS and by definition a more active disease with different lesions evolution characteristics. In a recent study including 548 placebo-treated RRMS patients, the multivariable analysis indicated EDSS score and T2 lesion load as factors that independently predict clinical progression. Nevertheless, these two variables taken together were able to account for only 3% of the probability to have an EDSS increase over follow-up time. Thus, confirming the limited value of these metrics in predicting disability changes in RRMS. 20 Such a result is in line with several previous cross-sectional and longitudinal studies conducted on a smaller group of patients with different clinical characteristics, which have shown only a modest correlation between T2 and changes in disability.21,22 The presence of 10 or more of WMLs at baseline scan has attracted attention since the Barcelona group reported as being very predictive of how agressive the disease appears to be 10 years later. 23 In our cohort, we did not replicate this finding. Although in the recent Barcelona study 4 the median number of baseline T2 lesions in the aggressive group (EDSS 6 at 10 years) was higher compared to our cohort (17 vs 18). Again, the most likely explanation is that that they included CIS patients whereas we only included MS patients showed some merit to the use of this measure in progression prediction, which also has been reported previously. 24 Furthermore, lesion load continues to increase in RRMS patients and the rate of lesion growth in those who develop SPMS is higher than those who remain RRMS, in line with a previous finding. 19
Several reasons might explain the weak or absent association of WML and/or regional atrophy with clinical changes. This might be related to some technical limitations such as the difference in slice thickness and noise as manual measurements are generally more susceptible to it.
There are no previous studies investigating critical cut-off values for MRI volumetric predictors of disability, such as TVW. Therefore, a direct comparison of our findings with the existing literature’s cut-off values was not possible.
In the current study, the MEDW was measured as part of our dataset as a previous study showed that brainstem measures are sensitive to atrophy in MS 25 and may act as a replacement for cervical spinal cord volume in predictive diagnostics. 3 In another stud, MEDW measures were linearly correlated with disease severity in MS patients. 26 However, in the current study, the similar metric of MEDW did not produce any statistical significance as a prognostic predictor. These findings could suggest that there is some independence of the MS pathology in these regions, as this lack of correlation between the brainstem and spinal cord measurements has been reported previously. 24
Similarly, CCI was demonstrated in a previous study to be correlated to disease progression in MS patients but was not itself an independent predictor. 27 Accordingly, CCI showed an annual decrease in patients following MS diagnosis, and the severity of CCI decrease was double in SPMS compared to RRMS patients with identifying a trend for a slower rate of CCI in patients using DMTs. Consistent with this line of thinking, the use of CCI as a measure of disease progression may not have been as statistically robust.
A high proportion of our patients received DMTs during the follow-up, and for the majority, there was a short time between the first clinical event and treatment initiation. Although only 62% of patients were using DMTs at clinical follow-up, there may have been a tendency for use of DMTs to be more common in patients with mild to aggressive symptoms. It was found that the treatment effect on disability progression was independently correlated with brain atrophy and the presence of active MRI lesions. 28
This study has limitations that include the exclusion of a large subset of patients when patients with MS were not routinely scanned annually during the first years of their disease. Also, due to the retrospective study design, we were not able to accurately record all previous DMTs data. A further caveat is the estimated odds ratios, although statistically significant should be interpreted with caution, particularly where all confidence intervals are very wide. The lack of data on spinal cord lesions and atrophy is another major limitation, in knowing the association of spinal lesions with long-term disability. 29
Conclusion
Brain atrophy of ICD and TVW early in the cohort of MS could predict progressive disease and disability over 10 years of clinical follow-up, as measured by simple, fast linear measurements which are applicable to clinical practice. The current clinical monitoring relies on T2 lesions development, but this study suggests that simply counting the number of lesions does not have a direct effect on clinical disability 10 years later in the current DMTs treated cohorts. This study also shows that the predictive value of brain lesions and atrophy alone from routine MRI scans may be not enough if used as the sole predictor of outcome. The use of more advanced MRI biomarkers, and especially the integration of these measures in prediction modelling using artificial intelligence platforms, either by way of the use of machine learning (vector) programs with pre-defined features or deep learning techniques, could possibly improve in the future this prediction ability.
Supplemental Material
Supplemental Material - Predictors of long-term disability in multiple sclerosis patients using routine magnetic resonance imaging data: A 15-year retrospective study
Supplemental Material for Predictors of long-term disability in multiple sclerosis patients using routine magnetic resonance imaging data: A 15-year retrospective study by Amjad Altokhis, Abdulmajeed Alotaibi, Paul Morgan, Radu Tanasescu and Nikos Evangelou in The Neuroradiology Journal
Footnotes
Acknowledgements
We thank Andre Venn and Anna Podlasek, Medical Statisticians at the University of Nottingham for their advice on statistical analysis. Thanks to Anwar A. Sayed for his support and feedback.
Declaration of conflicting interest
The author(s) declared the following potential conflicts of interest with respect to the research, authorship and/or publication of this article: A.I.A., A.A., and P.M. nothing to declare. R.T. received support from the UK MRC (CARP MR/T024402/1). N.E. is a member of the advisory board for Biogen, Merck, Novartis and Roche; he has received grant income from the MS Society, MRC, PCORI and NIHR.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported in part by Nottingham University Hospital and Princess Nourah bint Abdulrahman University for Health and Rehabilitation Sciences in Saudi Arabia.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
