Abstract
Study design
Cross-sectional retrospective observational study.
Objective
To evaluate the reliability and clinical utility of the Modic changes (MC) grading score.
Method
Patients from the Danish national spine registry, DaneSpine, scheduled for lumbar discectomy were identified. MRI of patients with MC were graded based on vertical height involvement: Grade A (<25%), Grade B (25%-50%), and Grade C (>50%). All MRIs were reviewed by 2 physicians to evaluate the reliability of the MC grade.
Results
Of 213 patients included, 142 patients had MC, 71 with MC-1 and 71 with MC-2; 34% were Grade A, 45% were Grade B, and 21% were Grade C. MC grade demonstrated substantial intra-rater (κ = .68) and inter-rater (κ = .61) reliability. A significantly higher proportion (n = 40, 57%) of patients with MC-1 had a severe MC grade compared to patients with MC-2 (n = 30, 43%, P < .001). Severe MC grade was associated with the presence of severe lumbar disc degeneration (DD) (Pfirrmann grade = V, P = .024), worse preoperative ODI (52.49 vs 44.17, P = .021) and EQ-5D scores (.26 vs .46, P = .053). MC alone including type was not associated with a significant difference in patient-reported outcomes (P > .05).
Conclusion
The MC grade score was demonstrated to have substantial intra- and inter-observer reliability. Severe MC grade was associated with both severe DD and MC type, being more prevalent in patients with MC-1. The MC grade was also significantly associated with worse disability and reduced health-related quality of life. Results from the study suggest that MC grade is more clinically important than MC type.
Introduction
Low back pain (LBP) is one of the most common causes of reduced health-related quality of life and disability.1,2 Most episodes of LBP are self-limiting with a short duration, and are classified as non-specific.3,4 Unfortunately, for many patients, LBP is a recurrent or chronic condition associated with both increased opioid use, physical disability, depression and increased mortality.1,3,5,6 A great deal of effort including government funded initiatives have focused on reducing the global burden of the back pain pandemic.3,7-11 This effort has also included extensive research in risk factors for developing LBP such as genetic disposition, disc degeneration (DD) and Modic changes (MC).12-15
Degenerative changes in the spine are unavoidable with advanced age and include changes in the sagittal/frontal plane like HIZ-lesions and disc protrusion/herniation; and in the transverse plane such as endplate changes (EC) including Schmorl’s nodes (SN) and MC.16-23 In a clinical setting it is of importance to differentiate between and understand the significance and association of these potential causes of LBP.24,25
MC are non-neoplastic subchondral bone marrow changes within the vertebral body and endplates visible on magnetic resonance imaging (MRI).26-29 First described by Assheuer,
26
de Roos,
27
and Modic et al.
28
MC are now classified into 3 types- Modic changes type 1-3 (MC-1/MC-2/MC-3 – see Figure 1).
29
MC in the low back, are found in 5%-22% the background population.15,30 There is an increased presence of MC in patients with LBP and lumbar disc herniation (LDH).15,31,32 LDH can cause endplate-junction failure which can possibly explain the increased prevalence of MC and the increased risk of LBP in sciatica patients with MC.33–36 MRI of Modic changes.
MC-1 is characterized by a reactive change in response to biomechanical stressors, reparative change, and/or inflammation; MC-2 is characterized by fatty infiltration, and MC-3 is associated with bony sclerosis of trabeculae and crowding out of marrow signal.28,29
In systematic reviews, the prevalence of MC in LBP patients and its association with patient-reported outcomes (PROs) demonstrated large between-study variations,37,38 specifically in the interpretation of the actual term “MC”. 37 This variation and the heterogeneity of MC nomenclature has been proposed as a possible cause of phenotype variation across studies.36,38,39 In addition, the extent of MC involvement has not generally been taken into account.40-42 The clinical relevance of the size of the MC have been investigated by Määttä43,44 and Weishaupt, 42 who found that increased vertical extent of MC (proximal or distal) was associated with the type of MC (MC-1 > MC-2). Severe degenerative changes at the functional spinal unit (FSU) were also associated with an increased risk of a positive discography and worse PROs.42,44 Based on these results, previous studies have suggested that MC grade, based on the vertical extent of MC in the vertebral body, should be included in the description and analysis of MC.36,37,40,43,45 The purpose of this study was to evaluate the reliability and clinical utility of the MC grading score in a clinically relevant population.
Methods
Ethical Considerations, Approvals, and Registration
Approval for data collection was obtained from the Danish National Data Protection Agency. The study was approved by The Regional Committees on Health Research Ethics (S-20192000-112). All patients gave informed consent for the use of their data including questionnaires for research purposes before registration in the registry.
This is a registry-based comparative cohort study on patients who underwent primary elective discectomy for LDH at L4-L5 or L5-S1 at a single institution between January 2014 and July 2017. 46 Patients were consecutively included based on the preoperative MRI findings. As patients with isolated MC-3 were few, these cases were excluded.
Inclusion criteria were patients who had surgery for LDH due to bilateral or unilateral radiculopathy with MRI findings concordant with the level and side of the radiculopathy. All patients had an MRI of the lumbar spine within 6 months before their discectomy. All patients underwent lumbar discectomy for persistent radiculopathy (at least 6 weeks) and after failure of nonsurgical treatment. Exclusion criteria were incomplete preoperative data and preoperative lumbar MRIs unavailable for review. Patients with motor deficits (motor grade <3/5), cauda equina, suspicion of malignancy, fractures, disc rupture due to trauma, and a history of previous spine surgery were also excluded.
Based on previous studies,38,47,48 it was estimated that approximately 45% of included patients would have MC on the preoperative MRI. Three predefined groups of similar size were selected for inclusion: MC-1, MC-2, and patients without MC (no-MC). The total number of patients within each cohort would depend on the least common MC finding in regards to these 3 predefined groups
Study participants completed questionnaires within 2 weeks before planned surgery including the Oswestry Disability Index (ODI), 49 European Quality of Life – 5 Dimensions (EQ-5D), Visual Analogue Scale (VAS) 0 (no pain)-100 (maximal pain) for back pain (VAS-BP), and leg pain (VAS-LP). 50
MRI Technique
The MRI system used was a Philips 1.5 Tesla (Philips Healthcare, The Netherlands). The standard imaging protocol consisted of a sagittal T1-weighted turbo spin-echo (TSE), and axial T2-weighted TSE of the lumbar segmental levels including S1. Sagittal short-tau inversion recovery (STIR) imaging was not included in the analysis. Imaging parameters for the sagittal images used in the study were: T1: repetition time 728 milliseconds (ms), echo time 10 ms, matrix 212 × 413, echo train length 6; T2: repetition time 3000 ms, echo time 90 ms, matrix 244 × 216, echo train length 24. Field of view was 196 mm × 196 mm and slice thickness/spacing was 4.0 mm/.4 mm for T1 and 3.0 mm/.3 mm for T2.
MRI Assessment
Modic Change
Modic changes were classified in accordance with the original studies by de Roos 27 and Modic et al.28,29 and the Modic grading score by Udby et al. 36 MC types were defined by T1/T2 weighted characteristics alone; type 1 as clearly hypo-intense on T1 and hyper-intense on T2, type 2 as hyperintense on T1, and iso- or hyperintense on T2 (Figure 1). If mixed MC or several types were present, the MC at the index level with the lowest classification, MC-1 > MC-2 > MC-3, was described. 36 If not present at the index level, MC was described as above, since adjacent segment MC could be associated with clinical symptoms. This was done in order to avoid a type-2 error, failing to find an existing association, as previous studies have shown a consistent association between MC-1 and LBP.37,38
MRI Grade
Modic change grade was defined as Grade A - MC <25% vertebral vertical height involvement, Grade B - MC with 25%-50% vertebral vertical height involvement, and Grade C - MC >50% vertebral vertical height involvement (Figure 2).
36
The grading score.
All MRIs were evaluated twice by 2 physicians, and thrice by a third physician if inconsistencies existed after the second evaluation. The evaluating physicians were blinded as to the clinical status and discectomy level. Lumbar MRI evaluation included: Presence, type (MC-1/2/3), and level of MC, MC grade, DD (Pfirrmann classification, Grade I (no disc degeneration) - V (most severe degree of disc degeneration)), 51 and level of disc herniation (L1-S1).
Statistical Analyses
All statistical analysis was performed using IBM SPSS v27.0 (Armonk, New York). Continuous data from the groups were compared using one-way ANOVA and the unpaired student t-tests. Categorical data were analyzed using Fisher’s exact test. Reliability tests were evaluated by weighted Cohen’s kappa (values ≤0 as indicating no agreement and .01-.20 as none to slight, .21-.40 as fair, .41- .60 as moderate, .61-.80 as substantial, and .81-1.00 as almost perfect agreement). 52 The threshold for statistical significance was established at P ≤ .05.
Results
A total of 620 patients met the criteria for inclusion in the study. 46 Of these, 290 (47%) had MC and 330 (53%) did not have MC on the preoperative MRI scan. According to the study protocol, MC-1 (being the least common finding in regards to the 3 predefined groups) would be the limiting factor for the number of patients included in each cohort for this study. The final cohort included 71 pts with MC-1, 71 pts with MC-2, and 71 pts without MC, a total of 213 patients.
Comparison of the Three Groups
Preoperative Characteristics for All Groups.
Plus-minus values are means±standard deviation.
Years: yr; Number: no; Kilo: kg; meter: m.
aScores on the Oswestry Disability Index (ODI) range from 0 to 100, with higher scores indicating more severe disability.
bScores on the European Quality of Life–5 Dimensions (EQ-5D) range from 0 to 1, with higher scores indicating better quality of life.
cScores on the visual-analog scales (VASs) for back pain and leg pain range from 0 to 100, with higher scores indicating more severe pain.
Impact of the MC Grade
A scatter-plot showed substantial overlap in PRO-scores between patients with MC Grade B and Grade C. Thus, patients with MC were stratified into 2 groups – Grade A patients (non-severe MC grade) vs Grade B-C patients (severe MC grade). Patients with a severe MC grade had worse ODI scores, 52.49 vs 44.17 (P = .021), and worse EQ-5D scores of .26 vs .46 (P = .053). A similar but non-significant finding was observed for VAS-BP, 55.91 vs 44.85 (P = .102). A more severe MC grade was seen in patients with MC-1 (40, 57%) compared to MC-2 (30, 43%, P > .001) (Table 2).
Reliability of the Modic Change Grading Score
The results showed substantial reliability in the radiological MRI evaluation of the MC grade with an intra-observer Kappa of .68 (95%CI: .52-.76) and inter-observer Kappa of .61 (95%CI: .47-.69).
In patients with MC, severe disc degeneration in the lumbar spine (defined as Pfirrmann grade 5 at any lumbar level) was predominantly found in those with a severe MC Grade, 15 patients (83%) with a MC Grade ≥B vs 3 patients (17%) with a MC Grade A (P = .024).
Discussion
Low back pain is the most common cause of disability worldwide. 1 Millions of spine MRIs are performed annually in order to evaluate patients with back pain. 53 Typically, pathological findings, including those found on MRI, are described and graded according to structure, size, and extent, of involvement. 54 For example, DD grade on MRI is typically graded using the Pfirrmann classification, with grade V indicating maximum amount of DD, and is associated with the severity of DD.51,55
Preoperative Characteristics for MC Patients ± Severe MC Grade.
Plus-minus values are means ± standard deviation.
Years : yr; Number: no; Kilo: kg; meter: m.
aScores on the Oswestry Disability Index (ODI) range from 0 to 100, with higher scores indicating more severe disability.
bScores on the European Quality of Life–5 Dimensions (EQ-5D) range from 0 to 1, with higher scores indicating better quality of life.
cScores on the visual analog scales (VASs) for back pain and leg pain range from 0 to 100, with higher scores indicating more severe pain.
The extent/volumne of MC, have been associated with increased biomechanical strain in the area of involvement, degenerative sagittal/coronal plane changes (facet joint degeneration, disc protrusion, high-intensity zones, disc degeneration), and PROs.18,42,43,46 Weishaupt et al. 42 found that in normal discs (no/minimal DD) a provocative discography was in general not painful. However, in patients with MC (MC-1 or MC-2) and a vertical extent of >25% a positive provocative discography was present in all patients. Määttä et al.43,44,56 found that a vertical extent of the MC >25% was associated with MC-1, overall DD, a worse total disc displacement score, and an increased number of SN in the lumbar spine overall, and worse disability scores. Recently, a subgroup analysis of patients with MC and LBP treated with antibiotics, demonstrated that a larger intervertebral MC size on MRI STIR is associated with decreased disability after treatment. 45 These findings have established the foundation and need for implementing a useable MC grading score that can be applied when reviewing MRIs.
The current study found that it is possible to grade MC according to the vertical extent (Figure 2). The grading score is reproducible and reliable in a clinically relevant population. A severe MC grade was associated with worse disability scores, reduced health-related quality of life, and the presence of severe lumbar disc degeneration. Overall, the results from this study are consistent and in agreement with the results from previous studies.42,44,57
These findings may indicate a paradigm shift in terms of understanding spine phenotypes. Previous studies have primarily viewed MC type as the significant spine phenotype. Results from this study suggest that the MC grade is a better clinical predictor of disability and spine degeneration than the MC type.
As with any clinical study, limitations and strengths existed with our study. Limitations of this study include the lack of a follow-up MRI. Also, as the patient cohort consisted of patients scheduled for discectomy due to LDH, some patients might have very little or no back pain. A selection bias is possible since only patients with complete datasets available in DaneSpine were included in the study. The evaluation was performed by experienced clinical physicians with a research background, no musculoskeletal radiologists participated in the MRI evaluation.
The strengths of this study include a sizeable relevant study population of patients scheduled for discectomy due to LDH at the same institution. Relevant confounders such as smoking and BMI were comparable between the 2 groups. Only patients with L4-L5 or L5-S1 discectomy and MC at these levels were included, in order to avoid an interpatient comparison bias. All MRIs were reviewed twice by 2 experienced physicians adhering to a verified classification system and a predefined protocol on MC definition and description.
The results from this study indicate that the adherence to a strict definition and grading of MC on MRI can improve future studies making these less heterogeneous and the results comparable in systematic reviews. It is essential that we proceed with a valid, reproducible, and systematic classification and grading of MC. Overall, future studies on the etiology, natural course, and clinical association of MC in patients with LBP are needed.
Conclusion
This is the first study to investigate the clinical relevance and reliability of the MC grading score. The intra- and interobserver reliability for the MC grading score is substantial in a clinically relevant population. A severe MC grade is associated with both severe DD and MC-1. The MC grade is also statistically significantly associated with increased disability and reduced health-related quality of life. Results from this study suggest that MC grade is more clinically important than MC type. We advocate for future studies on MC to implement the MC grading score in order to advance the current knowledge in this field.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
