Machine Learning Algorithm to Predict Change in the Decision-Making for Thoracolumbar Fractures Without Neurological Deficit After MRI: A Multicenter Study

Abstract

Study Design

A multicenter study.

Objective

To develop a machine learning algorithm to predict when magnetic resonance imaging (MRI) may change the thoracolumbar AO Spine injury severity score (TLAOSIS) treatment recommendation for thoracolumbar fractures (TLFs) without neurological deficits.

Methods

Three trauma centers recruited 619 neurologically intact TLFs (AO Spine A-fractures) who underwent computed tomography (CT) and MRI. CT findings indicating posterior ligamentous complex (PLC) injury were defined as facet malalignment, horizontal laminar fracture, spinous process fracture, and interspinous widening ≥4 mm. A single positive CT finding indicated an M1 modifier. The primary outcome was any change in the TLAOSIS treatment recommendation among conservative (≤3), grey zone (4-5), and surgical (>5) groups after MRI. The derivation and validation sets utilized 80% and 20% of the samples, respectively. A classification and regression tree (CART) was developed using the M1 modifier, AO fracture subtype (A1-A4), and spine level. Model discrimination was quantified using the area under the receiver operating curve (AUC).

Results

MRI altered TLAOSIS recommendations in 82 (13.2%) cases. The CART used the M1 modifier, A subtype, and spine level (importance = 0.914, 0.055, and 0.031, respectively). The model achieved an AUC of 0.93, sensitivity of 87.5%, specificity of 96.3%, and mean accuracy of 92.9% (±12.0%) in cross-validation in predicting TLAOSIS recommendation change.

Conclusion

The CART model accurately predicted changes in the TLAOSIS recommendation after MRI. This algorithm provides cost-effective indications for MRI in neurologically intact AO A-type fractures, ensuring accurate PLC assessment while minimizing unnecessary imaging.

Keywords

posterior ligamentous complex thoracolumbar fractures decision-making magnetic resonance imaging machine learning classification regression tree thoracolumbar AO spine injury severity score

Introduction

The Thoracolumbar Injury Classification System (TLICS) and the new AO Spine classification both cite the integrity of the posterior ligamentous complex (PLC) as a major determinant of instability in thoracolumbar fractures (TLFs).^1,2 Nevertheless, the assessment of PLC integrity remains elusive.^3,4 Computed tomography (CT) is the most practical modality for TLF evaluation; however, it can miss up to 30% of PLC injuries.^5-9 Although MRI is considered the reference standard, its routine use is limited by its high cost, inaccessibility in numerous trauma centers, and potential treatment delay. Evidence-based indications for MRI that achieve the practicality and accuracy of PLC assessments are lacking.^3,5 This lack of clear guidance often leads to either overuse of MRI, incurring unnecessary costs and delays, or, conversely, missed clinically significant PLC injuries, potentially resulting in suboptimal patient outcomes.⁵

Ideally, MRI should be restricted to cases with an indeterminate PLC assessment on CT.⁴ The “Indeterminate PLC’’ category was allocated by TLICS and AO Spine classification (designated as M1 Modifier) based on personal uncertainty of PLC status; hence, it is a poor guide for MRI indications.^1,2,10,11 A CT criterion for PLC injury was recently validated based on the following: facet malalignment, horizontal laminar fracture, spinous process fracture, and interspinous widening ≥4 mm.^8,12,13 At least two positive CT findings or no positive findings could reliably confirm or rule out PLC injury, as shown by MRI.¹³ Conversely, a single positive finding was insufficiently accurate to confirm or exclude PLC injury and indicated an M1 modifier.¹³

A single-center study has shown that the M1 modifier, defined as a single CT finding, could predict most of the changes in classification or decision-making along with the AO fracture subtype and spinal regions.¹⁴ Aly et al proposed that the influence of MRI on TLFs’ management is better measured by the change in thoracolumbar AO Spine injury severity scores (TLAOSIS) recommendations than by PLC status.^4,15 Notably, changing the PLC injury status following MRI does not necessarily alter the treatment decision, as in the case of a highly comminuted burst fracture.^14,16 While promising, this single-center approach may limit generalizability, and a more robust, multicenter, data-driven model is warranted to enhance predictive accuracy and clinical applicability.

To overcome these challenges and provide more objective guidance for MRI utilization, innovative predictive models are needed.¹⁷ Classification and regression trees (CART) are machine learning algorithms that provide evidence-based diagnostic algorithms, particularly for binary outcomes.¹⁸ CART is a regression method that splits data into increasingly homogeneous subgroups at each step based on its importance for the target outcome.¹⁹ We propose to train CART using a multicenter dataset to predict when MRI would change TLAOSIS treatment recommendations in TLFs without neurological deficits.

Methods

Study Design and Setting

Three tertiary trauma hospitals participated in this ambispective multicenter study between January 2014 and April 2024; all hospitals performed MRI routinely for most TLFs, even without neurological deficits (King Saud Medical City Riyadh, Saudi Arabia [center 1]; Prince Mohammed Bin Abdulaziz Hospital, Riyadh , Saudi Arabia [center 2]; and University Hospital Puerta de Hierro Majadahonda, Madrid, Spain [center 3]). Each participating center obtained institutional review board permission, and informed consent was waived. The methods were based on the Standards for Reporting Diagnostic Accuracy.²⁰

Inclusion & Exclusion Criteria

All consecutive traumatic TLFs meeting the following inclusion criteria were identified through a search of the radiology information system: (1) AO Spine A-fracture based on CT (T1-L5); isolated spinous or transverse process fractures were excluded, as they are considered minor injuries²; (2) fracture with intact or indeterminate PLC, 0 or 1 CT findings, respectively, based on the proposed CT criteria for PLC injury; (3) CT and MRI performed within 10 days of injury, as MRI sensitivity for PLC injury may diminish over time²¹; (4) neurologically intact patients, because patients with neurological deficits are universally treated operatively, irrespective of the fracture classification; (5) the most severe level of multi-level fractures²; ages between 18 and 65 years.

The following fractures were excluded¹³: (1) osteoporotic fractures, which are associated with low-energy trauma, such as a fall from standing; (2) AO C-type injuries, since they are universally treated operatively; (3) fractures with ≥2 CT findings suggestive of definitive PLC injury;^5,13 (4) incomplete or low-quality images due to motion artifacts; (5) cervicothoracic junction fractures, as they typically require MRI; (6) pathological fractures due to tumors or infections; and (7) prior trauma or surgery to the thoracolumbar spine.

Data Collection

Demographic and injury characteristic data were collected from the patients’ records (Table 1).

Table 1.

Baseline Demographic and Injury Characteristics of the 619 Patients Included in the Study

Characteristic	Total N (%)
Total samples, n (%)	619 (100%)
Age (years, [median, IQR])	37 (26, 51)
Male gender	444 (71.4%)
Level of fracture
Low lumbar (L3-L5)	95 (15.3%)
Thoracic (T1-T9)	99 (16%)
Thoracolumbar (T10-L2)	425 (68.7%)
Mechanism of injury	614
Road traffic accident	346 (55.8%)
Fall from a height ≥3m	247 (40%)
Sucidal jump	6 (1%)
Others	20 (3.2%)
Number of fractures
Single-level	488 (78.9%)
Multi-level contiguous	85 (13.7%)
Multi-level non-contiguous	46 (7.4%)
AO spine classification on CT
A1	196 (31.7%)
M0	177 (28.7%)
M1	19 (3 %)
A2	51 (8.2%)
M0	48 (7.7%)
M1	3 (0.5%)
A3	194 (31.3%)
M0	159 (25.6%)
M1	35 (5.7%)
A4	178 (28.8%)
M0	139 (22.5%)
M1	39 (6.3%)
M1 modifier	96 (15.5%)
Polytrauma (yes)	237 (38.2%)
Timing of MRI
0-3 days	469 (75.7%)
3-10 days	150 (24.3%)
MRI strength
1.5 tesla	456 (76%)
3.0 tesla	163 (24%)
Treatment
Conservative	296 (47.8%)
Surgical	310 (50%)
Other^a	13 (2.2%)
TLAOSIS ≤3	371 (60%)
TLAOSIS 4-5	209 (33.7%)
TLAOSIS ≥6	39 (6.3%)
TLAOSIS reclassification after MRI	82 (13.2)

Abbreviations: IQR, interquartile range; ASIA, CT, computed tomography; TL AOSIS, Thoracolumbar AOSpine injury severity score. Note. Data are the number of patients, with the percentage in parentheses.

^aThis number includes patients who were transferred to other hospitals before definitive treatment.

Image Acquisition Protocol

MRIs was performed on 1.5-tesla (T) or 3-T scanners and included axial and sagittal T1-weighted images (T1-WI), T2-weighted images (T2-WI), and sagittal short tau inversion recovery (STIR) images.^22,23 CTs were performed according to a previously described protocol.¹³

Standardized Interpretation Protocol

Three reviewers from each center independently analyzed the images using a picture archiving and communication system (INFINITT PACS; INFINITT Healthcare Co. Ltd., Seoul, Korea). All nine evaluators (five neurosurgeons, one orthopedic surgeon, and three fellowship-trained radiologists) had more than 5 years of experience analyzing TLF images. Reviewers were denied access to clinical data and any additional images.²⁴ Anonymized CT and MRI images were shown in two separate sessions, four weeks apart, to reduce the recollection bias.²⁴ All reviewers received consensus pre-training based on a standardized image interpretation protocol.²²

Assessment of AO Spine Classification and TLAOSIS Using CT and MRI

We classified all fractures in CT into A1-A4 according to the AOSpine Classification.² The fractures were classified as M1 or M0 based on the presence of a single vs no findings of the following: interspinous widening ≥4 mm^25,26 spinous process fracture, horizontal laminar fracture,²⁷ and facet malalignment (Figure 1).²⁵

Figure 1.

CT findings for detecting thoracolumbar posterior ligamentous complex injury.

We classified fractures on MRI as B-injuries if they were associated with a black stripe discontinuity due to supraspinous ligament (SSL), ligamentum flavum (LF), or facet capsule (FC) disruption. A-injuries were classified as having high-signal intensity on T2-WI due to interspinous ligament (ISL) edema or facet joint effusion, or no high signal intensity (no M1 modifier was assigned after MRI, Figure 2).²⁸

Figure 2.

MRI signs of thoracolumbar posterior ligamentous complex injury.

Outcome Measures and Candidate Predictor Selection

CT and MR images were used to calculate the TLAOSIS score (Table 2) and determine treatment recommendations: TLAOSIS ≤3 (conservative treatment), 4-5 (grey zone), and >5 (surgical treatment).¹⁵ Changes in management of TLAOSIS treatment recommendations in any direction after evaluation of MRI, compared to CT only, were recorded.¹⁵

Table 2.

The Thoracolumbar AOSpine Injury Score (TL AOSIS)¹⁶

Fracture pattern		Points
Type A-compression fractures
A0	Minimal injuries such as transverse process fractures	0
A1	Wedge compression	1
A2	Pincer compression injury	2
A3	Incomplete burst fracture: fracture that only involves a single endplate	3
A4	Complete burst fracture: fracture that involves both endplates	5
Type B-Tension band injuries
B1	Osseous disruption of the tension band	5
B2	Posterior tension band injury including ligamentous injury	6
B3	Anterior tension band injury	7
Type C
C	Translation/rotation injuries	8
Neurological status
N0	Neurologically intact patient	0
N1	Resolved transient neurological symptoms	1
N2	Persistent radicular symptom	2
N3	Incomplete spinal cord or cauda equina injury	4
N4	Complete spinal cord injury	4
NX	Neurologic exam is unobtainable	3
Case specific modifiers
M1	Fractures in which the status of the posterior ligamentous complex is unclear	1
M2	Patient-specific morbidities affect the treatment algorithm such as ankylosing spondylitis, polytrauma, etc	0
Treatment recommendation	Conservative treatment	<4
	Conservative or surgical treatment	4,5
	Surgical treatment	>5

We prespecified three candidate predictors for model development based on their documented influence on MRI indications in the previous literature.¹⁴ These include the M1 modifier as defined by a single CT finding (binary),^14,15 the level of the fracture (low lumbar [L3-L5],^25,29,30 thoracic [T1-T9],^31–33 or thoracolumbar junction [T10-L2]), and the AO Spine A-injury subtype (A1-A4).^14,19

Sample Size Calculation

We calculated the total sample size based on the approach by Riley et al, as it ensures more robust statistical power and precision compared to the rule of thumb (1 variable per 10 events).^33,34 The optimal sample size was 488 based on the following assumptions: (1) a margin of error <0.05; (2) a mean absolute prediction error is assumed to be 0.07; (3) a shrinkage factor <10% to avoid model overfitting; (4) the difference between the R² and the adjusted unbiased “R² Nagelkerke” ≤80% in the target population; and (5) the management change proportion was 22% based on a previous meta-analysis and three candidate predictors.^14,33

Statistical Analysis

We used Python version 3.11 and scikit-learn 1.5 for analysis and considered P-values ≤.05 as statistically significant. The dataset was randomly partitioned and stratified by event status into a derivation set (495 patients, 80%) and a validation set (124 patients, 20%). A Gini-impurity classification tree was fitted with the following hyperparameters: maximum depth of 4, class-balanced weighting to counteract outcome imbalance (13.2% management change rate), minimum samples per leaf of 5, minimum samples for split of 5, and a random splitter with square root feature selection.¹⁸

Model discrimination was summarized using the area under the receiver operating characteristic curve (AUC-ROC), sensitivity, specificity, positive and negative predictive values (PPV and NPV, respectively), and overall accuracy. Five-fold cross-validation was performed to assess the stability of the model.¹⁸ We utilized Fleiss' kappa to determine the inter-reader reliability among the three reviewers from each center regarding the PLC status on MRI and CT.^13,35

Results

Baseline Patient Characteristics

Among the 619 patients included, 280 were from Center 1, 234 from Center 2, and 105 from Center 3. The median age was 37 years (IQR 26-51), and 444 (71.4%) were male. Thoracolumbar junction fractures were the most common (425, 68.7%), followed by thoracic (99, 16%), and low lumbar fractures (95, 15.3%). The most frequent AO Spine fracture subtypes were A1 (31.7%) and A3 (31.3%), followed by A4 (28.8%) and A2 (8.2%). The M1 modifier was present in 96 patients (15.5%) (Table 1).

Change in the Thoracolumbar AOSpine Injury Score after MRI

The TLAOSIS score on CT was ≤3 in 406 patients (65.6%), 4-5 in 174 patients (28.1%), and >5 in 39 patients (6.3%; Table 3). The addition of MRI changed the three treatment recommendation groups in 82 cases (13.2%). The addition of MRI changed TLAOSIS from ≤3 to >5 in 13/406 patients (3.2%), from >5 to 4-5 in 25/39 patients (64%), from 4-5 to >5 in 13/174 patients (7.4%), and from 4-5 to ≤3 in 31/174 patients (17.8%).

Table 3.

Change in TLAOSIS Scores After MRI Compared to CT for 619 Cases Included

	CT classification			MRI classification
	AOSpine	TLAOSIS	Change in AO classification, N (%)	Reason of change
TL AOSIS≤ 3 (non-conservative treatment)	A1/M0 (n = 177)	≤3	4 (2.2%)	Switch to TLAOSIS> 5 (surgical) due to dtetction of PLC injury by MRI.
	A1/M1 (n = 19)	≤3	7 (36.8%)
	A2/M0 (n = 48)	≤3	0 (0%)
	A2/M1 (n = 3)	≤3	2 (66%)
	A3/M0 (n = 159)	≤3	0 (0%)
	Total (n = 406)		13 (3.2 %)
TL AOSIS 4-5 (grey zone	A4/M0 (n = 139)	4-5	9 (6.4%)	Switch to TLAOSIS> 5 (surgical)
	A3/M1 (n = 35)	4-5	35 (100%)	Switch to TLAOSIS >5 (n = 4,11.4%),Switch to <3 (n = 31, 88.6%)
	Total (n = 174)		44 (25.2%)
TL AOSIS >5 (operative treatment)	A4/M1 (n = 39)	>5	25 (64%)	Switch to “grey zone'' due to rulling out PLC injury (n = 25, 64%)
	Total		82 (13.2%)

TLAOSIS, Thoracolumbar AO Spine injury severity score; MRI, magnetic resonance imaging, CT, computed tomography; M1, modifier 1 for indeterminate posterior ligamentous complex (PLC) status on CT, M0, no M1 modifier.

Model Description

The algorithm encompasses nine terminal nodes. It begins by evaluating the M1 modifier, which discriminates between a 71% vs 3.2% chance of ’management change.’ The management change probability for M1 A-fractures was further stratified by AO fracture type: A3 (100%), A4 (62%), and A1/A2 (44%). When M1 is absent (M0), the algorithm further stratifies the probability of management change by spine level and AO fracture type (Figures 3 and 4). Management change was negligible, except for thoracic A4 (21%) and thoracic A1 and thoracolumbar junction A4 (4% each).

Figure 3.

Final decision tree algorithm for predicting changes in TLOASIS recommendations after MRI.

Figure 4.

Simplified illustration showing the summarization of the decision tree algorithm for predicting changes in TLOASIS recommendations after MRI

Model Performance

CART achieved excellent discriminative performance, with an AUC of 0.931 and an area under the precision-recall curve of 0.864 (Figure 5A). At the optimal threshold, the model demonstrated 87.5% sensitivity, 96.3% specificity, 77.8% positive predictive value (PPV), 98.1% negative predictive value (NPV, and 95.2% overall accuracy. Cross-validation confirmed robust accuracy [mean 92.9% ± 12.0%, Figure 5B]. The M1 modifier was the dominant predictor (importance = 0.914), followed by the AO fracture subtype (0.055) and spine level (0.031) (Figure 5C).

Figure 5.

Figure 1. Model Performance Evaluation of Decision Tree Classification.

Inter-Reader Agreement on PLC Status in CT and MRI

Inter-observer agreement M1 vs M0 on CT was substantial to moderate among centers (κ = 0.418-0.682) and excellent for MRI (κ, range 95.6%-97.2%).

Discussion

Key Findings and Interpretations

To our knowledge, this multicenter study is the first to develop and validate a machine-learning algorithm for predicting the need for MRI in neurologically intact patients with TLFs.⁵ MRI altered the TLASOSIS recommendation in 13.2% of patients compared to CT alone. A model that includes the M1 modifier, A-fracture subtypes, and spinal level could accurately predict changes in management after MRI. The algorithm provides cost-effective indications for MRI in neurologically intact TLFs, ensuring accurate PLC assessment while minimizing the burden of MRI. The merits of this study lie in its large sample size, robust methodology, and multicenter design, all of which contributed to its high precision.⁴

Methodological Challenges in Evaluating the Impact of MRI on TLFs Management

This study addressed most of the potential biases encountered while assessing the influence of MRI on TLFs’ management.⁴ We excluded patients with neurological deficits and type C fractures, as they are typically treated surgically, irrespective of MRI results.⁴ Additionally, osteoporotic and low-energy fractures were excluded because of their low probability of associated PLC injuries.⁴ We followed a standardized protocol for MRI acquisition, including sagittal STIR and axial T2-WI, and image interpretation, which explains the high inter-observer reliability.²²

Traditionally, the impact of MRI on TLFs management has been assessed by detecting PLC disruption and upgrading type A-type to B-type injuries.³⁶ However, this approach overlooks vertebral fracture severity, which plays a key role in treatment decisions.¹⁵ For example, identifying a PLC injury in a severely comminuted A4 fracture may not alter management, as such cases already warrant surgery. In contrast, the TLAOSIS framework integrates both PLC integrity and fracture subtype (A1–A4), allowing for a more nuanced assessment.

Furthermore, measuring MRI impacts by solely relying on upgrades from A to B-injuries disregards the implications of ruling out PLC injury on decision-making.⁴ Ruling out PLC injury shifted the decision from surgery to the gray zone in 64% of A4M1 and from the gray zone to conservative in 17.8% of A3M1. Conversely, the TLAOSIS is a three-tier system that considers the switch from and to the “grey zone'', thus reducing treatment bias.¹⁵

While TLAOSIS provides a structured framework, its clinical application—particularly for A3/A4 fractures—varies due to the lack of high-quality evidence favoring surgical or non-surgical treatment.³⁷The higher weighting of A4 than A3 suggests a surgical approach, yet practice patterns remain inconsistent across regions.³⁸ Our study predicts changes relative to an ideal TLAOSIS-guided algorithm, identifying when MRI offers critical, decision-altering information regardless of individual variability.⁴

Potential Variables Included in the Algorithm

The M1 modifier was critical in distinguishing between a high and low likelihood of management change. The risk of management change for M1 was further stratified based on AO fracture types A3 (100%), A4 (75%), and A1/A2 (45%). A3/A4 fractures have a higher chance of management change than A1/A2 fractures, which aligns with the hierarchical severity and risk of PLC injury.² The chance of management change for AM0 fractures was negligible for low lumbar fractures, A2, and A3 fractures; the highest was for thoracic A4 (21%), followed by thoracic A1 and thoracolumbar junction A4 (4% each).

The impact of the spine region on the decision-making of thoracolumbar fractures may be attributed to their variable anatomical and biomechanical characteristics.^30,31 Because of lordosis, PLC injury is uncommon in the low lumbar spine. However, when it does occur, it usually leads to a severe type B or C pattern.^29,30 The kyphotic curvature of the thoracic spine results in flexion forces leading to proportional distraction of the SSL/ISL complex (the inflection point).^31,32 This may lead to localized disruption of the SSL, causing isolated ISW, which can be readily missed on a supine CT. This elucidates why management alterations following MRI are more prevalent for thoracic A4M0 and A1M0 fractures than for other fractures.

Study Implications

Though MRI proved to significantly influence thoracolumbar fracture management, there is a need for objective MRI indications that reduce its burden while ensuring accurate PLC assessment.^4,5 The proposed algorithm offers utility in both resource-limited and resource-rich settings. In resource-limited areas where MRI is unavailable, the algorithm can help identify cases that are manageable with CT alone vs those needing referral, minimizing treatment delays caused by transfers. Additionally, the model could predict when a standing X-ray might serve as a feasible alternative to detect occult PLC injury.³⁷

In settings with MRI availability, the algorithm supports selective MRI use, improving cost-effectiveness, time, and resource allocation. Notably, the M1 modifier predicted 71% of management changes while applying to only 15.5% of cases. While MRI remains the gold standard for PLC evaluation, its moderate inter-observer reliability highlights the value of using predictive tools to identify high-probability cases, prompting detailed assessment or expert radiological consultation.³⁹

Uncertainty about PLC status on CT places most of burst fractures in the grey zone, contributing to the inconsistent treatment.¹¹ To address this clinical dilemma, our approach provides two complementary solutions: first, the use of validated CT-based criteria to assess PLC integrity when findings are definitive and accurate; and second, the selective use of MRI in cases where PLC status remains uncertain. “It may reduce the number of cases ultimately classified in the gray zone, thereby minimizing treatment variability.

This algorithm can be integrated into clinical practice through a simple platform that calculates the probability of TLAOSIS classification change, offering real-time support for MRI decisions. It could also be embedded in CT systems to flag cases with high likelihood of PLC injury.⁴⁰ When combined with an image-recognition tool that detects fracture morphology and assesses PLC integrity, this AI-driven model could form a comprehensive decision-support system—enhancing diagnostic accuracy, reducing variability, and guiding selective MRI use.¹⁷

Strengths and Limitations

Retrospective data collection may introduce recall bias; however, imaging data are less susceptible to such biases.¹³ More severe fractures and those with neurological deficits are more likely to be verified by MRI (verification bias) or recruited from level 1 trauma centers (spectrum bias). However, focusing on neurologically intact patients with A fractures and including 50% treated non-surgically minimizes these biases.^13,24

The retrospective design precluded the assessment of the impact of changing PLC status on clinical outcomes. However, the black stripe discontinuity is regarded as a surrogate marker of instability based on the findings that the LF and SSL are the key biomechanical contributors to PLC stability.^41,42 Black stripe discontinuity showed high specificity in identifying PLC disruption during surgery and can be applied to both surgical and non-surgical cases, contrary to intraoperative findings.²⁸

Management change was defined as any shift among the three TLAOSIS categories. While a switch from conservative treatment to surgery is likely more clinically impactful than transitions involving the gray zone, categorizing multiple outcome groups would have reduced statistical power and model precision. However, future prospective studies are required to independently validate the proposed algorithm in independent populations by independent reviewers.

The moderate inter-observer reliability reported should be interpreted in light of the fact that we excluded the most obvious PLC injuries and focused on those with intact or undetermined PLC injuries. However, the proposed standardized imaging interpretation protocol could improve the reliability and increase generalizability in different settings.²² This secondary analysis used multicenter data focused on PLC injury assessment. Inter-observer reliability was evaluated for major types (A vs B) and M1, while a single reviewer assessed A-fracture subtyping.¹³

We excluded B injuries with ≥2 CT findings due to their low likelihood of downgrading to type A after MRI.¹³ Predictors of downgrade differ from those of upgrade, and including such cases would confound the analysis.

Conclusion

The developed machine learning algorithm, based on the M1 modifier, spine level, and AO fracture subtype, demonstrated high accuracy in predicting changes in the TLAOSIS treatment recommendations after MRI in neurologically intact patients with A-type thoracolumbar fractures. By accurately identifying cases where MRI would alter clinical decision-making, this model offers a cost-effective tool to guide MRI utilization, ensuring precise assessment of posterior ligamentous complex integrity while minimizing unnecessary imaging and associated burdens. The multicenter design, large sample size, and robust methodology underpin the model’s generalizability and clinical applicability. Future prospective validation studies are warranted to confirm these findings and further refine MRI indication criteria in this patient population.

Footnotes

Acknowledgments

The authors wish to thank Dr. Ahmad Mashaly for the illustrations included in this article. We also acknowledge the research department and grant management department in King Fahd Medical City, for their help.

ORCID iDs

Mohamed M. Aly

Xavier A. Santander

Raquel Gutiérrez González

Ethical Considerations

Institutional board review obtained from King Fahd Medical IRB approval number KACST, KSA, H-01-R-012.

Author Contributions

A.M.M. helped with design and methodology. A.M.M., M.A., F.A.A., and X.A.S. helped with data extraction. A.M. M., A.A.M., WA, M.A., F.A. A., R.A.R., X.A.S., G.T.K., and G.R.G. helped with image interpretation. A. F. and A. M. M. did the statistical analysis. All authors helped compose the manuscript and approved the final version.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: the authors declare that this research was supported by a research grant from the research center, King Fahd Medical City, Riyadh, second cluster (#IRF 024-016).

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

AI Disclosure Statement

Figures and visual summaries (eg, infographics) in this manuscript were created with the assistance of generative AI platforms (eg, Notebook LM). These tools were used to support the visual design and layout only. All scientific content was provided, reviewed, and verified by the authors, who took full responsibility for the accuracy and integrity of the final submission. Additionally, AI tools like ChatGPT and Paperpall were used for drafting the manuscript.

Appendix

References

Vaccaro

Lehman

Hurlbert

, et al. A new classification of thoracolumbar injuries: the importance of injury morphology, the integrity of the posterior ligamentous complex, and neurologic status. Spine. 2005;30(20):2325-2333. doi:10.1097/01.brs.0000182986.43345.cb

Vaccaro

Oner

Kepler

, et al. AOSpine thoracolumbar spine injury classification system: fracture description, neurological status, and key modifiers. Spine. 2013;38(23):2028-2037. doi:10.1097/BRS.0b013e3182a8a381

Qureshi

Dhall

Anderson

, et al. Congress of neurological surgeons systematic review and evidence-based guidelines on the evaluation and treatment of patients with thoracolumbar spine trauma: radiological evaluation. Clin Neurosurg. 2019;84(1):E28-E31. doi:10.1093/neuros/nyy373

Aly

Bigdon

Speigl

UJA

Camino-Willhuber

Baeesa

Schnake

. Towards a standardized reporting of the impact of magnetic resonance imaging on the decision-making of thoracolumbar fractures without neurological deficit: conceptual framework and proposed methodology. Brain Spine. 2024;4:102787. doi:10.1016/J.BAS.2024.102787

Aly

Soliman

Elemam

Pizones

Alzahrani

Elwatidy

. How frequently MRI modifies thoracolumbar fractures’ classification or decision-making? A systematic review and meta-analysis. Eur Spine J. 2024;33(4):1540-1549. doi:10.1007/s00586-023-08087-4

Durmaz

İlter

Tuzlali

. Is magnetic resonance imaging needed for decision making diagnosis and treatment of thoracic and lumbar vertebral fractures? Eur J Orthop Surg Traumatol Orthop Traumatol. 2021;33:107-115. doi:10.1007/S00590-021-03165-Z. Published online 24 November.

Leferink

Veldhuis

Zimmerman

Ten Vergert

Ten Duis

. Classificational problems in ligamentary distraction type vertebral fractures: 30% of all B-type fractures are initially unrecognised. Eur Spine J. 2002;11(3):246-250. doi:10.1007/s00586-001-0366-6

Khurana

Prevedello

Bono

, et al.

CT for thoracic and lumbar spine fractures: can CT findings accurately predict posterior ligament complex injury?

Eur Spine J. 2018;27(12):3007-3015. doi:10.1007/s00586-018-5712-z

Pizones

Izquierdo

Álvarez

, et al. Impact of magnetic resonance imaging on decision making for thoracolumbar traumatic fracture diagnosis and treatment. Eur Spine J Off Publ Eur Spine Soc Eur Spinal Deform Soc Eur Sect Cerv Spine Res Soc. 2011;20(Suppl 3):390-396. doi:10.1007/s00586-011-1913-4

10.

Schroeder

Kepler

Koerner

, et al. A worldwide analysis of the reliability and perceived importance of an injury to the posterior ligamentous complex in AO type A fractures. Glob Spine J. 2015;5(5):378-382. doi:10.1055/s-0035-1549034

11.

Canseco

Paziuk

Schroeder

, et al. Interobserver reliability in the classification of thoracolumbar fractures using the AO spine TL injury classification System among 22 clinical experts in spine trauma care. Glob Spine J. 2024;14(1_suppl):17S-24S. doi:10.1177/21925682231202371

12.

Azevedo

Martins

Oliveira

Varanda

Direito-Santos

. Inter-observational analysis of computed tomography parameters to predict nonobvious posterior ligament complex injury in neurologically intact patients with thoracolumbar trauma. Brain Spine. 2024;4:102855. doi:10.1016/J.BAS.2024.102855

13.

Aly

Abdelaziz

Alfaisal

, et al. Multicenter external validation of the accuracy of computed tomography criteria for detecting thoracolumbar posterior ligamentous complex injury. Neurosurgery. 2024;96:1236-1248. doi:10.1227/neu.0000000000003263. Published online 5 December.

14.

Aly

Al-Shoaibi

Abduraba Ali

Al Fattani

Eldawoody

. How often would MRI change the thoracolumbar fracture classification or decision-making compared to CT alone? Glob Spine J. 2024;14(1):11-24. doi:10.1177/21925682221089579

15.

Vaccaro

Schroeder

Kepler

, et al. The surgical algorithm for the AOSpine thoracolumbar spine injury classification system. Eur Spine J. 2016;25(4):1087-1094. doi:10.1007/s00586-015-3982-2

16.

Aly

Dandurand

Dvorak

, et al.

The influence of comminution and posterior ligamentous complex integrity on treatment decision making in thoracolumbar burst fractures without neurologic deficit?

Glob Spine J. 2024;14(1_suppl):41S-48S. doi:10.1177/21925682231196452

17.

Maki

Furuya

Inoue

, et al. Machine learning and deep learning in spinal injury: a narrative review of algorithms in diagnosis and prognosis. J Clin Med. 2024;13(3):705. doi:10.3390/jcm13030705

18.

Hastie

Tibshirani

Friedman

. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer; 2009.

19.

Dandurand

Fallah

Öner

, et al. Predictive algorithm for surgery recommendation in thoracolumbar burst fractures without neurological deficits. Glob Spine J. 2024;14(1_suppl):56S-61S. doi:10.1177/21925682231203491

20.

Bossuyt

Reitsma

Bruns

, et al. Stard 2015: an updated list of essential items for reporting diagnostic accuracy studies. The BMJ. 2015;351(October):1-9. doi:10.1136/bmj.h5527

21.

Benedetti

Fahr

Kuhns

Hayman

. MR imaging findings in spinal ligamentous injury. Am J Roentgenol. 2000;175(3):661-665. doi:10.2214/ajr.175.3.1750661

22.

Aly

Al-Shoaibi

Aljuzair

Issa

Vaccaro

. A proposal for a standardized imaging algorithm to improve the accuracy and reliability for the diagnosis of thoracolumbar posterior ligamentous complex injury in computed tomography and magnetic resonance imaging. Glob Spine J. 2023;13(3):873-896. doi:10.1177/21925682221129220

23.

Lee

Vaccaro

Schweitzer

, et al. Assessment of injury to the thoracolumbar posterior ligamentous complex in the setting of normal-appearing plain radiography. Spine J. 2007;7(4):422-427. doi:10.1016/j.spinee.2006.07.014

24.

Pavlou

Kurtz

Song

. Diagnostic accuracy studies in radiology: how to recognize and address potential sources of bias. Radiol Res Pract. 2021;2021:5801662. doi:10.1155/2021/5801662. Published online.

25.

Aly

Al-Shoaibi

Abduraba

, et al.

Which morphological features of facet diastasis predict thoracolumbar posterior ligamentous complex injury as defined by magnetic resonance imaging?

World Neurosurg. 2023;171:e276-e285. doi:10.1016/j.wneu.2022.12.010

26.

Rajasekaran

Maheswaran

Aiyer

Kanna

Dumpa

Shetty

. Prediction of posterior ligamentous complex injury in thoracolumbar fractures using non-MRI imaging techniques. Int Orthop. 2016;40(6):1075-1081. doi:10.1007/s00264-016-3151-1

27.

Aly

Al-Shoaibi

Al Fattani

AlJuzair

. Diagnostic value of various morphological features of horizontal and vertical laminar fractures for posterior ligamentous complex injury of the thoracolumbar spine as defined by magnetic resonance imaging. World Neurosurg. 2021;153:e290-e299. doi:10.1016/j.wneu.2021.06.109

28.

Pizones

Izquierdo

Sánchez-Mariscal

Zúñiga

Álvarez

Gómez-Rice

. Sequential damage assessment of the different components of the posterior ligamentous complex after magnetic resonance imaging interpretation: prospective study 74 traumatic fractures. Spine. 2012;37(11):E662-E667. doi:10.1097/BRS.0b013e3182422b2b

29.

Aly

Al-Shoaibi

Abduraba

Alzahrani

Eldawoody

. Traumatic low lumbar fractures: how often MRI changes the fracture classification or clinical decision-making compared to CT alone? Eur Spine J. 2022;31(1):37-45. doi:10.1007/s00586-021-06987-x

30.

Moore

Bransford

France

, et al.

Low lumbar fractures: does thoracolumbar injury classification and severity score work?

Spine. 2014;39(17):1021-1025. doi:10.1097/BRS.0000000000000415

31.

AlRaddadi

Al-Shoaibi

Alnaqeep

, et al.

Traumatic thoracic spine fracture: can we predict when MRI would modify the fracture classification or decision-making compared to CT alone?

Eur Spine J Off Publ Eur Spine Soc Eur Spinal Deform Soc Eur Sect Cerv Spine Res Soc. 2024;33:3685-3694. doi:10.1007/s00586-024-08196-8. Published online April 3.

32.

Johnson

Zhang

. Regional differences within the human supraspinous and interspinous ligaments: a sheet plastination study. Eur Spine J. 2002;11(4):382-388. doi:10.1007/s00586-001-0378-2

33.

Riley

Ensor

Snell

KIE

, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368:m441, Published online 2020. doi:10.1136/bmj.m441

34.

Peduzzi

Concato

Feinstein

Holford

. Importance of events per independent variable in proportional hazards regression analysis II. J Clin Epidemiol. 1995;48(12):1503-1510. doi:10.1016/0895-4356(95)00048-8

35.

Landis

Koch

. Landis amd Koch1977_agreement of categorical data. Biometrics. 1977;33(1):159-174. doi:10.2307/2529310

36.

Winklhofer

Thekkumthala-Sommer

Schmidt

, et al. Magnetic resonance imaging frequently changes classification of acute traumatic thoracolumbar spine injuries. Skelet Radiol. 2013;42(6):779-786. doi:10.1007/s00256-012-1551-x

37.

Aly

El-Sharkawi

Joaquim

, et al. Toward identifying and resolving the challenges to the prognostic validation of the classifications for thoracolumbar burst fractures: a narrative review. Clin Spine Surg. 2025;38(6):266-277. doi:10.1097/BSD.0000000000001764

38.

Camino-Willhuber

Bigdon

Dandurand

, et al.

Expert opinion, real-world classification, and decision-making in thoracolumbar burst fractures without neurologic deficits?

Glob Spine J. 2024;14(1_suppl):49S-55S. doi:10.1177/21925682231194456

39.

Crosby

Even

Song

Block

Devin

. Diagnostic abilities of magnetic resonance imaging in traumatic injury to the posterior ligamentous complex: the effect of years in training. Spine J. 2011;11(8):747-753. doi:10.1016/j.spinee.2011.07.005

40.

Toh

Berg

Han

QYC

, et al. Clinical decision support system used in spinal disorders: scoping review. J Med Internet Res. 2024;26:e53951. doi:10.2196/53951

41.

Jin

Yan

, et al. Biomechanical role of the thoracolumbar ligaments of the posterior ligamentous complex: a finite element study. World Neurosurg. 2018;112(2018):e125-e133. doi:10.1016/j.wneu.2017.12.171

42.

Merter

Karaca

Yazar

. Biomechanical effects of sequential resection of the posterior ligamentous complex on intradiscal pressure and resistance to compression forces. Acta Orthop Traumatol Turcica. 2019;53(6):502-506. doi:10.1016/j.aott.2019.08.016