Abstract
Background:
Magnetic resonance enterography (MRE) is a noninvasive and radiation-free option for managing Crohn’s disease (CD), for which various predictors have been reported to show significant prognostic value.
Objectives:
The aim of this review was to summarize MRE predictors that indicated poor prognosis, including surgeries, complications, relapses, therapeutic escalations, and persistent pathological changes in patients with CD.
Design:
Systematic review with meta-analysis.
Data sources and methods:
Web of Science and PubMed were systematically searched for articles exploring the predictive performance of MRE in patients with luminal CD, with the latest search performed on July 21, 2025. Pooled analyses of odds ratios (OR) and hazard ratios for predicting poor prognosis were conducted using a random-effects model when available.
Results:
Thirty articles were included in this study. Meta-analysis identified five significant MRE predictors for predicting CD poor prognosis over 12 months, including inflammation (OR 3.40, 95% confidence interval (CI 1.96–5.89)), bowel wall thickening (OR 2.88, 95% CI (1.95–4.25)), strictures (OR 4.95, 95% CI (1.99–12.34)), edema (OR 2.67, 95% CI (1.25–5.69)), and fistula (OR 5.15, 95% CI (2.72–9.76)). Furthermore, scoring systems, such as the Magnetic Resonance Index of Activity (MaRIA) and Clermont scores, showed significant predictive values.
Conclusion:
MRE is effective in predicting the prognosis of patients with CD, with significant predictors including inflammation, bowel wall thickening, strictures, edema, fistula, and MaRIA score.
Trial registration:
The study protocol was registered in the International Prospective Register of Systematic Reviews (registration number: CRD42022365965).
Plain language summary
This systematic review demonstrated that MRE is effective in predicting the prognosis of CD, with significant predictors including inflammation, bowel wall thickening, strictures, edema, fistula, MaRIA, and Clermont scores.
Introduction
Crohn’s disease (CD) is a subtype of chronic inflammatory bowel disease thought to arise from a complex interaction among genetic susceptibility, environmental factors, gut microbiota, and immune responses. 1 CD is characterized by abdominal pain, diarrhea, and weight loss, with skipping lesions under endoscopy.1,2 CD usually follows a prolonged course, requiring lifelong management, which significantly influences patients’ quality of life and imposes a heavy burden on families, society, and the healthcare system. 3 Notably, significant heterogeneity was observed in the prognosis of patients with CD, with over 30% developing stricturing or penetrating complications and 28% requiring surgery within 5 years.4–6 With the advancement of personalized treatment and precision medicine, identifying predictors for disease prognosis has become increasingly critical for both doctors and patients. 7
Predicting prognosis plays a crucial role in CD management, particularly in risk stratification, therapy selection, and therapeutic efficacy monitoring. Currently, various methods are available for predicting disease prognosis, including clinical symptoms, biomarkers, endoscopic performances, radiological examinations, and so on. 1 Among them, magnetic resonance enterography (MRE) is a radiation-free and noninvasive method that is gaining popularity, especially for those with small intestine involvement or with intestinal strictures where colonoscopies cannot proceed. 8 In this way, MRE is well received by patients, with enhanced compliance and tolerance.9,10
Recent studies have identified various MRE predictors of CD prognosis. Jauregui-Amezaga et al. 11 evaluated the prognostic value of MRE in predicting intestinal surgery, in which strictures and fistulas were observed as predictors of surgery within 2 years. Additionally, a retrospective cohort of 136 patients showed that the number and length of strictures were predictive of abdominal surgeries. 12 Furthermore, MRE-based scoring systems, such as the Magnetic Resonance Index of Activity (MaRIA) score, are associated with a decreased risk of surgery. 13 These MRE findings and scoring systems have been utilized in CD management in recent years; however, a comprehensive review summarizing these predictors is still needed.
Therefore, the aim of this review was to summarize MRE predictors associated with poor prognosis in patients with CD.
Materials and methods
Search strategy and registration
This review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement. 14 A comprehensive literature search was performed using Web of Science and PubMed databases to identify all studies published before July 21, 2025. Keywords focusing on “Crohn’s disease,” “magnetic resonance,” and their synonyms were used to formulate search queries in both databases. Studies categorized as meta-analyses, reviews, or case reports were excluded. The detailed search strategies are shown in Table S1.
The study protocol was registered in the International Prospective Register of Systematic Reviews (registration number: CRD42022365965).
Selection criteria
Studies meeting the following inclusion criteria were included: (1) patients were diagnosed with luminal CD; (2) MRE prognostic factors were used as predictors; (3) outcomes were related to any poor prognosis in CD, including surgical or endoscopic interventions, complications, hospitalizations, relapses, therapeutic escalations, and persistent pathological changes; (4) the follow-up time reached at least 12 months; (5) the studies were original articles, such as cohort or case-control studies.
The exclusion criteria were as follows: (1) studies that focused on pediatric CD; (2) studies that focused on perianal CD; (3) editorials, letters, case reports, and review articles; and (4) studies in languages other than English or without sufficient data to calculate.
Two reviewers (D.W. and R.C.) independently screened the titles and abstracts of all the articles. Full-text screening was then conducted to determine which studies would be included in the final analysis. Discrepancies were resolved through discussion, and if necessary, a third reviewer (Y.T.) was consulted.
Data extraction and quality assessment
The following data were collected from the eligible studies: authors, country, publication year, study design, sample size, patient age, follow-up time, outcomes, MRE predictors (e.g., MRE findings or scoring systems), and magnetic field strength. Odds ratios (ORs) and hazard ratios (HRs) for each prognostic factor were also collected for pooling analysis.
Two reviewers (D.W. and Z.F.) independently assessed the quality of the included studies using the Newcastle-Ottawa Scale. Any discrepancies were discussed to reach a consensus. The Newcastle-Ottawa Scale score ranged from 0 to 9, with scores of 7–9 and 4–6 indicating high and moderate quality, respectively. 15
Prognostic factors and disease outcomes
All available MRE prognostic predictors for poor prognosis were included and divided into MRE findings and MRE scoring systems. Prognostic factors reported more than once were recorded for pooled analysis. Predictors of surgery were discussed separately. Other outcomes of interest included complications, hospitalizations, endoscopic interventions, relapses, therapy escalations, and persistent pathological changes. Prognostic factors reported only once are summarized in Table S4.
Disease outcomes included surgeries, complications, relapses, therapeutic escalations, and persistent pathological changes. Surgery was defined as an abdominal resection related to CD. Complications were defined as newly developed fistulas, pseudotumors, strictures, or abscesses. According to the definitions in eligible articles, relapse was defined as the need for therapy escalation, hospitalization, or surgery due to abdominal symptoms worsening (CD activity index >150 or 17516,17 or Harvey–Bradshaw index >4 with an increase of at least 3 points compared to baseline 18 ). Therapeutic escalation was defined as a change in medication dosage or type, and persistent pathological changes were defined as persistent creeping fat or bowel wall thickening.
Data analysis
The ORs/HRs and their 95% confidence intervals (CIs) were collected to evaluate the prognostic value of each MRE finding when available. When direct acquisition was not feasible, these values were calculated based on the data from the original texts. ORs were calculated by constructing a fourfold table based on exposures and outcomes, and the corresponding 95% CIs were obtained using the Woolf approximation.
When two or more studies were available, the ORs and corresponding 95% CIs were pooled and analyzed using a random-effects model. Heterogeneity was assessed by I2-test, where I2 > 50% indicating high heterogeneity. 19 To evaluate publication bias, funnel plots and Egger’s tests were conducted for each predictor. All analyses were performed using Stata MP 16.0 (StataCorp LLC., College Station, TX, USA). Statistical significance was set at p < 0.05.
Results
Literature search
The screening and selection process is illustrated in Figure 1. A total of 5841 studies were identified through PubMed (3099 studies) and Web of Science (2742 studies). After removing duplicates, 4758 studies were reviewed through title and abstract screening, of which 30 articles were included and 18 were eligible for meta-analysis.

Flowchart for the study selection.
Characteristics of included studies
The characteristics of the 30 studies are shown in Table 1. Ten studies had a prospective design, and the others were retrospective. Among them, three studies had prospective recruitment, but the MRE observations were derived from a retrospective analysis of data.13,20,21 There were no overlapping study groups or subjects among the included studies. Sample sizes ranged from 21 to 223 among different studies. Among these studies, 96.7% had a follow-up period of at least 12 months, except for one that reported an unclear follow-up duration. Twenty-three studies reported the prognostic values of MRE in predicting surgery, while six predicted disease relapses. All included studies had overall good quality (Tables S2 and S3).
Study characteristics.
Continuous variables were described as median (interquartile range), median (range)†, mean ± SD‡, or median (95% confidence interval)§.
IQR, interquartile range; MRE, magnetic resonance enterography; NA, not available.
Association between MRE findings and poor prognosis
We analyzed studies focusing on surgery and poor prognosis. The pooled analysis results are shown in Figures 2 and 3. Studies providing predictors not suitable for meta-analysis are described separately in Table 2.

Forest plot showing the predictive value of magnetic resonance enterography predictors for poor prognosis using odds ratio as the effect size.

Forest plot showing the predictive value of magnetic resonance enterography predictors for poor prognosis using hazard ratio as the effect size.
Summary of magnetic resonance enterography predictors for poor prognosis in Crohn’s disease.
aHR, adjusted hazard ratio; CI, confidence interval; HR, hazard ratio; OR, odds ratio.
Inflammation
MRE can detect inflammation not visible via endoscopy.8,44 The inflammation was often defined by a group of heterogeneous indicative signs in radiology, including bowel wall thickening >3 mm, increased contrast uptake, extra-mural hypervascularity, or ulcer. 17 However, another study added bowel wall edema, the combs sign, lymphadenopathy, mesenteric fibrofatty proliferation, and creeping fat as indicative signs. 29 A pooled analysis of three studies showed MRE had stable prognostic performances in predicting surgery or relapse, with a pooled OR of 3.40 (95% CI (1.96–5.89)). No significant heterogeneity was observed (I2 = 0.0%, p = 0.857).12,17,29 A cohort of patients also reported the prognostic value of inflammation for therapeutic escalation (HR (95% CI): 5.133 (1.187–22.029)) and hospitalization (HR (95% CI): 3.171 (1.092–9.204)). 16
Bowel wall thickening
Five studies observed that wall thickening increased the risk of poor prognosis, including surgeries, relapses, and persistent wall thickening with a pooled OR (95% CI) of 2.86 (2.03–4.03) and no significant heterogeneity (I2 = 0.0%, p = 0.849).12,17,20,30,40 Notably, wall thickening was defined as wall thickness >3 mm in four studies, while one study defined it by 10 mm. 12 Considering this heterogeneity, further analysis of the four studies with consistent definitions yielded a stable conclusion (OR (95% CI): 2.88 (1.95–4.25), I2 = 0.0%, p = 0.71). As for other related variables, maximum wall thickness was capable of predicting surgery as a continuous variable, with an OR (95% CI) of 1.31 (1.12–1.55) in a retrospective cohort of 136 patients. 12 In 2023, one study revealed that the ratio between the wall thickness at baseline and at 1 year could predict therapeutic responses, indicating the potential prognostic value of wall thickness change. 41
Stricture
Although specific cutoff values varied between studies, stricture was defined as narrowing of the lumen with or without upstream dilatation.11,17,20,24,32,34 The pooled analysis of six studies showed that patients with stricture had nearly fivefold risk of poor outcomes in 1 year (OR (95% CI): 4.95 (1.99–12.34)), with significant heterogeneity (I2 = 74.7%, p = 0.001).11,17,20,24,32,34 For 1-year surgery, a pooled OR (95% CI) of 6.64 (2.90–15.23) with low heterogeneity (I2 = 31.5%, p = 0.23)11,32,34 and a pooled HR (95% CI) of 8.95 (3.05–26.24) with high heterogeneity (I2 = 60.8%, p = 0.078) were observed.11,22,23
Ulcer
Although ulcers were one of the typical manifestations in CD, the pooled analysis of five studies showed no significance in predicting poor prognosis in 1 year (OR (95% CI): 1.41 (0.91–2.19)), with no heterogeneity(I2 = 0.0%, p = 0.717).11,12,20,34,40 Of the two studies providing HRs, one reported that ulcers were associated with relapse (HR (95% CI): 12.5 (1.13–138)), 18 while the other found no statistical significance 38 (Table 2).
Edema
Patients with edema in the mesentery or bowel wall had a higher risk of poor prognosis, with a pooled OR (95% CI) of 2.67 (1.25–5.69) from five studies.12,30,26,34,40
Fistula
Three studies showed fistula detected by MRE exerted as a prognostic factor for surgery or additional endoscopic dilation in 1 year (OR (95% CI): 5.15 (2.72–9.76)) with no significant heterogeneity (I2 = 0.0%, p = 0.498).11,12,32
Others
Some other MRE findings were reported but not eligible for pooled analysis. These included fat-related findings, myopenia, multisegmental bowel involvement, and extramural hypervascularity (Table S4).
Association between MRE scoring system and poor prognosis
The MaRIA score demonstrated high and stable accuracy in detecting CD activity.45,46 Due to various cutoff values across studies, a pooled analysis was conducted by treating the predictor as a dichotomous variable, categorizing patients into high or low MaRIA score group. Five studies using different thresholds were included (three studies used 7, 11, and 24 for the segmental MaRIA index, and two used 50 and 62 for the global MaRIA score), with no significant heterogeneity observed (Figure 2). The pooled analysis indicated a higher MaRIA score correlated with a higher risk of poor prognosis in 1 year (OR (95% CI): 3.90 (2.24–6.81); HR (95% CI): 3.62 (2.17–6.03)).12,13,21,27,31,35
Other studies not suitable for meta-analysis are summarized in Table 3, where the MaRIA score was used in both continuous and dichotomous forms. One study used the logarithmic MaRIA score as a continuous variable; however, no significance was observed in predicting disease relapse. 18 Recently, a multicenter prospective cohort also reported the simplified MaRIA score showed no predictive value of disabling disease in 5 years with an HR (95% CI) of 1.00 (0.98–1.01). 43 Decreases in MaRIA score were associated with corticosteroid-free remission in a cohort of 46 patients (OR (95% CI): 4.2 (1.3–13.3)). 33 Elements of the MaRIA score, such as relative contrast enhancement (RCE) independently showed statistical significance in predicting relapse (OR (95% CI): 2.56 (1.02–6.45), Table S4). 18 However, in three other studies, RCE failed to predict surgery or relapse.12,17,22
Summary of magnetic resonance enterography-related scoring systems for poor prognosis in Crohn’s disease.
aHR, adjusted hazard ratio; CI, confidence interval; HR, hazard ratio; MaRIA, Magnetic Resonance Index of Activity; MREC, Magnetic Resonance Enterocolonography score; OR, odds ratio; sMaRIA, simplified Magnetic Resonance Index of Activity.
The Clermont score assesses ileal inflammation in patients with CD. It includes the components of the MaRIA score (except RCE, which requires gadolinium injection) and the apparent diffusion coefficient (ADC) derived from diffusion-weighted MRE. 47 Two studies reported the prognostic value of the Clermont score. One reported a decreased risk of surgery when the baseline Clermont score ⩽8.4 for each segment (HR (95% CI): 0.24 (0.07–0.78)). 13 Another study showed that a 25% decrease in the Clermont score at week 12 indicated a lower risk of bowel damage at week 52 (HR (95% CI): 0.21 (0.0–0.9)). 33 Some studies focused on ADC values, indicating a negative correlation between ADC values and surgery risk. One study reported that an ADC value of <1 × 10−3 mm2 s−1 could predict higher risks of treatment failure (therapy escalation, need for surgery, or hospitalization), with an OR of 30.1. 26 In a case-control study of 70 patients, the mean ADC was associated with conservative management (OR (95% CI): 0.0003 (0.00–0.13)), with an area under the curve of 0.717 (0.607–0.810). 34
Others
Other reported scoring systems are listed in Table 3, including the London score, magnetic resonance enterocolonography score, and newly developed scoring systems.
Risk of publications bias
The funnel plot of strictures showed asymmetry (p < 0.05) in the Egger’s test, indicating publication bias (Figure S1). Using “trim-and-fill” analysis for correction, the pooled OR (95% CI) was 2.007 (1.388–2.903) and 2.205 (0.894–5.437) from the fixed and random models, respectively. The funnel plots of other predictors were symmetrical, indicating a low risk of publication bias.
Discussion
MRE has emerged as a valuable tool in CD diagnosis and management, and MRE predictors have been evaluated in multiple recent studies on CD prognosis. This review identified and summarized significant MRE predictors for CD prognosis, including inflammation, bowel wall thickening, strictures, edema, fistulas, and scoring systems. These predictors might provide valuable information for risk stratification and treatment planning. Besides, our results highlight the potential for MRE-derived parameters to noninvasively monitor treatment response, reducing reliance on repeated endoscopic evaluations in CD management. For the radiologists reporting MREs, these predictors may help optimizing MRE reporting protocols and we suggest for establishing standardizing measurement thresholds.
Through meta-analysis, we identified five MRE findings that effectively predicted poor prognosis in 1 year, including the presence of inflammation, bowel wall thickening, stricture, edema, and fistula. Mural inflammation is a typical symptom of CD, however, its definition varies across studies and may overlap with other MRE findings. Despite the pooled analysis indicating statistical significance, the lack of a unified definition may complicate its practical application. In contrast, the other four characteristics are more clearly defined. For example, a wall thickness cutoff value of 3 mm is the most frequently used, aligning with MRE reporting recommendations. 48 Stricture is mostly defined as luminal diameter reduction with prestricture dilation, 48 though some studies have used a distinct bowel wall thickness to define it.20,34 The presence of stricture is typical in luminal CD and has been extensively studied for its predictive value. We did not explore the heterogeneity between studies; however, we suspect that our eligibility criteria did not restrict patient medication. As summarized in Table 2, the number and length of strictures also serve as predictors, showing considerable prognostic value in predicting relapse and surgery within 1 year.12,38 Edema, often observed as a hyperintense signal on fat-saturated T2-weighted MRE, significantly correlated with inflammation severity and showed a strong correlation with poor prognosis in our meta-analysis. 49 Ulcers were not predictive of disease prognosis in our review, and we guess that this may be due to the limited ability of MRE to recognize ulcer, especially shallow ulcers, which may be somewhat restricted its predictive value.
This study also summarized some less frequently reported MRE findings. Although some of these findings remained controversial, they may inspire future research. For example, fat-related findings could reflect the inflammation level or absorption function of the digestive tract, indicating considerable potential for predicting disease prognosis. Two studies reported creeping fat as a predictor for various poor outcomes such as surgery and disabling disease.20,29 Myosteatosis has a poor predictive value for hospitalization according to 2 studies involving 463 patients.37,39 Findings such as fat stranding and myopenia require further investigation due to controversial results among studies.20,37,39,40 Other predictors, like multi-segmental bowel involvement and extra-mural hypervascularity, also warrant further research. Upstream dilation of the intestinal tract on MRE might be a potential predictor, although computed tomography (CT) is used more often. This may partly result from CT reports of prestenotic dilation occurring more often than MRE reports when evaluating strictures. 50
The scoring systems summarized in this review were mostly based on typical MRE findings and parameters such as the ADC value. These scoring systems may reveal different predictive capacities by using different cutoff values or transformations.18,27 The MaRIA score, for instance, was first used to assess ileocolonic CD activity and was based on wall thickness, RCE, edema, and ulcers to calculate, ranging from 3.4 to 35.0. Generally, cutoff values of 7 or 11, which indicated active or severe disease according to the derivation cohort, are most often used. 46 Some studies have divided the gastrointestinal tract to calculate a global MaRIA score and have reported encouraging results.31,35 This may widen the application of the score.
This review had several strengths. We used broad search terms and did not restrict by country or race, ensuring the inclusion of as many MRE prognostic factors as possible. We also included studies that did not report effect estimates but provided sufficient data to calculate them. For predictors where OR/HR could not be calculated, we tabulated other parameters to reflect their predictive power (Table S5, which shows the results of studies ineligible for ORs or HRs). However, this review had some limitations. Firstly, heterogeneity occurred in our analysis due to the small sample size, various interventions, or distinct definitions of predictors. Besides, our review did not systematically assess the inter-rater reliability or the number of readers in the included studies. Variations in these methodological aspects (e.g., single vs multiple readers, inconsistent scoring criteria) may also introduce heterogeneity, potentially affecting the generalizability of our conclusions. In this way, our findings require further validation in cohorts with larger sample sizes. Furthermore, our review focused on the patients with essential disease activity on the MRE, which might neglect some manifestations of mild disease observed under endoscopy, such as aphthous ulcers or erosions. Notably, a previous systematic review has reported that MRE demonstrates moderate-to-excellent accuracy in evaluating disease activities in IBD, which showed superior performance than ultrasonography and CT, especially in proximal small bowel disease. 51 Besides, though MRE might neglect some mild manifestations, mild disease may not need near-term treatment escalation or surgery.51,52
Conclusion
In summary, this review summarized the MRE predictors that are effective for predicting poor prognosis in patients with luminal CD, including inflammation, bowel wall thickening, strictures, edema, fistula, and scoring systems like MaRIA score. This highlights the potential for MRE-derived parameters on monitoring treatment response noninvasively and reducing reliance on repeated endoscopic evaluations in CD management. However, further prospective studies with larger sample sizes and unified definitions are needed to confirm these results and enhance the clinical utility of MRE in predicting CD prognosis.
Supplemental Material
sj-docx-1-tag-10.1177_17562848251386317 – Supplemental material for Magnetic resonance enterography for predicting poor prognosis in Crohn’s disease: a systematic review with meta-analysis
Supplemental material, sj-docx-1-tag-10.1177_17562848251386317 for Magnetic resonance enterography for predicting poor prognosis in Crohn’s disease: a systematic review with meta-analysis by Yizhe Tie, Daiyue Wang, Zinan Fan, Li Li, Minhu Chen, Rirong Chen and Shenghong Zhang in Therapeutic Advances in Gastroenterology
Supplemental Material
sj-docx-2-tag-10.1177_17562848251386317 – Supplemental material for Magnetic resonance enterography for predicting poor prognosis in Crohn’s disease: a systematic review with meta-analysis
Supplemental material, sj-docx-2-tag-10.1177_17562848251386317 for Magnetic resonance enterography for predicting poor prognosis in Crohn’s disease: a systematic review with meta-analysis by Yizhe Tie, Daiyue Wang, Zinan Fan, Li Li, Minhu Chen, Rirong Chen and Shenghong Zhang in Therapeutic Advances in Gastroenterology
Footnotes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
