Sage Journals: Discover world-class research

Abstract

French

Purpose

Rapid identification of hematoma expansion (HE) risk at baseline is a priority in intracerebral hemorrhage (ICH) patients and may impact clinical decision making. Predictive scores using clinical features and Non-Contract Computed Tomography (NCCT)-based features exist, however, the extent to which each feature set contributes to identification is limited. This paper aims to investigate the relative value of clinical, radiological, and radiomics features in HE prediction.

Methods

Original data was retrospectively obtained from three major prospective clinical trials [“Spot Sign” Selection of Intracerebral Hemorrhage to Guide Hemostatic Therapy (SPOTLIGHT)NCT01359202; The Spot Sign for Predicting and Treating ICH Growth Study (STOP-IT)NCT00810888] Patients baseline and follow-up scans following ICH were included. Clinical, NCCT radiological, and radiomics features were extracted, and multivariate modeling was conducted on each feature set.

Results

317 patients from 38 sites met inclusion criteria. Warfarin use (p=0.001) and GCS score (p=0.046) were significant clinical predictors of HE. The best performing model for HE prediction included clinical, radiological, and radiomic features with an area under the curve (AUC) of 87.7%. NCCT radiological features improved upon clinical benchmark model AUC by 6.5% and a clinical & radiomic combination model by 6.4%. Addition of radiomics features improved goodness of fit of both clinical (p=0.012) and clinical & NCCT radiological (p=0.007) models, with marginal improvements on AUC. Inclusion of NCCT radiological signs was best for ruling out HE whereas the radiomic features were best for ruling in HE.

Conclusion

NCCT-based radiological and radiomics features can improve HE prediction when added to clinical features.

Visual Abstract

Keywords

hematoma expansion intracerebral hemorrhage radiomics machine learning non-contrast CT

Introduction

Intracerebral hemorrhage (ICH) accounts for 10-20% of all strokes and is associated with up to 40% mortality after 30 days.¹ Hematoma expansion (HE) occurs in 18% of patients with ICH within 3 hours of presentation and is a marker of poor prognosis. The rapid identification of ICH is critical to ongoing clinical trials of tranexamic acid and recombinant factor VIIa (rFVIIa). There is some optimism that early treatment using mobile stroke units may allow treatment of all patients and reduce HE. However, the therapeutic options are not all benign and it is plausible that a subpopulation may be a more ideal target for these therapies, sparing patients who are unlikely to respond to therapy from potential adverse drug effects. As such, treatment of patients at risk for HE remains an important, although somewhat elusive, target to improve clinical decision-making and outcomes in ICH treatment.²

The computed tomography angiography (CTA) spot sign is validated as a predictor for HE³; however, CTA is frequently not performed hyperacutely in ICH due to systemic barriers in obtaining vascular imaging in non-comprehensive stroke centres, concerns that additional imaging delays time to treatment and to a lesser degree lingering concerns over iodinated contrast induced injury in a rare disease without an available treatment.⁴ Non-contrast CT (NCCT) imaging remains the imaging modality of choice in ICH diagnosis. Multiple NCCT markers of HE have been published, including both density markers—such as swirl sign, hypodensity, black hole sign, blend sign, fluid level—and shape markers, including irregular shape, island sign, satellite sign.⁵ However these features occur with relatively low prevalence, and are subject to inter- and intra-observer bias,⁵ highlighting the need for objective tools such as radiomics. Further, overlapping definitions and criteria exist between signs, adding complexity to clinical use, and few studies have assessed the combination of signs or features to predict ICH. Recent literature has re-explored radiomic and machine learning approaches to the problem of predicting HE, arguing that quantitative analysis may facilitate more complex assessment of ICH features and more accurately predict HE.

Radiomics is an established tool used for extraction of quantifiable features from medical images, including shape, textural, or intensity measurements. Previous studies have evaluated the predictive ability of extracted radiomic features for HE outcome.^6–8 While some prospective multicentre studies have evaluated or validated radiomics-based predictive tools, few have incorporated commonly collected clinical variables directly into their models. In this study, we sought to identify the relative value of clinical, radiological, radiomics, and combination models in HE prediction using a large prospective multicentre ICH dataset.

Methods

Patients

This study is a retrospective analysis of prospectively collected data from the PREDICT, SPOTLIGHT, and STOP-IT trials, three large multicentre studies conducted across multiple countries and was approved by the Institutional Review Board of a large Canadian Research Institute. The PREDICT observational study investigated the potential of the CTA spot sign at baseline for the prediction of HE in follow-up. SPOTLIGHT (NCT01359202) and STOP-IT (NCT00810888) investigated the effect of rFVIIa on HE stratified by CTA spot sign presence. Full inclusion and exclusion criteria of the studies can be found in the supplementary materials section.^3,9 Patients from these studies were excluded if valid baseline and follow-up images were not available; or if significant NCCT degradation was present (Figure 1). HE was defined as any ICH growth exceeding 6 mL or 33% from baseline.¹⁰ ICH volume segmentation derived with the semiautomated ITK-SNAP software (http://www.itksnap.org), was then reviewed, modified and approved by an experienced neuroradiologist (R.I.A, 20 years’ experience). Images were normalized for gantry tilt and intensity (−1024 to 3071 HU) and submitted for feature extraction.

Figure 1.

Distribution of collected baseline imaging data from PREDICT, SPOTLIGHT, and STOP-IT studies.

Data Acquisition and Analysis

Images were obtained from 38 centres across 6 countries. Baseline and follow-up images were acquired at median: 24.92 hours, range: 11-46 hours apart in the SPOTLIGHT/STOP-IT trials and at median: 23.56 hours, range: 1-91 hours apart in the PREDICT. Images were collected using a variety of CT scanners ranging from 4 to 64 slices. Although varying across sites, typical imaging parameters were: 120 kVp; 340 mA; 4 × 5 mm-collimation; 1 second/rotation; and a table speed of 15 mm/rotation.

Clinical features were obtained from original study data, and included age, sex, time to ICH onset, hypertension, diabetes, renal disease, warfarin use, and Glasgow Coma score.

Three neuroradiology fellows with (1-3 years of Neuroradiology experience) independently reviewed all CTs from the dataset for radiological features, blinded to HE outcome. Prior to review a consensus meeting was held and 20 independent ICH cases were reviewed together and discussed in the context of established guidelines of NCCT ICH HE marker description.⁵ The following features were extracted for each blindly reviewed study: Barras density and irregularity, island sign, black hole sign, blend sign, fluid level, any hypodensity and swirl sign.^11–16 Following review, a consensus of sign presence or absence was determined by agreement between at least of 2 of the 3 reviewers. Inter-observer and intra-observer agreement between reviewers were assessed using Cohen’s kappa statistic.

Radiomic features were extracted for each 3-D lesion volume using PyRadiomics open-source software (version 3.0).¹⁷ The features extracted from the raw NCCT volumes included 14 shape and 88 textural features computed from each of three sets of Laplacian of Gaussian (LoG) filtered image volumes derived for fine, medium, and coarse filters (σ = 1, 2, or 3, respectively) for a total of 264 LoG filtered features. These features were also extracted for wavelet filtered images corresponding to each of 8 possible decompositions for a total of 704 wavelet filtered features. Overall, 1070 radiomic features were extracted for each patient. Feature selection was performed in RStudio version 1.3.1093 (RStudio, PBC, Boston, MA), using a method designed to rank features according to P-value and exclude correlated features.¹⁸ Five total radiomic features emerged for further consideration.

The patients were assigned to train/validation/hold-out cohorts using a specific criterion. First patients were evenly divided by site of scan, then by location of hemorrhage (deep, lobar, etc.), and finally by type of ICH (intraparenchymal, subarachnoid, subdural, intraventricular).

Statistical Analysis

The inter- and intra-observer agreement for each NCCT radiological sign was calculated to assess variability between and within reviewers using Cohens Kappa. Kappa of 0-.20, .21-.40, .41-.60, .61-.80 and .81-1 was considered as slight, fair, moderate, substantial, and perfect, respectively. To search for significant clinical, radiological, and radiomics predictive factors for HE, univariate logistic regression analysis was conducted. The P-value, odds ratio, 95% confidence interval (CI) and R² were calculated for each feature using the training cohort.

To evaluate combinations of features for HE prediction, 11 multivariate logistic regression models were created. The models were combinations of various feature sets including radiological and radiomic signs alone and clinical features without and with addition of radiomics and radiological signs. Models a-e (Table 1 served as benchmark models which models 1-7 are compared. Nested comparisons were made using these benchmark models. The area under the curve (AUC) with 95% CI, receiver operating characteristic (ROC), and the P-value for comparing the AUCs between nested models was calculated using DeLong’s method.¹⁹ For each ROC model, the decision threshold was chosen to maximize Youden’s index. The sensitivity (Se), specificity (Sp), positive predictive value (PPV), and negative predictive value (NPV) with 95% CI were estimated using the highest Youden’s index value. Goodness of fit statistics were reported with the R², Akaike Information Criterion, −2 log-likelihood, and likelihood ratio test (G-test), with a P-value for each nested comparison.

Table 1.

ROC Statistics for Nested Comparisons Between Multivariate Models Predicting HE on Hold-Out Validation Data.

Nested Model Comparisons		ROC Statistics
Nested Model Comparisons		AUC % (95% CI)	ROC DeLong Comparison P-value	Se (95% CI)	Sp (95% CI)	PPV (95% CI)	NPV (95% CI)
Model a	Clinical	80.3 (69.6-90.6)	ref.	63.2 (38.4-83.7)	74.6 (61.0-85.3)	46.2 (26.6-66.6)	85.4 (72.2-93.9)
Model b	Clinical + radiomic features	81.3 (71.4-91.3)	ref.	71.4 (47.8-88.7)	79.3 (65.9-89.2)	42.3 (36.9-76.7)	87.5 (74.8-95.3)
Model c	Clinical + radiological signs	86.8 (79.0-94.6)	ref.	68.4 (43.5-87.4)	76.4 (63.0-86.8)	50.0 (29.9-70.1)	87.5 (74.8-95.3)
Model d	Radiomic features	64.1 (50.6-77.6)	ref.	80.0 (28.4-99.5)	68.1 (55.8-78.8)	15.4 (4.4-34.9)	97.9 (88.9-99.9)
Model e	Radiological signs	63.3 (50.3-76.3)	ref.	42.9 (9.9-81.6)	65.7 (53.1-76.9)	11.5 (2.5-30.2)	91.7 (80.0-97.7)
Comparing to model a (clinical)
Model 1	Clinical + radiological signs model	86.8 (79.0-94.6)	.0818	68.4 (43.5-87.4)	76.4 (63.0-86.8)	50.0 (29.9-70.1)	87.5 (74.8-95.3)
Model 2	Clinical + radiomic model	81.3 (71.4-91.3)	.6511	71.4 (47.8-88.7)	79.3 (65.9-89.2)	57.7 (36.9-76.7)	87.5 (74.8-95.3)
Model 3	Clinical + radiological signs + radiomic model	87.7 (80.1-95.2)	.0823	68.2 (45.1-86.1)	78.9 (65.3-88.9)	57.7 (36.9-76.7)	85.4 (72.2-93.9)
Comparing to model b (clinical + radiomic features)
Model 4	Clinical + radiological signs + radiomic model	87.7 (80.1-95.2)	.0742	68.2 (45.1-86.1)	78.9 (65.3-88.9)	57.7 (36.9-76.7)	85.4 (72.2-93.9)
Comparing to model c (clinical + radiological features)
Model 5	Clinical + radiological signs + radiomic model	87.7 (80.1-95.2)	.5863	68.2 (45.1-86.1)	78.9 (65.3-88.9)	57.7 (36.9-76.7)	85.4 (72.2-93.9)
Comparing to model d (radiomic features)
Model 6	Clinical + radiomic model	81.3 (71.4-91.3)	.0236	71.4 (47.8-88.7)	79.3 (65.9-89.2)	57.7 (36.9-76.7)	87.5 (74.8-95.3)
Comparing to model e (radiological signs)
Model 7	Clinical + radiological signs model	86.8 (79.0-94.6)	.0005	68.4 (43.5-87.4)	76.4 (63.0-86.8)	50.0 (29.9-70.1)	87.5 (74.8-95.3)

Sensitivity (Se), specificity (Sp), positive predictive value (PPV) and negative predictive value (NPV) are measured at the point with the highest Youden’s index (or J statistic).

To evaluate the suitability of candidate models for ruling in HE as a condition, we focused on specificity >85% as the region of interest (ROI) and computed the average specificity of models as 95.4%, 94.2%, and 96.3% (i.e., “very high”[22]) for the clinical + radiomics, clinical + radiological signs, and clinical + radiological signs + radiomics models, respectively (Supplementary Table 4).^20,21 The normalized AUC in the ROI²¹ is compared against the overall AUC, to confirm the model is not worse in the ROI for the clinical + radiological signs + radiomics model.

To understand the complexity of the radiomic features for classification, we visualized the data with 3 different dimension reduction and factor analysis techniques: principal components analysis (PCA); t-Stochastic neighborhood embedding (t-SNE) for all combinations of perplexity in {5,15,25,...,55} and learning rates in {10, 110, 210,…, 1010}; and kernel supervised principal components analysis (kSPCA) with a Gaussian Radial Basis Function (RBF) kernel for widths in {1, 2, 5, 7, 20}. Complete case analysis was utilized requiring removal of incomplete cases from analysis. Multiple comparisons were controlled using Bonferroni correction.

Results

Patients

Total cohort comprised 317 patients after exclusion of patients without baseline and follow-up scans. Table 2 provides a summary of the characteristics of the train/validation and hold-out cohorts. Baseline ICH volume was significantly different for HE groups in the training/validation cohort. No significant differences in baseline characteristics were observed between training/validation and hold-out cohorts. We did not detect a difference in HE between participants receiving rfVIIa and those who did not receive the treatment (P = .23, Table 2). Warfarin use and Glasgow Comma Score (GCS) were the only significant clinical predictors of HE (Table 3. No NCCT radiological signs were associated with HE (Supplementary Table 1).

Table 2.

Characteristics of Patients Allocated to Train and Hold-Out Cohorts.

	Train/Validation, n = 236			Hold-Out, n = 81			P-value
	HE (n, % = 86, 42.4)	NHE (n, % = 150, 57.6)	P-value	HE (n, % = 27, 40.7)	NHE (n, % = 54, 59.2)	P-value	P-value
Mean Age, yrs (SD)	67.2 (13.3)	69.0 (13.7)	.32^a	67.8 (13.4)	69.0 (13.5)	.68^a	.87^a
Sex, female	46	57	.12^b	12	25	.17^b	.75^b
Study cohort, n
PREDICT	64	101	.55^b	20	37	.70^b	.94^b
SPOTLIGHT	15	22	.23^b	5	8	.67^b	.94^b
STOP-IT	7	27	.06^c	2	9	.51^c	1.00^c
RfVIIa Treatment, n			.53			.18	.23
No	17	39		3	14
Yes	5	10		4	3
Non-rfVIIa trial	64	101		20	37
Clinical variables, n
ICH location, deep	67	95	.64^b	23	35	.75^b	.62^b
Baseline ICH volume, mL (IQR)	28.1 (10.2-41.3)	21.0 (5.7-28.7)	.01 ^a	28.1 (12.5-41.0)	22.4 (9.3-33.5)	.21^a	.50^b
IVH, present	44	52	.37^b	7	18	.12^b	.12^b
Brainstem ICH, present	1	2	1.00^c	0	0	1.00^c	.57^c
Time from onset to CT, hrs (IQR)	2.6 (1.3-3.8)	2.4 (1.3-3.1)	.22^a	2.6 (1.3-3.5)	2.8 (1.4-3.6)	.70^a	.33^a
Radiological signs, present
Barras density	9	14	.83^c	4	8	.75^c	.22^c
Swirl sign	70	90	.65^b	23	31	.63^b	.07^b
Hypodensities	51	60	.30^b	16	25	.75^b	.62^b
Black hole sign	26	23	.11^c	8	17	.33^c	.09^c
Blend sign	19	17	.20^c	4	5	1.00^c	.46^c
Fluid level	0	3	.26^c	1	1	1.00^c	.61^c
Barras irregularity	56	75	.95^b	17	27	.68^b	.77^b
Island sign	24	26	.41^b	9	11	.68^b	.55^b
Satellite sign	62	80	.62^b	18	29	.60^b	.64^b

^aP-value calculated with t-test.

^bP-value calculated with chi-squared test.

^cP-value calculated with Fisher’s exact test.

HE, hematoma expansion; NHE, no hematoma expansion; rfVIIa, recombinant factor VIIa; CT, computed tomography; ICH, intracerebral hemorrhage; IQR, interquartile range; IVH, intraventricular hemorrhage.

Table 3.

Univariate Analysis of Clinical Variables for HE.

Univariate Analysis	P-value	OR (95% CI)
Age (years)	.3795	.991 (.972-1.011)
Sex (M vs F)	.8704	.956 (.561-1.634)
Time to ICH onset (mins; log)	.5507	1.151 (.728-1.825)
Hypertension (Yes vs No)	.4233	.772 (.414-1.460)
Diabetes (Yes vs No)	.5717	1.225 (.603-2.441)
Renal disease (Yes vs No)	.4238	1.781 (.448-7.088)
Warfarin use (Yes vs. No)	.0012	18.446 (4.373-170.911)
GCS score	.0461	.886 (.786-.995)

Inter- and Intra-Observer Agreement of NCCT Signs

Inter-observer agreement was moderate for most signs, with the exceptions of slight agreement for fluid level, fair agreement for Barras and irregularity, and fair to moderate agreement for island sign (Supplementary Table 2).²² Intra-observer agreement was lowest for Barras irregularity and density and fluid level across 2 readers with slight to fair agreement. The remaining features performed well demonstrating moderate to almost perfect agreement (Supplementary Table 3).

Multivariate Analysis Outcomes

Clinical, radiological, and radiomic model outcomes are summarized in Tables 1 and 4. The clinical benchmark model resulted in good performance in predicting HE with an AUC of .803 (95% CI: .696-.906) and sensitivity of .632 (95% CI: .384-.837). Addition of NCCT radiological signs non-significantly improved the AUC of the benchmark clinical model (P = .08) showing a 6.5% improvement in AUC, and an increased sensitivity to .684 (95% CI: .435-.874). Similarly, the addition of radiological signs to a model of clinical and radiomic features non-significantly altered the AUC to .877 (P = .07). Addition of clinical features to base models NCCT radiological features significantly improved prediction (P < .001). Figure 2 shows the ROC of model performance in comparison to the benchmark clinical model.

Table 4.

Model Fitting Statistics for Nested Comparisons Between Multivariate Models Predicting HE on Hold-Out Validation Data.

Nested Model Comparisons		Model Fitting Statistics
Nested Model Comparisons		R² (%)	AIC	−2 log-likelihood value	Likelihood ratio test G² (P-value)
Model a	Clinical	22.6	73.66	57.66	ref.
Model b	Clinical + radiomic features	23.6	98.16	72.16	ref.
Model c	Clinical + radiological signs	27.6	79.48	47.48	ref.
Model d	Radiomic features	4.2	117.01	105.01	ref.
Model e	Radiological signs	3.9	102.20	84.20	ref.
Comparing to model a (clinical)
Model 1	Clinical + radiological signs model	27.6	79.48	47.48	10.18 (.2526)
Model 2	Clinical + radiomic model	23.6	98.16	72.16	14.50 (.0127)
Model 3	Clinical + radiological signs + radiomic model	27.8	105.61	63.61	5.95 (.9480)
Comparing to model b (clinical + radiomic features)
Model 4	Clinical + radiological signs + radiomic model	27.8	105.61	63.61	8.55 (.3817)
Comparing to model c (clinical + radiological features)
Model 5	Clinical + radiological signs + radiomic model	27.8	105.61	63.61	16.13 (.0065)
Comparing to model d (radiomic features)
Model 6	Clinical + radiomic model	23.6	98.16	72.16	32.85 (<.0001)
Comparing to model e (radiological signs)
Model 7	Clinical + radiological signs model	27.6	79.48	47.48	36.72 (<.0001)

Figure 2.

ROC plots for models regarding hold-out validation data are each compared to the benchmark clinical model (red). A. ROC plot for clinical + radiological signs (green). B. ROC plot for clinical + radiomic features. C. ROC plot for clinical + radiological signs + radiomic features (green).

The addition of radiomic features to models improved model fit of clinical and radiological features (G² 16.13; P = .0065, Table 4). Addition of clinical features to a model of radiomic signs alone markedly improved performance, (P = .0005; Model 7, Table 1).

For ruling in, the clinical and radiomic model was better in average specificity and the AUC in the ROI was better than the overall AUC. For ruling out, the clinical and radiological sign model was better in average sensitivity (Supplementary Table 4); and the AUC in the ROI is better than the overall AUC (Figure 2).

Post-Hoc Radiomic Feature Visualization

Kernel supervised principal components analysis (KSPCA) revealed a potential separation of HE positive from HE negative patients with radiomic features (Figure 3). The complexity of the groups in the plot needed to imperfectly separate the data indicates that the HE classification problem is challenging with radiomics alone.

Figure 3.

Radiomic features in the test set are visualized with kernel supervised principal components analysis (kSPCA) at left with red and blue dots for HE and NHE. Each point represents one feature set for each patient case. A hypothetical yet achievable separation is shown at right.

Discussion

Using iterative additions of sets of features to a benchmark clinical model, we demonstrated the potential value of adding NCCT radiological signs and radiomics features to predict HE. NCCT-based radiological and radiomics features may complement HE prediction when added to clinical features. The best performing model included clinical, radiomic, and radiological feature sets and had an AUC of 87.7%. That is a 7.4% increase from the clinical benchmark AUC of 80.3%, although no statistical significance was observed, we believe these findings highlight the potential benefit of including radiological and radiomic features in models of HE. A model including radiomic features demonstrated better goodness of fit, with statistical significance, compared to models of clinical plus radiological features and increased the AUC by approximately 1%. Inter- and intra-observer agreement for NCCT radiological markers were heterogeneous, demonstrating, on average, fair to moderate agreement emphasizing the utility of radiomic augmentation of clinical features.

The CTA Spot sign remains the most robust imaging biomarker of HE but has limited generalizability in clinical practice because of the need for CTA and contrast administration.²³ This was demonstrated in the slow recruitment from three recent clinical studies.^3,9 While the 2015 acute stroke studies requiring CTA demonstration of a large vessel occlusion have significantly increased CTA access in many centres, the employment of CTA may not be necessary. A recent study demonstrated a modest reduction in ICH volume with tranexamic acid administration based on NCCT diagnosis of ICH.²⁴ A growing body of evidence suggesting that hyperacute administration is required to be effective in preventing HE.^24,25 Whereas earlier studies sought to intervene medically within 6 hours, current ICH trials are focused on hyperacute (<2 hour) medical intervention with either rFVIIa or Tranexamic acid (STOP-MSU NCT03385928 and FASTER NCT00109382 studies). Rapid ICH diagnosis is required, established either in mobile stroke units where CTA is not available or in the emergency room where contrast administration may lengthen the time from ictus to diagnosis. While the safety profile of tranexamic acid may lend itself to “nonselective” administration to patients with and without a risk of HE, ideally intervention would be best given only to patients with risk for HE. There is therefore an increased interest in NCCT markers of HE as potential biomarkers to guide therapeutic intervention. However, the relatively low prevalence of these markers and heterogeneity of inter- and intra-agreement suggests that an autonomous assessment utilizing radiomic features may be a superior approach.

Our data demonstrated that the inclusion of NCCT radiological signs is best for “ruling out” conditions whereas the inclusion of radiomic features is best for “ruling in” conditions.^21,26 The distinction may be important depending on the drug choice to avert HE. rFVIIa is previously shown to be effective in reducing HE however is associated with a significant risk of thrombotic complications including deep vein thrombosis, stroke, and myocardial infarction. rFVIIa use would favor a “rule in” approach to the condition so that only patients with risk of HE is exposed to the potential complications of rFVIIa administration. Tranexamic acid is recently shown to be an alternative promising intervention with a relatively robust safety profile. A rule out approach to conditions would be most appropriate for this intervention given its safety profile. Our data suggest that a combination of radiological and radiomic features appears to provide some compromise between the two approaches.

Scores incorporating clinical and radiological elements for the prediction of HE have been previously published. Inclusion of baseline imaging features, including baseline ICH volume or CTA Spot sign improve outcome prediction in ICH.²⁷ Several scores employing NCCT features have been validated including the 9-point, BRAIN, HEAVN, and BAT scores.^28–31 The AUC for HE ranged from 72% to 81%. The predictive results presented in the current study exceed existing NCCT scores in the literature; however, it should be acknowledged that score assessments of HE has not yet impacted research or clinical decision-making in the absence of effective therapies. The ICH score is a common tool used to guide clinical care by predicting 30-day mortality in patients with ICH.³² External validation of the ICH score at 30 days reported a c-statistic of .86 that is maintained at 3 (c = .88), 6 (c = .87), and 12 months (c = .87).³³ This standard implies that a similar performance will be required for NCCT radiological and radiomics-based tools to guide clinical decision-making in the era of effective interventions for HE.

While our results meet these standards, there are differences in magnitude between our results and existing results in previously published literature. Other papers have reported better performances of up to 92% AUC on datasets of similar size but represent fewer than 4 centres^6–8 (compared to the 38 sites in our study). Optimistic AUC would be expected in studies involving fewer sites because the inter-centre variation in image acquisition protocols is reduced, making data more homogeneous and predictable, but less generalizable. Conversely, the heterogeneity in our data makes HE prediction using radiological and radiomic features more challenging, but yields results that are potentially more generalizable. A larger multicentre study including 112 sites, achieved a lower mean performance of 70%³⁴ compared to 81% in our study. This study and a study by Xie et al. also reported the added value of radiomics to clinical and radiological methods.⁴ Pszczolkowski and Xie respectively report higher increases of 8.4% and 13.3% when adding radiomics features to radiological-based models, and Pszczolkowski reports a 3.6% increase when adding radiomics features to clinical-based models compared to 1% increases in our study.^4,34 Pszczolkowski et al. incorporated clinical features within their model, reporting an AUC of .668 (95% CI: .613-.723). However, their clinical model also included baseline hematoma volume and ultra-early hematoma growth in addition to true clinical features such as age or sex. In contrast, the clinical model in this study, achieved an AUC of .803 (95% CI: 69.6-90.6) driven by anticoagulation status and GCS alone. Despite differences between feature sets our data recapitulate findings of other studies demonstrating the importance of anticoagulation use, and baseline GCS as strong clinical predictors of HE.²⁷

Limitations of this study are inclusion of data from 2 treatment trials utilizing rFVIIa. However, in the absence of a treatment effect demonstrated in these studies it is unlikely that the HE outcome measure would be impacted. We must also acknowledge the processing required to generate radiomics-based features will add significant time to the generation of HE prediction models. Given the critical nature of acute prediction of HE it will be important to develop accessible processing tools capable of providing radiomics information to clinicians in a rapid time frame.

In conclusion, addition of radiological and radiomic signs can improve accuracy in HE prediction. AUC for HE prediction was 87.7%, but further study and external validation is necessary to investigate both the clinical validity and utility of these models.

Conclusion

The best performing model for HE prediction combined all feature sets and reported an AUC of 87.7%, with significant improvement in model fit. This finding suggests there is promise in using these feature sets for predicting HE, and establishes the contributions of clinical, radiological, and radiomics features for future model development or validation. This study importantly highlights the value of including radiological feature analysis as an imaging biomarker to improve prediction of HE.

Supplemental Material

Supplemental Material - Clinical Features, Non-Contrast CT Radiomic and Radiological Signs in Models for the Prediction of Hematoma Expansion in Intracerebral Hemorrhage

Supplemental Material for Clinical Features, Non-Contrast CT Radiomic and Radiological Signs in Models for the Prediction of Hematoma Expansion in Intracerebral Hemorrhage by Zejia Frank Chen, Liying Zhang, André M Carrington, Rebecca Thornhill, Olivier Miguel, Angela M Auriat, Nima O Fard, Shiva Hiremath, Vered Tshemeister Abitbul, Dar Dowlatshahi, Andrew Demchuk, David Gladstone, Andrea Morotti, Ilaria Casetta, Enrico Fainardi, Thien Huynh, Marah Elkabouli, Zoé Talbot, Gerd Melkus, and Richard I Aviv in Canadian Association of Radiologists Journal

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Canada Foundation for Innovation - John R. Evans Leaders Fund (41174).

ORCID iDs

Zejia Frank Chen

André M Carrington

Rebecca Thornhill

Angela M Auriat

Shivaprakash Hiremath

Vered Tshemeister Abitbul

Supplemental Material

Supplemental material for this article is available online.

References

Qureshi

Tuhrim

Broderick

Batjer

Hondo

Hanley

. Spontaneous intracerebral hemorrhage. N Engl J Med. 2001;344(19):1450-1460. doi:10.1056/NEJM200105103441907

Steiner

Bosel

. Options to restrict hematoma expansion after spontaneous intracerebral hemorrhage. Stroke. 2010;41(2):402-409.

Demchuk

Dowlatshahi

Rodriguez-Luna

, et al. Prediction of haematoma growth and outcome in patients with intracerebral haemorrhage using the CT-angiography spot sign (PREDICT): A prospective observational study. Lancet Neurol. 2012;11(4):307-314. doi:10.1016/S1474-4422(12)70038-8

Xie

Wang

Zhang

. Noncontrast computer tomography–based radiomics model for predicting intracerebral hemorrhage expansion: preliminary findings and comparison with conventional radiological model. Eur Radiol. 2020;30(1):87-98. doi:10.1007/s00330-019-06378-3

Morotti

Boulouis

Dowlatshahi

, et al. Standards for detecting, interpreting, and reporting noncontrast computed tomographic markers of intracerebral hemorrhage expansion. Ann Neurol. 2019;86(4):480-492. doi:10.1002/ana.25563

Zhang

Niyazi

, et al. Radiomics for predicting hematoma expansion in patients with hypertensive intraparenchymal hematomas. Eur J Radiol. 2019;115:10-15.

Ding

Shan

, et al. A nomogram model of radiomics and satellite sign number as imaging predictor for intracranial hematoma expansion. Front Neurosci. 2020;14:491. doi:10.3389/fnins.2020.00491

Song

Guo

Tang

, et al. Noncontrast computed tomography-based radiomics analysis in discriminating early hematoma expansion after spontaneous intracerebral hemorrhage. Korean J Radiol. 2021;22(3):415-424. doi:10.3348/kjr.2020.0254

Gladstone

Aviv

Demchuk

, et al. Effect of recombinant activated coagulation factor VII on hemorrhage expansion among patients with spot sign–positive acute intracerebral hemorrhage: The SPOTLIGHT and STOP-IT randomized clinical trials. JAMA Neurol. 2019;76(12):1493-1501. doi:10.1001/jamaneurol.2019.2636

10.

Dowlatshahi

Demchuk

Flaherty

Ali

Lyden

Smith

. Defining hematoma expansion in intracerebral hemorrhage: Relationship with patient outcomes. Neurology. 2011;76(14):1238-1244. doi:10.1212/WNL.0b013e3182143317

11.

Boulouis

Morotti

Brouwers

, et al. Association between hypodensities detected by computed tomography and hematoma expansion in patients with intracerebral hemorrhage. JAMA Neurol. 2016;73(8):961-968. doi:10.1001/jamaneurol.2016.1218

12.

Liu

Yang

, et al. Island sign: an imaging predictor for early hematoma expansion and poor outcome in patients with intracerebral hemorrhage. Stroke. 2017;48(11):3019-3025.

13.

Zhang

Xiong

, et al. Black hole sign: Novel imaging marker that predicts hematoma growth in patients with intracerebral hemorrhage. Stroke. 2016;47(7):1777-1781.

14.

Zhang

Huang

, et al. Blend sign on computed tomography: Novel and reliable predictor for early hematoma growth in patients with intracerebral hemorrhage. Stroke. 2015;46(8):2119-2123.

15.

Churilov

Mitchell

Dowling

Yan

. The CT swirl sign is associated with hematoma expansion in intracerebral hemorrhage. AJNR Am J Neuroradiol. 2018;39(2):232-237. doi:10.3174/ajnr.A5465

16.

Barras

Tress

Christensen

, et al. Density and shape as CT predictors of intracerebral hemorrhage growth. Stroke. 2009;40(4):1325-1331.

17.

van Griethuysen

JJM

Fedorov

Parmar

, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77(21):e104-e107. doi:10.1158/0008-5472.CAN-17-0339

18.

Lim

Abreu-Gomez

Thornhill

, et al. Utility of machine learning of apparent diffusion coefficient (ADC) and T2-weighted (T2W) radiomic features in PI-RADS version 2.1 category 3 lesions to predict prostate cancer diagnosis. Abdom Radiol. 2021;46(12):5647-5658. doi:10.1007/s00261-021-03235-0

19.

DeLong

Clarke-Pearson

. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics. 1988;44(3):837-845. doi:10.2307/2531595

20.

Demler

Pencina

D'Agostino

Sr . Misuse of DeLong test to compare AUCs for nested models. Stat Med. 2012;31(23):2577-2587. doi:10.1002/sim.5328

21.

Carrington

Manuel

Fieguth

, et al. Deep ROC Analysis and AUC as Balanced Average Accuracy to Improve Model Selection, Understanding and Interpretation. arXiv preprint arXiv:2103.11357. 2021.

22.

Viera

Garrett

. Understanding interobserver agreement: The kappa statistic. Fam Med. 2005;4.

23.

Wada

Aviv

Fox

, et al. CT angiography “spot sign” predicts hematoma expansion in acute intracerebral hemorrhage. Stroke. 2007;38(4):1257-1262.

24.

Sprigg

Flaherty

Appleton

, et al. Tranexamic acid for hyperacute primary intracerebral haemorrhage (TICH-2): An international randomised, placebo-controlled, phase 3 superiority trial. Lancet. 2018;391(10135):2107-2115. doi:10.1016/S0140-6736(18)31033-X

25.

CRASH-3 Trial Collaborators . Effects of tranexamic acid on death, disability, vascular occlusive events and other morbidities in patients with acute traumatic brain injury (CRASH-3): A randomised, placebo-controlled trial. Lancet. 2019;394(10210):1713-1723. doi:10.1016/S0140-6736(19)32233-0

26.

Sackett

Straus

. On some clinically useful measures of the accuracy of diagnostic tests. BMJ Evidence-Based Medicine. 1998;3(3):68.

27.

Al-Shahi Salman

Frantzias

Lee

, et al. Absolute risk and predictors of the growth of acute spontaneous intracerebral haemorrhage: a systematic review and meta-analysis of individual patient data. Lancet Neurol. 2018;17(10):885-894. doi:10.1016/S1474-4422(18)30253-9

28.

Brouwers

Chang

Falcone

, et al. Predicting hematoma expansion after primary intracerebral hemorrhage. JAMA Neurol. 2014;71(2):158-164. doi:10.1001/jamaneurol.2013.5433

29.

Yogendrakumar

Moores

Sikora

, et al. Evaluating hematoma expansion scores in acute spontaneous intracerebral hemorrhage: A systematic scoping review. Stroke. 2020;51(4):1305-1308.

30.

Huynh

Aviv

Dowlatshahi

, et al. Validation of the 9-point and 24-point hematoma expansion prediction scores and derivation of the PREDICT A/B scores. Stroke. 2015;46(11):3105-3110. doi:10.1161/STROKEAHA.115.009893

31.

Morotti

Dowlatshahi

Boulouis

, et al. Predicting intracerebral hemorrhage expansion with noncontrast computed tomography: The BAT score. Stroke. 2018;49(5):1163-1169.

32.

Hemphill

Bonovich

Besmertis

Manley

Johnston

. The ICH score: A simple, reliable grading scale for intracerebral hemorrhage. Stroke. 2001;32(4):891-897. doi:10.1161/01.STR.32.4.891

33.

Hemphill

Farrant

Neill

. Prospective validation of the ICH score for 12-month functional outcome. Neurology. 2009;73(14):1088-1094. doi:10.1212/WNL.0b013e3181b8b332

34.

Pszczolkowski

Manzano-Patrón

Law

, et al. Quantitative CT radiomics-based models for prediction of haematoma expansion and poor functional outcome in primary intracerebral haemorrhage. Eur Radiol. 2021;31(10):7945-7959. doi:10.1007/s00330-021-07826-9

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.57 MB

Clinical Features,Non-Contrast CT Radiomic and Radiological Signs in Models for the Prediction of Hematoma Expansion in Intracerebral Hemorrhage

Abstract

Purpose

Methods

Results

Conclusion

Keywords

Introduction

Methods

Patients

Data Acquisition and Analysis

Statistical Analysis

Results

Patients

Inter- and Intra-Observer Agreement of NCCT Signs

Multivariate Analysis Outcomes

Post-Hoc Radiomic Feature Visualization

Discussion

Conclusion

Supplemental Material

Supplemental Material - Clinical Features, Non-Contrast CT Radiomic and Radiological Signs in Models for the Prediction of Hematoma Expansion in Intracerebral Hemorrhage

Footnotes

Declaration of Conflicting Interests

Funding

ORCID iDs

Supplemental Material

References

Supplementary Material