Sage Journals: Discover world-class research

Abstract

Objective: To develop and validate predictive models based on clinical parameters, and radiomic features to distinguish pulmonary pure invasive mucinous adenocarcinoma (pIMA) from mixed mucinous adenocarcinoma (mIMA) before surgery. Method: From January 2017 to December 2022, 193 pIMA and 111 mIMA were retrospectively analyzed at our hospital in this retrospective study. From contrast-enhanced computed tomography, 1037 radiomic features were extracted. The patients were randomly divided into a training group and a test group (n = 213 and 91, respectively) in a 7:3 ratio. The least absolute shrinkage and selection operator algorithm was used to select radiomic features. In this study, 9 machine learning radiomics prediction models were applied. The radiomics score was then calculated based on the best-performing machine learning model adopted. The clinical model was developed using the same machine learning model of radiomics. In the end, a combined model based on clinical factors and radiomics features was developed. The area under the receiver operating characteristic curve (AUC) value and decision curve analysis (DCA) were used to evaluate the clinical usefulness of the prediction model. Results: The combined model established by the Gaussian Naive Bayes machine learning method exhibited the best performance. The AUC of the combined model, clinical model, and radiomics model were 0.81, 0.80, and 0.68 in the training group and 0.91, 0.80, and 0.81 in the test group, respectively. The Brier scores of the combined model were 0.171 and 0.112. The DCA curve also showed that the combined model was beneficial to clinical settings. Conclusion: The combined model integration of radiomics features and clinical parameters may have potential value for the preoperative differentiation of pIMA from mIMA.

Keywords

radiomics contrast-enhanced computed tomography invasive mucinous adenocarcinoma mixed mucinous/nonmucinous adenocarcinoma model

Introduction

Invasive mucinous adenocarcinoma (IMA), previously referred to as mucinous bronchioloalveolar carcinoma (BAC), is made up of histologically goblet and/or columnar cells containing abundant intracytoplasmic mucin within the lung tumor.¹ Because of the unique clinical, radiological, pathological, and genetic features of IMA, it has been categorized as a variant subtype of lung adenocarcinoma in the 2011 International Association for the Study of Lung Cancer (IASLC)/American Thoracic Society (ATS)/European Respiratory Society (ERS) and 2015 World Health Organization (WHO) classification of lung adenocarcinoma, accounting for only 2% to 5% of adenocarcinomas.^2,3 Based on the 2015 criteria established by the WHO, mucinous adenocarcinoma can be classified into 2 distinct categories characterized by their invasive mucinous pattern: pure invasive mucinous adenocarcinoma (pIMA, comprising > 90% of the invasive mucinous pattern with a lepidic predominant pattern) and mixed invasive mucinous/nonmucinous adenocarcinoma (mixed mucinous adenocarcinoma [mIMA], consisting of at least 10% of a nonmucinous invasive component).⁴

Multiple studies have shown that the revised classification of lung adenocarcinoma holds significant prognostic value.^5–8 Within this classification, mIMA and pMIA subtypes exhibit distinct molecular biological and prognostic features.⁹ Research indicates that mIMA is often linked to unfavorable survival rates in comparison to pIMA, and may also present with a higher frequency of anaplastic lymphoma kinase rearrangements. However, pIMA demonstrates a higher prevalence of Kirsten rat sarcoma viral oncogene homolog mutation in comparison to mIMA. This suggests that mIMA displays greater heterogeneity, potentially leading to an increased likelihood of drug resistance to targeted therapy. Therefore, patients with mIMA may derive greater benefit from combination therapy. Therefore, we guessed that patients with mIMA may derive greater benefit much more from the combination therapy, especially in patients with advanced stages. And preoperative prediction may help us to make a much more precision neoadjuvant treatment plan in the future. To date, there have been limited studies on the computed tomography (CT) characteristics of IMA.^10–12 However, due to the rarity of IMA, these studies have a small sample size and have not consistently differentiated between pIMA and mIMA, or have solely focused on pIMA. Thus, there is limited knowledge regarding mIMA, and the D characteristics for preoperative differentiation of pIMA from mIMA have not been documented in medical literature. Currently, the diagnosis of pIMA and mIMA relies on postoperative examination due to tumor cellular heterogeneity. However, postoperative pathology results are delayed, highlighting the urgent necessity for a predictive method to supplement surgical pathology in distinguishing between pIMA and mIMA.

Radiomics is an emerging technology with significant potential for clinical application in enhancing clinical decision-making through the high-throughput extraction of medical image features from lesions.^13,14 In clinical settings, radiomic models have demonstrated utility in the diagnosis of lung nodules, including differentiation between benign and malignant nodules, preoperative prediction of nodule type, prognostic analysis, surgical outcome forecasting, and assessment of tumor gene expression patterns and microenvironment.^15–17 Given the routine use of imaging examinations in clinical diagnosis, radiomics research is readily accessible. Drawing upon prior investigations, the objective of our study is to undertake an initial assessment of the intricate clinical parameters, radiomic characteristics, and their amalgamation for the preoperative differentiation of pIMA from mIMA.

Materials and Methods

This study was conducted in accordance with the Helsinki Declaration and received approval from the Ethics Committee of our hospital (Ethics Committee of our hospitals, reference number: 2023(124), dated 2023.11.15 and 2022KS017, dated 2022.06.27). Ethical clearance was obtained from our hospital, and waivers of consent were provided to the study participants.

Patient Selection

The reporting of this study conforms to Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines.¹⁸ In this respective study, we consecutively reviewed and retrospectively analyzed the clinical characters and radiomics features of all patients with pulmonary pIMA and mIMA diagnosed from January 2017 to December 2022 in our hospital. The study's inclusion criteria for patients with pIMA and mIMA were defined as follows: (1) confirmation of pIMA with a mucinous adenocarcinoma component exceeding 90% and mIMA with a nonmucinous invasive component of at least 10%; (2) presence of solitary and solid nodules with peripheral lung cancer characteristics on imaging; (3) availability of complete clinical and pathological data, including analyzable plain and enhanced thin-slice CT images at a thickness of 1.25 mm per slice; and (4) possession of CT images taken within 2 weeks prior to the pathological diagnosis. Conversely, patients were excluded based on certain criteria: (1) multiple pulmonary nodules; (2) anti-tumor therapy (including radiotherapy, chemotherapy, chemoradiotherapy, or molecular targeted therapy) prior to CT examination and pathological diagnosis; and (3) patients with other types of cancer. A total of 193 patients with pIMA and 111 patients with mIMA were included in this study. The retrospective analysis of this study was approved by our hospital's ethical review board, and the need for informed consent was waived. We have de-identified all patient details.

CT Image Acquisition

Contrast-enhanced chest CT scans were conducted on all patients using a 256-multidetector CT scanner (Discovery CT 750 HD Revolution, GE Medical Systems, Milwaukee, Wisconsin, USA). Patients were instructed to practice breathing techniques and hold their breath during inspiration prior to scanning. To minimize scanning artifacts, patients’ arms were elevated in a supine position. The imaging range encompassed the thoracic inlet and lung bases, with the sternoclavicular joint serving as the reference point. The scanning parameters included a tube voltage of 120 kV, tube current of 200 mAs, reconstruction layer thickness of 1.25 mm, matrix of 512 × 512, and pitch of 1.2. The reconstruction algorithm utilized standard soft tissue and lung reconstruction algorithms. Subsequent to the completion of the plain scan, 70 to 90 mL of nonionic contrast agent iohexol (300 mg/mL) was administered via the ulnar vein using a high-pressure syringe at a flow rate of 3 mL/s. Following the contrast agent injection, dual-phase enhanced scans of the arterial phase and venous phase were conducted 30 and 90 s postinjection, respectively. The remaining parameters mirrored those of the standard scans. Subsequent to scanning, the unprocessed data was transferred to a postprocessing terminal for multiplanar reconstruction (MPR).

Two thoracic radiologists, both certified by a governing body and possessing 12 and 30 years of experience in chest CT imaging, respectively, conducted the image feature analysis in a blinded fashion, without knowledge of the clinical or histological results. Window levels were configured for the mediastinal window (400 HU width; 40 HU level) and lung window (1500 HU width; −600 HU level). The study examined various characteristics of primary lung tumors, including tumor location (left and right lungs, upper, middle, and lower lobes), tumor size (maximum diameter), mean CT value (plain scan, venous phase), ΔCTV (the difference between the mean CT value of the venous phase and the mean CT value of the plain scan), internal features (such as cavities or vacuoles, angiogram sign), and external features (such as edge characteristics including lobular, burr, and pleural indentation sign). Additionally, the mean CT values of the nodules were measured on both plain and venous scans. To ensure data accuracy, the region of interest (ROI) was defined as the largest slice of the lesion along with 3 adjacent slices above and below, excluding any areas containing calcifications, necrosis, or atelectasis. The ROI encompassed more than half of the interface in cases of uniform density. In instances of uneven density, the slice containing the most solid components was chosen for measurement, with the mean value calculated based on 3 measurements. Cavity and vacuole were characterized as gas-filled spaces with a maximum diameter exceeding or falling below 5 mm, respectively. The CT angiogram sign was described as the visualization of enhancing vascular structures within the lesion during contrast-enhanced scanning.¹⁹ The CT image characteristics were evaluated independently by 2 radiologists, with any discrepancies resolved in a uniform manner.

Segmentation, Feature Extraction, and Selection

The CT images were imported into the open-source software 3D-Slicer (version 5.0.2, http://www.slicer.org) and interpreted using specific window settings for the lung (width 1500/−600 HU) and mediastinal (width 400/40 HU) regions. Following image acquisition, primary lesions from patients diagnosed with pIMA and mIMA were chosen for tumor segmentation. A radiologist with 12 years of experience in chest CT imaging, blinded to clinical data, meticulously outlined ROIs layer by layer. The tumor ROIs were delineated to encompass the entirety of the lesion, including any cavities or vacuoles present within the nodules, while excluding bronchi, blood vessels, and normal lung tissue. Following one month of tumor segmentation, a subset of 40 patients was randomly chosen from the overall cohort for independent segmentation to evaluate intraclass agreement. Subsequently, an interclass agreement was assessed by a second radiologist with 30 years of experience in chest CT imaging, who independently segmented the same 40 patients. In order to evaluate the consistency of feature extraction by individual observers and between different observers, intraclass correlation coefficients (ICCs) were employed.

Radiomics features were extracted utilizing Pyradiomics within the 3D-Slicer software platform. A total of 1037 radiomics features were extracted from the images, including original, log-sigma-4-0-mm-3D, log-sigma-5-0-mm-3D, and various wavelet transformations (wavelet-LLH, wavelet-LHL, wavelet-LHH, wavelet-HLL, wavelet-HLH, wavelet-HHL, wavelet-HHH, and wavelet-LLL). The features extracted from the original image encompassed a total of 14 shape factor classes, 18 histogram classes (referred to as first-order), 24 classes derived from the gray-level co-occurrence matrix (GLCM), 16 classes from the gray-level run length matrix, 16 classes from the gray-level size zone matrix, 5 classes from the neighboring gray-tone difference matrix, and 14 classes from the gray-level dependence matrix. The type and quantity of features extracted from images of other types remained consistent with those extracted from the original image, with the exception of the shape factor classes.

Three consecutive steps were undertaken to decrease the dimensionality of the radiomic features to a manageable number. Initially, the interobserver agreement of radiomic features was evaluated, and features with an intraclass correlation coefficient (ICC) > 0.75 were selected. Subsequently, radiomic features demonstrating statistical significance between the pIMA and mIMA groups were identified. Finally, to minimize redundancy, the least absolute shrinkage and selection operator (LASSO) logistic regression algorithm was employed to identify the most informative predictive radiomic features for distinguishing between pIMA and mIMA in the training group, with 5-fold cross-validation performed 100 times to avoid overfitting.

Model Development

Factors with statistical significance in the multivariate analysis were further constructed by machine learning methods. The specific steps in our study were as follows: (1) the patients were randomly split into the training set and the test set in a 7:3 ratio; (2) based on factors with statistical significance in multivariate analysis in the training set, 9 machine learning models were developed in the training set: EXtreme gradient boosting (XGB) classifier, light gradient boosting machine (LGBM) classifier, random forest classifier, AdaBoost classifier, Gaussian Naive Bayes (GNB), logistic regression, multilayer perceptron (MLP) classifier, polynomial supervised vector machine (SVC), and k-nearest neighbor (KNeighbors) classifier. The optimal parameters in the 9 models were retrospectively identified using 5-fold cross-validation. XGB classifier was implemented using XGBoost1.2.1. LGBM classifier was implemented using lightgbm 3.2.1. The others were implemented using sklearn 0.22.1. The performance of 9 machine learning models was evaluated using the receiver operating characteristic (ROC) curve. To validate the most efficient machine learning model, a 5-fold cross-validation was employed. The primary evaluation metrics included the area under the ROC curve (AUC), accuracy, sensitivity, specificity, positive predictive value, and negative predictive value. Based on the ROC results, the most predictive machine learning classifier was selected, and the radiomics score (Rad-score) was subsequently calculated using the chosen model. The Rad-score, representing the likelihood of each patient being diagnosed with pIMA, was determined through the utilization of the most predictive machine learning classifier. Additionally, a predictive model was constructed based on clinical and CT data, with clinical features encompassing sex, age, and smoking status, while the specific CT features are detailed above.

Subsequently, 3 models—the clinical model, the radiomics model, and the combined model incorporating both clinical factors and radiomics features—were subjected to statistical comparison in order to determine the model exhibiting the greatest predictive accuracy.

Statistical Analysis

Statistical analyses were conducted using Python version 3.7, with patients randomly allocated in a 7 : 3 ratio to training and test groups. Radiomic features were normalized using Z scores, and baseline data were subjected to univariate analysis using Python stats models 0.11.1. Categorical variables were assessed with chi-square tests, while continuous variables were evaluated using t tests or Mann-Whitney U-tests. Factors exhibiting significant differences (P < .05) were incorporated into the multivariate logistic regression analysis. The multivariate analysis results identified clinical and CT features with statistically significant differences (P < .05), which were utilized to develop a clinical prediction model.

Key performance metrics for evaluating the model included the AUC, accuracy, sensitivity, specificity, positive predictive value, and negative predictive value. Decision curve analysis (DCA) was employed to assess the clinical utility of the 3 models. The methodology outlined in the “Materials and Methods” section allows for reproducibility by fellow researchers.

Results

Patient Characteristics

In this retrospective study, a total of 373 patients diagnosed with IMA were initially reviewed. Following the application of specific inclusion and exclusion criteria, 69 patients were excluded from the study. The reasons for exclusion included 21 patients who did not undergo enhanced CT examination at our hospital, 23 patients who had received chemotherapy drugs prior to hospitalization, and 25 patients for whom precise delineation of ROIs proved challenging. Ultimately, a cohort of 304 patients, with ages ranging from 15 to 81, was included in the analysis. Participants were allocated into a training group and a test group in a random manner, with a ratio of 7 : 3. The training group consisted of 213 patients, with 135 patients diagnosed with pIMA and 78 patients diagnosed with mIMA. The test group included 91 patients, with 58 patients diagnosed with pIMA and 33 patients diagnosed with mIMA.

There was no statistically significant difference in age between patients with pIMA and patients with mIMA, with median ages of 62 and 60 years, respectively. The proportion of female patients was higher in the pIMA group compared to the mIMA group (56.48% vs 45.05%, P = .055). There was no significant difference in the proportion of patients with a smoking history between the pIMA and mIMA groups (36.27% vs 37.84%, P = .785).

Regarding the characterization of CT findings, it was observed that lesions situated in the lower lobe were significantly more prevalent in patients with pIMA compared to those with mIMA, with respective proportions of 67.36% and 47.75% (P < .001). Furthermore, a statistically significant disparity was noted in the occurrence of pleural indentation sign on CT lung window images between the pIMA and mIMA groups, with proportions of 68.39% and 30.63%, respectively (P < .001). Additionally, the presence of cavities or vacuoles was more frequently observed in patients with pIMA than in those with mIMA, with proportions of 56.48% and 27.93%, respectively (P < .001). Angiogram sign and lobular appearance were observed more frequently in patients with mIMA compared to those with pIMA, with rates of 81.08% versus 64.77% and 92.79% versus 69.95%, respectively. The mean CT values on plain scan and venous phase were slightly lower in patients with pIMA than in those with mIMA, although these differences did not reach statistical significance. However, there was a statistically significant difference in the mean CT value on the arterial phase between pIMA (14.222 ± 93.167) and mIMA (36.294 ± 88.919), with a P value of .049. Furthermore, there was no significant difference in ΔCTV between the 2 groups (P = .848). The characteristics of the patients in the training and testing cohorts are detailed in Table 1.

Table 1.

Clinical Characteristics of the Patients.

Characters	Training cohort			Testing cohort
	pNMA	mNMA	P	pNMA	mNMA	P
Gender
Female	72	41	.444	26	20	.014
Male	58	41		37	9
Smoking
No	86	53	.821	37	16	.748
Yes	44	29		26	13
PLI
No	90	25	<.001	42	9	.001
Yes	40	57		21	20
Edge clear
No	22	8	.145	10	2	.235
Yes	108	74		53	27
Lobul
No	36	7	<.001	22	1	.001
Yes	94	75		41	28
Spicul
No	89	40	.004	42	13	.047
Yes	41	42		21	16
AS
No	78	66	.002	47	24	.387
Yes	52	416		16	5
Cavity
No	60	59	<.001	24	21	.002
Yes	70	23		39	8
Lower lobe
No	38	45	<.001	25	13	.641
Yes	92	37		38	16
ΔCTA, median [IQR]	19.984 [10.266, 35.895]	15.000 [7.333, 33.000]	.034	19.256 [9.000, 40.000]	29.000 [17.000, 38.000]	.317
ΔCTV, median [IQR]	28.000 [15.000, 46.000]	25.000 [18.000, 43.000]	.749	29.000 [20.731, 51.000]	28.000 [23.260, 37.000]	.983
Venous phase median [IQR]	43.597 [18.000, 61.000]	59.000 [45.000, 72.000]	<.001	47.000 [28.000, 68.000]	55.000 [46.000, 71.000]	.024
Arterial phase median [IQR]	36.000 [13.000, 52.000]	48.000 [35.000, 62.000]	<.001	41.000 [13.000, 57.000]	58.000 [45.000, 65.000]	.002
Plain, median [IQR]	20.000 [−11.716, 31.000]	30.000 [20.000, 39.000]	<.001	21.000 [−21.000, 30.000]	25.000 [20.000, 32.000]	.027
Diameter, median [IQR]	2.400 [1.500, 3.600]	2.500 [2.000, 3.700]	.180	1.900 [1.400, 3.500]	2.100 [1.600, 3.700]	.461
Age, median [IQR]	62.000 [56.000, 67.000]	59.000 [53.000, 65.000]	.032	57.474 ± 12.991	59.657 ± 10.370	.461

Abbreviations: pNMA, pure invasive mucinous adenocarcinoma; mNMA, mixed mucinous/nonmucinous adenocarcinoma; Lobul, lobulation; Spicul, speculation; Cavity, cavities or vacuoles; AS, angiogram sign; PLI, pleural indentation sign; ΔCT_V, venous CT value–plain CT value; ΔCT_A, arterial phase CT value–plain CT value.

Radiomics Feature Selection and Model Construction

A total of 1037 radiomics features were initially extracted. Through a process of elimination, 425 features that exhibited no significant difference between pIMA and mIMA, as well as 255 highly correlated features with ICC values below 0.75, were excluded. Subsequent screening using LASSO with a λ value set at the minimum standard and standard error of 0.052 and 0.119, respectively, resulted in the retention of 6 robust radiomics features (contrast, T10Percentile, median, minimum, root mean squared, and skewness) with λ = 0.052, as illustrated in Figure 1.

Figure 1.

Radiomics feature selection with the least absolute shrinkage and selection operator (LASSO). (A) Plots the coefficients of the log (λ) function. The dotted line represented the λ value the minimum standard and the smallest. Select the coefficient that is not 0 here as the coefficient of the last reserved feature. (B) The x-axis represents log(λ), and the numbers above the x-axis represent the average number of predictive variables. The red dot represents the average deviation value of each model with a given λ, while the vertical bar of the red dot represents the upper and lower limit values of the deviation. The vertical dotted line represents the log(λ) value corresponding to the best λ value; the selection standard is the minimum standard. By adjusting different parameters (λ), the binomial deviation of the model is minimized, and the feature datasets with the best performance are selected. After screening out the redundant features by LASSO, the 6 most robust radiomics features (including contrast, T10Percentile, median, minimum, root mean squared, and skewness) were retained, with λ = 0.052.

In the training dataset, nine machine learning radiomics prediction models were developed, including XGB classifier, LGBM classifier, random forest classifier, AdaBoost classifier, GNB, logistic regression, MLP classifier, SVC, and KNeighbors classifier, utilizing 6 radiomics features. The model constructed using the GNB classifier method exhibited superior performance, with AUC values of 0.657 and 0.733 in the training and test datasets, respectively (refer to Tables 2, 3, and Figure 2).

Figure 2.

Receiver operating characteristics (ROC) curves of the nine machine learning.

Table 2.

Performance metrics for 9 models in the training dataset.

Model	AUC (SD)	Accuracy (SD)	Sensitivity (SD)	Specificity (SD)	F1 score (SD)
XGBoost	0.999 (0.001)	0.988 (0.006)	0.992 (0.008)	0.991 (0.000)	0.992 (0.008)
logistic	0.680 (0.012)	0.639 (0.036)	0.742 (0.045)	0.583 (0.087)	0.618 (0.009)
LightGBM	0.929 (0.007)	0.873 (0.015)	0.913 (0.024)	0.858 (0.009)	0.848 (0.020)
RandomForest	1.000 (0.000)	0.985 (0.009)	1.000 (0.000)	0.990 (0.010)	0.992 (0.008)
AdaBoost	0.951 (0.003)	0.885 (0.027)	0.866 (0.002)	0.906 (0.045)	0.863 (0.027)
GNB	0.657 (0.003)	0.621 (0.006)	0.679 (0.013)	0.595 (0.001)	0.576 (0.012)
MLP	0.481 (0.073)	0.473 (0.047)	0.899 (0.004)	0.222 (0.063)	0.565 (0.031)
SVM	0.653 (0.004)	0.624 (0.003)	0.697 (0.003)	0.587 (0.001)	0.591 (0.018)
KNN	0.786 (0.006)	0.689 (0.009)	0.761 (0.144)	0.709 (0.133)	0.727 (0.012)

Abbreviations: AUC, area under the curve; XGBoost: EXtreme gradient boosting; SVM: polynomial supervised vector machine; LightGBM: light gradient boosting machine; AdaBoost: adaptive boosting; GNB: Gaussian Naive Bayes; MLP: multilayer perceptron; KNN: k-nearest neighbor.

Table 3.

Performance metrics for nine models in the validation dataset.

Model	AUC (SD)	Accuracy (SD)	Sensitivity (SD)	Specificity (SD)	F1 score (SD)
XGBoost	0.594 (0.005)	0.593 (0.012)	0.670 (0.225)	0.588 (0.213)	0.581 (0.096)
logistic	0.661 (0.027)	0.605 (0.023)	0.500 (0.125)	0.815 (0.074)	0.481 (0.074)
LightGBM	0.654 (0.049)	0.628 (0.070)	0.728 (0.061)	0.711 (0.169)	0.630 (0.018)
RandomForest	0.557 (0.078)	0.605 (0.047)	0.506 (0.389)	0.708 (0.292)	0.485 (0.300)
AdaBoost	0.574 (0.105)	0.628 (0.070)	0.455 (0.170)	0.785 (0.008)	0.460 (0.152)
GNB	0.733 (0.005)	0.651 (0.023)	0.868 (0.132)	0.546 (0.162)	0.674 (0.013)
MLP	0.589 (0.035)	0.523 (0.058)	0.783 (0.117)	0.460 (0.112)	0.569 (0.096)
SVM	0.677 (0.022)	0.674 (0.000)	0.717 (0.133)	0.696 (0.175)	0.610 (0.099)
KNN	0.569 (0.010)	0.605 (0.000)	0.461 (0.039)	0.720 (0.030)	0.385 (0.100)

Abbreviations: AUC, area under the curve; XGBoost, EXtreme gradient boosting; SVM, polynomial supervised vector machine; LightGBM, light gradient boosting machine; AdaBoost, adaptive boosting; GNB, Gaussian Naive Bayes; MLP, multilayer perceptron; KNN, k-nearest neighbor.

Feature Selection and Clinical Model Construction

In the training dataset, a multivariable analysis revealed significant differences between pIMA and mIMA in lesion location in the lower lobe (OR = 0.395, 95% CI: 0.189-0.810), cavity or vacuole (OR = 0.321, 95% CI: 0.154-0.651), angiogram sign (OR = 0.266, 95% CI: 0.117-0.574), and pleural indentation sign (OR = 4.029, 95% CI: 1.883-8.934). A clinical prediction model was developed using a GNB machine learning approach based on these 4 features. The AUC values of the clinical models for distinguishing between pIMA and mIMA were 0.803 and 0.80 in the training and test groups, respectively. See Table 4 for details.

Table 4.

Multivariable Analysis to Identify Significant Factors for pNMA and mNMA in the Training Set.

Predictor	P	Odds ratio	Lower	Upper
AS	.001	0.266	0.117	0.574
PLI	<.001	4.029	1.883	8.934
Cavity	.002	0.321	0.154	0.651
Lower lobe	.012	0.395	0.189	0.81

Abbreviations: OR, odds ratio; AS, angiogram sign; PLI, pleural indentation sign.

Combined Model Construction and Validation of Performance

The GNB classifier was utilized to develop a composite model incorporating 4 clinical features and 6 radiomics features. The findings indicated that the composite model outperformed both the clinical and radiomics models, as evidenced by ROC-AUC values of 0.81 compared to 0.91, 0.68 compared to 0.80, and 0.80 compared to 0.81 in the training and test cohorts, respectively (Table 2 and Figure 3). Moreover, the combined model demonstrated high sensitivity and specificity, with values of 0.841 and 0.966 in the training group and 0.708 and 0.746 in the test group, respectively (Table 5 and Figure 3). Based on the findings of the decision curve analysis, the integrated model for preoperative differentiation between pIMA and mIMA demonstrated superior performance compared to the clinical and radiomics models in terms of net benefits in both the training and test cohorts (Figure 4). The Brier scores for the training and test groups were 0.171 and 0.094, respectively, with corresponding calibration plots depicted in Figure 5.

Figure 3.

Comparison of ROC curves among the clinical model, radiomics model, and combined model in the training cohorts (A) and testing cohorts (B). The AUC values in the combined model were better than those in the clinical model and radiomics model for the prediction of pNMA.

Figure 4.

Decision curve analyses for the radiomics–clinical model compared with the radiomics model and clinical model in the training cohort (A) and the testing cohort (B). Decision curve analysis showed that the net benefits of the combined model for the prediction of pure invasive mucinous adenocarcinoma (pNMA) were higher than those of the clinical model and radiomics model.

Figure 5.

Calibration plot of the combined model in the training group and testing group.

Table 5.

Diagnostic Performance of the Prediction Models.

Model	Training cohort			Testing cohort
Model	AUC	SEN	SPE	AUC	SEN	SPE
RADS	0.68	0.695	0.592	0.81	0.862	0.73
Clinical	0.80	0.78	0.723	0.8	0.828	0.73
Comb	0.81	0.841	0.708	0.91	0.966	0.746

Abbreviations: AUC, area under the curve; SEN, sensitivity; SPE, specificity; Radiomics, radiomics model; Clinical, clinical model; Comb, combined model.

Discussion

This retrospective study identified significant variations between pIMA and mIMA in terms of qualitative and quantitative clinical data as well as radiomics data. A clinical prediction model for distinguishing between pIMA and mIMA was developed using 4 selected clinical variables, including lesion location in the lower lobe, presence of cavity or vacuole, angiogram sign, and pleural indentation sign. Additionally, a radiomic prediction model was constructed using 6 selected radiomic features: contrast, T10Percentile, median, minimum, root mean squared, and skewness. The clinical-radiomics combined model, comprising 4 clinical features and 6 radiomics parameters, exhibited strong predictive capabilities in both the training and test cohorts. Furthermore, statistical analysis revealed significant disparities between the clinical model, radiomics model, and combined clinical–radiomics model, with the latter outperforming the former 2 individual models.

From a histological perspective, pIMA is characterized by the presence of goblet and/or columnar cells that secrete extracellular mucin, with 90% of the tumors showing invasive mucinous features. In contrast, mIMA typically displays invasive mucinous patterns ranging from 10% to 90%. Our multivariate logistic regression analysis revealed that pIMAs were more likely to be located in the lower lobe compared to mIMAs, with percentages of 67.36% and 47.75%, respectively (P < .001). These findings align with previous studies conducted by Nie et al¹² and Xu et al.²⁰ This disparity can be explained by the origin of tumor cells in pIMA, which derive from goblet cells or columnar epithelial cells. These cancer cells are relatively well differentiated and have the ability to produce increased amounts of mucus, a characteristic that is influenced by gravity. Consequently, pIMA was more frequently observed in the lower lobe compared to mIMA.

CT scanning, particularly CT dynamic contrast-enhanced scanning, plays a crucial role in the diagnosis of lung cancer. There have been some reports on CT features for IMA, but to our knowledge, the CT characteristics of preoperative differentiation of pIMA from mIMA are not described in the medical literature.^12,21–23 This study represents the first investigation into the preoperative differentiation of pIMA from mIMA. Our findings indicate that over half of patients with pIMA exhibited cavity or vacuole formation, whereas only a quarter of patients with mIMA displayed this characteristic. Cavities in IMA are attributed to incomplete obstruction of bronchioles by mucus, leading to alveolar hyperventilation. Conversely, vacuoles may result from internal necrosis of the tumor, with the necrotic material being expelled through the bronchus. Consequently, cavities or vacuoles are frequently observed in IMA. The presence of more invasive mucinous components in pIMA compared to mIMA may account for the higher prevalence of cavities or vacuoles in pIMA.

The CT angiogram sign refers to the highly enhanced pulmonary vascular texture shadow visible within the lesion during enhanced scanning, with the pathological basis of low malignancy of the tumor and mild damage to blood vessels.¹⁹ Out of the 193 patients with pIMA, 68 exhibited angiogram signs, resulting in a ratio of 1 : 3. Conversely, 21 out of the 111 patients with mIMA displayed angiogram signs, yielding a ratio of 1 : 5. The prevalence of angiogram signs in pIMA was found to be higher than in mIMA, consistent with previous research findings. This disparity may be attributed to the lower-grade malignancy of pIMA compared to mIMA, suggesting that pIMA experiences less vascular damage. Additionally, CT imaging revealed more vascular signs in pIMA than in mIMA. Our multivariate logistic regression analysis indicated that pleural indentation signs were less frequently observed in pIMA compared to mIMA. This phenomenon may be attributed to the predominant composition of mucus in pIMA, resulting in lower levels of fibrosis, scarring, and traction within the tumor. Conversely, mIMA is characterized by a higher proportion of nonmucinous components, rendering it more susceptible to fibrosis and subsequent pleural traction.

To explore a more effective method for differentiating between pIMA and mIMA, we identified 6 unique radiomic features associated with each type, including contrast, T10Percentile, median, minimum, root mean squared, and skewness. These characteristics are categorized under the Haralick feature and Histogram Parameter. Specifically, Contrast, a component of the Haralick feature, is calculated from the GLCM and represents the local contrast variation in an image. The remaining parameters, T10Percentile, median, minimum, root mean squared, and skewness, are classified under histogram parameters. The median represents the median pixel value of an image (of the lesion). The minimum indicates the lowest pixel value in an image (of the lesion). The T10Percentile indicates that 10% of the observed value in a set of observations is lower than that value. Skewness reflects the degree of asymmetry in the histogram distribution. If the predictive value was effective, the absolute values of the skewness would have been higher. Another histogram parameter is the root mean squared, which represents the square root of the average of all voxel values. The higher-dimensional data was acquired by converting images, resulting in all the mentioned features. Quantitative imaging features from general medical images were able to be mined at high throughput, enabling an automated analysis that aids in clinical decision-making. Prior research has demonstrated that radiomics features extracted from CT scans have the potential to differentiate between IMA and invasive nonmucinous adenocarcinoma, as well as distinguish IMA from pulmonary tuberculoma.^24,25 This current study represents the first attempt to differentiate pIMA from mIMA using radiomic characteristics. Furthermore, the findings suggest that these radiomic features can be utilized for preoperative differentiation of pIMA and mIMA. The ROC analysis yielded area under the curve (AUC) values of 0.68 and 0.81 in the training and test cohorts, respectively. The clinical–radiomics combined model exhibited a notable enhancement (ROC–AUC: 0.81-0.91) in comparison to the individual clinical and radiomics models. Furthermore, the decision curve analysis illustrated that the combined model outperformed the clinical and radiomic models in predicting outcomes. The utilization of decision curves offers additional insights beyond conventional performance metrics, such as discrimination and calibration, thereby aiding in the assessment of clinical efficacy and suggesting a higher likelihood of success.

Our study is subject to several limitations that warrant consideration. Firstly, the sample size was limited and restricted to a single institution due to the rarity of lung IMA, with no power calculation conducted to determine the sample size. Secondly, potential selection bias exists as only patients with pathologic results postsurgery were included. Thirdly, the short follow-up period postsurgery precluded the development of a predictive model for patient survival. In order to inform clinical practice, the model will undergo validation in a multicenter prospective study in the future, with subsequent optimization.

Conclusion

In summary, our current research has successfully developed a model for distinguishing between pIMA and mIMA through the integration of preoperative clinicopathological, radiomic, and clinical–radiomic features. This novel clinical–radiomic model demonstrated promising predictive capabilities and holds potential utility in the clinical setting. We developed the methods of diagnosis of pIMA and mIMA primarily based on postoperative examination of the tumors for the pathological diagnosis is most precisive at present. The potential value of the model developed in this study was to help us to make a much more precision treatment plan for patients with advanced stage and patients who need neoadjuvant therapy in the future.

Furthermore, the study adheres to the STROBE guidelines for reporting observational research.

Footnotes

Abbreviations

Acknowledgments

None.

Authors’ Contributions

JJZ and LGH performed the experiments and wrote the manuscript. QX and FXG were responsible for designing the experiments. All authors read and approved the final version of this submitted manuscript.

Availability of Data and Materials

The datasets produced and examined in the present investigation are not accessible to the public at this time due to ongoing analysis for future publications, although they can be obtained from the corresponding author on reasonable request.

Consent for Publication

We confirm that there has been no publication, submission, or acceptance elsewhere of the manuscript other than this journal. All potentially identifiable images or data in this article were published with the written consent of the individuals involved.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Ethical Approval and Consent to Participate

The Xing Tai People's Hospital ethical review board and Ethics Committee of the Fourth Affiliated Hospital of Hebei Medical University approved this retrospective analysis and waived informed consent requirements.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Key development plan of Xingtai (grant number ZC20301).

ORCID iD

Ligang Hao

References

Maeda

Tsuchiya

Hao

, et al. Kras(G12D) and Nkx2-1 haploinsufficiency induce mucinous adenocarcinoma of the lung. J Clin Invest 2012; 122(12): 4388-4400.

Travis

Brambilla

Nicholson

, et al. The 2015 World Health Organization classification of lung tumors: impact of genetic, clinical and radiologic advances since the 2004 classification. J Thorac Oncol 2015; 10(9): 1243-1260.

Travis

Brambilla

Noguchi

, et al. International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society international multidisciplinary classification of lung adenocarcinoma. J Thorac Oncol 2011; 6(2): 244-285.

Lantuejoul

Rouquette

Brambilla

Travis

. New WHO classification of lung adenocarcinoma and preneoplasia. Ann Pathol 2016; 36(1): 5-14.

Wang

Wei

, et al. Prognostic value of the new International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society classification in stage IB lung adenocarcinoma. Eur J Surg Oncol 2015; 41(10): 1430-1436.

Murakami

Ito

Tsubokawa

, et al. Prognostic value of the new IASLC/ATS/ERS classification of clinical stage IA lung adenocarcinoma. Lung Cancer (Amsterdam, Netherlands) 2015; 90(2): 199-204.

Song

Zhu

Guo

, et al. Prognostic value of the IASLC/ATS/ERS classification in stage I lung adenocarcinoma patients-based on a hospital study in China. Eur J Surg Oncol 2013; 39(11): 1262-1268.

Russell

Wainer

Wright

, et al. Does lung adenocarcinoma subtype predict patient survival?: a clinicopathologic study based on the new International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society international multidisciplinary lung adenocarcinoma classification. J Thorac Oncol 2011; 6(9): 1496-1504.

Boland

Maleszewski

Wampfler

, et al. Pulmonary invasive mucinous adenocarcinoma and mixed invasive mucinous/nonmucinous adenocarcinoma—a clinicopathological and molecular genetic study with survival analysis. Hum Pathol 2018; 71: 8-19.

10.

Kim

Bae

, et al. Radiological and clinical features of screening-detected pulmonary invasive mucinous adenocarcinoma. Interact Cardiovasc Thorac Surg 2022; 34(2): 229-235.

11.

Wang

Yang

Liu

, et al. Primary invasive mucinous adenocarcinoma of the lung: prognostic value of CT imaging features combined with clinical factors. Korean J Radiol 2021; 22(4): 652-662.

12.

Nie

Zhang

Y-X

. Comparing clinicopathological features and prognosis of primary pulmonary invasive mucinous adenocarcinoma based on computed tomography findings. Cancer Imaging 2019; 19(1): 47.

13.

Wen

Yang

Zhu

, et al. Pretreatment CT-based radiomics signature as a potential imaging biomarker for predicting the expression of PD-L1 and CD8+TILs in ESCC. Onco Targets Ther 2020; 13: 12003-12013.

14.

Lee

Park

, et al. Radiomics and its emerging role in lung cancer research, imaging biomarkers and clinical management: state of the art. Eur J Radiol 2017; 86: 297-307.

15.

Woodruff

Sanduleanu

, et al. Preoperative CT-based radiomics combined with intraoperative frozen section is predictive of invasive adenocarcinoma in pulmonary nodules: a multicenter study. Eur Radiol 2020; 30(5): 2680-2691.

16.

Zhao

Yang

, et al. Development and validation of a radiomics nomogram for identifying invasiveness of pulmonary adenocarcinomas appearing as subcentimeter ground-glass opacity nodules. Eur J Radiol 2019; 112: 161-168.

17.

She

Zhang

Zhu

, et al. The predictive value of CT-based radiomics in differentiating indolent from invasive lung adenocarcinoma in patients with pulmonary nodules. Eur Radiol 2018; 28(12): 5121-5128.

18.

von Elm

Altman

Egger

, et al. Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. BMJ (Clinical Research ed) 2007; 335(7624): 806-808.

19.

Lee

Kang

Lee

, et al. Spread through air spaces (STAS) in invasive mucinous adenocarcinoma of the lung: incidence, prognostic impact, and prediction based on clinicoradiologic factors. Thorac Cancer 2020; 11(11): 3145-3154.

20.

Shen

Wang

, et al. Cheng G: clinical features and prognosis of resectable pulmonary primary invasive mucinous adenocarcinoma. Transl Lung Cancer Res 2022; 11(3): 420-431.

21.

Yin

Y-H

Y-G

Wang

. Differential diagnosis of pulmonary nodular mucinous adenocarcinoma and tuberculoma with dynamic CT: a retrospective study. J Thorac Dis 2022; 14(4): 1225-1231.

22.

Shen

Wang

, et al. Clinical features and prognosis of resectable pulmonary primary invasive mucinous adenocarcinoma. Transl Lung Cancer Res 2022; 11(3): 420-431.

23.

Miyata

Endo

Nakajima

, et al. High-resolution computed tomography findings of early mucinous adenocarcinomas and their pathologic characteristics in 22 surgically resected cases. Eur J Radiol 2015; 84(5): 993-997.

24.

Zhang

Hao

, et al. Radiomics nomogram for preoperative differentiation of pulmonary mucinous adenocarcinoma from tuberculoma in solitary pulmonary solid nodules. BMC Cancer 2023; 23(1): 261.

25.

Zhang

Hao

, et al. CT Radiomics combined with clinicopathological features to predict invasive mucinous adenocarcinoma in patients with lung adenocarcinoma. Technol Cancer Res Treat 2023; 22: 15330338231174306.

Radiomics and Clinical Characters Based Gaussian Naive Bayes (GNB) Model for Preoperative Differentiation of Pulmonary Pure Invasive Mucinous Adenocarcinoma From Mixed Mucinous Adenocarcinoma

Abstract

Keywords

Introduction

Materials and Methods

Patient Selection

CT Image Acquisition

Segmentation, Feature Extraction, and Selection

Model Development

Statistical Analysis

Results

Patient Characteristics

Radiomics Feature Selection and Model Construction

Feature Selection and Clinical Model Construction

Combined Model Construction and Validation of Performance

Discussion

Conclusion

Footnotes

Abbreviations

Acknowledgments

Authors’ Contributions

Availability of Data and Materials

Consent for Publication

Declaration of Conflicting Interests

Ethical Approval and Consent to Participate

Funding

ORCID iD

References