Diagnosis of Benign and Malignant Pulmonary Ground-Glass Nodules Using Computed Tomography Radiomics Parameters

Abstract

Objective: To assess the clinical value of a radiomics model based on low-dose computed tomography (LDCT) in diagnosing benign and malignant pulmonary ground-glass nodules. Methods: A retrospective analysis was performed on 274 patients who underwent LDCT scanning with the identification of pulmonary ground-glass nodules from January 2018 to March 2021. All patients had complete clinical and pathological data. The cases were randomly divided into 191 cases in a training set and 83 cases in a validation set using the random sampling method and a 7:3 ratio. Based on the predictor sources, we established clinical, radiomics, and combined prediction models in the training set. A receiver operating characteristic (ROC) curve was generated for the training and validation sets, the predictive abilities of the different models for benign and malignant nodules were compared according to the area under the curve (AUC), and the model with the best predictive ability was selected. A calibration curve was plotted to test the good-of-fitness of the model in the validation set. Results: Of the 274 patients (84 males and 190 females), 156 had malignant, and 118 had benign nodules. The univariate analysis showed a statistically significant difference in nodule position between benign nodules and lung adenocarcinoma in both data sets (P <.001 and .021). In the training set, when the nodule diameter was >8 mm, the probability of nodule malignancy increased (P < .001). The results showed that the combined model had a higher prediction ability than the other two models. The combined model could distinguish between benign and malignant pulmonary nodules in the training set (AUC: 0.711; 95%CI: 0.634-0.787; ACC: 0.696; sensitivity: 0.617; specificity: 0.816; PPV:0.835; NPV: 0.585). Moreover, this model could predict benign and malignant nodules in the validation set (AUC: 0.695; 95%CI: 0.574-0.816; ACC: 9.747; sensitivity: 0.694; specificity: 0.824; PPV: 0.850; NPV: 0.651). The calibration curve had a P value of 0.775, indicating that in the validation set, there was no difference between the value predicted by the combined model and the actual observed value and that the result was a good fit. Conclusion: The prediction model combining clinical information and radiomics parameters had a good ability to distinguish benign and malignant pulmonary ground-glass nodules.

Keywords

radiomics prediction model low-dose CT pulmonary ground-glass nodule early screening of lung cancer

Introduction

The latest global cancer statistics show that lung cancer ranks first in mortality and second in morbidity.¹ Due to the asymptomatic nature of early lung cancer, most patients present in the advanced stages when treatment opportunities are limited, resulting in reduced survival. Low-dose computed tomography (LDCT) plays an increasingly important role in early lung cancer screening.² Moreover, with the increasingly extensive application of artificial intelligence in medical research, the mining and analysis of medical image data have gained attention. Pulmonary ground-glass nodules are similar to ground glass in appearance.³ Studies have shown that pulmonary ground-glass nodules, especially semi-solid nodules, are associated with a high probability of malignancy. Therefore, timely detection of pulmonary ground-glass nodules and subsequent interventions could reduce the incidence of lung cancer.⁴ At present, the diagnosis of benign and malignant pulmonary ground-glass nodules is mainly based on the density and morphology of the nodules and the experience of radiologists, but sometimes it is still difficult to distinguish between benign and malignant nodules.⁵ With the development of radiomics, medical images have gained unprecedented diagnostic advantages. There have been studies on combining image analysis and artificial intelligence to predict the presence of benign or malignant pulmonary ground-glass nodules, but a consensus has not been reached.^6–9 This study aimed to assess the clinical value of radiomics parameters based on LDCT in the diagnosis of pulmonary ground-glass nodules.

Materials and Methods

Study Design

A retrospective analysis was performed on 274 patients who underwent LDCT scanning identifying pulmonary ground-glass nodules in the Affiliated Cancer Hospital of Chongqing University from January 2018 to March 2021. There were 84 males and 190 females, with an average age of 50 years (ranging from 21 to 81 years). All patients met the following inclusion criteria: 1) the nodule type was based on the pathology assessment, and 2) There was no history of other malignant tumors. The exclusion criteria were as follows: the nodules were too small to extract radiomics parameters. Information on gender, age, nodule position, spiculation sign, diameter, and smoking history were collected and recorded. The 274 cases were randomly divided into training and validation sets in a 7:3 ratio. The patient enrollment flowchart is shown in Figure 1. This study was approved by the Affiliated Chongqing University Cancer Hospital Ethics Committee (Ethical code: CZLS2022054-A).

Figure 1.

Patient enrollment flowchart.

Image Acquisition

A GE Revolution CT scanner was used to obtain all images. CT scanning parameters were as follows: voltage 120 kV, current size 50 mA, scanning thickness 7 mm, and reconstruction thickness 1.5 mm. The reconstruction matrix was 512 × 512, and the pitch and interlayer spacing was 1 mm and 7 mm, respectively.

Extraction of Radiomics Parameters

LIFEx software was used to segment and extract the target lesions. The exported CT images in DICOM format were imported into LIFEx software, and the lesions were segmented manually slice by slice. The segmented regions of interest (ROIs) were examined to ensure that ROIs did not include peripheral blood vessels, the trachea, or normal lung tissue. As an example, an image of lesion tissue segmentation by LIFEx software is shown in Figure 2. The extraction of radiomics parameters was set as follows: image resampling parameters: 2 × 2 × 2 and binWidth: 64. A total of 47 parameters were extracted, including First Order Statistics,¹⁰ Shape,⁵ Gray Level Cooccurrence Matrix,⁷ Gray Level Run Length Matrix,¹¹ Gray Level Size Zone Matrix,¹¹ and Neighboring Gray Tone Difference Matrix.³

Figure 2.

The lesion tissue segmentation by LIFEx software.

Feature Engineering, Modeling, and Selection

Feature engineering and modeling were only completed for the training set. Clinical, radiomics, and combined prediction models were established based on the stepwise regression method of binary logistic regression classifiers.

The same feature selection method was used in all three models. The recursive feature elimination (RFE) method was used for the first step selection. The parameters were selected according to the model accuracy. Then, the logical regression stepwise regression method was used to determine the parameters for establishing the model based on the Akaike information criterion (AIC). The model with the minimum AIC value was selected.

The clinical model included gender, age, nodule position, spiculation sign, diameter, and smoking history. The radiomics model included 47 radiomics parameters. The combined model included all clinical and radiomics variables.

Model Comparison, Optimal Model Selection, and Visualization

The indicators used for model comparison in this study included the AIC value used to establish the model, which reflects the good-of-fitness of model fitting with smaller values corresponding to a better good-of-fitness). The receiver operating characteristic (ROC) curve and the area under the curve (AUC) were also used for model comparison. AUC values closer to 1 correspond to a better ability of the model to distinguish between benign and malignant nodules.

After selecting the optimal model, the calibration curve was plotted to test the good-of-fitness of the model for the validation set. The calibration curve P-value was <.05, indicating no deviation between the predicted and observed values in the validation set. Finally, a nomogram was created to visualize the optimal model.

Statistical Methods

R software (version 3.6.3) was used for statistical analysis. Continuous variables were analyzed using T-tests or the Wilcoxon rank-sum test depending on whether the variables were normally distributed, and categorical variables were analyzed with Fisher's exact test. The “caret” package was used for fitting the logistic regression model and RFE parameter selection, and the “pROC” package was used for plotting the ROC curve. The “rms” package was used to plot the calibration curves and nomograms. P < .05 indicates statistically significant differences.

Results

Analysis of Enrolled Patients and Clinical Data

The 274 enrolled patients included 156 cases of lung adenocarcinoma (51 males and 105 females) and 118 cases of benign nodules (33 males and 85 females) confirmed by pathology. There were 191 patients in the training set (61 males and 130 females), including 109 lung adenocarcinoma and 82 benign nodule cases. There were 83 patients in the validation set (23 males and 60 females), including 47 lung adenocarcinoma and 36 benign nodule cases. Clinical data from 274 patients are shown in Table 1. The data show that nodule diameter and position were closely correlated with benign and malignant nodules (P = .005 and P < .001, respectively).

Table 1.

Comparison of Clinical Data of 274 Patients Enrolled in the Study

Variable	Benign nodule	Lung adenocarcinoma	P
Variable	n = 118	n = 156	P
Gender			.401
Male	33 (28%)	51 (32.7%)
Female	44 (60%)	56 (67%)
Smoking history			.069
Non-smoker	104 (88.1%)	125(80.1%)
Smoker	14(11.9%)	31 (19.9%)
Spiculation sign			.774
Without spiculation	110 (93.2%)	144 (92.3%)
With spiculation	8 (6.8%)	12 (7.7%)
Position			.732
Centric	43 (36.4%)	60 (38.5%)
Peripheral	75 (63.6%)	96 (61.5%)
Age (years)			.005
≤51	71 (60.2%)	67 (42.9%)
>51	47 (39.8%)	89 (57.1%)
Diameter (mm)			＜.001
≤8	79 (66.9%)	59 (37.8%)
>8	39 (33.1%)	97 (62.2%)

Comparison of Clinical Data between the Training and Validation Sets

The univariate analysis showed that patient age and nodule diameter were statistically different between benign and malignant nodules in the training set (P = .023 and P < .001), while only the nodule diameter was statistically different between benign and malignant nodules in the validation set (P = .021; Table 2).

Table 2.

Comparison of Clinical Variables.

	Training set			Validation set
	Benign nodule (n = 82)	Lung adenocarcinoma (n = 109)	P	Benign nodule (n = 19)	Lung adenocarcinoma (n = 27)	P
Gender			.493			.629
Male	24 (29.3%)	37 (33.9%)		9 (25%)	14 (30%)
Female	58 (70.7%)	72 (66.1%)		27 (75%)	33 (70%)
Smoking history			.266			.131
Non-smoker	73 (89%)	91 (83.5%)		31 (86%)	34 (72%)
Smoker	9 (11%)	18 (16.5%)		5 (14%)	13 (28%)
Spiculation sign			.927			1.00
Without spiculation	77 (93.9%)	102 (93.6%)		33 (92%)	42 (89%)
With spiculation	5 (6.1%)	7 (6.4%)		3 (8%)	5 (11%)
Position			.875			.392
Centric	31 (37.8%)	40 (36.7%)		12 (33%)	20 (43%)
Peripheral	51 (62.2%)	69 (63.3%)		24 (67%)	27 (57%)
Age (years)			.023			.094
≤50	49 (59.8%)	47 (43.1%)		22 (61%)	20 (38%)
>50	33 (40.2%)	62 (56.9%)		14 (39%)	27 (62%)
Diameter (mm)			＜.001			.021
≤8	56 (68.3%)	41 (37.6%)		23 (64%)	18 (38%)
>8	26 (31.7%)	68 (62.4%)		13 (36%)	29 (62%)

Selection of Radiomics Parameters

The RFE method was combined with the logistic regression classifiers to select the most significant variables for predicting outcomes. Two radiomics parameters, GLZLM_SZLGE and Shape_Compacity, were considered stable. There were no statistically significant differences in radiomics parameters between benign nodules and lung adenocarcinoma in the two data sets (P > .05; Table 3).

Table 3.

Comparison of Radiomics Parameters in the Training and Validation Data Sets.

	Training set			Validation set
	Benign nodule (n = 82)	Lung adenocarcinoma (n = 109)	P	Benign nodule (n = 36)	Lung adenocarcinoma (n = 47)	P
GLZLM_SZLGE	−0.20 (−0.90, 0.70)	−0.40 (−0.90, 0.90)	.60	0.10 (−0.90, 1.00)	−0.70 (−0.90, 0.80)	.35
Shape_Compacity	−0.10 (−0.60, 0.80)	−0.20 (−0.70, 0.60)	.22	−0.30 (−0.80, 0.20)	−0.40 (−0.60, 0.20)	.96

Comparison and Selection of Clinical Prediction Models

The clinical model included patient age and nodule diameter, the radiomics model included GLZLM_SZLGE and Shape_Compacity, and the combined model included GLZLM_SZLGE, Shape_Compacity, and nodule diameter. The ROC curve was plotted, and the predictive ability of the three models was compared based on their AUC values. In the training set, the ability of the three models to distinguish between benign and malignant pulmonary nodules was as follows: clinical model (AUC: 0.662, 95%CI: 0.588-0.736，ACC：0.634，sensitivity：0.486，specificity：0.829，PPV：0.791，NPV：0.548), radiomics model (AUC: 0.555, 95%CI: 0.471-0.638，ACC：0.576，sensitivity：0.633，specificity：0.500，PPV：0.627，NPV：0.506), and combined model (AUC: 0.711, 95%CI: 0.634-0.787，ACC：0.696，sensitivity：0.617，specificity：0.816，PPV：0.835，NPV：0.585). As the AUC value for the radiomics model is relatively low, in the validation set, the ability of the other two models with higher AUC values (the clinical model and the combined model) to distinguish between benign and malignant pulmonary nodules was investigated and the results are as follows: clinical model (AUC: 0.656, 95%CI: 0.542-0.770，ACC：0.639，sensitivity：0.468，specificity：0.861，PPV：0.815，NPV：0.554) and combined model (AUC: 0.695, 95%CI: 0.574-0.816, ACC: 0.747, sensitivity: 0.694, specificity: 0.824, PPV: 0.850，NPV: 0.651). The difference of AUC between the clinical model and the combined model in the training set and the validation set was significant (p < .05) according to the Delong test. Thus, the prediction ability of the combined model is slightly higher than that of the clinical model. The ROC curves of the combined model for the training and validation set are shown in Figure 3.

Figure 3.

The ROC curves for the combined models (A) training set; (B) validation set.

Model Calibration

The results showed that the clinical and radiomics models had poor prediction ability in the validation set. However, when the clinical and radiomics models were combined, the predictive ability in the validation set was slightly higher than that in the training set. Calibration curves were plotted for the validation set according to the comparison results (Figure 4). The calibration curve showed that the model had good-of-fitness in the validation set (P = .775 > .05).

Figure 4.

The calibration curve of the combined model in the validation set.

Establishing Nomograms

Finally, based on the combined model results, we created a nomogram to visualize the optimal prediction model (Figure 5).

Figure 5.

The nomogram based on the combined model. Notes: cdm is nodule diameter; 0 means ≤8 mm; 1 means >8 mm.

Discussion

With the aggravation of air pollution and other factors like smoking, the incidence of lung cancer has been increasing. Early screening, diagnosis, and treatment play a crucial role in lung cancer prognosis, and early treatment by surgical resection can improve five-year survival rates.¹⁰ During routine medical CT examinations, most patients undergo conventional CT for lung scanning. High radiation doses may be harmful to healthy people. Therefore, LDCT has gradually become the main means of lung disease screening for people undergoing a physical examination. In this study, data from patients with pulmonary ground-glass nodules identified during LDCT scanning and confirmed by a pathology diagnosis were collected. A clinical prediction model was established using artificial intelligence to predict the benign and malignant pulmonary ground-glass nodules.

Many recent studies have compared the value of low-dose and conventional CT in detecting and diagnosing pulmonary ground-glass nodules. Liu¹¹ and Zhu et al¹² used conventional and LDCT to compare the detection rate of pulmonary ground-glass nodules, and the results showed that LDCT had a higher image quality and detection rate of nodules than conventional CT. The higher-order parameters derived from CT images also have predictive value for the pathological results of pulmonary ground-glass nodules. Xu et al¹³ used CT density histograms to study 126 patients with pulmonary ground-glass nodules, and their results showed that the average CT value, maximum CT value, skewness, and kurtosis density difference indexes could predict pathological grade. Feng et al¹⁴ conducted a computer-aided diagnostic study on 121 patients with early ground-glass nodule adenocarcinoma. Their results showed that computer-aided diagnosis could be used to quantitatively evaluate the internal characteristics of pure ground-glass nodules and distinguish between infiltrating lesions and pre-infiltrating/micro-infiltrating lesions.

With the advancements in precision medicine and statistical methods, clinical prediction models are widely used. Further, the wide application of radiomics parameters has also brought unprecedented advantages to medical imaging. Recently, the research focus has gradually turned to combining artificial intelligence and radiomics to predict benign and malignant pulmonary ground-glass nodules. In this study, three prediction models were established to compare the research value of clinical variables with radiomics parameters in predicting benign and malignant nodules. A total of 47 radiomics parameters were extracted to evaluate nodules density, morphology, and internal heterogeneity. After comparing the three models, we found that the combined model, including nodule location, size (diameter), and density had the best predictive ability. Eguchi et al¹⁵ showed that tumor size and tumor density on CT images more accurately predicted the invasiveness of ground-glass nodules. In our model, nodule size (diameter) and density were also important variables for predicting benign and malignant nodules. Similar to our approach, Zhao et al⁹ studied 626 cases of pathologically confirmed ground-glass nodules and included the average CT value in their final prediction model. The tumor density on CT images could distinguish between benign and malignant nodules. Similar research results were also described by Gao et al.⁷

However, the results are different when radiomics parameters are analyzed. The following reasons could account for the differences: (1) The software and settings of extraction parameters were different; (2) Differences in the included cases could lead to different results; (3) Different modeling classifiers and feature selection methods could lead to different results.

The main limitations of our study were the small sample size and the lack of external tests on the model. Also, since only patients with pathologically confirmed lung adenocarcinoma were included, the applicability to other cancer types may have certain limitations.

Footnotes

Abbreviations

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Decision Consultation and Management Innovation Project of Science and Technology Bureau of Shapingba District, Chongqing, Key Project of Chongqing Technology Innovation and Application Development, Key Technology Project for Prevention and Control of Major Diseases in Chongqing, (grant number Jcd202112, CSTC2019JSCX-FXYD0025, 2019ZX002).

ORCID iD

Ling liang

References

Sung

Ferlay

Siegel

, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. 2021;71(3):209–249.

Naidich

Marshall

Gribbin

Arams

McCauley

. Low-dose CT of the lungs: preliminary observations. Radiology. 1990;175(3):729–731.

Xia

. Application of low-dose multi-slice spiral CT in the examination of pulmonary ground glass nodules. China Med Dev. 2014;29(9):122–123+172.

Henschke

Yankelevitz

Mirtcheva

, et al. CT Screening for lung cancer: frequency and significance of part-solid and nonsolid nodules. Am J Roentgenol. 2002;178(5):1053–1057.

Tan

Chu

. Pulmonary benign ground-glass nodules: CT features and pathological findings. Int J Gen Med. 2021;14:581–590.

Mei

Wang

Yang

, et al. Predicting malignancy of pulmonary ground-glass nodules and their invasiveness by random forest. J Thorac Dis. 2018;10(1):458–463.

Gao

Yan

Luo

, et al. The growth trend predictions in pulmonary ground glass nodules based on radiomic CT features. Front Oncol. 2020;10:580809.

, et al. Non-invasive evaluation for benign and malignant subcentimeter pulmonary ground-glass nodules (≤1 cm) based on CT texture analysis. Br J Radiol. 2020;93(1114):20190762.

Zhao

Yang

, et al. Development and validation of a radiomics nomogram for identifying invasiveness of pulmonary adenocarcinomas appearing as subcentimeter ground-glass opacity nodules. Eur J Radiol. 2019;112:161–168.

10.

Chen

Gao

Zhang

. Diagnostic value of 64-slice spiral CT low-dose scan in pulmonary nodules. China Mod Doct. 2018;56(9):127–129.

11.

Liu

. Application value of low dose multi-slice spiral CT in the diagnosis of pulmonary ground glass nodules. Chin Recu Med. 2019;28(10):1067–1069.

12.

Zhu

Huang

. Application of multi-slice spiral CT in pulmonary ground glass nodules. Mod Med Image. 2016;25(3):512–514.

13.

, et al. Predictive value of CT density histogram for pathological grading of pure ground glass nodules in lung. J Med Image. 2021;31(1):37–39,42.

14.

Feng

Shi G

, et al. Application value of computer aided diagnostic quantitative measurement in differentiating early lung adenocarcinoma with pure ground glass nodules. J Appl Radiol. 2021;37(2):216–219.

15.

Eguchi

Yoshizawa

Kawakami

, et al. Tumor size and computed tomography attenuation of pulmonary pure ground-glass nodules are useful for predicting pathological invasiveness. PLoS One. 2014;9(5):e97867.