Abstract
Background
In clinical practice, preoperative identification of mixed ground-glass opacity (mGGO) nodules with micropapillary component (MPC) to facilitate the implementation of individualized therapeutic strategies and avoid unnecessary surgery is increasingly important
Objective
This study aimed to build a predictive model based on clinical and radiological variables for the early identification of MPC in lung adenocarcinoma presenting as mGGO nodules.
Methods
The enrolled 741 lung adenocarcinoma patients were randomly divided into a training cohort and a validation cohort (3:1 ratio). The pathological specimens and preoperative images of malignant mGGO nodules from the study subjects were retrospectively reviewed. Furthermore, in the training cohort, selected clinical and radiological variables were utilized to construct a predictive model for MPC prediction.
Results
The MPC was found in 228 (43.3%) patients in the training cohort and 72 (41.1%) patients in the validation cohort. Based on the predictive nomogram, the air bronchogram was defined as the most dominant independent risk factor for MPC of mGGO nodules, followed by the maximum computed tomography (CT) value (
Conclusions
This study proposed a predictive model for preoperative identification of MPC in known lung adenocarcinomas presenting as mGGO nodules to facilitate individualized therapy. This nomogram model needs to be further externally validated by subsequent multicenter studies.
Keywords
Introduction
The mixed ground-glass opacity (mGGO) nodule is a common imaging manifestation of early-stage lung adenocarcinoma [1]. In clinical practice, assessing the degree of malignancy mGGO nodules during imaging follow-up and finding the appropriate time for operative intervention remain a clinical challenge [2–4]. Some studies observed that lung adenocarcinoma with the micropapillary component (MPC) also showed distinct genomic profiles and immunosuppression [5,6]. Additionally, numerous studies find that such patients also have a higher risk of early metastasis and exhibit a poorer prognosis even when the MPC component is not predominant [7–9]. Therefore, the early identification of mGGO nodule with MPC may help better judge the tumor characteristics in lung adenocarcinoma patients, which in turn can facilitate the implementation of individualized therapeutic strategies [10–12].
Given the diagnosis of MPC in adenocarcinoma requires pathological findings following surgical excision, effective preoperative predictive methods for the presence of MPC in GGO nodules are increasingly needed. Computed tomography (CT) is a non-invasive screening tool for lung cancer manifesting as GGO nodules and is widely used in clinical practice [13]. It was noticed that the internal radiological characteristics of solid-prominent nodules are often obscured and difficult to accurately evaluate [14] and the accuracy of pathological results between solid-predominant and mGGO nodules is also heterogeneous [15]. Therefore, the imaging prediction model established without distinguishing the purely solid nodules and mGGO nodules may reduce the predictive accuracy to a certain extent. Many studies have used imaging features of lung nodules, which include the solid-predominant nodules, to predict pathological subtypes of lung adenocarcinoma [16,17], but little to no research has focused on the mGGO nodules solely.
In this study, we aimed to retrospectively investigate the clinical and pathological features of mGGO nodules with MPC in lung adenocarcinoma patients. Furthermore, this study also constructed a nomogram to effectively predict the probability of MPC in mGGO by a non-invasive method.
Materials and methods
Patients and study design
This retrospective study was approved by the Institutional Review Boards of The First Affiliated Hospital of Soochow University (No.476 in 2022). Patients who underwent resection for lung adenocarcinoma from January 2021 to January 2022 were enrolled in this study. The flow chart of the study subjects is depicted in Fig. 1. The patients must meet all the following inclusion criteria: i, preoperative chest CT examination presenting as mGGO nodules (
Based on the presence or absence of the MPC in pathological findings, the training cohort and the validation cohort were divided into two groups respectively, the MPC group and the non-MPC group. The clinical and pathological data between the two groups in both cohorts were compared respectively. Accordingly, based on the univariate analysis, significant variables were selectly included in the multivariate analysis. Furthermore, a nomogram was constructed by selecting significant variables from the multivariate logistic analysis. And the discriminative and consistent ability of the predictive model was also validated.

Inclusion criteria and screening process of the study. The flow chart showed the process of patient enrollment, including the inclusion criteria and the grouping of study subjects.
We have reviewed the preoperative CT imaging of all enrolled subjects and some of the representative imaging findings are shown in Fig. 2. In particular, the imaging findings of mGGO nodules are standardly described, including the location of the lung lobes where the tumor is present and their relative position with pleura; the size of the tumor (maximum diameter); tumor margins (Lobulation, spiculation); internal features of the tumor (vacuolar sign, air bronchogram sigh, vessel convergence); adjacent to the pleura; and maximum CT values of the solid component.
The immunohistochemistry testing of programmed cell death-Ligand 1 (PD-L1) was conducted by 22C3 assays (Agilent Technologies) using the DakoAutostainer Link 48 platform following its manufacturer’s instructions [18]. According to the tumor proportion score (TPS), cases were categorized into negative (TPS

Representative CT images of the study subjects. (a) A mixed ground ground-glass opacity (mGGO) nodule with homogeneous density in the right upper lobe Few blood vessels are penetrating within the mGGO and the margins appear lobulated; (b) A mGGO nodule in the upper left lobe that depicts vacuolar sign, vessel convergence, and pleural retraction; (c) A mGGO nodule adjacent to the pleura in the right upper lobe with pleural retraction.
Clinical and preoperative imaging characteristics between the enrolled subjects.
Clinical and preoperative imaging characteristics between the enrolled subjects.
The random grouping method was conducted using R language and the enrolled 702 study subjects were randomly divided into two groups in a 3:1 ratio: the training cohort (
Nomogram construction and evaluation
As for the selection of relevant variables, the demographic information such as age, gender, and smoking history were selected as variables based on previous literature. Variables regarding the internal and external characteristics of mGGOs were selected based on not only the literature but also the clinical practice. Based on the independent risk factors from multivariate logistic regression in the training cohort, a nomogram was plotted and validated by using the R programming language and environment (http://www.r-project.org/). In the training and validation cohort, the area under the curve (AUC) was used to evaluate the discriminative ability and the calibration curve was used to assess the agreement between the predicted risks and the actual results. The internal validation was implemented using the bootstrapping method by setting the number of iterations to 1000.
Results
Demographic features
Multivariate analysis of risk factors for differentiating mGGO nodules with and without MPC in the training cohort.
Multivariate analysis of risk factors for differentiating mGGO nodules with and without MPC in the training cohort.
The preoperative imaging characteristics between the MPC group and the non-MPC group are shown in Table 1. The location of the lung lobes for mGGO nodules was comparable between the MPC group and non-MPC group (
In terms of marginal features of mGGO nodules, mGGO nodules with MPC were more likely to manifest as lobulation (83.3% vs. 76.1%,
Independent risk factors for the presence of MPCin mGGO nodules in the training cohort
Firstly, we screened variables with statistical significance from the univariate analysis, including age, gender, adjacent to pleura, lobulation, spiculation, vacuolar sign, air bronchogram, maximum diameter (
The preoperative predictive nomogram construction and validation
As shown in Fig. 3a, based on the five significant variables from multivariate analysis in the training cohort, we constructed a nomogram for the prediction of MPC in lung adenocarcinoma patients presenting as mGGO nodules. Based on the predictive nomogram, the air bronchogram was defined as the most dominant independent risk factor for the presence of MPC in mGGO nodules, followed by the maximum CT value (
Then the discriminative and consistent ability of the nomogram in the training cohort was validated. As shown in Fig. 3b, the cut-off score of the receiver operating characteristic (ROC) curve was 241.4 points and lung adenocarcinoma presenting as mGGO nodules could be further identified as MPC or non-MPC with a sensitivity of 72.4% and a specificity of 71.3%. Based on the ROC curve, favorable discrimination of the predictive ability was shown with an AUC value of 0.783 (95%[CI] 0.744–0.822), which demonstrated the satisfactory discrimination of the model. As for the consistent validation, in the training cohort, the calibration curve of the nomogram also showed high consistencies between the predicted and observed micropapillary-contained probability. Furthermore, the internal validation was implemented using the bootstrapping method by setting the number of iterations to 1000. And this predictive model calculated a corrected AUC of 0.774 (95% CI: 0.770–0.779).

The construction and validation of a predictive nomogram based on clinical and radiological features in the training cohort. (a) The nomogram for the prediction of MPC among lung adenocarcinoma presenting as mGGO nodules. Five categorical variables from the multivariate logistic analysis were included in the model and had different contributions to the probability of MPC among mGGO nodules; (b) A ROC curve for the discriminative validation of the nomogram in the training cohort. The AUC value of the ROC curve was 0.783 (95%[CI] 0.744–0.822); (c) A calibration curve for the estimation of agreement between the predicted mGGO nodules with MPC and those of the nomogram.
Accordingly, we validated the predictive nomogram in the validation cohort (Fig. 4a and b). The predictive model also demonstrated favorable discriminative ability with an AUC value of 0.799 (95%[CI] 0.732–0.866) in the validation cohort. The calibration curve also showed consistencies between the predicted and observed micropapillary-contained probability.

The validation of the nomogram in the validation cohort. (a) A ROC curve for the discriminative validation of the nomogram from the training cohort. The AUC value of the ROC curve was 0.799 (95%[CI] 0.732–0.866) in the validation cohort; (b) A calibration curve for the estimation of agreement between the predicted mGGO nodules with MPC and those of the nomogram in the validation cohort.
Histopathological characteristics of the study subjects.
Histopathological characteristics of the study subjects.
Clinical and preoperative imaging characteristics in the training and validation cohort.
Histopathological characteristics in the training and validation cohort.
Notably, cases that tested with high Ki-67 expression (
To explore the immune microenvironment characteristics among lung adenocarcinoma patients with and without MPC, we then compared the expression of the immune inhibitory molecule PD-L1. According to the immunohistochemistry staining results (Table 3), the proportion of cases that tested positive for PD-L1 was significantly higher in the MPC group compared with their non-MPC counterparts (57.7% vs. 30.8%,
Discussion
In this study, we found that the preoperative assessment of mGGO nodules with MPC implicated by clinical and radiological features could be very helpful for lung adenocarcinoma patients. Furthermore, we proposed an effective predictive nomogram to classify mGGO nodules with MPC in lung adenocarcinoma patients. It is a practical and non-invasive tool to assess the probability of MPC risk in mGGO nodules, thereby enabling timely identification for surgery and appropriate treatment when the samples are unobtainable.
Age and gender are controversial variables associated with pathological subtypes of lung adenocarcinoma [16,17,22]. In the present study, we found that the older (
The current study supported previously reported findings that lung adenocarcinoma with MPC was more likely to be associated with lymphatic and pleural invasion [7,10]. Furthermore, for primary lung adenocarcinoma, a previous study demonstrated clinicopathological characteristics can provide predictive information of unexpected lymph node metastasis [23]. This further illustrated the importance of developing a pathological subtype predictive model, as this can also provide an early indication of the possibility of invasive capacity for primary lung adenocarcinoma.
The size was an important risk for the presence of MPC in lung adenocarcinoma according to previous research [15,24]. Our data showed that the proportion of mGGO nodules with a maximum diameter
Tumor spread through air spaces pattern has been reported as a novel invasive pattern in lung adenocarcinoma [25]. Notably, a study conducted by Yotsukura, M et al. showed that air bronchogram sign was significantly associated with tumor spread through air spaces pattern and vacuole was an independent predictor of this invasive pattern [26]. Interestingly, among the internal imaging features of mGGO nodules, our data also demonstrated that the vacuolar sign and air bronchogram sign were the independent risk factors for the presence of MPC in lung adenocarcinoma. The above information may indicate why the mGGO nodule with MPC has a higher invasion capability.
The CT value inside adenocarcinoma is diverse, reflecting cellularity, density, and heterogeneous histological components within the tumor [27]. Because the mGGO nodules with MPC were also more likely to be found with high invasive ability, the quantitative CT indicators are non-negligible predictive variables. A study demonstrated that among lung adenocarcinoma presenting as GGO nodules, the invasive components were more likely to appear in areas with high CT values while adherent components appear in the site with low CT values [28]. Similarly, a recent study by Koezuka et al. also found that the combined use of mean and maximum CT values was useful for predicting lepidic-predominant lung adenocarcinoma [29]. As expected, the maximum CT value was an important independent predictor for the MPC among mGGO nodules in this study.
This study has some limitations. Firstly, this is a single-center retrospective study and the results obtained need to be further validated by subsequent multicenter studies. Additionally, the study subjects included are more likely to have higher-risk nodules, which may lead to selection bias. Since our data were retrospectively derived from cases in recent years, the lack of longitudinal follow-up information in this study is another limitation that cannot be ignored. The data on genetic mutations in this study were not fully available and their correlation with lung adenocarcinoma with MPC also needs to be further explored. Finally, all the nodules included in this cohort were known adenocarcinoma, the predictive model proposed in this study cannot be used in a GGO nodule that has no tissue diagnosis.
Conclusion
The predictive nomogram based on the multivariate logistic regression analysis proposed in this study showed that the air bronchogram was the most dominant independent risk factor for the presence of MPC in known lung adenocarcinoma presenting as mGGOs, followed by the maximum CT value (
Abbreviations
mGGO: Mixed ground-glass opacity
MPC: Micropapillary component
AUC: Area under the curve
CT: Computed tomography
PD-L1: Programmed cell death-Ligand 1
ROC: Receiver operating characteristic
Footnotes
Acknowledgments
We would like to thank all the nurses and the clinical staff who provided care for the patient. We thank the funders of the National Natural Science Foundation of China (NSFC) grant (grant number 81672280), Jiangsu Provincial Medical Key Discipline (grant number ZDXK202201) and Project of Suzhou City (grant number SYS2021034) for supporting this work.
Author contributions
Conception: Cheng Chen and Weidong Zhu.
Interpretation or analysis of data: Ziwen Zhu and Weizhen Jiang.
Preparation of the manuscript: Danhong Zhou.
Revision for important intellectual content: Ziwen Zhu.
Supervision: Cheng Chen and Weidong Zhu.
Funding
The project was mainly supported by the National Natural Science Foundation of China (NSFC) grant (grant number 81672280), Jiangsu Provincial Medical Key Discipline (grant number ZDXK202201) and Project of Suzhou City (grant number SYS2021034).
Competing interests
The authors have no relevant financial or non-financial interests to disclose.
Ethics approval
This study was performed in line with the principles of the Declaration of Helsinki. This study was approved by the Institutional Review Boards of The First Affiliated Hospital of Soochow University (No.476 in 2022).
