Abstract
Background
To determine the populations who suitable for surgical treatment in elderly patients (age ≥ 75 y) with IA stage.
Methods
The clinical data of NSCLC patients diagnosed from 2010 to 2015 were collected from the SEER database and divided into surgery group (SG) and no-surgery groups (NSG). The confounders were balanced and differences in survival were compared between groups using PSM (Propensity score matching, PSM). Cox regression analysis was used to screen the independent factors that affect the Cancer-specific survival (CSS). The surgery group was defined as the patients who surgery-benefit and surgery-no benefit according to the median CSS of the no-surgery group, and then randomly divided into training and validation groups. A surgical benefit prediction model was constructed in the training and validation group. Finally, the model is evaluated using a variety of methods.
Results
A total of 7297 patients were included. Before PSM (SG: n = 3630; NSG: n = 3665) and after PSM (SG: n = 1725, NSG: n = 1725) confirmed that the CSS of the surgery group was longer than the no-surgery group (before PSM: 82 vs. 31 months, P < .0001; after PSM: 55 vs. 39 months, P < .0001). Independent prognostic factors included age, gender, race, marrital, tumor grade, histology, and surgery. In the surgery cohort after PSM, 1005 patients (58.27%) who survived for more than 39 months were defined as surgery beneficiaries, and the 720 patients (41.73%) were defined surgery-no beneficiaries. The surgery group was divided into training group 1207 (70%) and validation group 518 (30%). Independent prognostic factors were used to construct a prediction model. In training group (AUC = .678) and validation group (AUC = .622). Calibration curve and decision curve prove that the model has better performance.
Conclusions
This predictive model can well identify elderly patients with stage IA NSCLC who would benefit from surgery, thus providing a basis for clinical treatment decisions.
Introduction
Lung cancer is 1 of the most common malignancies worldwide, with high morbidity and mortality. 1 Non-small cell lung cancer (NSCLC) is accountable for more than 80% of lung cancer cases. 2 In the past decades, due to the neglect of health and the constraints of economic and technical conditions, most of lung cancer were found to be in the middle and advanced stage, and the treatment prognosis was poor. In recent years, with the enhancement of the awareness of physical examination and the popularization of multi-slice spiral CT, more and more patients with stage IA lung cancer can be found and diagnosed early.
In the current treatment guidelines, primary tumor resection (PTR) is still the standard operation for the treatment of early lung cancer, 3 which can completely remove the focus, alleviate symptoms and control the disease to a certain extent. Many studies have observed that patients with early cancer can obtain significant survival time after radical resection. 3 This may be the result of surgery4,5 reducing the complications of the primary tumor and benefiting from multiple treatment modalities.6,7 With increasing global aging and increasing life expectancy per capita, the proportion of elderly patients among lung cancer patients is gradually increasing. 8 Treatment options vary significantly for different age groups, for example, radical tumor resection requires high patient lung function, immunity, and baseline physical fitness. 9 Elderly patients with NSCLC have more chronic diseases and decreased body reserve function, resulting in poor surgical tolerance and objectively do not meet the surgical conditions.10,11 However, with continuous improvement of medical level and the improvement of the basic state of the elderly, more and more clinicians actively advocate the preoperative evaluation of elderly patients and give priority to radical surgery. 12 A retrospective study found that age ≥ 70 years old is not a risk factor for postoperative pulmonary complications (PPCs), indicating that age alone should not be used as a contraindication for surgery. 13 In conclusion, the choice of treatment for elderly patients with stage IA NSCLC remains controversial. Therefore, the risk of postoperative morbidity and mortality, as well as cancer recurrence and long-term survival, must be carefully weighed in the treatment of this patient population. 14 A study predicts that elderly patients will account for more than 70% of lung cancer cases in the next 10 years. 15
Therefore, elderly patients over 75 years of age were selected as the study population in this study. A population-based Surveillance, Epidemiology and End Results (SEER) database was used to develop and validate a new predictive model to identify older patients with stage IA NSCLC who could benefit from surgery and to suggest the best treatment option for this population.
Materials and Methods
Data Source and Study Population
Patient Selection in SEER Database
This study used the United States Surveillance, Epidemiology and Outcome Database (SEER), which was established in 1973 and covers 28% of the population in the United States. The information of patients diagnosed with NSCLC ranging from January 2010 to December 2015 was obtained from the SEER database (November 2019 submission).
Patients were included in our study if they met the following inclusion criteria: (1) patients with stage IA NSCLC from January 2010 to December 2015 (n = 38368); (2) Age ≥ 75 years old (n = 13531); (3) There was only 1 malignant primary lesion (n = 7387). Exclusion criteria: (1) Unknown survival time (n = 14). (2) Unknown cause of death (n = 61); (3) Surgical approach is unclear (n = 15).
Patient demographic characteristics (gender, age, marital status, race), tumor characteristics (histology type, tumor location, tumor laterality, grade, stage), CSS, and treatment details (type of surgery) were collected. In this study, histology types of NSCLC were classified as LUAD, LUSC, and other (large cell carcinoma, adenosquamous carcinoma, and sarcomatoid carcinoma). Depending on the type of surgery, surgical patients are further classified as segmentectomy, lobectomy, pneumonectomy.
In this study, cancer specific survival (CSS) was calculated from the date of diagnosis to the date of cancer specific death. Deaths due to lung cancer were regarded as events, and deaths from other causes were regarded as censored observations.
In addition, this study was licensed to access these research data (SEER stat user name: liuyf, 13981-Nov2020) and conducted in accordance with the Helsinki Declaration (revised in 2013). This study was conducted in accordance with the principles of the Helsinki Declaration. It has been approved by the Medical Ethics Committee of Tumor Hospital of Anhui University of Technology (No:2022-KY-FZZX-03). The research process is shown in Fig 1. Flow chart of predictive model construction.
Statistical Analysis
Continuous variables were expressed as mean ± standard deviation and P-values were derived from Kruskal Wallis rank sum test; categorical variables were expressed as percentages and P-values were derived from Fisher’s exact probability test. The study population was divided into surgery group and no-surgery group according to whether they received radical surgery. Age, gender, race, marrital status, tumor location, laterallity, grade, T stage, histology, and surgery were included as independent variables with tumor specific survival time (CSS) as the intended event.
Calculated propensity score matching (PSM) was used to reduce the difference between groups by adjusting all independent variables, setting the caliper value to 0.01 and taking the population of 1:1 as the standard. Kaplan-Meier method and log-rank test were used to estimate CSS of patients before and after PSM.
Univariate and multivariate Cox proportional hazards regression analysis was performed in the population after PSM to determine independent prognostic factors. The risk ratio (HR), 95% confidence interval (CI) and P value were calculated and displayed using forest map. The dashed line perpendicular to the X-axis (with a horizontal coordinate scale of 1) in the forest plot is the invalid line, and several line segments parallel to the X-axis are used to represent the effect size of each study and its 95% confidence interval. In the graph, the blue dots indicate the risk ratio (HR) and the solid black line indicates the 95%CI. When both the upper and lower 95% CI limits of HR values were on the same side of the Y-axis, they were protective or risk factors affecting prognosis (left: protective factors; right: risk factors). When the 95%CI interval of the HR value of the variable is across the Y-axis, it means that the variable is not statistically significant.
All statistical analyses were performed using power stats (version 2.2) and R software (version 4.0.5). All statistical tests were two-sided, P < .05 was considered statistically significant.
Establishment and Validation of the Nomogram
Referring to the median CSS of the no-surgery population, the surgery population was divided into surgery-benefit and surgery-no benefit groups. And patients in the surgery population were divided into training and validation sets according to 7:3 using benefit and no-benefit as dependent variables.
Based on the training group, factors that independently affect CSS in multivariate Cox analysis are included, and a prediction model is constructed by using logistic regression, and the nomogram is used to visualize the model. The nomogram are used as a quantitative tool to calculate which NSCLC patients will benefit from surgery. Scores for each clinical variable was calculated and summed to obtain the patient’s total score. The total score corresponds to the probability that the patient will benefit from the procedure. Patients with a probability of >50% are the ones who benefit from PTR.
ROC analysis shows the performance of the classification model. The AUC values of the training group and the validation group are calculated respectively to verify the discrimination of the model (AUC <.5, not significant; .5 < AUC <.7, low significant; .7 < AUC < 1.0, high significance). 16
Draw calibration curve to evaluate the calibration degree of the model. If the model calibration is correct, dots on the calibration plot should be close to a 45° diagonal line, shows calibration performance as measured by the Brier score (lower is better). 17 If the Brier score = 0, the model was considered to have perfect overall performance, and the predicted and actual values were in perfect agreement. If the Brier score is > .25, the model was considered to have no value.
Clinical Application
Decision curve analysis (DCA) was used to evaluate the clinical utility of the model in decision-making. DCA evaluates the clinical benefit of the prediction model by plotting the net benefit (NB) within the range of clinical reasonable risk threshold. 18
According to the calculation of the prediction model, all elderly patients with stage IA NSCLC in SEER database after PSM were divided into 3 groups (surgery-benefit group, surgery-no benefit group and no-surgery group). The CSS of the 3 groups were compared by K-M survival analysis to evaluate whether the surgery-benefit group screened by the prediction model can really benefit.
Results
Patient Demographics and Survival Analysis
Baseline characteristics for overall survival in patients with NSCLC.
Cancer-specific survival for Populations With Different Treatment Modalities
Kaplan-Meier survival analysis before PSM found that the prognosis of the surgery group was better than that of the no-surgery group (82 vs. 31 months; P < .0001) (Figure 2A). After PSM, a longer median CSS (55 vs. 39 months; P < .0001) was still observed in patients who underwent surgery than in patients who did not undergo surgery (Figure 2B). Kaplan–Meier plots show the specific survivals of NSCLC patients according to the group before PSM (A) or after PSM (B).
Screening of Independent Prognostic Factors Affecting CSS
In order to explore which factors lead to the difference in survival between the surgery group and the no-surgery group, univariate and multivariate COX analyses were performed on the baseline characteristics of the after PSM population, and visualize it with a forest plot.
Univariate COX regression analysis showed that age, gender, race, marital, grade, histology, and surgery procedure were independent prognostic factors affecting CSS (all P < .05). However, location, laterality, and AJCC-T were not significantly correlated with CSS (Figure 3). The variable with P < .05 was selected to include multivariate Cox analysis. Among them, age, gender, race, marital, tumor grade, histology, and surgery were independent prognostic factors (Figure 4). Univariate Cox regression analysis of factors affecting CSS. Multivariate Cox regression analysis of factors affecting CSS.

In particular, male was an independent risk factor for developing cancer-specific mortality compared to female (HR = 1.719, 95%CI: 1.454-2.033, P < .001). Single patients have a higher risk of cancer-specific death than married patients (HR = 1.331, 95%CI: 1.128-1.570, P = .001). The benefits were not significantly different in LUAD and LUSC (P = .297). However, other rare subtypes benefit more than LUAD and LUSC (HR = .739, 95%CI: .575-.949, P = .017). Among the different surgical approaches, benefits for patients undergoing segmentectomy are significantly better than those for the non-surgical population (HR = 1.584, 95%CI: 1.268- 1.978, P < .001).
Construction of the Predictive Model
The above independent prognostic factors were included in the analysis of distinguishing the people who benefited from surgery. Firstly, according to the population data after PSM and referring to the median CSS (39 months, n = 1725) of no-surgery patients, patients undergoing surgery were divided into benefit population (survival month > 39 months, n = 1005) and no-benefit population (survival month ≤ 39 months, n = 720).
Secondly, in order to avoid the phenomenon of over fitting or under fitting caused by the construction of a single data model, the benefit population and no-benefit population are randomly assigned to the training group (total N = 1207, surgery benefit n = 763, surgery-no benefit n = 444) and the validation set (total N = 518, surgery benefit n = 242, surgery-no benefit n = 276) according to the ratio of 7:3.
Finally, based on the training group data, a prediction model was established by logistic regression analysis to find the elderly stage IA NSCLC surgery candidates, and the prediction model was visualized by nomogram (Figure 5). The probability of surgical benefit was calculated by adding the scores of each variable. Patients with a prediction probability greater than 0.5 were classified as “surgery-benefit group”. On the contrary, it is the surgery-no benefit group. Nomogram was used to identify patients with NSCLC who would benefit from surgery.
The ROC curve is drawn, and the results show that the prediction model has good discrimination in the training group (AUC = 0.678) (Figure 6A). The calibration curve shows a good calibration degree (training group Brier score: 0.211) (Figure 6B). The DCA curves of the training group confirmed the good clinical practical value of the prediction model (Figure 6C). Receiver operating characteristic curve of the nomogram in the training (A) and validation (D) groups. (B) The calibration plots of the training group. (E) The calibration plots of the validation group. Decision curve analysis of training group (C) and validation group (F).
Validation and Clinical Application of the Nomogram
At the same time, the model performance is verified with the validation group data. The ROC curve results showed that the prediction model was equally well distinguished in the validation group (AUC = 0.622) (Figure 6D). The calibration curve shows a good degree of calibration (validation group Brier score: 0.238) (Figure 6E). The DCA curves of the validation group confirmed the good clinical utility of the prediction model (Figure 6F).
Using the constructed prediction model, Kaplan-Meier survival analysis was used to analyze the population after PSM, and the survival results of “surgery-benefit group”, “surgery-no benefit group” and “no-surgery group” in the training group and validation set were displayed respectively. In the training group and validation group, the median CSS of the surgery benefit group was higher than that of the surgery-no benefit group and no-surgery group (training group: 83 vs 20 vs 39 months, P < 0.0001; validation group: 81 vs 23 vs 39 months, P < 0.0001) (Fig. 7(A), (B)). Kaplan–Meier analysis of a comparison of benefit candidates, no-benefit candidates and no-surgery patients in the training group (A) and validation group (B) after using our nomogram.
Discussion
The standard surgical treatment for early-stage NSCLC has been lobectomy, in which individual units of the lung lobes are removed along with their specific regions of the hilum and mediastinal lymphatics. 19 A previous diagnosis and treatment guideline 20 proposed that in the treatment of early NSCLC, surgery has better survival time and lower recurrence rate. However, not all patients with early NSCLC are suitable for surgery. Statistics showed that the incidence rate and mortality rate of elderly people increased exponentially 21 compared with younger patients. In clinical treatment, surgeons are often reluctant to recommend lobectomy for patients with older age, complications or poor pulmonary function. Preoperative multiple complications and poor pulmonary function may lead to high recurrence rate and mortality, resulting in poor surgical treatment effect. 22 Therefore, it is urgent to establish a reliable and simple discrimination method for elderly stage IA NSCLC.
We constructed a prediction model and proposed several findings. First, NSCLC patients who underwent surgery survived longer than those who did not, which is consistent with previous studies.6,23,24 Second, although surgery can improve the survival of elderly patients with stage IA NSCLC to some extent, not all patients with surgery have a longer survival than those without surgery. We established a visual nomogram to screen the patients who benefit most from surgery and prolong their survival time; at the same time, patients who are not suitable for surgery can also be screened in order to reduce the additional physical and economic burden. Finally, the effectiveness and stability of the model are verified internally.
According to our model, age, gender, race, grade, histological type, Marital and surgery methods are predictive variables to estimate whether elderly patients with stage IA NSCLC can benefit from surgery. Among them, surgery methods, tumor grade, gender and age have the greatest impact on the benefit of surgery.
Among various surgical procedures, pneumonectomy is a traditional treatment modality that allows complete resection of tumor tissue, but also excessive removal of normal lung tissue. 25 The lung function of the elderly is inherently poor and the surgical tolerance is not strong, so total pneumonectomy for the elderly is usually considered with caution, and total pneumonectomy is only considered when partial resection cannot remove the complete lesion. This usually means that the primary lesion is large or has metastasized, and the prognosis is usually poor. 26 In a retrospective study, it was found that lobectomy and segmental resection have similar clinical efficacy in the treatment of early NSCLC patients, 27 but segmental resection can maintain healthy lung tissue as much as possible, with less trauma, protect lung function, and promote postoperative recovery.27,28 Several other population-based studies have found that lobectomy has a significant survival advantage over segmental resection.29-31 With our results, lobectomy is more beneficial than segmental resection. This may be due to the fact that fewer regional lymph nodes are removed by lung segmental resection and the presence of potentially positive lymph nodes is more likely to lead to postoperative recurrence or distant metastasis of lung cancer, resulting in poorer patient prognosis. Further prospective randomized trials, as well as more detailed stratification, are still needed to explore reasonable surgical treatment modalities.
Previous studies have shown that tumor grade is an independent prognostic factor for patients with non-small cell lung cancer, 32 and the higher the tumor grade, the worse the prognosis. 33 A recent study found that low differentiation of lung cancer is not only associated with low survival rate, but also can predict the survival benefit of cisplatin-based adjuvant chemotherapy after surgery, 34 which is similar to our results, Tumor grade can predict the surgical benefit rate of patients. 35 Our results also show a surgical benefit in patients with stage G4, which may be a false positive due to the fact that this population is too small.
In our prediction model, female can increase the surgical benefit rate. The results were the same as in 2 studies based on the seer database, where male patients underwent surgery with higher lung cancer-related mortality compared to female patients. 36 Another study on the survival rate of elderly lung adenocarcinoma also shows that female patients have a better survival rate. 36 Overage is usually 37 considered as a predictor of poor OS. 38 Different from previous studies, our research focuses on elderly patients (≥75 y). The results show that this law is also applicable to judge whether the operation benefits. The older the elderly patients are, the less likely they will benefit from the operation.
We also found that postoperative survival was better in patients with LUAD than in patients with LUSC, with the result that in the same study based on a Japanese population, patients with LUAD who underwent surgery had a better prognosis than patients with LUSC. 39 Marital status has been shown to be associated with survival in NSCLC patients in many studies, with married patients often having a better prognosis than single patients,40,41 possibly because spouses provide strong social support for cancer patients, alleviating their depression and anxiety and increasing their desire to survive. 42
Our results have some practical implications, as well as implications for future research. First, with the advent of aging, older patients' may gain more benefit with surgical treatment when clinical guidelines are strictly followed. Second, the variables in this study are only some of the factors affecting prognosis, and more blood and imaging biomarkers are worthy of further exploration to construct a more integrated and comprehensive prediction model. Finally, in this study, the prognostic impact of different procedures also differed, which provides a data reference basis to further promote the study of individualized treatment.
This study has several limitations. First, this study is retrospective, and bias is inevitable. Despite the PSM adjustment, we attempted to reduce this bias with extensive data analysis and rigorous statistical methods. Second, 1 of the undeniable shortcomings of this study is that the SEER database does not document common complications and pulmonary function in older patients that could affect decisions about surgical treatment. Third, the conclusion of this study still needs to include more variables to improve the effectiveness of the model. However, based on the current state of data, seer is the world’s most extensive comprehensive database of study populations. Analysis using this database remains an ideal method for our study of surgical beneficiaries.
Conclusion
We constructed a simple and effective prediction model to select the best operable elderly patients with stage IA NSCLC who can benefit from surgery. Using our prediction model, clinicians can formulate more accurate treatment strategies for elderly patients with stage IA NSCLC.
Supplemental Material
Supplemental Material - Surgical Treatment is Still Recommended for Patients Over 75 Years with IA NSCLC: A Predictive Model Based on Surveillance, Epidemiology and End Results Database
Supplemental Material for Surgical Treatment is Still Recommended for Patients Over 75 Years with IA NSCLC: A Predictive Model Based on Surveillance, Epidemiology and End Results Database by Cancer Control by Yafeng Liu, Wenting Zhang, Jing Wu, Wenyang Wang, Jiawei Zhou, Jianqiang Guo, Qingsen Wang, Xin Zhang, Jun Xie, Yingru Xing, and Dong Hu in Cancer Control
Footnotes
Acknowledgments
I would like to declare on behalf of my co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. All the authors listed have approved the manuscript that is enclosed.
Author Contribution
HD and WJ: conception and design, and study supervision. LY, ZW, ZX, WQ, GJ and ZJ: development of methodology, analysis and interpretation of data, and writing of the manuscript. XY, XJ and WW: review of the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the National Natural Science Foundation of China (No. 81971483), the Collaborative Innovation Project of Colleges and Universities of Anhui Province (GXXT-2020-058).
Ethical Approval
This study was conducted in accordance with the principles of the Declaration of Helsinki. It has been approved by the Medical Ethics Committee of the Cancer Hospital of Anhui University of Technology (No. 2022-KY-FZZX-03). Written informed consent is not required in this retrospective analysis.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
