Abstract
Objective
Combined small cell lung cancer (CSCLC) with distant metastasis (DM) is an aggressive disease with a poor prognosis. Effective nomograms are needed to predict DM and early death in patients with CSCLC and DM.
Methods
This retrospective study included patients with CSCLC from the Surveillance, Epidemiology, and End Results database between 2004 and 2015. Risk factors for DM and early death were analyzed by univariate and multivariate logistic regression. Nomograms were constructed based on the results in a training cohort and confirmed in a validation cohort, and their performances were assessed by concordance index (C-index), receiver operating characteristic curve (ROC), calibration curve, and decision curve analysis (DCA).
Results
A total of 788 patients with CSCLC were selected, including 364 patients with metastatic CSCLC. Sex, tumor site, T stage, and N stage were independent risk factors for DM, while age, surgery, chemotherapy, and liver metastasis were independent risk factors for early death. C-index, ROC, calibration, and DCA curve analyses all showed good predictive performances for both nomograms.
Conclusions
These nomograms could reliably predict DM risk in CSCLC patients and early death in CSCLC patients with DM, and may thus help clinicians to assess these risks and implement individualized therapies.
Introduction
According to the Global Cancer Statistics 2020, lung cancer was the leading cause of cancer mortality worldwide, with an estimated 2.2 million new cases and 1.8 million new deaths in 2020. 1 Small cell lung cancer (SCLC) is characterized as a highly lethal and aggressive type, accounting for around 15% of lung cancer cases, while non-small cell lung cancer (NSCLC) accounts for nearly 80%. The World Health Organization Histology Criteria 2015 defined CSCLC as a subset of SCLC diagnosed by pathological biopsy including components of both SCLC and any other subtype of NSCLC, such as adenocarcinoma, squamous cell carcinoma, or large cell carcinoma. 2 The incidence of CSCLC is variable, accounting for approximately 5% to 20% of all SCLC cases. 3 SCLC is a recalcitrant carcinoma with high propensities for metastasis and recurrence. Approximately one third of patients with SCLC are initially diagnosed with early-stage disease, while around 70% of cases are diagnosed at an advanced stage.4,5 In a retrospective study of 181 patients with CSCLC, 99 patients (54.7%) had recurrence after surgery, leading to poor survival. 6 Although standard chemotherapy and radiation plus immune checkpoint inhibitors have been recommended, survival is still limited, especially in patients with SCLC and distant metastasis (DM). It is therefore imperative to identify patients with CSCLC at high risk of metastasis.
In a study of 43,156 patients with T1-2 NSCLC, 6944 patients (16.09%) developed DM, and race, sex, histology, T and N stage, marital status, tumor size, grade, and laterality were all identified as independent risk factors for metastasis. 7 In another series of 18,187 patients with SCLC, 22.07% of patients were diagnosed with bone metastasis, with a median survival of 6 months, with age, sex, N stage, and tumor size all related to bone metastasis. 8 Zhao et al. reported that low expression levels of Cullin5 and SOCS3 promoted metastasis, drug-resistance, and a poor prognosis in SCLC [13]. Other research also revealed that high neuron-specific enolase, neutrophil-to-lymphocyte, Friend leukemia virus integration 1, and the long noncoding RNA CCAT2 were associated with SCLC metastasis.9–11 Risk factors for metastasis based on laboratory markers have been identified in patients with NSCLC and SCLC; however, the risk factors based on clinicopathological parameters correlated with metastatic CSCLC remain unclear.
The treatment response is currently transient in most patients with SCLC and DM, resulting in a median survival of 1 year. In a comprehensive analysis of 358 patients with extensive-stage disease, 43.8% of patients died within 12 months, 11 while another study in China also reported a median survival of just 11 months, and 58.4% of patients with extensive-stage SCLC died within 1 year. 12 Patients with CSCLC and DM are prone to early death, defined as an overall survival time of less than 3 months after diagnosis. Identification of the factors contributing to early death may help clinicians to develop individualized treatment strategies, which might in turn improve overall survival and alleviate the suffering caused by the disease. There are currently no published in-depth studies on mortality within 3 months of metastatic CSCLC. There is thus a need for a simple-to-use, predictive model to distinguish the potential risk factors for early death.
Nomograms are widely used to predict DM and early death in cancer patients; however, most existing nomograms are derived from patients with metastatic SCLC or CSCLC without DM, and there are no nomograms for diagnosis and early death in patients with CSCLC and DM.
We therefore aimed to construct novel nomograms to predict DM in patients with CSCLC and early death in patients with CSCLC and DM, based on demographic and clinicopathologic variables from the Surveillance, Epidemiology, and End Results (SEER) database. The resulting nomograms may facilitate individualized patient care and medical therapy.
Materials and methods
Patients and data collection
This retrospective study used data extracted from the SEER database between 2004 and 2015. The inclusion criteria were as follows: (a) malignant tumor located in the main bronchus and lung (site code: C340-C349); (b) patients diagnosed with primary CSCLC (histology code: 8045); (c) diagnostic confirmation based on positive histology, laboratory test/marker, or microscopic confirmation; and (d) complete T stage, N stage, and M stage according to the 6th edition American Joint Committee on Cancer staging system. We excluded patients with unknown overall survival time. Data on clinical variables including age at diagnosis, sex, race, grade, tumor site, surgery, chemotherapy, radiotherapy, lung metastasis, brain metastasis, bone metastasis, and liver metastasis were also obtained. We also excluded patients with missing data on race. Age at diagnosis was divided into under 60 years old, 60 to 70 years old, and over 70 years old. Tumor grade was divided into grade I to II, grade III to IV, and unknown.
Patients were divided randomly into training and validation cohorts at a ratio of 7:3, using R software (www.r-project.org). The selection processes for the two study cohorts are shown in Figure 1.

Study selection process. CSCLC, combined small cell lung cancer; DM, distant metastasis; SEER, Surveillance, Epidemiology, and End Results database.
This retrospective study was approved by the Institutional Review Board of The First Affiliated Hospital Shaoyang University and was exempt from the need for informed consent. All information from the SEER database has been de-identified and no personal identifying information was used in our analysis, and informed consent was therefore not required. The reporting of this study conforms to STROBE guidelines. 13
Statistical analysis
All statistical analyses were carried out using R software (version 4.0.3) and a P-value <0.05 (two-sided) was considered statistically significant. Numbers and percentages were used to summarize categorical variables. The training and validation cohorts were extracted randomly using R software at a ratio of 7:3, and the distribution and difference between the two cohorts were examined by χ2 or Fisher’s exact tests. Independent risk factors for DM in patients with CSCLC and for early death in patients with CSCLC and DM were analyzed by univariate and multivariate logistic regression analyses. Risk factors with a P-value <0.05 in univariate analysis were included in the multivariate analysis. A predictive nomogram was constructed based on the results of multivariate analysis in the training cohort and was validated in the validation cohort using the “rms” package. The discrimination abilities of the nomograms in both the training and validation cohorts were determined using receiver operating characteristic (ROC) curves and the corresponding area under the curve (AUC), and the C-index. Calibration curves were plotted to determine the consistency between the predicted and actual probabilities. Decision curve analysis (DCA) curves were generated to evaluate the clinical benefits and improved performance of the nomogram.
Results
Baseline characteristics
A total of 788 patients diagnosed with primary CSCLC met the inclusion criteria and were enrolled in the present study between 2004 and 2015, including 364 patients (46.2%) diagnosed with DM. The patients’ baseline characteristics are presented in Table 1. The DM rates were similar in the training and validation cohorts (46.5% and 45.6%, respectively).
Baseline characteristics of patients with combined small cell lung cancer.
DM-related risk factors
We explored potential risk factors for DM by univariate logistic regression analysis. Sex, tumor site, T stage, and N stage were all correlated with DM (P < 0.05) and were further included in multivariate analysis, which identified male sex, main bronchus, T4 stage, and higher N stage as independent risk factors for DM in patients with CSCLC (Table 2).
Univariate and multivariate logistic regression analyses of risk factors associated with distant metastasis in patients with combined small cell lung cancer.
HR, hazard ratio; CI, confidence interval.
Nomogram construction and validation
We constructed a diagnostic risk nomogram based on the four independent risk factors in multivariate analysis (Figure 2). The C-indexes in the training and validation cohorts were 0.769 (95% confidence interval: 0.730–0.808) and 0.764 (0.704–0.823), respectively, and the AUCs of the ROC curves were 0.769 (0.729–0.808) and 0.769 (0.704–0.824), respectively, indicating that both these factors showed good discrimination (Figure 3a, b). Calibration curves showed good consistency for predicting DM between the predicted and actual results (Figure 3c, d). In addition, DCA showed that the nomogram provided excellent benefits. A larger net benefit was associated with a better predictive performance of the diagnostic risk model, implying that the nomogram could provide an effective and precise tool for DM prediction (Figure 3e, f).

Diagnostic nomogram for predicting distant metastasis in patients with combined small cell lung cancer. Tumor site code: 1: left upper lobe; 2: left lower lobe; 3: right upper lobe; 4: right middle lobe; 5: right lower lobe; 6: main bronchus; 7: unspecific. DM, distant metastasis.

Validation of the diagnostic nomogram. Receiver operating characteristic curve (a), calibration curve (c), and decision curve analysis (DCA) (e) in the training cohort, and receiver operating characteristic curve (b), calibration curve (d), and DCA (f) in the validation cohort. DM, distant metastasis; AUC, area under the curve.
Factors associated with early death
Patients lacking data on surgery, radiotherapy, and chemotherapy were excluded from the study on early death, and this cohort therefore included 363 patients with metastatic CSCLC, including 150 patients (41.3%) who experienced early death. The demographic characteristics of all patients with DM are summarized in Table 3. There were no significant differences between the training and validation cohorts. Univariate and multivariate logistic regression analyses identified four independent risk factors associated with early death in patients with DM, namely age, surgery, chemotherapy, and liver metastasis (Table 4).
Baseline characteristics of patients with combined small cell lung cancer and distant metastasis.
Univariate and multivariate logistic regression analyses of risk factors associated with early death in patients with combined small cell lung cancer and distant metastasis.
HR, hazard ratio; CI, confidence interval.
Nomogram construction and validation
We constructed a nomogram to predict early death in patients with CSCLC and DM based on the independent risk factors screened by multivariate analysis (Figure 4). The C-indexes in the training and validation cohorts were 0.840 (0.792–0.889) and 0.898 (0.833–0.963), respectively, and the AUCs were 0.840 (0.791–0.89) and 0.898 (0.833–0.964), respectively (Figure 5a, b). The calibration curves were good predictors and observers in terms of early death risk (Figure 5c, d). In addition, DCA showed a significant improvement in the net benefit of the nomogram with a wide range of threshold probabilities in both the training and validation sets, implying good clinical application value of this nomogram (Figure 5e, f).

Risk nomogram for predicting early death in patients with combined small cell lung cancer and distant metastasis. Tumor site code: 1: left upper lobe; 2: left lower lobe; 3: right upper lobe; 4: right middle lobe; 5: right lower lobe; 6: main bronchus; 7: unspecific.

Validation of the risk nomogram. Receiver operating characteristic curve (a), calibration curve (c), and decision curve analysis (DCA) (e) in the training cohort, and receiver operating characteristic curve (b), calibration curve (d), and DCA (f) in the validation cohort. EM, early death; AUC, area under the curve.
Discussion
To the best of our knowledge, this was the first study to identify independent risk factors for DM and early death in patients with CSCLC based on a large-scale, population-based, national cancer database. We also established two risk nomograms with reliable accuracy and discriminative ability and validated these by ROC, calibration, and DCA curves. The results suggested that these two nomograms may provide a practical tool to help clinicians identify patients at high risks of DM and early death, as well as to determine the optimal clinical interventions for patients diagnosed with metastatic CSCLC.
The incidence of DM at the time of initial diagnosis of SCLC is generally more than 60%, and patients with metastatic SCLC have poorer overall and median survival, 14 whereas the incidence of DM in the current cohort was only 46.3%. Accurately identifying patients at high risk of metastasis may facilitate the implementation of individualized therapeutic strategies; however, despite extensive research on extensive-stage SCLC, studies focusing on metastatic CSCLC are scarce. Cai et al. indicated that the liver was the most common metastatic site and the lung was the least common, 15 and Lu et al. established a diagnostic nomogram based on age, sex, N stage, and tumor size to predict liver metastasis with good accuracy in SCLC. 16 Other researchers exploring metastasis-related factors in SCLC validated some laboratory markers, such as interferon-induced transmembrane protein 1, KMT2C, and DLL3, as useful and efficient biomarkers to estimate the risk of metastasis in SCLC17–19; however, these are expensive, time-consuming, and invasive, thus limiting their clinical use, and their accuracy and efficiency have not been confirmed in large cohorts. Nomograms based on clinical variables have been used to predict DM efficiently in other tumors. Chao et al. combined age, sex, T stage, N stage, histology, tumor location, and pathological grade to predict metastasis in esophageal cancer, 20 while a nomogram based on race, age, primary site, tumor depth, size, and grade was established in patients with superficial gastric cancer without lymph node metastasis. 21 Nevertheless, no studies have concentrated on metastasis in CSCLC based on clinical characteristics.
In the current study, we identified four risk factors for DM in CSCLC using logistic regression analyses, namely sex, tumor site, T stage, and N stage. As for other tumors, T stage and N stage were confirmed to be associated with DM in CSCLC.19,22 Patients with centrally located primary tumors have visibly worse survival outcomes compared with patients with peripheral SCLC. 23 This may support the current finding that DM was more frequently associated with CSCLC located in the main bronchus, but further investigations are required to clarify this phenomenon. Males are predominant in both SCLC and CSCLC,12,24,25 and male sex was associated with a high probability of DM, as reported in previous studies.8,14,22
Similarly, a novel tool was established to predict early death in patients with uterine sarcoma, which provided a better prediction than the FIGO stage system. 26 Shi et al. also built a model to predict early death in patents with stage IV esophageal cancer based on histological grade, chemotherapy, liver metastasis, and bone metastasis. 27 These studies demonstrate the feasibility and significance of nomograms for predicting early mortality in patients with cancer. Notably however, recent studies have focused on the prognoses of patients with NSCLC and SCLC without metastases or with brain metastasis, and early death in patients with CSCLC and DM has not been explored. Shen et al. revealed that age and liver metastasis were risk factors for early death in patients with lung cancer with synchronous brain metastasis, 28 while another study demonstrated that age, surgery, and chemotherapy were independent risk factors for early death in patients with advanced epithelial ovarian cancer (both FIGO stage III–IV). 29 In line with these studies, we found that age was significantly correlated with early death in patients with metastatic CSCLC. Wang et al. reported that extensive disease, no surgery, elevated neuron-specific enolase, and elevated neutrophil/lymphocyte ratio were associated with worse prognosis in patients with CSCLC, 25 while surgical resection significantly improved overall and lung cancer-specific survival in a propensity score-matched analysis of stage III SCLC patients from America and China. 30 In the current study, surgery improved the risk of early death in patients with metastatic CSCLC; notably however, the limited number of patients with metastatic CSCLC undergoing surgery means that this result must be treated with caution, and more large prospective clinical trials are needed to clarify the benefits of surgery in patients with CSCLC and DM. In a study of 207 patients with metastatic SCLC, response to chemotherapy was the most important factor related to a better prognosis. 31 Although chemotherapy plus immune checkpoint inhibitors are recommended for patients with extensive-stage SCLC, platinum plus etoposide has also been recommended for many years and is a common treatment for advanced SCLC in clinical practice.32,33 Chemotherapy reduced the risk of early death in patients with metastatic CSCLC in our study, consistent with previous reports.34,35 The benefits of radiotherapy for metastatic CSCLC thus remain unknown and require further investigation.
Liver metastasis was usually an early event whereas bone and brain metastases tended to be late occurrences in a series of 1009 patients with SCLC. 23 In addition, patients with extensive-stage with liver metastasis alone or in combination with other organs have worse outcomes. The liver is the most common site of DM in solid tumors because of its anatomical location and vascular features. 36 Moreover, Wu et al. revealed that liver metastasis was an independent prognostic factor leading to shorter progression-free and overall survival in patients with stage IV lung adenocarcinoma (median progression-free survival, 6.7 vs 11.2 months, P < 0.001; median overall survival, 9.2 vs 17.5 months, P < 0.001). 37 As expected and in line with published data, our study showed that the liver was the most metastatic organ, associated with a significantly increased risk of early death.
This study had some limitations. First, this was a retrospective study with potential selection bias, and prospective randomized controlled studies are therefore required to confirm our results. Second, inherent bias could not be avoided because of the lack of external validation. Third, no detailed information on the CSCLC components was included because the data are not available in the SEER database. Finally, the data on tumor grade and metastatic organs were incomplete, and their impact on metastasis and early death may thus have been underestimated.
Conclusion
We comprehensively demonstrated that sex, tumor site, T stage, and N stage were independent risk factors for DM in patients with CSCLC, while age, liver metastasis, surgery, and chemotherapy were independent risk factors associated with early death in patients with CSCLC and DM. These simple-to-use nomograms will facilitate precise clinical decision-making and improve the prognosis for patients with metastatic CSCLC.
Supplemental Material
sj-pdf-1-imr-10.1177_03000605241238689 - Supplemental material for Risk factors and nomograms for diagnosis and early death in patients with combined small cell lung cancer with distant metastasis: a population-based study
Supplemental material, sj-pdf-1-imr-10.1177_03000605241238689 for Risk factors and nomograms for diagnosis and early death in patients with combined small cell lung cancer with distant metastasis: a population-based study by Hui Yin, Zhi Hu and Jie Yang in Journal of International Medical Research
Footnotes
Acknowledgements
We thank the developers of the R package who shared their code freely. We also acknowledge the contribution of the SEER database for access to the data.
Author contributions
The study was conceptualized and designed by HY and ZH. The figures and tables were generated by HY. Statistical results were completed by HY. The initial manuscript was written by HY and ZH. This study was scrutinized in every aspect by JY who critically reviewed the original article.
Declaration of conflicting interests
None.
Funding
None.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
