Abstract
Introduction
Lung cancer, the leading cause of cancer-related mortality globally accounting for a substantial proportion of deaths in both men and women.1,2 It can be predominantly categorized into 2 major subtypes: SCLC and non-small cell lung cancer (NSCLC). 3 Among its subtypes, SCLC is a highly aggressive neuroendocrine tumor responsible for approximately 250 000 deaths annually. 4 The unique clinical behaviors and treatment challenges of SCLC—such as its aggressive nature, rapid recurrence, early metastasis, and poor prognosis—emphasize the need for tailored therapeutic strategies. 5
In recent years, surgical intervention has emerged as a viable treatment option for SCLC, with large retrospective studies suggesting that surgery may offer superior outcomes compared to chemoradiotherapy in select cases.6,7 Several international guidelines from American, European, and Japanese medical associations recommend surgery with adjuvant chemotherapy for early-stage SCLC patients without mediastinal nodal disease.8,9 In the study by Yang et al, 10 29.6% (681/2301) of SCLC patients were identified as eligible for surgery, characterized mainly by T1N0 tumors. Radiomics tools can further help identify patients who may be suitable candidates for potential neoadjuvant chemotherapy before surgery.
Complete resection of SCLC lesions while minimizing damage to healthy lung tissue and addressing peritumoral disease remains a significant clinical challenge. CT is the primary imaging modality for SCLC diagnosis, provides critical morphological information for screening and evaluation. 11 However, considerable overlap in CT features among lung cancer subtypes often complicates accurate diagnosis based solely on visual assessment.12,13 Radiomics offers a solution by extracting high-dimensional quantitative imaging features for improved tumor characterization, potentially supporting diagnosis, staging, and treatment planning.14,15 While CT-based radiomics has shown promise in guiding precise surgical decisions in SCLC, but most studies to date have focused on histological subtyping, 16 and its application in surgical decision-making remains limited.
Recent research indicates that tumor’s physiological microenvironment, particularly the peritumoral region, harbors critical prognostic information that may be overlooked by analyses limited to the tumor itself.17-20 To address this limitation, we incorporated peritumoral radiomic features extracted from the extended 2-mm region surrounding visible lung lesions on CT. Using these features, we developed a radiomics model aimed at differentiating SCLC from NSCLC. This study aims to provide a more comprehensive understanding by integrating both lesion and peritumoral radiomic features with clinical data to assess their potential for distinguishing SCLC from NSCLC. Furthermore, we explore the potential applications of these radiomic signatures in guiding surgical decision-making for SCLC patients. Ultimately, this approach aims to improve accuracy in characterizing lesion extent, facilitate early diagnosis, enable precise staging, and support the development of personalized treatment strategies.
Materials and Methods
Data Collection
This retrospective study enrolled 113 patients with histopathologically confirmed primary lung tumors, diagnosed between March 2022 and August 2023, comprising 54 cases of SCLC and 59 cases of NSCLC. Inclusion criteria included histopathological diagnosis of primary lung cancers and no prior treatment. Exclusion criteria included history of other primary malignancies, certain pathological types, incomplete clinical data, prior treatment, and lesions with a maximum diameter <1 cm.
Collected clinical data included demographic information (sex, age, smoking history, lymph node metastasis), tumor characteristics (tumor diameter, imaging features), and laboratory biomarkers, which included neuron-specific enolase (NSE), carcinoembryonic antigen (CEA), cancer antigen 125 (CA125), and cytokeratin 19 fragment (CYFRA21-1). These data were used to characterize the patient population and explore potential correlations between clinical features and radiomic characteristics.
The Ethics Committee of The First Affiliated Hospital of Zhejiang Chinese Medical University approved the study (2023-KLS-098-01). The requirement for informed consent was waived due to the retrospective study design.
CT Scanning Protocol
CT imaging was performed using Toshiba Aquilion ONE (Toshiba Medical Systems, Japan) and Siemens Somatom Sensation 64-slice scanners (Siemens Healthcare, Germany). Scanning parameters for the Toshiba system included: tube voltage 120 kV, tube current 200 mA, detector collimation 0.5 mm × 64 mm, matrix 512 × 512, slice thickness 1.0 mm, rotation time 0.4 s, and reconstruction kernel FC51. Parameters for the Siemens system were: tube voltage 120 kV, automatic tube current modulation, detector collimation 0.6 mm × 64 mm, matrix 512 × 512, slice thickness 0.75 mm, and reconstruction kernel B3H. Standard lung windows (window level: −600 HU, window width: 1300 HU) were used, with adjustments as needed to visualize internal lesion characteristics. Images were routinely transferred to the Picture Archiving and Communication System (PACS) for storage.
Lesion Segmentation and Radiomics Feature Extraction
Image Selection
CT images of selected patients were retrieved from the PACS. Images were reviewed by radiologists with over 10 years of diagnostic experience to ensure the presence of complete sequences without artifacts before segmentation.
Lesion and Peritumoral Region Segmentation
Thin-slice axial CT images in DICOM format were imported into 3D Slicer (version 5.3.0) for segmentation. Regions of interest (ROIs) were manually delineated on lung window images, with reference to mediastinal windows, to ensure accurate lesion coverage. Each lesion was outlined, carefully avoiding blood vessels, bronchi, and extrathoracic normal tissues. The original ROIs were semi-automatically segmented and extended by 2 mm outward, with manual adjustments to exclude normal structures. For lesions with unclear margins, only regions with significantly high density were delineated. In cases with multiple lesions, the largest lesion was selected for analysis. Segmentation and review were performed independently by 2 experienced radiologists, each with over 10 years of expertise in thoracic imaging. In cases of disagreement, a third radiologist with more than 15 years of experience in thoracic imaging adjudicated the final decision. The workflow of the study is illustrated in Figure 1. Overall Workflow of This Study
Radiomics Feature Extraction, Selection, and Model Construction
The reporting of this study conforms to TRIPOD guidelines. 21
Using the pyradiomics module in 3D Slicer, 1050 radiomics features were automatically extracted for each ROI, including shape, first-order statistics, and texture features such as gray-level co-occurrence matrix (GLCM), gray-level size zone matrix (GLSZM), gray-level dependence matrix (GLDM), gray-level run-length matrix (GLRLM), and neighborhood gray-tone difference matrix (NGTDM). Advanced filtering techniques, including wavelet and Laplacian of Gaussian (LoG) filters, were applied to enhance feature extraction. Wavelet filters decomposed images into multiscale components, while LoG filters emphasized regions with high gray-level variation. Smaller sigma values highlighted finer textures, whereas larger values emphasized coarser patterns.22,23
Logistic regression models were constructed to calculate radiomics scores for each patient. The dataset was standardized and split into training and testing sets at a 7:3 ratio. Feature selection was performed using t-tests, least absolute shrinkage and selection operator (LASSO) regression, and maximum relevance minimum redundancy (mRMR). Logistic regression models were developed for original and expanded ROIs based on selected features. We also performed a five-fold cross-validation (internal validation), which is recognized as an effective way to generate unbiased patient-specific predictions. 24
Clinical and Clinical-Radiomics Nomogram Models
Independent clinical predictors for SCLC were identified through t-tests and LASSO regression, elucidating the key predictive factors. Following this, we developed a clinical logistic regression model incorporating the selected clinical variables. Additionally, these features were integrated with the radiomics score (rad scores) to construct a combined clinical-radiomics nomogram.
Statistical Analysis
Feature stability was evaluated using the Intraclass correlation coefficient (ICC). Here, an ICC of ≥0.75-0.89 was considered good reproducibility and an ICC ≥0.90 was considered excellent reproducibility as recommended by Koo et al. 25 Differences in continuous variables were analyzed using independent t-tests, while categorical variables were assessed with Fisher’s exact test or the chi-square test. Diagnostic performance was evaluated using receiver operating characteristic (ROC) curves and clinical impact curves (CIC). Key metrics included area under the curve (AUC), sensitivity, specificity, and accuracy. Statistical comparisons of ROC curves were performed using the DeLong method. 26 The Hosmer-Lemeshow test assessed nomogram calibration using the statsmodels package. Statistical analyses were conducted in Python (version 3.9.13), with significance set at P < .05.
Results
Reproducibility Analysis
Inter-observer reproducibility analysis of both the whole nodule and solid component volumes delineated by the 2 radiologists demonstrated good agreement, with all ICCs exceeding 0.75.
Comparison of Clinical Factors Between SCLC and NSCLC Patients
Comparison of Clinical Factors Between SCLC and NSCLC Patients
Selection of Radiomics and Clinical Features
Feature selection was performed using t-tests and least absolute shrinkage and LASSO regression, which retained 11 features from the original ROI and 20 features from the expanded ROI. The efficacy of LASSO regression in selecting texture features within lesions, along with the detailed procedure and results of feature selection, are illustrated in Figures 2 and 3. To further optimize feature dimensions and improve model efficiency, the mRMR classifier was employed, identifying the most relevant and least redundant features. A total of 10 representative features were selected from each ROI configuration, with their respective weight distributions displayed in Figure 4. Moreover, t-tests and LASSO analysis identified gender, smoking history, maximum tumor diameter, glitch, and NSE levels as independent clinical predictors for SCLC. Selection of Radiomics Texture Features Using t-test and LASSO Regression Feature Selection Flowchart Feature Weights Selected From Original and Expanded ROIs. (A) Feature Weights Within the Original ROI. (B) Feature Weights Within the 2-mm Expanded ROI


Model Performance Comparison
The clinical model demonstrated limited diagnostic capability, with an accuracy of 0.70, sensitivity of 0.77, and specificity of 0.65, primarily attributable to its low specificity, which resulted in a higher false-positive rate. The original ROI radiomics model showed improved diagnostic performance, but remained suboptimal. In contrast, the expanded ROI model achieved higher accuracy (0.83), sensitivity (0.80), and specificity (0.84), capturing additional lesion information and enhancing discrimination ability. As illustrated in Figure 5, the expanded ROI model’s AUC in the test set was 0.85 (95% CI, 0.80-0.97), outperforming the clinical model (AUC = 0.71, 95% CI, 0.79-0.90) and the original ROI model (AUC = 0.76, 95% CI, 0.76-0.87). Model Performance Comparison. (A) AUC of the Clinical Model. (B) AUC of the Original ROI Model. (C) AUC of the Expanded ROI Model. (D) AUC of the Clinical-Radiomics Nomogram
A clinical-radiomics nomogram was developed by combining expanded ROI radiomics features with clinical predictors for SCLC prediction, as shown in Figure 6. This model demonstrated superior performance, with an AUC of 0.96 (95% CI, 0.88-1.00), accuracy of 0.91, sensitivity of 0.92, and specificity of 0.90 in the test set. Hosmer-Lemeshow tests indicated good calibration in both the test (P = .891) and training sets (P = .663). DeLong tests confirmed significant differences in diagnostic performance between models (P < .05). The diagnostic performance of each model is detailed in Table 2. Clinical-Radiomics Nomogram Constructed by Integrating Radiomics Features and Clinical Factors to Diagnose SCLC Diagnostic Performance Metrics of Each Model in the Test Set
CIC analysis, as presented in Figure 7, revealed that the clinical-radiomics nomogram consistently predicted more true-positive cases at a high-risk threshold above 72%, whereas other models achieved optimal benefits at thresholds >80%. These findings suggest that the clinical-radiomics nomogram has the potential to improve treatment decision-making in clinical practice. The Clinical Impact Curves. (A) CIC of the Clinical Model. (B) CIC of the Original ROI Model. (C) CIC of the Expanded ROI Model. (D) CIC of the Clinical-Radiomics Nomogram
Discussion
Conventional single-energy CT imaging evaluates tumor characteristics based on attenuation, morphology, and invasiveness, with treatment response assessed through changes in tumor volume and density. 20 However, relying solely on morphological assessments often falls short in accurately determining tumor pathology, highlighting the need for more comprehensive evaluation methods. Radiomics enhances tumor characterization by extracting extensive quantitative imaging features, enabling a detailed depiction of tumor phenotypes and offering a more nuanced understanding of tumor biology.22,23 Advanced statistical and machine learning methods can identify clinically significant features, providing high accuracy and non-invasive evaluation potential.27-30
The biological significance of the peritumoral microenvironment in tumor growth and progression has been well established. 31 However, radiomics studies on SCLC have primarily focused on intralesional features and prognosis,32-34 with limited exploration of peritumoral characteristics. Integrating radiomics features from both the lesion and the peritumoral microenvironment may improve diagnostic accuracy and predictive efficiency, offering a novel perspective for surgical guidance by incorporating microscopic textural complexity and multidimensional data. This approach has the potential to provide a more comprehensive understanding of SCLC, enabling more accurate diagnosis and treatment planning.
In this study, we constructed 2 ROIs based on SCLC and NSCLC lesions respectively: the original ROI, which is based on visually identifiable SCLC or NSCLC lesion areas on CT images, and an expanded ROI with a 2-mm margin from the original ROI. Meanwhile, we identified clinical features such as gender, smoking history, maximum tumor diameter, glitch, and NSE levels as effective differentiators between SCLC and other lung cancers. After extracting the radiomics features from the 2 ROIs and performing dimensionality reduction, logistic regression models were established for the clinical variables, original ROI, and expanded ROI to evaluate their diagnostic performance. The results showed that compared to the clinical models, the radiomics-based model derived from the original ROI demonstrated superior performance compared to the clinical model, with higher accuracy (0.70 vs 0.76), specificity (0.65 vs 0.75), and sensitivity (0.77 vs 0.74). In previous studies investigating the role of radiomics in the differential diagnosis of lung cancer, Wu et al 35 reported an AUC of 0.72 for distinguishing lung cancer subtypes. Similarly, Junior et al 36 demonstrated an AUC of 0.71 in the training cohort when utilizing radiomics features from CT images to classify lung cancer subtypes, which is comparable to the performance of our original ROI model. In contrast, Liu et al reported a higher AUC of 0.82. The lower performance in our study compared to Liu et al 37 may be attributed to differences in sample size, which could have influenced the predictive performance of the radiomics models. However, our analysis of the expanded ROI showed that the results from our expanded ROI model outperformed theirs. Furthermore, compared to the radiomics model constructed using the original ROI, the expanded ROI model notably further improved accuracy (0.83 vs 0.76), sensitivity (0.80 vs 0.74), specificity (0.85 vs 0.75), and AUC (0.85 vs 0.76).A related conclusion has also been discussed in the study by Jia et al. 38 In their study, the expanded ROI demonstrated significantly higher accuracy (0.89 vs 0.82), sensitivity (0.85 vs 0.86), specificity (0.84 vs 0.79), and AUC (0.92 vs 0.86) compared to the original ROI. The advantage of the expanded ROI radiomics model over the original ROI model may lie in its ability to capture subtle features in the peritumoral microenvironment that are often undetectable by human observation, which also suggests that it may have captured certain peritumoral information. This finding is consistent with evidence suggesting that extending surgical margins improves overall survival in SCLC.39,40
Moreover, we calculated the rad score derived from the expanded ROI model and integrated it with the clinical features used to construct the clinical model. Based on this, we developed a clinical-radiomics nomogram that achieved superior diagnostic performance (AUC = 0.96), significantly outperforming both the clinical model (AUC = 0.76) and the expanded ROI radiomics model alone (AUC = 0.85). Our findings align closely with those reported by Liu et al, 37 where their radiomics nomogram achieved an AUC of 0.94 compared to our 0.96. In the nomogram, rad scores and NSE hold significant diagnostic weight. NSE, a well-established neuroendocrine marker, continues to play an important role in distinguishing SCLC from NSCLC when combined with imaging-based predictors.41,42 Similarly, in the studies by Liu et al 37 and Gigika et al, 42 NSE was also identified as a crucial clinical marker. However, unlike previous studies where clinical-radiomics nomograms offered no significant advantages over radiomics models alone,34,43 our approach demonstrated superior performance, likely due to the incorporation of peritumoral features. The nomogram’s visual representation facilitates clinical decision-making, providing an effective tool for individualized prediction of SCLC and NSCLC. CIC analysis further validates the clinical utility of the nomogram. In contrast to the clinical model and the 2 standalone radiomics models, the clinical-radiomics nomogram demonstrated high concordance between predicted and actual SCLC cases at the moderate-risk threshold (0.71), whereas the other 3 models required higher risk thresholds exceeding 0.8. This finding suggests that extending the surgical margin by an additional 2 mm could enhance SCLC resection and improve postoperative outcomes, highlighting its potential for supporting surgical risk assessment, postoperative prognosis prediction, and enabling more precise surgical planning.
Despite its promising findings, this study has limitations. It is a single-center retrospective study with a limited sample size. The number of SCLC cases is relatively small, potentially introducing bias and reducing generalizability. Lymph node status is a key prognostic factor for non-metastatic lung cancer,44-47 with radiomics providing superior accuracy in detecting metastasis through primary tumor analysis compared to size-based CT criteria.48,49
In future studies, we plan to conduct a prospective multi-center validation study involving 3-5 medical centers across different regions, targeting a sample size increase of at least 50%. This will include the establishment of standardized imaging protocols and centralized standard operating procedures (SOPs) for image processing and feature extraction. Additionally, we will implement training for research personnel to ensure technical standardization across centers. To address clinical integration, we will develop decision support tools integrated with PACS systems to provide real-time radiomics-based risk stratification for lymph node involvement, directly informing surgical candidacy and operative approach selection. The prospective validation will specifically focus on surgical cohorts to evaluate how radiomics predictions influence treatment decisions and patient outcomes. We will also incorporate comprehensive mediastinal staging data to evaluate the model’s performance in predicting lymph node involvement, which represents a critical prognostic factor that can significantly influence treatment planning and patient outcomes. We aim to refine the diagnostic and decision-making framework combining CT radiomics with clinical features for SCLC.
Conclusions
This study developed an efficient clinical-radiomics nomogram by integrating CT radiomics and clinical features of SCLC patients, demonstrating excellent diagnostic performance. Compared to traditional clinical models, the expanded ROI radiomics model significantly improved diagnostic accuracy and specificity, highlighting the importance of peritumoral radiomic features in SCLC diagnosis. The nomogram not only achieved a high AUC in the test set but also showed superior predictive performance at intermediate-risk thresholds, offering a precise, non-invasive tool with potential utility in individualized treatment planning. Future studies should incorporate longitudinal patient survival data and prospective surgical outcome assessments to comprehensively evaluate the clinical utility of this nomogram.
Footnotes
Ethial Approval
The study was reviewed and approved by the Ethics Committee of The First Affiliated Hospital of Zhejiang Chinese Medical University, located in Zhejiang Province, China. The approval number is 2023-KLS-098-01, and the approval date is July 19, 2023. The ethical review confirmed compliance with the Helsinki Declaration and the “Measures for the Ethical Review of Biomedical Research Involving Humans”.
Patient Consent
Not applicable. The requirement for informed consent was waived due to the retrospective study design.
Author Contributions
Conceptualization, Shiwei Wang; Data curation, Hao Zheng; Formal analysis, Lanqi Fu; Investigation, Yujie Ding; Methodology, Shucheng Huang; Resources, Junna Wang; Software, Jie Lin; Supervision, Junna Wang; Visualization, Yuan Dong; Writing – original draft, Jie Lin.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by Medical and Health Research Project of Zhejiang Province (2023KY853); Zhejiang Traditional Chinese Medicine Science and Technology Project (2023ZL393).
Declaration of Conflicting Interest
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
