Abstract
Introduction
This study aimed to assess the predictive value of integrating ultrasonographic features, pathological characteristics, and inflammatory markers for axillary lymph node metastasis (ALNM) in early-stage breast cancer (BC), and to construct a corresponding nomogram.
Methods
A retrospective review was conducted on clinical data from 287 early-stage BC patients who underwent surgery at Shenzhen Luohu People’s Hospital between January 2020 and March 2024. Based on histopathological evaluation, patients were categorized into ALNM-positive (ALNM+) and ALNM-negative (ALNM−) groups. Independent predictors of ALNM were identified using univariate and multivariate logistic regression analyses. These variables were used to develop a predictive nomogram. Model performance was evaluated by concordance index (C-index), receiver operating characteristic (ROC) curve, calibration plot, and decision curve analysis (DCA), assessing its accuracy, discrimination, calibration, and clinical utility.
Results
Multivariate analysis identified vascular invasion, neutrophil-to-lymphocyte ratio (NLR), lymphocyte count, tumor size, lymph node echogenicity, and margin characteristics as independent predictors of ALNM. The nomogram showed excellent discriminative ability (AUC = 0.944, 95% CI: 0.906-0.981; C-index = 0.944, 95% CI: 0.906-0.982) and good calibration (Brier score = 0.063). DCA indicated meaningful clinical benefit across relevant threshold probabilities.
Conclusion
The nomogram developed in this study demonstrates strong predictive performance and clinical value for preoperative ALNM assessment in early-stage BC. It may serve as a practical tool to guide individualized surgical and therapeutic decision-making.
Introduction
Breast cancer (BC) is the most prevalent malignancy among women worldwide and is marked by significant heterogeneity and variable aggressiveness. 1 Despite advances in standardized treatments, axillary lymph node metastasis (ALNM) remains a major indicator of poor prognosis. Axillary lymph node (ALN) status is essential not only for TNM staging but also for guiding surgical and adjuvant treatment decisions.2,3 Sentinel lymph node biopsy (SLNB) has replaced axillary lymph node dissection (ALND) as the preferred method for evaluating ALN status in early-stage BC. 4 However, SLNB lacks perfect diagnostic accuracy, with reported false-negative rates in high-quality studies ranging from approximately 4.6% to 16.7%, and it is associated with complications such as lymphedema, hematoma, nerve injury, and upper limb dysfunction, all of which can compromise postoperative quality of life.5,6 With improved imaging techniques, most patients are now diagnosed at earlier stages, with smaller tumor burdens and lower ALNM rates, 7 reducing the clinical benefit of both SLNB and ALND. These challenges highlight the urgent need for accurate, non-invasive methods to assess ALN status preoperatively, aiming to improve diagnostic precision while minimizing surgical morbidity.
Conventional imaging techniques—including mammography, ultrasonography, computed tomography (CT), and magnetic resonance imaging (MRI)—are commonly used to evaluate ALNM. However, their sensitivity in detecting micrometastases remains limited. 8 Diagnostic accuracy varies across modalities and is often influenced by operator experience, introducing potential bias in preoperative assessment.9,10 Current clinical prediction models mainly rely on imaging findings, clinicopathological features, and conventional tumor biomarkers,8,11-13 but largely neglect the critical role of the tumor-associated inflammatory microenvironment in metastasis. Inflammation contributes to tumor progression by modulating immune responses and establishing a pro-metastatic niche. Chronic inflammation, in particular, promotes oncogenesis through sustained tissue damage and transformation, with approximately 20% of cancers linked to long-standing inflammatory conditions.14,15 Recent studies highlight inflammation as a key driver of breast cancer metastasis. 16 The inflammatory microenvironment facilitates lymphatic spread through two main mechanisms. First, tumor-associated neutrophils (TANs) enhance lymphangiogenesis by releasing matrix metalloproteinase-9 (MMP-9) and vascular endothelial growth factor (VEGF), which remodel the extracellular matrix and create pathways for tumor invasion. 17 Second, systemic immune-inflammatory markers—such as the neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), and systemic immune-inflammation index (SII)—indicate a shift toward pro-inflammatory, immunosuppressive states that support tumor dissemination. 18 Unlike static anatomical imaging, blood-based inflammatory biomarkers offer dynamic, cost-effective, and readily available insight into tumor-host interactions. Their integration into diagnostic workflows may improve the accuracy of ALNM prediction and support more personalized therapeutic approaches in breast cancer.
This study presents a novel nomogram that combines inflammatory markers, ultrasound features, and pathological parameters to predict preoperative ALNM risk. Integrating these complementary indicators enhances predictive accuracy and supports more informed clinical decision-making.
Materials and Methods
Study Population
In this retrospective study, clinical data were collected from patients who were pathologically diagnosed with breast cancer and underwent surgical treatment at the Department of Breast and Thyroid Surgery, Luohu People’s Hospital of Shenzhen, between January 2020 and March 2024. Based on postoperative pathology, patients were categorized into axillary lymph node metastasis-positive (ALNM+) and ALNM-negative (ALNM-) groups. The inclusion criteria were as follows: (1) Primary early-stage breast cancer confirmed by histopathological examination; (2) Complete clinical records available; (3) Preoperative breast ultrasonography and routine blood testing performed; (4) Axillary lymph node status determined via SLNB or ALND; (5) All surgeries performed by the same surgical team. The exclusion criteria included: (1) Presence of severe preoperative infection; (2) Diagnosis of multiple primary malignant tumors; (3) Receipt of neoadjuvant therapy or evidence of distant metastasis before surgery; (4) Age over 80 years. A total of 287 patients were included, with 233 assigned to the ALNM+ group and 54 to the ALNM− group. The patient selection process is outlined in Figure 1. The study was conducted in accordance with the TRIPOD guidelines.
19
Ethical approval was granted by the Ethics Committee of Shenzhen Luohu People’s Hospital (Approval No. 2024-LHQRMYY-KYLL-044; Approval Date: July 21, 2024). A waiver of informed consent was approved due to the retrospective nature of the study. Flow Chart. Illustration of Patient Inclusion
Ultrasound Examination
Preoperative breast ultrasonography was performed on all patients by two experienced sonographers, each with more than 5 years of specialization in breast imaging. All examinations were conducted independently, and the sonographers were blinded to the histopathological results. The following breast sonographic features were recorded: tumor size, shape, echogenicity (hypoechoic or hyperechoic), posterior acoustic features, lymph node echotexture, margin characteristics, presence of a hyperechoic halo, calcifications, and color Doppler vascularity (classified as absent, minimal, moderate, or marked). All imaging parameters were classified according to the fifth edition of the American College of Radiology Breast Imaging Reporting and Data System (BI-RADS) Ultrasound Lexicon. 20
Clinical Data Collection
Clinical and pathological data were extracted from the hospital’s electronic medical record system. Collected variables included patient age, tumor location, histological grade (I-III), estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2), Ki-67, CK5/6, P63, vascular invasion, preoperative hematologic parameters (neutrophil, lymphocyte, monocyte, and platelet counts), and serum tumor markers (CEA, CA125, and CA153). Inflammatory indices were calculated as follows: NLR = neutrophil count/lymphocyte count; PLR = platelet count/lymphocyte count; LMR = lymphocyte count/monocyte count; and SII = (platelet count × neutrophil count)/lymphocyte count. Hormone receptor and HER2 statuses were assessed in accordance with established clinical guidelines. 21 ER and PR positivity were defined as nuclear staining in ≥1% of tumor cells by immunohistochemistry (IHC). HER2 expression was considered negative with IHC scores of 0 or (+), positive with (+++), and equivocal with (++). Equivocal cases were further evaluated by fluorescence in situ hybridization (FISH). HER2 amplification confirmed by FISH was classified as positive; non-amplified cases were deemed HER2-negative.
Cutoff Determination and Variable Stratification
Optimal cutoff values for tumor markers (CEA, CA125, CA153), hematologic indices (neutrophil, lymphocyte, monocyte counts), inflammatory markers (NLR, PLR, LMR, SII), and tumor size were determined using the Youden index derived from receiver operating characteristic (ROC) curve analysis. All continuous variables were subsequently dichotomized into high and low levels based on these thresholds.
Statistical Analysis
All statistical analyses were performed using IBM SPSS Statistics (version 23.0) and R software (version 4.3.2). Continuous variables following normal distribution were expressed as mean ± standard deviation (SD) and compared using independent t-tests. Non-normally distributed variables were presented as median (Q1, Q3) and analyzed using the Mann–Whitney U test. Categorical variables were summarized as counts and percentages [n (%)] and compared using the Chi-square test. Univariate logistic regression was conducted to examine associations between clinical variables and ALN status. Variables with P < 0.05 were ranked by importance using the random forest algorithm and included in multivariate logistic regression to identify independent predictors of ALN metastasis. Significant predictors were incorporated into a nomogram model. Model performance was evaluated by assessing discrimination (concordance index [C-index] and area under the curve [AUC]), calibration (calibration plots), and clinical utility (decision curve analysis [DCA]). A two-sided P-value <0.05 was considered statistically significant.
Results
Clinicopathological Features of Patients
A total of 287 patients with early-stage breast cancer were included based on predefined eligibility criteria. Patients were stratified into axillary lymph node metastasis-positive (ALNM+, n = 233) and metastasis-negative (ALNM−, n = 54) groups based on histopathological findings. The mean age was 51 years (range: 24-80 years), with no significant difference between groups (P > 0.05).
The incidence of vascular invasion was significantly higher in the ALNM+ group (48.2%) than in the ALNM- group (15.1%) (P < 0.001), as shown in Figure 2A. Histological grade II-III tumors were more prevalent in the ALNM+ group, reflecting a higher malignancy grade (P < 0.01). Two Groups of Patients With Axillary Lymph Node Metastasis (A) Proportion of Vascular invasion; (B) Proportion of Tumor Size
Univariate Analysis of Preoperative Clinicopathological Characteristics
Univariate Analysis of Preoperative US Characteristics
Predictors of ALNM in Early Breast Cancer
Figure 3A illustrates the correlations among 28 clinical and imaging variables. Univariate logistic regression identified 18 factors significantly associated with ALNM, including tumor location, vascular invasion, PR status, Ki67, P63, histological grade, CA125, CA153, lymphocyte count, NLR, SII, tumor size, lymph node echotexture and echogenicity, calcification, posterior acoustic features, margin characteristics, and blood flow (P < 0.05; Tables 1 and 2). A random forest algorithm ranked these variables by predictive importance, highlighting tumor size and lymph node echotexture as the most influential (Figure 3B). Multivariate logistic regression identified six independent predictors of ALNM: vascular invasion (OR = 6.24, P < 0.001), lymphocyte count (OR = 161.04, P < 0.001), NLR (OR = 3.58, P < 0.05), tumor size (OR = 17.63, P < 0.001), lymph node echotexture (OR = 16.70, P < 0.001), and margin characteristics (OR = 5.44, P < 0.001) (Table 3). (A) Correlations Between Variables; (B) the Importance Ranking of Significant Variables in Univariate Analysis Multivariate Analysis of Preoperative Clinicopathological Characteristics Associated With LNM
Nomogram Construction and Validation
A nomogram was constructed using six independent predictors to estimate the preoperative risk of ALNM in early-stage breast cancer (Figure 4). Each predictor was assigned a score based on its relative contribution, enabling calculation of a total risk score and corresponding probability of metastasis. The model demonstrated excellent discriminative performance, with an AUC of 0.944 (95% CI: 0.906-0.981) (Figure 5A). Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were 89.3%, 90.7%, 97.6%, and 66.2%, respectively. The C-index was 0.944 and remained robust at 0.929 after 1000 bootstrap iterations, indicating strong accuracy and internal validity. Calibration analysis showed close alignment between predicted and observed outcomes (Figure 5B), with a low Brier score of 0.063, reflecting high calibration precision. DCA further confirmed the model’s clinical utility across a wide range of threshold probabilities (Figure 6), supporting its potential to inform individualized surgical decision-making. A Nomogram for Predicting the Risk of Lymph Node Metastasis in Early Breast Cancer (A) The Receiver Operating Characteristic Curve (ROC) of the Nomogram; (B) the Calibration Curve DCA Analysis of ALNM Prediction Model for Breast Cancer


Discussion
Breast cancer remains one of the most prevalent malignancies among women globally. 1 Despite advances in multimodal treatments—such as surgery, chemotherapy, radiotherapy, targeted therapy, and immunotherapy—accurate preoperative staging of axillary lymph nodes is crucial for guiding personalized treatment planning and optimizing clinical outcomes. 21 However, conventional assessment methods often lack sufficient specificity, sensitivity, or objectivity, and their invasiveness limits feasibility for widespread screening of ALNM. To overcome these challenges, this study integrated ultrasonographic features, pathological factors, and inflammatory biomarkers to identify independent predictors of ALNM. Based on these variables, a predictive nomogram was developed as a practical tool to enhance individualized clinical decision-making.
Unlike conventional predictive models, our approach incorporates immune-inflammatory biomarkers, which have gained increasing attention in oncology for their clinical relevance.12,22 These markers offer several advantages: they are non-invasive, low-cost, objective, and easily obtainable, making them practical tools for clinical application. Furthermore, tumor development and progression are closely associated with chronic inflammation and immune dysregulation within the tumor microenvironment.23,24 Inflammatory indices such as the NLR, PLR, and LMR reflect the dynamic interplay between tumor-promoting inflammation and anti-tumor immunity. Guo et al. 23 reported that elevated preoperative NLR and PLR correlate with poor prognosis in breast cancer. A high NLR typically results from increased neutrophils and decreased lymphocytes, indicating heightened proinflammatory activity and suppressed immune surveillance. This imbalance promotes tumor proliferation, angiogenesis, and metastasis, supporting the utility of NLR as a predictive marker for ALNM. In our multivariate analysis, NLR was identified as an independent risk factor for ALNM in early-stage breast cancer (OR = 6.04, P < 0.05). In contrast, PLR and LMR were not independently associated with ALNM, possibly due to sample variability, selection bias, or inconsistent threshold definitions.
The SII—derived from neutrophil, lymphocyte, and platelet counts—has been proposed as a comprehensive marker of systemic inflammation and immune function. 25 Although prior studies linked elevated SII with increased ALNM risk, 26 our multivariate analysis did not confirm SII as an independent predictor (P > 0.05). This discrepancy may stem from variations in study populations, cutoff criteria, or limited sample size. Further large-scale, multicenter research is warranted to validate the prognostic value of SII. Lymphocyte profiling has also emerged as a key factor in ALNM risk stratification. Previous studies 27 have reported a significant association between elevated lymphocyte levels and both axillary metastasis and locoregional recurrence. Notably, patients with three or more metastatic lymph nodes exhibited higher lymphocyte counts than those with one or two, suggesting that lymphocyte-mediated immune responses may influence the extent of nodal involvement.
Vascular invasion is a key step in the invasion–metastasis cascade of cancer progression. During tumor development, cancer cells secrete growth and angiogenic factors that stimulate fibrous tissue proliferation and upregulate VEGF, promoting neovascularization and lymphangiogenesis. These microenvironmental changes facilitate tumor expansion, invasion, and metastatic spread. 28 Our findings reinforce vascular invasion as a significant predictor of ALNM in early-stage breast cancer (OR = 6.24, P < 0.05), aligning with established models of metastatic progression.29,30 This highlights the importance of microvascular dynamics in predicting nodal involvement and justifies its inclusion in our model.
We also evaluated the utility of ultrasonographic features in predicting ALNM. As the primary imaging modality for assessing axillary lymph nodes and guiding interventions, ultrasound plays a central role in clinical management. Multivariate analysis identified significant associations between ALNM and tumor size (P < 0.001), lymph node echogenicity (P < 0.001), and tumor margin characteristics (P < 0.001). Tumors exceeding 14 mm in diameter were strongly associated with increased ALNM risk (OR = 17.63, 95% CI: 5.23-59.46), consistent with the progressive rise in metastatic potential as tumor volume increases. This may be due to the tumor’s enhanced ability to invade the basement membrane and infiltrate surrounding glandular tissues, increasing access to periductal lymphatics and facilitating lymphatic dissemination. Approximately 75% of breast lymph drains via the axillary pathway. Tumors in the outer quadrants, especially the upper outer quadrant, are more likely to metastasize to axillary nodes, resulting in higher ALNM rates than tumors in other locations. In contrast, inner quadrant tumors preferentially spread through the internal mammary chain. Accordingly, surgical resection and thorough assessment remain essential to achieve accurate staging and guide optimal therapeutic strategies. These anatomical differences in lymphatic drainage likely explain the variation in ALNM incidence by tumor location.30-32 In our study, most ALNM cases involved tumors in the upper outer quadrant (70.4%), followed by the lower outer quadrant (14.8%), a pattern consistent with previous anatomical and clinical reports.
The tumor margin, where cancer cells often accumulate, is a common site of invasive behavior. 33 Prior research has identified irregular margins as a risk factor for ALNM11,34; our results confirmed irregular margins as an independent predictor. This may reflect increased collagen deposition at the invasive front, where fibrosis correlates with aggressiveness. VEGF overexpression, stimulated by tumor-secreted growth factors, promotes angiogenesis and lymphangiogenesis, while MMP-9 contributes to basement membrane degradation, enabling local invasion. 35 Guo et al. 36 proposed that color Doppler flow imaging (CDFI) grading could serve as a predictive marker for ALNM. While our univariate analysis supported this, CDFI grade did not remain significant in multivariate analysis. This may reflect limitations of our retrospective, single-center study and the inherent subjectivity of CDFI interpretation, which could introduce bias.
Of the sonographic features analyzed, only abnormal lymph node echogenicity emerged as an independent predictor of ALNM (OR = 15.44, P < 0.001). Other primary tumor characteristics—including vascularity, echogenicity, hyperechoic halo, posterior acoustic shadowing, and calcification—were not independently associated with ALNM. This contrasts with prior studies linking some of these features, particularly hyperechoic halo and calcification, to metastatic potential.28,37 These inconsistencies may be due to variations in imaging techniques or population differences. Standardized, multicenter studies are needed to clarify the prognostic value of these sonographic features in ALNM.
Various nomogram models have been developed to predict ALNM in breast cancer, with the goal of guiding individualized treatment decisions.8,38,39 Among them, the Memorial Sloan Kettering Cancer Center (MSKCC) model is a well-established example, incorporating nine clinicopathological variables and achieving an AUC of 0.754. However, it does not include imaging features, particularly those derived from ultrasonography. As interest in noninvasive imaging increased, researchers began integrating preoperative imaging data into predictive models. Xiong et al. 11 developed a nomogram combining conventional ultrasound features with clinicopathological factors, yielding an AUC of 0.705. More recently, advances in artificial intelligence (AI) have enabled the creation of more accurate and personalized models. For example, Li et al. 40 applied deep learning to ultrasound images to predict ALNM, achieving an AUC of 0.72, with 72.6% accuracy, 65.5% sensitivity, and 78.9% specificity—highlighting the potential of AI-enhanced imaging in preoperative evaluation. Our model incorporates ultrasonographic characteristics, histopathological findings, and preoperative inflammatory markers—all routinely assessed in standard clinical practice—without adding cost or procedural complexity. It demonstrated excellent predictive performance, with an AUC of 0.944 (95% CI: 0.906-0.981) and a C-index of 0.944. Calibration curves indicated strong agreement between predicted and actual outcomes, supported by a low Brier score of 0.068. Moreover, DCA confirmed substantial clinical benefit across a broad range of threshold probabilities. Compared to existing models, our nomogram shows superior discriminative power by integrating multimodal clinical data. It offers a practical, accurate, and visually accessible tool for preoperative ALNM risk stratification in early-stage breast cancer, enhancing clinical decision-making and improving communication between physicians and patients.
This study has several limitations. First, its single-center, retrospective design and relatively small sample size may introduce selection bias. Validation through large-scale, multicenter prospective studies is warranted. Second, the analysis did not include other ALNM-related variables such as preoperative medical history, molecular subtypes, and additional inflammatory markers (eg, C-reactive protein-to-albumin ratio [CAR]), which may affect the model’s generalizability. Third, future research should investigate the molecular pathways linking tumor-associated inflammation and ALNM to support the development of more precise therapeutic strategies.
Conclusion
This study presents a predictive model incorporating vascular invasion, neutrophil-to-lymphocyte ratio, lymphocyte count, tumor size, lymph node echogenicity, and margin irregularity, which demonstrates high predictive accuracy and clinical utility in identifying patients at elevated risk for ALNM preoperatively, thereby guiding precision treatment and helping avoid overtreatment.
Footnotes
Ethical Approval
Ethical approval was granted by the Ethics Committee of Shenzhen Luohu People’s Hospital (Approval No. 2024-LHQRMYY-KYLL-044; Approval Date: July 21, 2024). A waiver of informed consent was approved due to the retrospective nature of the study.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Shenzhen Basic Research Project (No. JCYJ20190812171215641); Shenzhen Key Medical Discipline (SZXK054).
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Original data are available upon reasonable request to the corresponding author.
