Abstract
OBJECTIVES:
This study explores a deep learning (DL) approach to predicting bone metastases in breast cancer (BC) patients using clinical information, such as the fat index, and features like Computed Tomography (CT) images.
METHODS:
CT imaging data and clinical information were collected from 431 BC patients who underwent radical surgical resection at Harbin Medical University Cancer Hospital. The area of muscle and adipose tissue was obtained from CT images at the level of the eleventh thoracic vertebra. The corresponding histograms of oriented gradients (HOG) and local binary pattern (LBP) features were extracted from the CT images, and the network features were derived from the LBP and HOG features as well as the CT images through deep learning (DL). The combination of network features with clinical information was utilized to predict bone metastases in BC patients using the Gradient Boosting Decision Tree (GBDT) algorithm. Regularized Cox regression models were employed to identify independent prognostic factors for bone metastasis.
RESULTS:
The combination of clinical information and network features extracted from LBP features, HOG features, and CT images using a convolutional neural network (CNN) yielded the best performance, achieving an AUC of 0.922 (95% confidence interval [CI]: 0.843–0.964,
CONCLUSION:
Subcutaneous fat index could predict bone metastasis in BC patients. Deep learning multimodal algorithm demonstrates superior performance in assessing bone metastases in BC patients.
Keywords
Introduction
Breast cancer (BC) is a prevalent cancer that is most commonly found among female cancers and is the second leading cause of cancer death among women globally [1]. Bone metastasis is one of the most common metastatic sites in BC patients [2]. Patients with bone metastasis are accompanied by osteoclast-mediated bone destruction and have a 5-year overall survival rate of 22.8% [3, 4]. Early detection of bone metastasis is crucial for improving survival. However, the predictors for bone metastasis have not been sufficiently elucidated. Therefore, there is an urgent need for identifying novel markers for bone metastasis.
Obesity is associated with an increased risk of death in the general population [5], but this contradicts with some reports on the relationship between obesity and mortality in cancer patients [6, 7]. An important reason for the ‘obesity paradox’ may be that body mass index (BMI) cannot distinguish well between individual muscle and fat tissue distributions [8, 9]. Additionally, a higher BMI can mask low muscle mass, and a lower BMI can mask excess obesity. Subcutaneous adipose tissue (SAT) and visceral adipose tissue (VAT) are two different physical forms of white adipose tissue [10]. The VAT is currently thought of as a metabolic and endocrine organ that can influence systemic immunological disorders and body weight homeostasis. SAT, a metabolic storehouse, is linked to visceral fat deposition [11]. In a number of tumor types, including breast cancer, colorectal cancer, hepatocellular carcinoma, and gastric cancer, elevated VAT has been associated with a worse overall survival (OS) rate [12, 13, 14]. Similarly, the SAT index can be used to predict the outcomes of several cancers, such as head and neck, breast, and prostate cancer [15, 16, 17]. Visceral to subcutaneous fat area ratio (VSR) is an independent prognostic factor for poor prognosis in type 1 endometrial cancer and gastric cancer [18, 19]. Recent studies have demonstrated that adipocyte-BC cell interactions are critical for the development of BC and its related bone metastases [20].
Radiomics involves extracting quantitative characteristics from digital images and converting the data into high-dimensional information. Its primary objective is to develop decision-support tools [21, 22]. Typical radiomics includes image segmentation within the region of interest (ROI), followed by the selection and extraction of features such as size, shape, and texture. Statistical methods or machine learning techniques are then utilized to derive the final clinical outcomes [23]. Deep learning (DL) is widely applied in medical image analysis due to its exceptional performance [24]. DL algorithms can extract features from medical images that are beyond human recognition capabilities, enabling automatic quantitative evaluations without introducing additional errors associated with manual feature extraction. However, many studies only employ a single set of medical images, lacking comprehensive clinical information, thus limiting the final predictive outcomes. Multimodal deep learning models offer the potential to encompass additional modalities beyond image data [25, 26, 27].
Local binary pattern (LBP) serves as an efficient texture description technique. By comparing the gray value of each pixel with its neighboring pixels, it characterizes the image’s texture features using a two-level system [28, 29, 30]. This method finds extensive use in face recognition, yielding favorable results [31, 32, 33]. Histograms of oriented gradient (HOG), an algorithm for extracting feature histograms from local pixel blocks [34], have enjoyed significant success in object detection, particularly within pedestrian detection scenarios [35, 36, 37]. LBP boasts advantages in rotation and grayscale invariance, effectively capturing image texture features, while HOG excels in capturing local shape information, maintaining strong invariance to geometric and optical variations. Consequently, convolutional neural networks (CNN) can extract a broader spectrum of features from LBP and HOG features, which diverge from the original computed tomography (CT) images.
In this study, we have developed and validated a neural network-based DL algorithm to assess predictors for bone metastases in breast cancer patients. Additionally, regularized Cox regression models were employed to analyze independent prognostic factors for bone metastases.
Materials and methods
Patients and data sets
This study protocol was approved by the Ethics Committee of Harbin Medical University Cancer Hospital. As it was a retrospective study, informed consent from all participants was exempted. The study comprised 431 patients who underwent radical surgical resection at Harbin Medical University Cancer Hospital between January 1, 2015, and December 31, 2016. The inclusion criteria for this study were as follows: (1) Patients did not receive neoadjuvant chemotherapy or other treatments before surgery. (2) The patients’ age was at least 18 years. (3) Pathological findings were histologically confirmed. (4) Patients had complete clinical and follow-up data. (5) Patients had no history of other malignancies and no metastatic disease at the time of diagnosis. The patients were assigned to the training cohort, validation cohort, and test cohort.
All clinical information, including age, menopausal status, histology type, and body composition, was retrieved from the records of breast cancer (BC) patients. Distant metastasis-free survival (DMFS) was computed from the surgery date to the occurrence of distant metastasis or the last follow-up date, with assessments conducted every 3 months during the first 2 years post-operation and subsequently every 6 months for the 3–5 years thereafter. Distant metastasis outcomes were derived from patient imaging conducted during the follow-up period. Body mass index (BMI, kg/m2) is calculated by dividing weight (kg) by height squared (m2). The erector spinae area, visceral adipose tissue area, and subcutaneous adipose tissue area were obtained with Image J software version 1.53a (Wayne Rasband National Institutes of Health, USA). Different tissues were differentiated based on CT Hounsfield Units (HU). HU was set from
CT image of T11 level, paraspinal muscles, subcutaneous fat and visceral fat. Figure (a) is the CT image of the T11 level. The yellow area in Figure (b) is the visceral fat in the T11 level CT image. The yellow area in (c) is the subcutaneous fat in the T11 level CT image. The yellow area in (d) is the paraspinal muscles in the T11 level CT image.
Selecting an appropriate CNN for use as a feature encoder holds notable influence over the classification outcomes. To identify an apt model for the prediction of bone metastases, we assessed the performance of ResNet34, ResNet50, and ResNet101.
Feature Extraction with HOG and LBP
LBP feature and HOG feature. Figure (a) is the LBP feature extracted from the T11 layer CT image. Figure (b) is the HOG feature extracted from the T11 layer CT image.
The HOG and LBP features were respectively extracted from the CT images, as illustrated in Fig. 2. For HOG, features were calculated using a unit size of 16
The overall pipeline of the model. The fat area mask, LBP feature, HOG feature, and CT image are combined according to the channel dimension to form the input. ResNet combines the attention mechanism guided by the fat mask to form an encoder model to extract features from image data. Lasso-Cox screens clinical information data for features, and then combines them with extracted network features to form multimodal data. GBDT is a classifier for multimodal data.
The study’s workflow is depicted in Fig. 3. To bolster model robustness and mitigate overfitting concerns, we implemented horizontal and vertical flipping, alongside standardized data augmentation techniques [38]. For a more focused emphasis on the subcutaneous fat area, we incorporated a mask-guided attention mechanism, amplifying the mask area’s responsiveness. The mask-guided attention process is detailed in the appendix. All images were standardized to a size of 500*400 pixels, ensuring uniform distance scaling. Preprocessing was facilitated by the Torchvision toolkit (version 0.11.3) within Python (version 3.8.12). PyTorch 1.10.2 was employed as the backend for all model training.
The 431 patients were equally divided into three groups for three-fold cross-validation, with two of them being used as the training cohort and the other as the test cohort. Then, 20% of the training cohort was randomly selected as the verification cohort. The training cohort was employed for updating the CNN model weights, while the validation cohort assisted in guiding the selection of model hyperparameters. The network model’s weights were initialized using the pre-trained model from ImageNet. ResNet101 was chosen as the foundational model for feature extraction.
Before training the CNN, each patient’s image was assigned a label of either 0 or 1, based on the presence or absence of bone metastases. During the training phase, the improved image was fed into the CNN, and the CNN’s parameters were refined through the process of backpropagation. The model employed the CrossEntropyLoss function as its loss metric, while the Adam optimizer updated the model’s parameters using a batch size of 64, a learning rate of 1e-6, and 200 iterations. Further elaboration on the training outcomes can be found in the appendix.
Deep learning has the capability to extract high-throughput features via supervised learning, effectively harnessing the inherent information embedded within images. The process involves amalgamating the initial CT image alongside HOG and LBP features, organized based on the channel dimension, and integrating them into ResNet101. The ResNet101 linear layer is employed for feature extraction within the network, which are then harmoniously merged with clinical data. This amalgamation of clinical insights and network-derived features occurs through horizontal concatenation. Subsequently, these fused features are utilized to train a gradient boosting decision tree (GBDT) model.
Characteristics of study popilation by bone metastasis
Characteristics of study popilation by bone metastasis
Qualitative variables are represented as
The data were presented as percentages or mean
Results
Clinical information
The baseline characteristics table, stratified by the presence of bone metastasis, is presented in Table 1. Patients with bone metastases displayed elevated levels of carcinoembryonic antigen (CEA) and larger tumor sizes. Moreover, significant variations were observed in terms of lymphatic vascular invasion (LVI), molecular subtype, radiotherapy and histologic type. Additionally, pertaining to body composition parameters, noteworthy distinctions were identified in subcutaneous fat tissue area (SFA) and SFI, which pertain to subcutaneous fat, as well as visceral fat tissue area (VFA) and VFI, which are related to visceral fat.
The predictors of bone metastases in patients with breast cancer
The predictors of bone metastases in patients with breast cancer
95% confidence intervals included in brackets. – indicates coefficient
The results of the regularized Cox analysis are presented in Table 2. The Lasso regression algorithm employs the L1 norm for shrinkage penalties and retains LVI, Ki67 expression, histologic type, CEA, BMI, SFA, and SFI for the multivariate Cox regression analyses. The findings revealed that CEA (HR: 1.088, 95% CI: 1.049–1.129,
The performance of distinct deep learning models varies when applied to different datasets. Notably, ResNet101 emerged as the most effective model for predicting bone metastases, as evidenced by its superior performance in the three-fold cross-validation, as depicted in Table 3. In the test cohort, ResNet34 and ResNet50 yielded AUCs of 0.599 (95% CI: 0.476, 0.715,
The performance comparison of different models
The performance comparison of different models
95% confidence intervals included in brackets. AUC area under the receiver operating characteristic curve, ACC accuracy, SENS sensitivity, SPEC specificity. T training cohort, V validation cohort, I-T independent test cohort.
The prediction of bone metastasis result
The prediction of bone metastasis result
95% confidence intervals included in brackets. AUC area under the receiver operating characteristic curve, ACC accuracy, SENS sensitivity, SPEC specificity. GBDT gradient boosting decision tree, SVM support vector machine, RF random forest. T training cohort, V validation cohort, I-T independent test cohort.
We explored various machine learning models to integrate clinical information with network features extracted by CNN. The results are presented in Table 4. Notably, the optimal performance was achieved when employing the GBDT model to amalgamate different feature predictions. Combining clinical information with features extracted by ResNet101 (utilizing feature extraction from T11 level CT images, HOG, LBP, and mask), yielded the most favorable predictive outcomes, with an AUC of 0.922 (95% CI: 0.843, 0.964,
Comparison of receiver operating characteristic (ROC) curves between different models for predicting bone metastases. Figure (a) is the ROC image on the test set when different ResNet models only have CT. Figure (b-c) are ROC images of different machine learning models under different data conditions.
Correlation coefficient heat map between different modal features. The features from X1 to X16 represent the CNN network features extracted from the combination of CT, HOG, LBP, and Mask, while the remaining features encompass clinically informative attributes.
We calculated the Pearson correlation coefficient between data of different modalities (clinical information and network features) and drew it into a heat map, as shown in Fig. 5. It can be seen that in addition to the strong linear relationship between network features (X1–X16), body composition parameters (BMI-VFI) also have a certain linear relationship with network features, which shows that CNN has extracted features that are similar to body composition. In addition, network features are less correlated with other clinical features except body composition parameters, and feature fusion can play a complementary role.
The gradient-weighted activation mapping method (Grad-CAM) was employed to accentuate specific regions within an input CT image, illustrating their role in influencing predictions made by the ResNet model [39]. Figure 6 illustrates the responses of several convolutional layers within the deep learning architecture when exposed to CT images from three distinct patients. Notably, the outcomes underscore the significance of subcutaneous fat within the CT images as a pivotal element in the model’s learning process. This finding aligns harmoniously with the outcomes derived from the Lasso-Cox regression analysis.
Discussion
In this study, we have identified SFI as an independent prognostic factor for bone metastases in breast cancer (BC) patients. Additionally, we have successfully developed a multimodal prediction model for bone metastases in BC patients, employing a combination of deep learning and GBDT techniques. Our multimodal model has demonstrated promising predictive performance on the test cohort, yielding an AUC of 0.922, a sensitivity of 82.7%, and a specificity of 83.2%. These outcomes underscore the viability of employing a multimodal approach in predicting bone metastases in BC patients. Such a model holds potential to enhance diagnostic proficiency, particularly among less-experienced physicians. To our knowledge, this study represents the pioneering utilization of a multimodal model for predicting bone metastases in BC. Table 5 presents a comparative analysis of our results with other pertinent studies focused on predicting breast cancer metastasis. Notably, our current AUC and ACC models exhibit superior performance compared to alternative models utilizing diverse datasets [40, 41, 42, 43, 44, 45, 46].
Comparison of results from related work
ML: machine learning.
Responses of several convolutional layers in ResNet to CT images of different patients. Grad-CAM results of three CT images. Red areas represent a greater impact on the prediction, blue areas the opposite.
Low SFI is independently associated with increased mortality and poorer survival in cancer patients, as indicated by prior studies [47]. Dong et al. similarly discovered that reduced subcutaneous fat was linked to a more adverse prognosis for gastric cancer, encompassing overall survival and disease-free survival [48]. Black et al. also reported that diminished subcutaneous fat has been correlated with reduced survival in operable colorectal cancer [49]. Furthermore, a recent study substantiated that lower SFI independently predicted poor overall survival in hepatocellular carcinoma [50]. Another study proposed that patients with bone metastases exhibiting elevated SFI and VFI demonstrated superior overall survival [51]. However, Bradshaw et al. observed that, among women with non-metastatic breast cancer, increased SAT was associated with shorter survival [16]. These divergent findings might stem from variances in study populations, inclusion criteria, and biomarkers of SAT. Consequently, further research is warranted to elucidate the role of SAT in breast cancer.
Up to the present date, clear identification of potential explanations for the predictive effect of SAT on bone metastases remains elusive. SAT stands out as the primary source of adiponectin and leptin, pivotal players in the regulation of bone health and bone metastasis in BC [52]. Adiponectin exhibits pro-apoptotic and anti-proliferative properties within human BC cells [53, 54]. Research has indicated that adiponectin hampers the metastatic process through its capacity to suppress the adhesion, invasion, and migration of BC cells, facilitated by the activation of the AMPK/S6K axis and the upregulation of live kinase B1 (LKB1) [53]. In sum, compelling evidence suggests that lower levels of circulating adiponectin forecast heightened BC risk and a less favorable prognosis [55]. Moreover, the mass of adipose tissue corresponds directly to leptin synthesis and plasma levels, enhancing lipid metabolism and insulin sensitivity [56]. Leptin, in turn, exerts a regulatory role on bone health by modulating bone density, growth, and adiposity. Investigations have unveiled diminished serum leptin levels in premenopausal BC patients compared to their healthy counterparts [57]. Furthermore, leptin has also shown promise in correlating with improved prognosis among patients with colorectal cancer [58]. Hence, it emerges that adipokines originating from adipose tissue and leptin influence distinct phases of the bone metastatic cascade. Nonetheless, further inquiry is warranted to validate the mechanisms through which SAT potentially safeguards against bone metastasis in BC.
Within our cohort, we have also demonstrated a notable association between CEA, molecular subtype, and LVI in multivariate analysis. In a retrospective study by Chen et al., CEA was not observed to independently predict bone metastasis in BC; however, a significant difference in the distribution of bone metastasis in breast cancer was noted [59]. Further research and validation are essential to determine the significance of CEA in predicting the likelihood of bone metastasis in breast cancer. A SEER population-based study showed that histologic type was an independent factor for bone metastases in breast cancer [60]. The study carried out by Nishimura et al. demonstrated a strong correlation between KI-67 expression and bone metastasis in cases of breast cancer [61]. A separate retrospective study indicated that ER
Meanwhile, no association between BMI and bone metastasis was observed in our study. The relationship between BMI and postoperative outcomes in cancer patients has been a subject of controversy. A previous report unveiled the inconsistency of the association between BMI and cancer survival across diverse cancer types and stages [56]. Disparities in the prognostic implications of BMI may, in part, arise due to variations in body composition among individuals with comparable BMI values (e.g., more fat or muscle in one patient than in another) [64]. In essence, BMI falls short in discerning between fat and fat-free mass or distinguishing diverse fat deposits for an accurate assessment of body composition [65]. In this study, we opted for regional measurements of obesity, such as SFI and VFI, which mitigate the risk of misestimation and enhance the coherence of our findings.
Kaplan-Meier analysis of bone metastases stratified by SFI. Patients were grouped with an SFI of 27.7 as the cutoff value. The difference in the probability of bone metastasis between the two groups was compared.
Our research has unearthed the substantial potential of the SFI in stratified prognostication of breast cancer bone metastasis. Drawing from our findings, we have pinpointed a critical threshold of 27.7 cm2/m2, yielding a sensitivity of 0.727 (95% confidence interval: 0.694–0.751) and a specificity of 0.743 (95% confidence interval: 0.701–0.785). The Kaplan-Meier curve for bone metastasis stratified by this threshold is shown in Fig. 7. This discovery carries profound clinical implications, capable of empowering medical practitioners to arrive at more precise diagnoses and therapeutic choices. In the context of predicting breast cancer bone metastasis, we propose that an individual’s SFI can be measured to serve as a pivotal risk assessment marker. Subsequently, this marker can be judiciously stratified to align with specific diagnostic requirements. Furthermore, a pertinent avenue to explore involves incorporating the SFI as an adjunctive feature within the artificial intelligence model’s training phase, thus constituting an integral facet of “feature engineering.” This strategic inclusion empowers the model to harness SFI-derived insights, culminating in heightened prediction precision.
Studies have demonstrated that CNN can effectively employ multi-view bone scan images to automatically diagnose bone metastases [66, 67]. For the prediction of lymph node metastasis, CNN has been utilized to extract features from ultrasound images and SWE images [68, 69]. In our study, we showcase the efficacy of a deep learning model based on CNN in conjunction with the GBDT method for the accurate prediction of bone metastases. This is achieved by integrating CT image features with the clinical information of BC patients. Additionally, we establish that the inclusion of HOG and LBP features alongside CT image features leads to further enhancement of predictive performance. In contrast to traditional qualitative reasoning, quantitative evaluation of imaging data yields more precise predictive outcomes.
Deep neural networks are often referred to as “black box” models due to the challenge in discerning the specific input components linked to predicted labels. Our goal is to illuminate the areas of focus within CNN that pertain to CT images and contribute to bone metastasis prediction. To achieve this, we employ the Grad-Cam method, which generates a visual representation of CNN’s attention through a heat map.
It is important to acknowledge certain limitations within our study. Firstly, our data emanates from a singular center, warranting the need for data from multiple centers to validate the model comprehensively. Secondly, the retrospective nature of the study introduces a degree of selection bias. Lastly, our study exclusively comprises Chinese participants, thus precluding the extrapolation of results to other ethnic groups.
In summation, deep learning emerges as a valuable technique for extracting pertinent information from CT images to predict bone metastases. Our approach of amalgamating clinical data and GBDT further refines prediction accuracy. Through validation and refinement in a larger and more diverse population, our multimodal model holds the potential to evolve into a pivotal auxiliary diagnostic tool in clinical practice.
Authors’ contributions
S.D.M. conceived the experiment, W.J.H. and W.J.Z. collected data and performed Data cleaning, H.B.J. and K.C. conducted the experiments and analyzed the results, S.D.M., W.J.H. and H.B.J. wrote the manuscript, W.J.H. and R.T.W. reviewed the manuscript.
Funding
This study has received funding by National Cancer Center Climb Plan (NCC201908B09) and Heilongjiang Provincial Postdoctoral Funding Project (LBH-Z15100).
Guarantor
The scientific guarantor of this publication is Shidi Miao.
Informed consent
Written informed consent was not required for this study because it was a retrospective study, informed consent from all participants was waived.
Ethical approval
Harbin Medical University Cancer Hospital Institutional Ethics Review Board approved this retrospective study.
Code availability
We provide the Python source code of DLR model training, which is freely available at
Supplementary data
The supplementary files are available to download from http://dx.doi.org/10.3233/CBM-230219.
Footnotes
Acknowledgments
Thanks to Harbin University of Science and Technology for providing the equipment needed for the experiment, and thanks to Harbin Cancer Hospital for providing the relevant data needed for the experiment.
Conflict of interest
The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.
