Sage Journals: Discover world-class research

Abstract

Seroma is a common complication after mastectomy. To the best of our knowledge, no prediction models have been developed for this. Henceforth, medical records of total mastectomy patients were retrospectively reviewed. Data consisting of 120 subjects were divided into a training-validation data set (96 subjects) and a testing data set (24 subjects). Data was learned by using a 9-layer artificial neural network (ANN), and the model was validated using 10-fold cross-validation. The model performance was assessed by a confusion matrix in the validating data set. The receiver operating characteristic curve was constructed, and the area under the curve (AUC) was also calculated. Pathology type, presence of hypertension, presence of diabetes, receiving of neoadjuvant chemotherapy, body mass index, and axillary lymph node (LN) management (i.e., sentinel LN biopsy and axillary LN dissection) were selected as predictive factors in a model developed from the neural network algorithm. The model yielded an AUC of 0.760, which corresponded with a level of acceptable discrimination. Sensitivity, specificity, accuracy, and positive and negative predictive values were 100%, 52.9%, 66.7%, 46.7%, and 100%, respectively. Our model, which was developed from the ANN algorithm can predict seroma after total mastectomy with high sensitivity. Nevertheless, external validation is still needed to confirm the performance of this model.

Keywords

Seroma mastectomy breast cancer artificial neural network

1. Introduction

Breast cancer is a significant cause of illness in Thailand and around the world. In the United States, breast cancer is recognized as the most common cancer in females and is rated as the second leading cause of death after lung cancer [1]. According to the 2019 record, 271,270 people in the United States had been diagnosed with breast cancer, causing 42,260 deaths [2]. In Thailand, breast cancer incidence is rated as the first rank in female cancer [3]. Age-standardized incidence rate was 27.9 per 100,000 population per year approximately in 2012 and is expected to be 30.3 per 100,000 population per year in 2025 [4].

The majority of treatments for breast cancer consist of surgery, chemotherapy, hormone therapy, and radiotherapy [1]. In surgical techniques, mastectomy with or without axillary lymph node dissection (ALD) is still a frequently performed operation, especially in locally advanced tumors. After the mastectomy, complications can be found, including seroma, bleeding, hematoma, surgical site infection, lymphedema, and skin flap necrosis [5–9].

Fig. 1.

Example of artificial neural network structure.

Seroma is a common complication after breast surgery. The incidence of seroma was 15% to 81% [9–11]. Occurrence of seroma affects further treatments. For instance, chemo- or radiotherapy could be delayed. Seroma could delay wound healing, increase infection rate, and increase financial burden [11,12]. Many factors were identified as potential risks of seroma occurrence. Risk factors are divided into three groups: patient factors, disease factors, and operative factors.

Seroma is associated with aging [11–13], hypertension [9,14], obesity [12,15], and smoking [16]. The presence of malignant axillary lymph node, number of malignant nodes [14,17,18], and receiving of neoadjuvant chemotherapy [19] were identified as the critical disease-related factors. Risk factors related to surgery include axillary lymph node dissection [11,12,15], dissection with conventional electrocautery [11,20], prolonged operative time [15], and extensive chest wall dissection or increasing dead space from surgery [11,14].

Several researches studied seroma prevention [11,14,15] and many studies investigated using of quilting stitches [15,21], fibrin glue [22,23], or sclerotherapy [11,15] to obliterate dead space. Moreover, some studies claimed that using the ultrasonic scissors dissection can prevent seroma occurrence [15,24,25]. However, routine use of these strategies will increase the operative time and cost.

To the best of our knowledge, no prediction models are available for screening the patients, who would benefit from additional procedures targeting seroma prevention. Hence, we used our data to create a model to predict the occurrence of seroma using the artificial neural network (ANN) machine learning algorithm.

Table 1

Demographic data and baseline characteristics

Characteristics	Seroma	Non-seroma	p-value
	(N = 33)	(N = 87)
Age (years),
Mean ± SD	63.27 ± 12.03	59.11 ± 11.67	0.087
BMI (kg/m²),
Mean ± SD	26.55 ± 4.79	24.84 ± 4.60	0.075
Hypertension, N (%)	24 (72.3)	41 (47.1)	0.012
Diabetes, N (%)	10 (30.3)	17 (19.5)	0.207
Pathology, N (%)			0.198
IDC	30 (90.9)	82 (94.3)
ILC	0 (0.0)	2 (2.3)
DCIS	3 (9.1)	1 (1.2)
Others	0 (0.0)	2 (2.3)
Pathological grade
Poor differentiation, N (%)	2 (6.1)	8 (9.2)	0.725
Lymphovascular invasion
Present, N (%)	7 (21.2)	15 (17.2)	0.616^†
Tumor stage, N (%)			0.302
CIS	2 (6.1)	2 (2.3)
I	9 (27.3)	23 (26.4)
II	13 (39.4)	48 (55.2)
III	7 (21.2)	9 (10.3)
IV	2 (6.1)	5 (5.8)
Node stage, N (%)			0.598
0	14 (42.4)	48 (55.2)
I	12 (36.4)	26 (29.9)
II	5 (15.2)	9 (10.3)
III	2 (6.1)	4 (4.6)
Neoadjuvant chemotherapy
Received, N (%)	8 (24.2)	10 (11.5)	0.081^†
Axillary operation, N (%)			0.325^†
SLB	13 (39.4)	43 (49.4)
ALD	20 (60.6)	44 (50.6)
Number of LNs removed,
Median (IQR)	11 (6, 18)	8 (3, 15)	0.059^∗

^†Chi-square test, ^∗Mann–Whitney test. ALD axillary lymph node dissection, BMI body mass index, CIS carcinoma in situ, DCIS ductal carcinoma in situ, IDC invasive ductal carcinoma, ILC invasive lobular carcinoma, LN lymph node, SLB sentinel lymph node biopsy.

2. Methods

Data on mastectomy patients from December 2016 to December 2019 were collected retrospectively. Variables, including age, body mass index (BMI), hypertension and diabetes status, pathology and pathological grade, lymphovascular invasion status, tumor and node stage, neoadjuvant chemotherapy status, axillary lymph node (LN) treatment, and the number of removed LNs, were gathered. Seroma was defined as fluid collection in the mastectomy area after drain removal.

Fig. 2.

Receiver operating characteristic curve of the artificial neural network model.

Variables were described as mean and standard deviation (SD) or frequency and percentage. These variables were compared between seroma and non-seroma cases using an independent t-test (or Mann–Whitney test) or Fisher exact test (or chi-square). All analyzes were performed using STATA version 16.1.

2.1. Model development and validation

Data from 120 mastectomy patients were split into two groups of 96 (80%) and 24 (20%) patients. The model was trained and tested in the first group (96 patients), and validated in the second group (24 patients). For the preventive purpose, we selected only the pre- and intra-operative factors for our seroma’s prediction model. Therefore, by using this model surgeons can add special procedures during operation to avoid seroma occurrence.

ANN, or Multilayer Perceptron, in this case, is one type of supervised machine learning algorithm. It comprises of the input layer, hidden layers, and the output layer (see Fig. 1). In each layer, there are usually multiple units, called “nodes”. The number of nodes and layers are different among algorithms and tasks. The data are input to the input layer, then passed to the hidden layers. These layers then try to capture the important features from the data and later output them to the output layer. The exported data are then compared to the ground truth. By using the back-propagation algorithm, the network can adjust the weight of the model’s parameters. The goal is to reduce error distance from the ground truth in order to not lead to overfitting the training data.

Data was trained by the 9-layer neural network. The activation function was a hyperbolic tangent, with the limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) optimization algorithm, 𝛼 10⁻⁵ for regularization, and differential learning rate. Then, validation was performed by a 10-fold cross-validation method.

2.2. Model performance evaluation

The validating data set, including 24 patients, was used for performance evaluation. The receiver operating characteristic (ROC) curve was plotted, and the area under the curve (AUC) was calculated. Sensitivity, specificity, and positive and negative predictive values (PPV and NPV) were also reported. All steps of model development and validation were conducted using Scikit-learn 0.22.1.

3. Results

Seroma was observed in 33 patients. Only hypertension status was significantly different between seroma and non-seroma groups (72.3% versus 47.1%, respectively; p-value 0.012). Demographic and characteristic data are demonstrated in Table 1.

Pathology, hypertension, diabetes, neoadjuvant status, and BMI were selected as predictive factors. An axillary LN treatment was also included in the model because a higher risk of seroma after axillary LN dissection was observed in many previous studies [11,12,15]. AUC (95%CI) of our ANN model was 0.760 (0.676, 0.844) in the validating data set (see Fig. 2). Sensitivity, specificity, accuracy, PPV, and NPV were 100%, 52.9%, 66.7%, 46.7%, and 100%, respectively.

4. Discussion

Seroma is a common complication after mastectomy and can prolong drainage. Many surgical techniques have been used to avoid this problem, although no predictive models have been developed for screening the patients who risk seroma occurrence. To the best of our knowledge, the present model, which was derived from the ANN algorithm, is the first model for this task. This machine learning model yielded an AUC (95%CI) of 0.760 (0.676, 0.844). A high sensitivity (100%) made this ANN model suitable for screening purposes.

Nowadays, several medical studies apply machine learning in their researches. For instance, the ANN algorithm was better in the diagnosis of anterior, and inferior myocardial infarction than the conventional electrocardiographic criteria in the study of Hedén et al. [26]. ANN has been used to predict the intensive care unit outcome including the length of hospital stay of trauma patients in one study and satisfactory model performance was achieved [27]. The advantage of ANN is that it can overcome multicollinearity; thus, more factors can be included in the ANN model. Furthermore, the non-linear problem and correlation of factors can be solved by the algorithm [28].

Nonetheless, some limitations in the present model were recognized. This model was derived from a small data set, which could affect the model’s accuracy and caused the straight-line appearance of the ROC curve. Only pre- and intra-operative predictive factors were used in this model expecting that additional intra-operative procedures could be guided by the model; although adding more factors (e.g., patient’s age, number of nodes removed, extra-nodal spread, or size of breast) may improve predictive accuracy. Additionally, exposure effects (i.e., odds ratio or relative risk) are not provided by ANN. Without a clear picture, some physicians would be reluctant to use the ANN model in their practice.

In conclusion, this study presents the ANN model, which aims to screen patients who might benefit from additional seroma prevention techniques. The performance of the model is satisfactory. Please see https://seroma.herokuapp.com/ for the web application of the model. The impact of model implementation should be explored in future studies.

Footnotes

Acknowledgements

This manuscript was proofread and edited by Nattakrit Tongpoonsakdi, and a comprehensive English language review was conducted by Stephen Pinder, a medical English specialist.

Statement of ethics

This study involved retrospective data collection and had been approved by the ethics committee before data retrieval (Registration number 070/62).

Conflict of interest

All authors declare no conflict of interest.

Funding source

None.

Author contributions

Data were collected by Panupong Nakchuai and validated by Pakkapol Sukhvibul. Basic statistical analysis was performed by Amarit Tansawet. Artificial neural network model development and model deployment were performed by Sermkiat Lolak. The manuscript was drafted by Panupong Nakchuai and Pakkapol Sukhvibul. Amarit Tansawet designed the study under Suphakarn Techapongsatorn’s supervision.

References

Goetz

, Gradishar

, Anderson

, Abraham

, Aft

, Allison

, NCCN guidelines insights: Breast cancer, Version 3.2018: Featured updates to the NCCN guidelines, J Natl Compr Canc Netw , 17(2): 118–126, 2019.

Siegel

, Miller

, Jemal

, Cancer statistics, 2019, CA Cancer J Clin , 69(1): 7–34, 2019.

Sriplung

, Viroj Tangcharoensathien

, Breast cancer screening among women in Thailand: Analyses of population-based household surveys, J Med Assoc Thai , 97(11): 1106–1118, 2014.

Virani

, Bilheem

, Chansaard

, Chitapanarux

, Daoprasert

, Khuanchana

, National and subnational population-based incidence of cancer in Thailand: Assessing cancers with the highest burdens, Cancers (Basel) , 9(8): 108, 2017.

Pyfer

, Chatterjee

, Chen

, Nigriny

, Czerniecki

, Tchou

, Early postoperative outcomes in breast conservation surgery versus simple mastectomy with implant reconstruction: A NSQIP analysis of 11,645 patients, Ann Surg Oncol , 23(1): 92–98, 2016.

Al-Hilli

, Thomsen

, Habermann

, Jakub

, Boughey

, Reoperation for complications after lumpectomy and mastectomy for breast cancer from the 2012 National Surgical Quality Improvement Program (ACS-NSQIP), Ann Surg Oncol , 22(3): 459–469, 2015.

Olsen

, Nickel

, Fox

, Margenthaler

, Ball

, Mines

, Incidence of surgical site infection following mastectomy with and without immediate reconstruction using private insurer claims data, Infect Control Hosp Epidemiol , 36(8): 907–914, 2015.

Doscher

, Schreiber

, Weichman

, Garfein

, Update on post-mastectomy lymphedema management, Breast J , 22(5): 553–560, 2016.

Akinci

, Cetin

, Asian

, Kulacoglu

, Factors affecting seroma formation after mastectomy with full axillary dissection, Acta Chir Belg , 109(4): 481–483, 2009.

10.

Hashemi

, Kaviani

, Najafi

, Ebrahimi

, Hooshmand

, Montazeri

, Seroma formation after surgery for breast cancer, World J Surg Oncol , 2(1): 44, 2004.

11.

Agrawal

, Ayantunde

, Cheung

, Concepts of seroma formation and prevention in breast cancer surgery, ANZ J Surg , 76(12): 1088–1095, 2006.

12.

Burak Jr

, Goodman

, Young

, Farrar

, Seroma formation following axillary dissection for breast cancer: Risk factors and lack of influence of bovine thrombin, J Surg Oncol , 64(1): 27–31, 1997.

13.

Loo

, Chow

, Factors predicting seroma formation after mastectomy for Chinese breast cancer patients, Indian J Cancer , 44(3): 99–103, 2007.

14.

Pogson

, Adwani

, Ebbs

, Seroma following breast cancer surgery, Eur J Surg Oncol , 29(9): 711–717, 2003.

15.

Van Bemmel

, Van de Velde

, Schmitz

, Liefers

, Prevention of seroma formation after axillary dissection in breast cancer: A systematic review, Eur J Surg Oncol , 37(10): 829–835, 2011.

16.

Sforza

, Husein

, Atkinson

, Zaccheddu

, Unraveling factors influencing early seroma formation in breast augmentation surgery, Aesthet Surg J , 37(3): 301–307, 2017.

17.

Bryant

, Baum

, Postoperative seroma following mastectomy and axillary dissection, Br J Surg , 74(12): 1187, 1987.

18.

Petrek

, Peters

, Nori

, Knauer

, Kinne

, Rogatko

, Axillary lymphadenectomy: A prospective, randomized trial of 13 factors influencing drainage, including early or delayed arm mobilization, Arch Surg , 125(3): 378–382, 1990.

19.

Woodworth

, McBoyle

, Helmer

, Beamer

, Seroma formation after breast cancer surgery: Incidence and predicting factors/discussions, Am Surg , 66(5): 444–450, 2000.

20.

Porter

, O’Connor

, Rimm

, Lopez

, Electrocautery as a factor in seroma formation following mastectomy, Am J Surg , 176(1): 8–11, 1998.

21.

Chen

, Li

, Song

, Zheng

, Yu

, Systematic review and meta-analysis of the use of quilting to prevent seroma formation after axillary lymphadenectomy, Int J Clin Exp Med , 9(2): 760–772, 2016.

22.

Carless

, Henry

, Systematic review and meta-analysis of the use of fibrin sealant to prevent seroma formation after breast cancer surgery, Br J Surg , 93(7): 810–819, 2006.

23.

Fawzy

, Gaber

, Role of fibrin glue in seroma reduction after modified radical mastectomy, Int Surg J , 4(7): 2103–2109, 2017.

24.

Huang

, Yu

, Wei

, Qin

, Mo

, Yang

, Harmonic scalpel versus electrocautery dissection in modified radical mastectomy for breast cancer: A meta-analysis, PLoS One , 10(11): e0142271, 2015.

25.

Michalik

, Matkowski

, Biecek

, Szynglarewicz

, The use of ultrasonic scalpel lowers the risk of post-mastectomy seroma formation in obese women, J Cancer , 10(15): 3481–3485, 2019.

26.

Hedén

, Edenbrandt

, Haisty Jr

, Pahlm

, Artificial neural networks for the electrocardiographic diagnosis of healed myocardial infarction, Am J Cardiol , 74(1): 5–8, 1994.

27.

Gholipour

, Rahim

, Fakhree

, Ziapour

, Using an Artificial Neural Networks (ANNs) model for prediction of Intensive Care Unit (ICU) outcome and length of stay at hospital in traumatic patients, J Clin Diagn Res , 9(4): 19–23, 2015.

28.

, Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes, J Clin Epidemiol , 49(11): 1225–1231, 1996.

Prediction of seroma after total mastectomy using an artificial neural network algorithm