Sage Journals: Discover world-class research

Abstract

The coronavirus disease 2019 (COVID-19) pandemic, which emerged in late 2019, has caused millions of infections and fatalities globally, disrupting various aspects of human society, including socioeconomic, political, and educational systems. One of the key challenges during the COVID-19 pandemic is accurately predicting the clinical development and outcome of the infected patients. In response, scientists and medical professionals globally have mobilized to develop prognostic strategies such as risk scores, biomarkers, and machine learning models to predict the clinical course and outcomes of COVID-19 patients. In this contribution, we deployed a mathematical approach called matrix factorization feature selection to select the most relevant features from the anonymized laboratory biomarkers and demographic data of COVID-19 patients. Based on these features, developed a model that leverages the deep stacking neural network (DSNN) to aid in clinical care by predicting patients’ mortality risk. To gauge the performance of our suggested model, performed a comparative analysis with principal component analysis plus support vector machine, deep learning, and random forest, achieving outstanding performances. The DSNN model outperformed all the other models in terms of area under the curve (96.0%), F₁-score (98.1%), recall (98.5%), accuracy (99.0%), precision (97.7%), specificity (97.0%), and maximum probability of correction decision (93.4%). Our model outperforms the clinical predictive models regarding patient mortality risk and classification in the literature. Therefore, we conclude that our robust model can help healthcare professionals to manage COVID-19 patients more effectively. We expect that early prediction of COVID-19 patients and preventive interventions can reduce the mortality risk of patients.

Keywords

Biomarkers deep learning COVID-19 healthcare machine learning

WHAT IS ALREADY KNOWN ABOUT THIS SUBJECT?

⇒ Coronavirus disease 2019 (COVID-19) was the cause of severe acute lung damage, with clinical symptoms ranging from asymptomatic to moderate, with cough, fever, and muscular discomfort, to bilateral pneumonia with chronic lung failure and multi-organ dysfunction, which could ultimately lead to death.

⇒ Several machine learning and deep learning algorithms have been developed and tested based on laboratory variables and CT images to predict the death risk of COVID-19 patients.

⇒ There is a need to identify the most important clinical and laboratory features contributing to the risk of death in COVID-19 patients and to develop clinical prediction models that can integrate and analyze these parameters in real time to predict patient outcomes.

WHAT ARE THE NEW FINDINGS?

⇒ The metric factorization feature selection (MFFS) method was utilized for significant COVID-19 feature selection.

⇒ Exploring the potential of deep stacking neural networks (DSNN) model in predicting the mortality risk of COVID-19 patients.

⇒ Our analysis shows that patients at higher mortality risks had coexisting conditions, including multi-organ dysfunction syndrome (MODS) and factors such as age structure and comorbidities.

HOW MIGHT IT IMPACT CLINICAL PRACTICE IN THE FORESEEABLE FUTURE?

⇒ This contribution will significantly support the healthcare system in making the right decision and providing the appropriate facilities for all coronavirus patients; such preventative measures enhance patient survival rates.

⇒ The clinical care application model can be used in hospitals to identify which patients require more attention.

⇒ The clinical predictive model can also be used to predict the mortality risk of patients with other diseases such as heart, kidney, lung, and breast cancer.

Introduction

The novel coronavirus disease 2019 (COVID-19) pneumonia disease was first appeared in Wuhan, China, in late 2019 and later declared a pandemic by the World Health Organization (WHO).¹ It spreads through respiratory droplets in the air breathed when an infected person coughs or sneezes, talks, and breathes. The infectious virus causes massive devastation to public health, social order, educational systems, and economic systems. Consequently, the global infection and mortality rates are around 768 million and 6.9 million, respectively. In this situation, the existing healthcare system was insufficient to cope with such calamitous challenges. Therefore, advanced clinical technologies, such as artificial intelligence (AI) and machine learning (ML), must be industrialized to manage and identify high-risk patients with COVID-19.²

Moreover, the majority of COVID-19 patients convalesce without intervention, but some necessitate intensive care unit (ICU) admission. The COVID-19 infection is usually elevated in the pediatric and adolescent cohorts, and the geriatric population and the subjects with concurrent pathologies were also susceptible to adverse outcomes.^3–5 Currently, safe and effective vaccines are accessible in hospitals, but COVID-19 persists worldwide due to its constant genetic evolution, including the B.1.1.7 and B.1.351 variants, which pose significant challenges for the healthcare system regarding mortality risk.⁶ Thus, advanced AI and ML technologies are essential to advance the healthcare system’s scalability, cost-effectiveness, and quality performance. In the meantime, AI and ML can help develop a prognostic model for COVID-19 that can predict the risk of severe outcomes and guide the optimal treatment strategies for patients.

Literature review

Since then, researchers and medical professionals worldwide have made significant efforts to effectively prognosis and minimize the proliferation of COVID-19 through various measures, including social isolation, wearing masks, routine hand washing, self-quarantine, and vaccination. Researchers have recently leveraged AI, ML, and deep learning (DL) algorithms to develop a predictive model for prompt diagnosis and prognosis.^7,8

AI approaches have proven effective in predicting disease spread rates, delineating illness duration, determining the peak of infectious diseases, elucidating the course of COVID-19 transmission, and classifying the patients in healthcare systems. Traditional methods involving statistical and classical mathematical algorithms have fallen short in addressing the exigencies of the current pandemic landscape.^9–13 Moreover, the ML and DL approaches might be significant for dealing with prognosis and diagnosis.¹⁴ In the literature, the procedures and the treatment of the coronavirus were carried out by Zhang et al.¹⁵ Chen et al.¹⁶ studied the prevention and novel treatment of COVID-19. Holland et al.¹⁷ set up a particular medical kit for frontline workers against the coronavirus virus. Burdick et al.¹⁸ used an extreme gradient boosting (XGBoost) classifier model to predict invasive mechanical ventilation of COVID-19 patients within 24 h of an initial phase, employing data from the first 2 h after admission. A detailed explanation of the potential role of biomarkers in COVID-19 patients can be found in Malik et al.¹⁹ In the study of Ramón et al.,²⁰ five ML models were deployed to predict the mortality rate of COVID-19 patients, including k-nearest neighbors(KNN), Gaussian nave Bayes (GNB), decision tree (DT), and support vector machine (SVM), where XGB achieved better performance in terms of accuracy (92%). Yadaw et al.²¹ proposed ML models, including XGBoost, Logistic Regression (LR), SVM, and Random Forest (RF), to predict the death risk of COVID-19 patients. The dataset (n = 3842) was collected between March 9, 2020 and April 9, 2020, from the Mount Sinai Health System in New York City, NY, USA. In terms of performance, XGBoost shows an excellent outcome with an accuracy of 92.0%. In their study, J. L. Guadiana-Alvarez et al.²² utilized DL and ML algorithms to forecast the fatality risk of COVID-19 patients. The study included a sample size of 1503 patients with 48 features. The performance of the DL model was benchmarked against other models such as RF, XGBoost, LR, artificial neural network (ANN), and SVM. The DL model demonstrated superior performance with a maximum probability of correct decision (93%), accuracy (95%), precision (91%), recall (100%), and F₁-score (93%).

Alakus et al.²³ developed retrospective DL models for COVID-19 patients using laboratory findings from patients examined at Hospital Israelite Albert Einstein in Sao Paulo, Brazil, and patients tested for SARS-CoV-2 in early 2020. The initial dataset consisted of 5644 patients with 111 laboratory test results. After filtering, the final dataset included 18 laboratory findings from 600 patients. In the balanced dataset, 520 patients tested negative for PCR test results and 80 tested positive. The predictive model demonstrated a maximum performance of 86.66% accuracy, 91.89% F₁-score, 86.75% precision, 99.42% recall, and 62.50% area under the curve (AUC) for identifying COVID-19 patients. Moulaei et al.²⁴ used ML algorithms to predict the death rate of coronavirus; around 1500 patients’ datasets were collected from Ayatollah Taleghani Hospital, affiliated with Abadan University of Medical Sciences. Moreover, they consider dyspnea, ICU admission, and oxygen treatment the most predictive factors out of 38. Results showed that smoking, alanine aminotransferase, and platelet count were the least reliable predictors of mortality from COVID-19. Based on the experimental results, RF has the highest performance in terms of accuracy (95.03%), sensitivity (90.70%), precision (94.2%), specificity (95.10%), and ROC (99.97%). In addition, there are few studies of specific ML and DL algorithms to predict mortality risk in COVID-19 patients.

Our clinical predictive model is a much more sophisticated approach for minimizing the generalization error rate of one or more generalizers. Deep stacking neural networks (DSNN) work by reducing the biases of the generalizers concerning a given learning set. This minimization proceeds by generalizing in a second space, whose output is the accurate prediction. DSNN architecture is more advanced with classifiers, and it is utilized for different purposes such as preventing network intrusion, intrusion detection wireless, and indicated footballer position.^25–28

Since most ML and DL models are handy for a big dataset, these models often struggle with small datasets and generalization well. To overcome these challenges, we propose to use integrated DL with stacking generalization systems that can leverage multiple algorithms. Then this system is employed to enhance the prediction accuracy of COVID-19 patients in hospitals. Our study offers the following contributions:

The clinical predictive DSNN model was implemented to predict mortality risk in COVID-19 patients based on laboratory markers and demographics instead of CT images or radiograph data.

The matrix factorization feature selection (MFFS) mathematical system was deployed for the significant feature selection.

A dataset of COVID-19 patients from Hospital Universitario HM Madrid, Spain, was utilized to validate the novel predictive model. The experimental results showed that DSNN outperformed state-of-the-art algorithms.

The efficacy of the DSNN model was corroborated by two distinct experiments, one with 24 significant features and another with nine paramount COVID-19 features.

The management of COVID-19 patients at advanced levels was ensured by biomarker functionality in patients with systemic manifestations of the disease.

Multiple organ dysfunction syndromes (MODS) and other comorbidities associated with COVID-19 in mortality risk were studied.

The clinical predictive model’s performance was evaluated with accuracy, recall, F₁-score, precision, specificity, AUC, and maximum probability of correction decision (MPCD).

To our knowledge, no study has yet utilized DSNN algorithms to predict the fatality risk of COVID-19 patients based on laboratory tests and demographic results. This study may motivate other researchers to test the model in the laboratory using different biomarkers, which also can be used for other diseases.

The rest of the study is structured as follows. After reviewing the related works, the methodology for early database characterization of COVID-19 patients was discussed in the section “Materials and methods.” We then present the ML and DL application models and performance metrics in the section “Results.” Section “Discussion” details the experimental setup, results, and discussion. Finally, we conclude the study with our findings in the section “Conclusions and future study.”

Materials and methods

The proposed methodology is illustrated in Figure 1 with a schematic diagram. The approach includes several steps: (i) describing and cleaning the dataset, (ii) using the MFFS-based approach for selecting significant features, (iii) applying data splitting techniques, and (iv) developing, training, and testing the proposed model. The expounded details are elucidated in the following subsections.

Figure 1.

Schematic exemplification of the clinical care model.

Clinical dataset description

This study leveraged big data analytics techniques to analyze a large-scale clinical dataset of 2307 PCR-confirmed COVID-19 patients from Hospital Universitario HM Madrid, Spain. The raw dataset comprised 92 features and included a varied patient population of 1377 males and 930 females. The clinical data have shown that the majority 71% of patients had favorable outcomes and were discharged from the hospital. However, a significant proportion of 16% of patients had fatal outcomes and succumbed to the disease. The remaining 13% of patients were still receiving treatment in the hospital at the time of data collection. Patient identity (ID) was maintained using a unique identifier to ensure data privacy and security.

Data cleaning and feature selection technique

In the data cleaning phase of our study, we faced several technical challenges with our raw dataset, such as class imbalance, inconsistent units of measurement, redundant information, a mix of numerical and categorical data types, and missing values. To fix all these problems in the dataset, we applied various data preprocessing techniques such as resampling to balance the classes, feature selection to remove redundant information, unit conversion to standardize the units of measurement, encoding to convert categorical data into the numerical format, and the KNN imputation scheme was deployed to predict the missing values based on the similarity or distance between the existing records. We applied specific filtering criteria to our raw dataset to ensure quality and relevance. We only included records of patients who had a PCR test positive and whose outcomes were known (either discharged or deceased). In addition only included the successive aged ≥ $18$ years, with SpO₂ scores and a registered patient ID. We use a raw dataset with 92 features, including the maximum, minimum, and standard of some biomarkers and demographical variables. For example, Troponin_max, Troponin_min, and Troponin are three features of the same biomarker. These features are unnecessary and can affect our analysis negatively. We removed the maximum and minimum features and retained only standard features. After applying these filtering criteria to the raw dataset, there were only 40 features left with a 1463 sample size.

However, some of these features still did not meet our inclusion criteria. Therefore, applied the MFFS approach to refine the dataset features further and identify the most significant ones for predicting the fatality risk of COVID-19 patients in advance. Wang et al.²⁹ proposed the MFFS scheme for important attribute selection. This method assumes all features exist on a linear manifold within the Euclidean space $R$ . By making this assumption, the method can effectively identify and select the most relevant features for analysis. Herein, the pseudo-code of MFFS is as follows:

Input: Data matrix

X \in R^{n \times d},

number of COVID-19 features

k

Output: Selected COVID-9 feature subset

S

1. Initialize: the feature weight matrix

W \in R^{d \times k}

and the representation matrix

H \in R^{k \times n}

2. repeat
3. Update

W

and fix

H

4. Update

H

and fix

W

5. until convergence
6. Select

S

as the most extensive

k

index set in

W

7. return

S

where W functions as an indicator matrix for the designated features derived from an index set S of selected features, H represents the representation matrix of the initial feature within the specified feature space, and K shows the number of COVID-19 features.

After implementing the MFFS technique, there was a significant reduction possible in the number of COVID-19 features. Out of the 40 features, only 24 were retained based on high rank. Of these 24 features, 17 belong to laboratory markers, 6 comorbidities features, and the age of patients. Features 1–18 were considered numerical data $\in R$ , while features 19–24 were categorical data in [True, False]; all the selected features with importance scores are seen in Table 1. This dimensionality reduction can potentially improve the clinical care models’ efficiency and accuracy.

Table 1.

MFFS approach deployed to select significant features in COVID-19 patients: |S|= feature important score; $\bar{x}$ (Min–Max) = feature mean value, minimum, and maximum value; p-value = probability value.

S. No	COVID-19 features (K)	Acronym with units/cod	\|S\|	$\bar{x}$ (Min–Max) values	p-Value
01	Leukocyte count	LEUC (×1000/ $μ L$ )	0.6315	6.17 (6.05–9.91)	<0.001
02	C-reactive protein	CPR (mg/L)	0.0773	91.0 (40.10–131.30)	<0.001
03	Troponin	TNI (ng/L)	0.0727	29.74 (24.01–35.47)	0.002
04	Neutrophils count	NEU (×1000/ $μ L$ )	0.0488	7.55 (6.0–9.10)	<0.001
05	Platelet count	PLAQ (×1000/ $μ L$ )	0.0437	275.45 (200.00–350.90)	<0.001
06	Lymphocyte count	LIN (×1000/ $μ L$ )	0.0410	20.2 (16.10–24.3)	<0.001
07	IgM (Immunoglobulin M)	IGM (mg/dL)	0.0351	102.07 (94.02–110.12)	0.052
08	Interleukin 6	IL-6 (ng/L)	0.0338	216.115 (192.73–239.5)	0.043
09	Ferritin	FER (ng/mL)	0.0300	1338.065 (1150.13–1526)	<0.001
10	Prothrombin time (s)	PT (s)	0.0276	75.775 (70.58–80.97)	<0.001
11	Lactate dehydrogenase	LDH (IU/L)	0.0233	619.23 (501.91–736.55)	<0.001
12	Oxygen saturation (%)	SpO2 (%)	0.0223	92.9 (86.7–99.1)	<0.001
13	Lactate	LAC (mmol/L)	0.0207	2.01 (1.73–2.29)	<0.001
14	Creatinine	Crea (mg/dL)	0.0093	0.965 (0.91–1.02)	<0.001
15	D-Dimer	DD (ng/mL)	0.0079	2852.15 (1194.71–509.60)	<0.001
16	IgG (Immunoglobulin G)	IGM (mg/dL)	0.0066	102.07 (94.02–110.12)	0.053
17	Monocytes	MO (×1000/ $μ L)$	0.0042	360 (120–950)	<0.001
18	Age	Age (yr)	0.2701	59 (18–110)	<0.001
Comorbidities
19	Hypertension	I10 (mm Hg)	0.2699	Categorical values [True, False]	0.008
20	Diabetes	E11(mg/dL)	0.0963		<0.001
21	Heart disease	I25 (%)	0.0411		<0.001
22	Lung disease	J80 (L)	0.0099		<0.001
23	Kidney disease	N17 (ML)	0.0089		<0.001
24	Liver disease	K76.9 (IU/L)	0.0016		0.051

In addition, the statistical approach t-test and p-value have been utilized further to verify the significance of the selected 24 features, and a t-test was deployed to compare the means of two sets of features and determine whether their differences were statistically significant. The p-value represents the probability that the difference between the two sets of features is due to chance. If the value of $p \leq 0.05$ , then it is considered statistically significant.³⁰ Thus, all the selected features by MFFS were also significant in the p-value, and for all those features whose p-values are less than 0.001, we write it as <0.001.

Moreover, to optimize the DSNN model for minimal features and maximal performance, we conducted another experiment with only nine key features. These features were chosen based on their statistical significance and clinical relevance: leukocyte, age, hypertension, C-reactive protein, troponin, neutrophils, interleukin, platelet count, and lymphocyte. These features reflect the immunopathological response and the comorbidities of organ dysfunction that lead to COVID-19. Among all the delineated features in Table 1, only leukocytes, age, and hypertension have the highest score $(> 0.1)$ , indicating their statistical significance and clinical relevance for COVID-19. Leukocytes are white blood cells that fight infections. Leukopenia (<6.16 × 10⁹/L) means a low leukocyte count, which weakens the immune system and increases the mortality risk of COVID-19 patients. Leukocytosis count (>10 × 10⁹/L) indicates a high leukocyte count, which reflects an active immune response and is often seen in asymptomatic COVID-19 patients.³¹ Age is a notorious risk factor for mortality and severe COVID-19 outcomes. Owing to the weakened immunological response against COVID-19, the geriatric population may also be more vulnerable to COVID-19.³² Hypertension is a common comorbid disease that may affect the expression of angiotensin-converting enzyme 2, the receptor for adherence, and internalization of COVID-19 into host cells. Hypertension may also impair the cardiovascular system’s ability to cope with the stress and inflammation induced by COVID-19.³³ The significance score encapsulates how each distinct feature influences the prognostication of the fatality rate for patients afflicted with COVID-19.

Experimental setup

The entire experimental process is illustrated in Figure 1 and consists of three main steps: (1) data processing and cleaning, (2) development of the proposed model, and (3) model evaluation and prediction.

Step 1: In the initial phase of the experiment, data preprocessing and cleaning were performed. Subsequently, the dataset was partitioned into training and test sets with an 80:20 ratio. All classification models were trained using TensorFlow and Keras on a CPU with 8 GB RAM, utilizing Python 3.9.0.

Step 2: In this stage, we describe the overall structure of the DSSN model, which can be seen in Figure 2. First, we trained and tested three models: RF, principal component analysis (PCA) + SVM, and recorded their performance in Table 6. After that, we experimented with the DSNN model, which has two main levels: level 0 (or base model) and level 1(meta-model). The level 0 (or based models) included PCA + SVM, RF, and DL, while the convolutional neural network (CNN) was used as the meta-model in level 1. We used an 80:20 typically train and test technique for the training and test dataset. In such a way, we trained three different base models and then saved their results. The best weight of every model was recorded using the ModelCheck function, which is activated after the validation loss reaches the minimum value. CNN utilized a level 1 model and trained it using the output features of the base model.

Step 3: Finally, the robust models were evaluated by the key performance measures (KPM) shown in Table 6, and the performance of the DSSN was also compared with RF, PCA + SVM, and DL.

Figure 2.

The deep stacking neural network architecture.

Machine learning and DL application models

DL is a subfield of machine learning, which itself is a branch of AI. DL models use multi-layered neural networks to learn and make predictions or decisions without being explicitly programmed to perform these tasks.³⁴ Voluminous research in healthcare has demonstrated the varied applications of AI in healthcare-related sustainable development goals. The implementation of AI has commenced for a wide variety of health problems, including tuberculosis and malaria, the diagnosis and prognosis of patients afflicted with COVID-19, diagnosing cancer by mammography, identification of brain tumors via magnetic resonance imaging, and predicting Alzheimer’s disease.^35–38 The advancement of AI transformation may guide medical professionals to provide more effective care and treatment closer to the patients. AI never controls patients’ illnesses and gives them superhuman disease management skills.³⁹ To select the most robust model for predicting the inflammatory and immune responses of COVID-19 patients, we applied and compared four different models: PCA with SVM, RF, DL, and DSNN. These models were trained and tested using laboratory test results and demographic data of COVID-19 patients.

PCA with SVM

The PCA is a well-established statistical tool deployed to examine the correlations between items and variables to determine principal components with a significant correlation. Deploying the few fundamental principal components changed a high-dimensionality dataset into a low-dimensional dataset, lowering the dataset’s dimensionality. As a result, the number of principal components equals the number of eigenvalues of the correlation matrix, with a value always greater than 1. SVM is an ML algorithm for classification and regression analysis. When SVM combines with PCA, it can improve the model’s performance by reducing the features’ dimensionality and removing noise from the data.⁴⁰ In this contribution, PCA with SVM was deployed as a base model to classify COVID-19 patients and the fatality risk prediction strategy for minimizing preventable deaths from the public health perspective.

Random forest

It is an ensemble approach that joins multiple decision trees. Because of randomness, it offers much greater performance than a single decision tree and can reduce model variance. The RF models easily handle all kinds of regression and classification issues because it has less sensitive and requires fewer parameters to configure.⁴¹ It performs better when the number of variables is significantly greater than the number of observations. Let us consider the input variable $x$ and a continuous output variable $y$ . The prediction model is managed based on random forest and bootstrap methods using the following phases⁴²:

Phase 1: Supposed that a bootstrap sample $Z$ of size $n$ from the training data.

Phase 2: Develop a random forest tree $T_{b}$ to the bootstrapped data by recursively replicating the following steps for every end node of the tree till the smallest node size $n_{\min}$ is achieved. Chose $m$ number of splitting random variables from the $p$ inputs. For this, select the best-split point among the $m$ , and deploy the residual sum of squares (RSS) to split the $m$ random node into two sub-nodes, and mathematically presented as in equation (1).

RSS = \sum_{left} (y_{i} - y_{l})^{2} + \sum_{right} (y_{i} - y_{R})^{2}

(1)

where $y_{l}$ and $y_{R}$ indicate the mean $y$ -value for the left nodes and right nodes, respectively. In such a way, we obtained the output of the ensemble of trees $[T_{b}]_{1}^{B}$ (T) and predict a new point $x$ : ${\hat{f}}_{rf}^{B} (x) = \frac{1}{B} \sum_{b}^{B} T_{b} (x)$ .

Deep learning

Recently, DL architectures have garnered great interest across myriad disciplines due to their unrivaled representational capabilities, particularly in computer vision, but they are also expanding into other fields, including medical diagnosis, prognosis, and predictions. DL uses a hierarchy of several layers to understand different representation levels.⁴³ DL is a neural network with three primary layers: input, hidden, and output. Mathematically presented as in equation (2).

Y_{i} = f (W_{i} . Y_{i - 1} + b_{i})

(2)

The above equation shows the output of the layer in the network, where f, W_i, b, and Y_i–1 are, respectively, defined as the activation function, weight, bias, and output of the previous layer. In addition, the network has multiple hidden layers, each subject to the output of the last layer; this ultimately generates the output variable Y_i as COVID-19 patients’ mortality risk.

Deep stacking neural network

A DSNN is a type of neural network architecture. It is inspired by DL and the idea of stacked generalization. It combines several different models (or heterogeneous ensembles) to create a more robust and accurate model.²⁸ The complete architecture of the DSNN model is seen in Figure 2. Also, in the main unit of DSNN, the hidden layer and output layer were sigmoidal nonlinear and linear, respectively. The linear property ensures greater competence and optimization for the output network’s weight. The DSNN architecture includes the fundamental structure of single and multi-layered units; the functional neural network unit consists of two training sets of weights and one hidden layer. The lowest unit’s topology consists of the linear input and hidden output layers with linear and nonlinear groups. It is significant to observe that the fine-tuning model of DSNN weight is implemented in each unit rather than generally through all layers. Total training samples in activity matrix H are known in the hidden layer. Hereafter, the upper weight matrix U can be efficiently learned. Suppose that the training vector X = [x₁, …, x_i,…, x_N] in which each vector x_i is represented as x_i = [x₁, …, x_ji, …, $X_{Di}$ ], D indicates the size of the input vector, while the number of training samples N, the hidden units L, and C shows the dimension of the output vector. The lowest input module of DSNN has a hidden layer W which consists of a set of nonlinear units. U denotes the output units, and the value of the weight matrix consists of 0 to 1. So, all the modules in DSSN have the same architecture and are interconnected, such as linear input linked with the hidden layer. The higher module input unit is a subset of a neighboring lower output Z_n unit in the DSNN, and is mathematically presented in equation (3).

Z_{n} = U^{T} K_{n}

(3)

where $K_{n} = σ (W^{T} x_{n})$ which goes to the hidden layer vector for sample $n = D \times L,$ and U = L × C, representing the upper and lower layer block, respectively, $σ$ denotes the activation function. Let us assume that N is the total number of samples in the training set, with $S = [S_{1}, . . ., s_{n}, . . ., S_{N}]$ where the $s_{n}$ vector is $[s_{n}, \dots, s_{jn}, \dots, s_{N}], W and U$ parameters are then determined to minimize the average total square error as follows.

E = \frac{1}{2} \sum_{i = 1}^{n} {‖ z_{n} - s_{n} ‖}^{2} = \frac{1}{2} Sr [(Z - S) - (Z - {S)}^{T}]

(4)

for output, $y_{i} = U^{T} k_{n} = σ U^{T} (W^{T} x_{n}) = (UW) .$ Let K = [k₁, …, k_nn, …, k_N], and W input weight matrix is known, error derivative to U (equation (5)):

U = {(K K^{T})}^{- 1} K S^{T} = F (W)

(5)

Deploying Lagrangian multiplier^44,48 to resolve the optimizing problem for $W$ output weight matrix.

E = \frac{1}{2} \sum^{n} | | Z_{n} (U, W) - S_{n} | |^{2} + λ | | U - F (W) | |

(6)

To fine-tune the W output weight matrix, Batch-mode gradient descent^45,49 was used; it can be calculated as in equation (7).

\frac{dE}{dW} = 2 X [\begin{matrix} K^{T °} {(1 - K)}^{T °} \\ [K^{T} (K S^{T}) (S K^{T}) - S^{T} (S K^{+})] \end{matrix}]

(7)

where $K^{+} = K^{T} {(K K^{T})}^{- 1}$ the pseudo-inverse of K, and “°” element-wise multiplication. The Batch-mode gradient method was used instead of the conventional back-propagation method because it trains more efficiently.⁵⁰ One of the most critical components in a neural network is the activation function because it helps the network use important information and suppress irrelevant data points.⁵¹ This model exercised two activation functions: rectilinear unit (ReLu) and sigmoid function.

Key performance measures

The proposed models were evaluated using performance metrics. COVID-19 patients who were discharged or survived were classified as class 1, while those who deceased were classified as class 0. The performance metric helps us understand whether the model can discriminate between the two classes. It provides metrics such as true positive (TP), false positive (FP), true negative (TN), and false negative (FN). The KPM used were recall, precision, specificity, the AUC, F₁-score, accuracy, and MPCD.^22,46

Recall: The ratio of TP to the sum of true positives and false negatives (TP + FN). The formula for the recall is:

Recall = \frac{TP}{TP + FN}

Precision: The ratio of TP to the sum of true positives and false negatives (TP + FP). The formula for precision is:

Precision = \frac{TP}{TP + FP}

Specificity: The ratio of TN to the sum of true negatives and false positives (TN + FP). The formula for specificity is:

Specificity = \frac{TN}{TN + FP}

Accuracy: The ratio of the number of correct predictions made by the model on the test dataset to the total number of predictions made. The formula for test accuracy is:

Accuracy = \frac{TP + TN}{TP + TN + FP + FN}

F₁-score: It is the harmonics mean of precision and recall. The mathematical formula for F₁-score is:

F_{1} - score = \frac{2 (Precision \times Recall)}{(Precision + Recall)}

MPCD: The maximum probability of correct decision (MPCD) is a probabilistic-based measure used to evaluate the performance of a classification model. It is designed to analyze highly imbalanced data structures. It can be used to maximize the probability of correct classification for the minority class while maintaining a high level of accuracy for the majority class.

MPCD = (1 - \frac{FN}{FN + TP}) (1 - \frac{FP}{FP + TN})

where $α = \frac{FP}{FP + TN}$ , and $β = \frac{FN}{FN + TP}$ .

Mean Absolute Error (MAE) is the average of the absolute difference between the predicted and actual values. The formula for MES is:

MES = \frac{1}{k} \sum_{j = 1}^{k} | y_{j} - y |

where $y_{i}$ and y represent predicted value and actual value, respectively.

Results

This section commences by elucidating the descriptive statistical analyses of the important factors that exacerbate the mortality risk of COVID-19 patients; it also presents the main results of the proposed model.

Symptom-based characterization of COVID-19 patients

There is considerable interest in employing symptoms to prioritize COVID-19 diagnostics and establish symptom-based management. However, this approach may be unsuccessful in identifying patients who require testing for COVID-19 infection. The criteria to determine who should undergo RT-PCR testing vary from region to region and ultimately depend on hospital resources and public health goals. The classification of patients based on COVID-19 early symptoms can be divided into three categories. (i) General symptoms: One epidemiological patient history, two clinical symptoms, one etiological indication of COVID-19 infection, and patients with pneumonia indications on CT scans or PCR are required. (ii) Mild symptoms: Fever, cough, fatigue, body aches, headaches, sore throat, and chest distress, required isolation. (iii) Severe symptoms: At rest, the $Sp O_{2} \leq 93$ %, causing shortness of breath. Heart rate (HR) interval was $\geq 30 bpm$ and lung infection was >50%. Positive results from PCR may indicate a high risk of developing MODS and shock in patients with viral infections.⁴⁷

Biomarkers-based classification criteria for COVID-19 patients

A characteristic that can be scientifically assessed as an indicator of normal biological and pathological processes is a biomarker. Biomarkers have become an important diagnostic method in managing COVID-19 patients, the etiological organism for COVID-19.¹⁹ In hospital practice, biomarkers ensure that resources are utilized as effectively as possible to improve clinical treatment and avoid serious complications.⁴⁸ COVID-19 is a multisystem disease caused by a diffuse systematic process with a complicated combination of immunological inflammatory and coagulated cascades. The genetics and acquired variations in the host immune system further affect the host with wide heterogeneity in the clinical picture, course, and outcomes.¹⁹ The COVID-19 biomarkers offer valuable information such as detection and classification to set prognosis criteria, timely diagnosis, manage high-risk cohorts, formulate hospital admission and discharge criteria, rationalize therapies, and forecast mortality risk.

Also, the use of COVID-19 biomarkers in children is highlighted, particularly to forecast multisystem inflammatory syndrome in a child (MIS-C). Eventually, biomarkers can be employed as predictive tools for doctors to identify and effectively manage patients more likely to suffer adverse outcomes from COVID-19. We aim to use biomarkers results and predict the death risk of COVID-19 patients and classification criteria using the DSNN model. The predictive criteria for biomarkers derived from various literature sources are shown in Table 2.

Table 2.

Classification of biomarkers and mortality risk criteria in COVID-19 patients.

Name of biomarker	Threshold values	Mean value min–max/range
Lung impairment in COVID-19 biomarkers⁴⁹
Ang-2	$> 2800 pg / mL$	$(25 - 60) pg / mL$
RAgE	$< 208 pg / mL$	$(188.4 - 3964.4) pg / mL$
VCAM-1	$> 1312 ng / mL$	$(449 - 1103) ng / mL$
ICAM-1	$> 1092 ng / mL$	$(100 - 200) ng / mL$
Cardiac biomarkers⁵⁰
High-sensitivity cardiac troponin I (hs-cTnI)	$7.12 ng / L$	$(0 - 14) ng / L$
NT-pro BNP	$5.11 pg / mL$	$(0 - 300) pg / mL$
Ck-MB	$4.86 IU / L$	$(5 - 25) IU / L$
Myoglobin (MYO)	$3.56 nmol / L$	$(1.28 - 3.67) nmol / L$
Renal biomarkers⁵¹
BUN elevated above the standard value	$11.07 mg / DL$	$(7 - 21) mg / DL$
Serum creatinine elevated above the expected value	$4.42 mg / DL$	$(0.84 - 1.21) mg / DL$
BUA—elevated above the average value	$2.92 mg / DL$	$(2.4 - 7.0) mg / DL$
MIS-C biomarkers⁵²
C-reactive protein	$149 mg / L$	$(< 6.0) mg / L$
Erythrocyte sedimentation rate	$50 mm / h$	$(< 13) mm / h$
D-dimer	$2523 ng / mL$	$(< 500) ng / mL$
Ferritin	$539 mg / L$	$(20 - 300) mg / L$
Lactate dehydrogenase	$359 U / L$	$(110 - 210) U / L$

COVID-19 association with multi-organ dysfunction syndrome

Although there have been a lot of basic and clinical studies on MODS, much work remains to be done to keep patients safe.⁵³ Dataset information reveals that MODS is one of the critical factors in increasing the risk of death among COVID-19 patients. Table 3 shows the distribution of MODS and mortalities in COVID-19 patients; of the 1463 COVID-19 patients in the dataset, 791 were diagnosed with MODS. In addition, 110 of these 791 patients had lung disease, and 77 died. Heart disease was another risk factor for mortality; 35 of 80 patients with this disease had a fatal outcome. The liver disease affected 150 patients, of whom 120 were successfully treated and discharged from the hospital. Kidney disease can lead to dangerous complications; only 150 of 200 patients with this condition survived. When coronavirus infects diabetic patients, the outcome can be worse; according to data analysis, 51 of 251 patients died. According to the data analysis, lung infection contributed most to the mortality rate of COVID-19 patients. This study examined the impact of MODS on the fatality risk of COVID-19 patients. The results showed that COVID-19 patients with MODS had a higher mortality risk than those without MODS. Especially, lung diseases showed a higher mortality rate among COVID-19 patients, and these findings suggest that the fatality of COVID-19 patients depends on various factors, and MODS is one of them.

Table 3.

Distribution of MODS and mortality in COVID-19 patients by comorbidities.

Features name	MODS with COVID-19 (n = 791)	Deceased patients (n = 243)
Lung diseases	110 (13.90%)	77 (70.00%)
Heart diseases	80 (10.11%)	35 (43.75%)
Liver diseases	150 (18.96%)	30 (20.00%)
Kidney diseases	200 (25.28%)	50 (25.00%)
Diabetes	251 (31.73%)	51 (20.31%)

MODS, multi-organ dysfunction syndrome.

COVID-19 with other comorbidities

The mortality risk of COVID-19 patients was also influenced by other comorbidities such as fever, catarrhal table, oncological diseases, discomfort, thoracic outlet syndrome (TOS), and dysuria. Table 4 presents the statistical facts about the mortality risk of COVID-19 patients along with comorbidities or symptoms: according to data analysis, 9.25% of 270 patients with fever died. Catarrh is a condition that involves phlegm and mucus in the throat and nose. Influenza, colds, and allergies can cause it. COVID-19 mortality and catarrh are unclear. Catarrh by itself is usually not a serious condition. However, it can be a sign of a COVID-19 infection, which can lead to complications and increase the chance of death.⁵⁴ According to the data records, 11.37% of the 211 patients who had catarrh with COVID-19 died. Oncological disease, commonly referred to as cancer, presents a multifaceted association with COVID-19. In general, patients diagnosed with oncological conditions may exhibit increased vulnerability to contracting COVID-19 and may experience greater severity and fatality rates after infection,⁵⁵ 35.13% of 37 patients deceased with the oncological disease, 15.62% of 64 patients passed away with general distress. TOS compresses nerves, veins, and arteries between the base of the neck and upper chest. It affects shoulder, arm, and hand pain, numbness, and weakness,⁵⁶ 20% of 140 patients died with TOS, and 66.66% of nine patients with dysuria died as reported in the dataset. Hence, it is apparent that the co-occurrence of preexisting medical conditions with COVID-19 significantly influences the mortality rate of COVID-19.

Table 4.

Distribution of mortality in COVID-19 patients with symptoms.

Features name	COVID-19 with symptoms (n = 731)	Deceased patients (n = 106)
Fever	270 (36.93%)	25 (9.25%)
Catarrh	211 (28.86%)	24 (11.37%
Oncological diseases	37 (05.06%)	13 (35.13%)
Discomfort	64 (08.75%)	10 (15.62%)
Thoracic outlet syndrome (TOS)	140 (19.15)	28 (20.00%)
Dysuria	9 (01.30%)	06 (66.66%)

Notably, the study included 1463 patients who were infected with COVID-19. Among them, 266 succumbed to the disease, from 18 to 110 years old. Another cause of death was classified into MODS and other comorbidities. Some patients had both MODS and comorbidities, while others had only one of them. Among the 266 fatalities, 243 were related to MODS, either by itself or in conjunction with other comorbidities, and 106 were attributed to other comorbidities, either alone or in conjunction with MODS. Therefore, the total number of deaths was equal to the combination of MODS-related and comorbidity-related deaths without counting the patients who had both conditions twice. In other words, the union of MODS and other comorbidities deaths equal to the total death representing in Table 5.

Table 5.

Distribution of mortality in COVID-19 patients by age group.

S. No	Age group	Total patients (n = 1463)	Deceased patients (n = 266)
01	18–38	160 (10.94%)	08 (05.0%)
02	39–58	377 (25.77%)	49 (12.99%)
03	59–78	430 (29.39%)	105 (23.95%)
04	79–98	490 (33.49%)	99 (20.20%)
05	$\geq 99$	06 (0.41%)	05 (83.33%)

COVID-19 with age structure

The study by Rymkiewicz⁵⁷ reported that the increased mortality rate of COVID-19 in geriatric populations is attributed to immunosenescence. The descriptive statistical analysis in Table 5 indicates that COVID-19 distresses all age groups, but not equally. Geriatric populations have a higher mortality rate compared to working folks. The infection (morbidity) rate is typically higher in younger individuals, while mortality is higher in the geriatric population.

Key results from the proposed model

This subsection delineates the experimental outcomes and furnishes a discourse on the findings, which entailed devising and implementing clinical prediction classifiers, including PCA + SVM, RF, DL, and DSNN. In this contribution, two experiments were executed; both utilized identical models and conformed to the same performance standards. However, the number of features deviated between the two experiments. So, in the first experiment, 24 features were utilized for all proposed classification algorithms to generate mortality predictions and assess the performance of the clinical prognostic models, as the outcomes illustrated in Table 6. Figure 3 compares the performance of the proposed clinical predictive models, where the DSNN model outperforms other models in all seven metrics, as evidenced by the higher bar height. This study applied PC + SVM to prognoses COVID-19 patients, and the performance profile of this classifier can be visualized in Figure 3, including F₁-score (0.972), AUC (0.914), recall (0.969), accuracy (0.972), precision (0.975), specificity (0.950), and MPCD (0.899). The RF classifier demonstrated strong performance across all seven indicators, as proved by the AUC (0.869), F₁-score (0.953), recall (0.950), accuracy (0.975), precision (0.957), specificity (0.907), and the MPCD (0.887). The DL algorithm was implemented using and obtained excellent results across multiple metrics, as indicated by the AUC (0.941), F₁-score (0.966), recall (0.960), specificity (0.965), accuracy (0.971), precision (0.973), and MPCD (0.920). The DSNN model outperformed all other application models, achieving an AUC (0.960), F₁-score (0.981), recall (0.985), accuracy (0.990), precision (0.977), specificity (0.970), and MPCD (0.934). In addition to that, the clinical care model exhibited a high-performance indicator in terms of accuracy achieved on both the training and test sets, which attained 0.990 and 0.985, respectively, and also visualized model predictive accuracy and loss function in Figure 4, the training and testing accuracy and loss function curves at some stage closely aligned, implying that our model looks stable and generalizing well to new data.

Table 6.

Summary of the outcomes of clinical predictive models with 24 attributes.

Algorithms	Key performance metric (KPM)
	AUC	F₁-score	Recall	Accuracy	Precision	Specificity	MPCD
DL	0.941	0.966	0.960	0.971	0.973	0.965	0.920
PCA + SVM	0.914	0.972	0.969	0.972	0.975	0.950	0.899
RF	0.869	0.953	0.950	0.975	0.957	0.907	0.887
DSNN	0.960	0.981	0.985	0.990	0.977	0.970	0.934

AUC, area under the curve; DL, deep learning; DSNN, deep stacking neural network; MPCD, maximum probability of correction decision; PCA, principal component analysis; RF, random forest; SVM, support vector machine. The bold values in Table 6 highlight algorithmic excellence across diverse performance metrics, simplifying interpretation and comparison.

Figure 3.

Performance comparison among clinical predictive application models.

Figure 4.

The performance of the DSNN model with 24 features is shown by its (a) accuracy and (b) loss.

In the second experiment, we used a parsimonious set of 09 features for forecasting COVID-19 patients’ mortality risk. This experiment trained and validated the DSNN model to ensure that the proposed model could attain high performance and efficiency with minimum features. The model achieved an AUC (0.971), F₁-score (0.934), recall (0.927), training accuracy (0.932), test accuracy (0.925), precision (0.941), specificity (0.966), and MPCD (0.901), also shown in Table 7. Hence, Figure 5 describes the accuracy and loss function of the proposed model with nine features, indicating its capability to forecast the mortality risk of COVID-19 patients accurately.

Table 7.

Summarizes the comparison of our study with existing studies for mortality prediction in COVID-19 patients.

KPM							Feature size	Sample size	Reference
AUC	F_1-score	Accuracy	Recall	Precision	Specificity	MPCD
0.960	0.981	0.990	0.985	0.977	0.970	0.934	24	1463	Our study
0.971	0.934	0.932	0.927	0.941	0.966	0.901	09
0.625	0.918	0.866	0.994	0.867	N/A	N/A	18	600	Alaku et al.²³
0.99	N/A	0.950	0.907	0.942	0.910	N/A	38	1500	Moulaei et al.²⁴
0.942	N/A	0.899	N/A	N/A	N/A	N/A	32	307382	Pourhomayoun et al.⁶³
0.844	0.742	N/A	0.742	N/A	0.800	N/A	15	235	deMoraes et al.⁶²
0.921	0.921	0.923	0.924	0.925	N/A	N/A	15	203	Ramón et al.²⁰

AUC, area under the curve; MPCD, maximum probability of correction decision; N/A, means not available. Bold shows excellence across diverse performance metrics.

Figure 5.

The performance of the DSNN model with 09 features as shown by its (a) accuracy and (b) loss curves.

Discussion

This section discusses the results of the proposed models, their implications, and limitations, and compares them with the existing works on ML and DL models for COVID-19 prognostication. Furthermore, we analyze the performance of each KPM for all proposed models: PCA + SVM, RF, DL, and DSNN, by conducting an experimental study using two distinct feature sets. First, we discuss the outcomes of using the clinical predictive model on a dataset with 24 features, and second, we explore the effects of reducing the feature set to 9 features, as follows.

Accuracy is a study feature that indicates how well experimental parameters correspond to population parameters and ensures the validity and reliability of the results.⁵⁸ Our all-predictive models achieved an accuracy greater than 0.970, demonstrating their robustness and performance in predicting fatal risk among COVID-19 patients. In particular, our DSNN model achieved test and train accuracies of 0.985 and 0.990, respectively. Clearly, accuracy serves as a rudimentary metric to evaluate the performance of the model. Therefore, to make a robust prediction model, one more advanced performance indicator has been measured, named precision and recall. Precision refers to the proportion of correct detections reported by a model. In simple words, the ratio between the true positive observation and the total predicted positive observation. The clinical model’s perfect precision would have been 1.⁵⁹ Our clinical predictive models obtained outstanding precision scores exceeding 0.950, particularly DSNN being the best with a score of 0.977, it nearly optimal precision. Recall efficiently predicts the true positives of each available category. In the significant classification process, the recall value for the best model is always measured as 1.⁵⁹ All four models obtained the best recall value greater than 0.950, while the DSNN model gained 0.985. Instead of using a curve to evaluate the classifier’s performance, we want to use a single scalar metric, or F₁-score, and compute the F₁-score as the harmonic mean of precision and recall⁵⁹; and suggested models obtained F₁-score greater than 0.950, especially DSNN achieved 0.981. The AUC was applied in the classification task to ascertain the classes that yielded the most accurate predictions. The evaluation criteria for the AUC are delineated in Mandrekar.⁶⁰ When the AUC score is 0.50, it means no discrimination ability, while scores ranging from 0.60 to 0.80 are deemed reasonable, scores between 0.80 and 0.90 are considered excellent, and values exceeding 0.90 are exceptional. The RF’s AUC score can be regarded as good, as the range of results fell between 0.8 and 0.90. The AUC values of the other three models were exceptional, and scores exceeding 0.90 are superior. Based on the AUC score performance, all clinical predictive models can be deployed to predict the mortality rate of COVID-19 patients. Finally, we calculate the model loss function, which measures how well the model fits the data and how robust it is to over-fitting.⁶¹ Figure 4(b) shows that the training and validation loss rapidly converge to a minimum within the first 90 epochs, after which they remain stable and closed for several epochs. It implies that our clinical care model is highly generalizable on unseen data.

Moreover, we experimented with only nine features, the clinical predictive model attained high efficacy and performance in prognosticating COVID-19 patients’ mortality risk. It achieved plausible performance with minimal features, exceeding 0.90 for all the KPM, showing that our model is robust for unseen data. All new experimented outcomes are seen in Table 7. The model training and test accuracy, minimal error rates, and loss values also provided evidence of its stability and robustness, as shown in Figure 5. These outcomes designate the prognostic algorithms’ capacity to precisely depict the nonlinear and complex relationships among the parameters and the outcome and its ability to generalize well to unseen data. The DSNN methodology is a distinctive and flexible technique that can effectively handle diverse sparsity arrangements while upholding consistent accuracy. This model exhibits advantages over other extant algorithms regarding accuracy, efficiency, and flexibility. Hence, this research work entirely focused on these five key indicators for model evaluation. The remaining indicator was computed for use in making comparisons.⁶²

Evaluation of results against existing studies

In this subsection, Table 7 compares our study with existing scholarly work on ML and DL frameworks for the prediction of COVID-19 patients using biomarkers and demographic features. Furthermore, researchers utilize various attributes in their model for training and evaluation. However, since all the authors were considered laboratory markers and demographic features, all the authors adopted different approaches for significant feature selection. Table 7 shows that our DSNN algorithm with 24 features outperforms all the other methods in terms of most KPM. The second experiment with nine features also achieves commendable performance and efficiency in predicting COVID-19 mortality risk. Thus, the framework yields satisfactory outcomes even when utilizing a limited number of features.

The comparative results also demonstrate that our contribution has a larger sample size than most of the other studies, except for Pourhomayoun et al.,⁶³ which has a much larger sample size but lower performance, and it shows that our DSNN framework can handle large and complex datasets better than existing ones. The outcomes also accentuate the gaps in some of the previous research papers, such as missing scores for some KPM or low scores for some KPM. Thus, our study achieved higher performance measures than previous studies with fewer features, indicating its superiority and efficiency in predicting the risk of death in COVID-19 patients based on biomarkers and demographics.

Study limitations

We acknowledge the limitations of our studies. First, our data were acquired from Hospital Universitario HM Madrid, Spain; clinical data do not reflect global COVID-19 patients, which may confine the applicability of our findings to other populations and settings. Second, our studied data were limited to 24 attributes, which may not cover all the relevant factors influencing the mortality risk of COVID-19 patients. Third, our data were subjected to some missing values and outliers, which may impair the quality and validity of our results. Fourth, our framework was trained and tested on a relatively small sample size, which may diminish its statistical significance and accuracy.

Conclusions and future study

COVID-19 was the black swan of the 20th century, resulting in a disastrous global health crisis that endangered human civilization and well-being. This prognostic modeling study has identified the most effective feature selection technique for forecasting the outcome of COVID-19 patients. This paper presents a novel predictive model known as DSNN that can accurately predict the mortality risk of COVID-19 patients. Our experimental results reveal that the higher mortality risks also had MODS and other coexisting medical conditions such as age structure and existing diseases. Compared the DSNN model with the other proposed models, it was proved that the stacking ensemble with the DL model has effectively enhanced prediction accuracy. The expectation is that this clinical care model will become notably beneficial for the healthcare system in making sensible decisions and providing ample facilities for COVID-19 patients; such preventive measures maximize patient survival rates.

In the future, the targeted treatments will be designed; meanwhile, sufficient actual data are expected to be accessible for algorithm validation and a plan to deploy the quantum neural network model for efficient and accurate performance.

Footnotes

Authors’ note

Xu Songhua is also affiliated to Institute of Medical Artificial Intelligence the Second Affiliated Hospital XJTU, Shaanxi, China.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Ethics approval

The anonymized data were obtained via special request to HM Hospitals (Spain) under their COVID DATA SAVE LIVES initiative, which was approved by their Ethics Committee. Our analysis adhered to HM Hospitals’ data use terms and protection standards.

Data availability statement

The dataset and related outcomes of this research work will be accessible from the corresponding author on a realistic request.

References

World Health Organization. Coronavirus disease (COVID-19): weekly epidemiological, update 1. Geneva: World Health Organization, 2020.

World Health Organization. Coronavirus disease 2019 (COVID-19): situation report, 72. Geneva: World Health Organization, 2020.

Stratton

Tang

YW.

The Wuhan SARS-CoV-2—What’s next for China. J Med Virol 2020; 92: 546–547.

Burki

The origin of SARS-CoV-2. Lancet Infect Dis 2020; 20: 1018–1019.

Shei

Wang

, et al. Pathological findings of COVID-19 associated with acute respiratory distress syndrome. Lancet Respirat Med 2020; 8: 420–422.

Marks

Gruppuso

Adashi

EY.

Urgent need for next-generation COVID-19 vaccines. JAMA 2023; 329: 19–20.

Shamout

Shen

, et al. An artificial intelligence system for predicting the deterioration of COVID-19 patients in the emergency department. npj Digital Med 2021; 4: 80.

Ortiz

Trivedi

Desbiens

, et al. Effective deep learning approaches for predicting COVID-19 outcomes from chest computed tomography volumes. Sci Rep 2022; 12: 1716.

Chen

Zhou

Dong

, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet 2020; 395(10223): 507–513.

10.

Yang

, et al. Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study. Lancet Respir Med 2020; 8: 475–481.

11.

Zhou

, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet 2020; 395: 1054–1062.

12.

Wang

, et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China. JAMA 2020; 323: 1061–1069.

13.

Spadon

Hong

Brandoli

, et al. Pay attention to evolution: time series forecasting with deep graph-evolution learning. IEEE Trans Pattern Analysis Mach Intell 2021; 44: 5368–5384.

14.

Lalmuanawma

Hussain

Chhakchhuak

Applications of machine learning and artificial intelligence for Covid-19 (SARS-CoV-2) pandemic: a review. Chaos, Solitons & Fractals 2020; 139: 110059.

15.

Darlenski

Tsankov

COVID-19 pandemic and the skin: what should dermatologists know?

Clinics Dermatol 2020; 38: 785–787.

16.

Chen

, et al. Clinical study of mesenchymal stem cell treatment for acute respiratory distress syndrome induced by epidemic influenza A (H7N9) infection: a hint for COVID-19 treatment. Engineering 2020; 6: 1153–1161.

17.

Holland

Zaloga

Friderici

CS.

COVID-19 personal protective equipment (PPE) for the emergency physician. Visual J Emerg Med 2020; 19: 100740.

18.

Burdick

Lam

Mataraso

, et al. Prediction of respiratory decompensation in Covid-19 patients using machine learning: the READY trial. Comput Biol Med 2020; 124: 103949.

19.

Malik

Patel

Mehta

, et al. Biomarkers and outcomes of COVID-19 hospitalisations: systematic review and meta-analysis. BMJ Evid Based Med 2021; 26: 107–108.

20.

Ramón

Torres

Milara

, et al. eXtreme gradient boosting-based method to classify patients with COVID-19. J Investigative Medicine 2022; 70: 1472–1480.

21.

Yadaw

Y-c

Bose

, et al. Clinical features of COVID-19 mortality: development and validation of a clinical prediction model. Lancet Digital Health 2020; 2: e516–e525

22.

Guadiana

JL-Alvarez

Hussain

Morales-Menendez

, et al. Prognosis patients with COVID-19 using deep learning. BMC Med Informat Decision Making 2022; 22: 1–18.

23.

Alakus

Turkoglu

Comparison of deep learning approaches to predict COVID-19 infection. Chaos Solitons Fractals 2020; 140: 110120.

24.

Moulaei

Shanbehzadeh

Mohammadi-Taghiabad

, et al. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC Med Informat Decision Making 2022; 22(1): 1–12.

25.

Rajadurai

Gandhi

UD.

A stacked ensemble learning model for intrusion detection in wireless network. Neural Comput Appl 2022; 34: 15387–15395.

26.

Buyrukoğlu

Savaş

Stacked-based ensemble machine learning model for positioning footballer. Arabian J Sci Eng 2023; 48: 1371–1383.

27.

Tang

Wang

Deep stacking network for intrusion detection. Sensors 2022; 22: 25.

28.

Wolpert

DH.

Stacked generalization. Neural Networks 1992; 5: 241–259.

29.

Wang

Pedrycz

Zhu

, et al. Subspace learning for unsupervised feature selection via matrix factorization. Pattern Recognit 2015; 48: 10–19.

30.

Krzywinski

Altman

Significance, P values and t-tests. Nat Methods 2013; 10(11): 1041–1042.

31.

Huang

Zhu

Bai

, et al. Multidimensional analysis of risk factors for the severity and mortality of patients with COVID-19 and diabetes. Infect Dis Ther 2020; 9: 981–1002.

32.

Grifoni

Alonzi

Alter

, et al. Impact of aging on immunity in the context of COVID-19, HIV, and tuberculosis. Front Immunol 2023; 14: 2657.

33.

Gallo

Calvez

Savoia

Hypertension and COVID-19: current evidence and Perspectives. High Blood Pressure Cardiovasc Prevent 2022; 29: 115–123.

34.

Nogales

Garcia-Tejedor

Monge

, et al. A survey of deep learning models in medical therapeutic areas. Artif Intellig Med 2021; 112: 102020.

35.

Farahani

Firouzi

Chang

, et al. Constant, and K. Mankodiya. Towards fog-driven IoT eHealth: Promises and challenges of IoT in medicine and healthcare. Future Gener Comput Syst 2018; 78: 659–676.

36.

Lassau

Ammari

Chouzenoux

, et al. Integrating deep learning CT-scan model, biological and clinical variables to predict severity of COVID-19 patients. Nat Commun 2021; 12(1): 634.

37.

Liu

Zhi

The impact of big data and artificial intelligence on the future medical model. J Life Environ Sci (PeerJ) 2018; 39: 1–4.

38.

Schwalbe

Wahl

Artificial intelligence and the future of global health. Lancet 2020; 395(10236): 1579–1586.

39.

Dhar

Ranganathan

Machine learning capabilities in medical diagnosis applications: Computational results for hepatitis disease. Int J Biomed Eng Technol 2015; 17: 330–340.

40.

Mahmoudi

Heydari

, et al. Principal component analysis to study the relations between the spread rates of COVID-19 in high risks countries. Alexandria Eng J 2021; 60: 457–464.

41.

Bernard

Heutte

Adam

On the selection of decision trees in random forests. In: 2009 International joint conference on neural networks, 2009, pp. 302–307. New York: IEEE.

42.

Friedman

Hastie

Tibshirani

. The elements of statistical learning: data mining, inference, and prediction. 2nd ed. New York: Springer, 2009, pp. 601–617.

43.

Min

Lee

Yoon

Deep learning in bioinformatics. Brief Bioinf 2017; 18: 851–869.

44.

Silvey

SD.

The Lagrangian multiplier test. Annals Math Statistics 1959; 30: 389–407.

45.

Burges

Shaked

Renshaw

, et al. Learning to rank using gradient descent. In: Proceedings of the 22nd international conference on machine learning, 2005, pp. 89–96. New York, NY: Association for Computing Machinery.

46.

Ong

Haw

S-C

K-W

. Deep Learning based-recommendation system: an overview on models, datasets, evaluation metrics, and future trends. In: Proceedings of the 2019 2nd international conference on computational intelligence and intelligent systems, 2019, pp. 6–11.

47.

Rodríguez

Agapito

Rodríguez

MSA

, et al. COVID-19: critical appraisal of the evidence. Anales de Pediatría (English Edition) 2021; 95: 207.

48.

Biomarkers Definitions Working Group. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther 2001; 69: 89–95.

49.

Spadaro

Fogagnolo

Campo

, et al. Markers of endothelial and epithelial pulmonary injury in mechanically ventilated COVID-19 ICU patients. Crit Care 2021; 25: 1–9.

50.

Qin

Cheng

Zhou

, et al. Redefining cardiac biomarkers in predicting mortality of inpatients with COVID-19. Hypertension 2020; 76: 1104–1112.

51.

Zhou

Zhang

Dobrinina

Urinalysis, but not blood biochemistry, detects the early renal impairment in patients with COVID-19. Diagnostics 2022; 12: 602.

52.

Dufort

Koumans

Chow

, et al. Multisystem inflammatory syndrome in children in New York State. N Engl J Med 2020; 383: 347–358.

53.

Zhao

Xia

Tao

, et al. Global research status of multiple organ dysfunction syndrome during 2001–2021: a 20-year bibliometric analysis. Front Med 2022; 9: 814381.

54.

Bouayed

Sorry, I am sneezing and coughing but I do not have COVID-19. Brain Behav Immun 2022; 101: 57–58.

55.

Denys

Guiu

Chevallier

, et al. Interventional oncology at the time of COVID-19 pandemic: problems and solutions. Diagnost Intervention Imaging 2020; 101: 347–353.

56.

Ohman

Annest

Azizzadeh

, et al. Evaluation and treatment of thoracic outlet syndrome during the global pandemic due to SARS-CoV-2 and COVID-19. J Vasc Surg 2020; 72: 790–798.

57.

Rymkiewicz

Heng

Vasudev

, et al. The immune system in the aging human. Immunol Res 2012; 53: 235–250.

58.

Wynants

Van Calster

Collins

, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ 2020; 369.

59.

Bengio

Goodfellow

Courville

Deep learning. Cambridge, MA: MIT Press, 2017.

60.

Mandrekar

JN.

Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol 2010; 5: 1315–1316.

61.

Faber

Rajko

How to avoid over-fitting in multivariate calibration—the conventional validation approach and an alternative. Analytica Chimica Acta 2007; 595: 98–106.

62.

de Moraes Batista

Miraglia

Donato

THR

, et al. COVID-19 diagnosis prediction in emergency care patients: a machine learning approach. MedRxiv, 2020.

63.

Pourhomayoun

Shakibi

Predicting mortality risk in patients with COVID-19 using machine learning to help medical decision-making. Smart Health 2021; 20: 100178.

Clinical predictions of COVID-19 patients using deep stacking neural networks

Abstract

Keywords

Introduction

Literature review

Materials and methods

Clinical dataset description

Data cleaning and feature selection technique

Experimental setup

Machine learning and DL application models

PCA with SVM

Random forest

Deep learning

Deep stacking neural network

Key performance measures

Results

Symptom-based characterization of COVID-19 patients

Biomarkers-based classification criteria for COVID-19 patients

COVID-19 association with multi-organ dysfunction syndrome

COVID-19 with other comorbidities

COVID-19 with age structure

Key results from the proposed model

Discussion

Evaluation of results against existing studies

Study limitations

Conclusions and future study

Footnotes

Authors’ note

Declaration of conflicting interests

Funding

Ethics approval

Data availability statement

References