Abstract
Background
Recently, studies on COVID-19 have focused on the epidemiology of the disease and clinical characteristics of patients, as well as on the risk factors associated with mortality during hospitalization in critical COVID-19 cases. However, few research has been performed on the prediction of disease progression in particular group of patients in the early stages of COVID-19.
Methods
The study included 338 patients with COVID-19 treated at two hospitals in Wuhan, China, from December 2019 to March 2020. Predictors of the progression of COVID-19 from mild to severe stages were selected by the logistic regression analysis.
Results
COVID-19 progression to severe and critical stages was confirmed in 78 (23.1%) patients. The average value of the neutrophil-to-lymphocyte ratio (NLR) was higher in patients in the disease progression group than in the improvement group. Multivariable logistic regression analysis revealed that elevated NLR, LDH and IL-10 were independent predictors of disease progression. The optimal cut-off value of NLR was 3.75. The values of the area under the curve, reflecting the accuracy of predicting COVID-19 progression by NLR was 0.739 (95%CI: 0.605–0.804). The risk model based on NLR, LDH and IL-10 had the highest area under the ROC curve.
Conclusions
The performed analysis demonstrates that high concentrations of NLR, LDH and IL-10 were independent risk factors for predicting disease progression in patients at the early stage of COVID-19. The risk model combined with NLR, LDH and IL-10 improved the accuracy of the prediction of disease progression in patients in the early stages of COVID-19.
Keywords
Introduction
The novel coronavirus, named by the World Health Organization (WHO) as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), was first identified in the city of Wuhan, China, in December 2019. 1 Given the rapid outbreak of the coronavirus disease 2019 (COVID-19) caused by SARS-CoV-2 in China and its rapid spread of COVID-19 pandemic worldwide, global concerns have emerged. 2 The extent and burden of COVID-19 are particularly evident in countries lacking adequate medical resources. As of 24 April 2020, 84,312 patients have been diagnosed with COVID-19 in China, and 4642 (5.5%) of them died.
The analysis of 44,672 confirmed cases of COVID-19 in China revealed that the majority of SARS-CoV-2 infections were mild or general type, while 889 (1.2%) were asymptomatic, 13.8% were severe and 4.7% were critical cases. 3 The mortality rate in critical COVID-19 cases is approximately 49%. 3 Severe patients usually develop dyspnoea or hypoxaemia one week after the onset of the disease. Critical COVID-19 patients develop rapidly acute respiratory distress syndrome, septic shock, refractory metabolic acidosis, coagulation disorders and multi-organ failure. 4 In the early days of COVID-19 spread in Wuhan, patients with severe or critical COVID-19 were admitted to a hospital, while patients with mild or general COVID-19 were advised to be isolated at home. With increased awareness of the spread and progression of COVID-19, efforts have been made throughout China to admit and treat all confirmed or suspected COVID-19 patients. It has been reported that patients in Guangzhou, China, had a better prognosis than in Wuhan. Patients in Wuhan were more likely to be admitted to an ICU and had higher mortality. 5 This difference highlights the importance of appropriate care and treatment in early mild cases, especially in patients at high risk of disease progression. Early detection of risk factors for progression to severe illness can help to provide adequate supportive care and treatment, decreasing the number of patients that develop the severe condition, reducing mortality and alleviating the shortages of medical resources.
The neutrophil-to-lymphocyte ratio (NLR) and systemic immune-inflammation index (SII), both easily calculated from a routine blood test, are indicators of inflammation and immune response.2,6 Elevated NLR represents a risk factor for mortality not only from infectious diseases, but also from malignancies, intracerebral haemorrhage and dermatomyositis.7–9 Similarly, increased SII has a prognostic value in a variety of malignancies, including hepatocellular carcinoma, breast cancer and oesophageal squamous cell carcinoma.10–12 It has been documented that severe and critical COVID-19 cases tend to have higher neutrophil counts and lower lymphocyte counts.1,13 Whether NLR or SII could be an independent predictor of COVID-19 progression in the early (mild or moderate) stages of the disease remains to be determined.
Thus far, studies on COVID-19 have focused on the epidemiology of the disease and clinical characteristics of patients,14–19 as well as on the risk factors associated with mortality during hospitalization in critical COVID-19 cases. However, no research has been performed on the prediction of progression in patients in the early stages of the disease. To address this lack of data, the present research was designed to determine whether baseline values of NLR and SII in patients with mild or moderate COVID-19 can be used as reliable predictors of the progression of the disease in its early stages.
Methods
Study design and participants
The present investigation was designed as a retrospective study involving a total of 476 COVID-19 patients admitted consecutively to the Union Hospital of Huazhong University of Science and Technology and the Wuhan Asia General Hospital, from December 2019 to March 2020. The clinical outcomes, discharge from hospital or death in hospital were recorded up to 31 March 2020. Both hospitals are located in Wuhan, Hubei Province, and were designated by the Chinese government as hospitals responsible for the treatment of COVID-19 patients. The disease was diagnosed according to the WHO guidelines.
Exclusion criteria were as follows: (1) patients that were pregnant or under 18 years old; (2) patients with severe or critical COVID-19 at admission; (3) patients receiving nonstandard treatment before the admission (Figure 1). Based on the exclusion criteria, 338 COVID-19 patients were included in the final analyses.

Flow diagram of patient disposition: A total of 338 COVID-19 patients were enrolled in this study.
All patients admitted to the hospital with mild or moderate COVID-19 were divided into the improvement group and the progression group. The improvement group included patients who recovered after admission and were discharged from the hospital. The progression group included patients who after the admission progressed to severe or critical condition, or died in the hospital, regardless whether the final outcome was discharged or death,
The protocol of the study was approved by the Research Ethics Committee of the Tongji Medical College. The data used in the study were anonymized, and the requirement for informed consent was waived by the Committee.
Data collection
Complete clinical data for all COVID-19 patients were collected from the medical records of the patients. They included demographic, clinical, laboratory, imaging, treatment and outcome information. The laboratory data and CT scan of the lung were collected corresponded to the results of the first test performed upon admission. Prior to the analysis, the patient information was de-identified and anonymized. The NLR was calculated from the results of neutrophil and lymphocyte counts. SII was calculated according to the results of a routine blood test, according to the formula: SII=N × P/L, where N, P and L represents the count of neutrophils, platelets and lymphocytes, respectively. 6
Laboratory examination
Laboratory confirmation of the infection of SARS-CoV-2 was conducted by local CDC in accordance with the Chinese CDC protocol. Pharyngeal swab samples were collected from all patients and the samples were stored in a viral-transport medium for laboratory testing. Real-time RT-PCR was performed to exclude infection with other respiratory viruses including influenza A virus, coxsackie virus, influenza B virus, respiratory syncytial virus, enterovirus and parainfluenza virus. Automatic biochemical analyser, AU5800 (Beckman Coulter, USA), was used for measuring the concentrations of ALT, AST, albumin and creatinine. Liquid Assayed Multiqual was performed in QC procedures. Automated blood analyser, XE-2100 (Sysmex, Japan), was used for measuring the count of white blood cells, neutrophils, lymphocytes and platelets. The lymphocyte test kit (Beckman Coulter Inc., FL, USA) was used for lymphocyte subset analysis. Plasma cytokines (IL10, IL6, IL4, IL2, IFN – γ and TNF – α) were detected using the human Th1/2 cytokine kit II (BD Ltd, Franklin lakes, NJ, USA). All laboratory tests were conducted in accordance with the product manual.
Study definitions
The classification of the severity of COVID-19 was based on the “Diagnosis and Treatment Protocol for COVID-19 (Trial Version 7)” (4). The clinical classifications of illness severity of COVID-19 were as follows: (1) mild type, with mild clinical symptoms and the absence of signs of pneumonia on imaging; (2) moderate type, with fever, respiratory tract symptoms and signs of pneumonia were identified on imaging; (3) severe type, characterized by one of the following: (a) respiratory distress, respiratory rate ≥ 30 breaths/min; (b) mean oxygen saturation ≤ 93% in the resting state; (c) oxygenation index ≤ 300 mmHg; and (4) critical type, characterized by one of the following: (a) shock; (b) respiratory failure requiring mechanical ventilation; (c) organ failure requiring ICU admission. Fever was defined as an axillary temperature of at least 37.3°C. 20 The duration of viral shedding was defined as the time from the onset of illness to the second negative nucleic acid test. 4
Statistical analysis
Summary statistics of the demographic data, clinical characteristics, laboratory results and radiographic findings were expressed as median and the interquartile range (IQR) for continuous variables and as frequencies and proportions for categorical variables. If the variance in the improvement group and the progression group was the same, continuous variables were compared by the Student’s t-test; otherwise, the Welch’s t-test was used. Categorical variables were compared by the χ2 test or Fisher's exact test.
The relationship between the NLR treated as a continuous variable, and the progression of COVID-19 was examined first; subsequently, the relationship was evaluated considering NLR as a categorical variable according to the best threshold value. These relationships were examined using univariate and multivariate logistic regression analyses, and odds ratio (OR) and 95% confidence interval (CI) were calculated. After univariate logistic regression analysis using the backward stepwise procedure (LR), only the variables with a P-value of less than 0.1 were considered for multivariate analysis to identify predictors of progression of COVID-19. To avoid overfitting in multivariate analysis, six of those variables were selected on the basis of clinical constraints and previous studies. Previous investigations have documented that older age was associated with poor prognosis in COVID-19 patients. 20 Recent studies indicated that patients with severe COVID-19 had higher C-reactive protein (CRP) and lactate dehydrogenase (LDH) concentrations than patients with non-severe COVID-19. 13 Moreover, in comparison with patients with mild COVID-19, the concentrations of IL-10 and IFNγ in patients with severe COVID-19 were significantly increased in the early stage of the disease, and most detected cytokines peaked in the serum three to six days after the onset.21,22 Therefore, NLR, CRP, LDH, interleukin-10(IL-10) and interferon γ(IFNγ) were selected as the six variables for the multivariable logistic regression analysis.
To further validate the association between elevated NLR and disease progression, propensity score matching (PSM) was used to eliminate confounding bias. 23 All COVID-19 patients were divided into two groups, the high NLR (≥3.75) group and the low NLR (<3.75) group, based on the best threshold value of NLR predicting disease progression. The propensity score was calculated using the logistic regression model in which age, gender, hypertension, coronary heart disease, diabetes, chronic obstructive pulmonary disease, fever and cough were considered. The matching was performed using a 1:1 ratio. By this approach, patients in the high NLR group were matched with patients in the low NLR group having the closest propensity score. Patients selected by PSM were enrolled in a new cohort and subjected to further analysis of the association between NLR elevation and disease progression. The best threshold value of NLR was calculated according to the Youden index that was the sum of sensitivity and specificity minus 1. The two-sided P-value of less than 0.05 was considered to indicate a statistically significant difference. SPSS 24.0 software was used to perform PSM and for statistical analysis.
Results
Patient characteristics
A total of 338 COVID-19 patients, in which the diagnosis was confirmed by the detection of SARS-CoV-2 RNA, were enrolled (Figure 1). Their demographic, clinical, laboratory, radiographic and outcomes data are listed in Table 1. Six patients died during hospitalization, and 332 were discharged. The median age of the patients was 57.5 years (IQR 35.0–68.0), and the majority were female (56.8%). Comorbidities were present in 154 (45.6%) patients, with hypertension being the most common comorbidity, followed by diabetes and coronary heart disease. The most frequent symptoms on admission were fever, followed by cough, fatigue, chest distress and myalgia. In routine blood tests, the median values of neutrophils, lymphocytes, platelet, SII and NLR were 3.05 (×109/L), 1.30 (×109/L), 205.50 (×109/L), 503.43 (×109/L), 2.31, respectively.
Baseline characteristics of COVID-19 patients (n = 338).
NLR: neutrophil-to-lymphocyte ratio; SII: systemic immune-inflammation index; ALT: alanine aminotransferase; AST: aspartate aminotransferase; BUN: blood urea nitrogen; LDH: lactate dehydrogenase; CRP: C-reactive protein; ESR: erythrocyte sedimentation rate; IL: interleukin; TNFα: tumor necrosis factor α; IFNγ: interferon γ.
Note: Data are median (IQR), n (%), or n/N (%). P values were calculated by t-test, Welch’s t-test, χ2 test, or Fisher’s exact test, as appropriate.
Relationship between clinical characteristics and COVID-19 progression
Seventy-eight (23.1%) COVID-19 patients developed disease progression after the admission, and six (7.7%) of them died while hospitalized (Table 1). Table 1 also indicates that patients in the progression group were 6.5 years older than in the improvement group (62.5 vs. 56.0 years, P = 0.024). There was no significant difference in the ratio of males to females between the two groups (P = 0.548). Compared with patients in the improvement group, patients in the progression group had a higher rate of developing a fever, chest distress, myalgia, nausea or vomiting and palpitation (all P<0.05). They also had a higher count of neutrophils (3.34 vs. 2.92 × 109/L, P = 0.014), lower count of lymphocytes (0.93 vs. 1.40 × 109/L, P<0.001) and higher NLR (3.90 vs. 2.08, P<0.001) and SII (807.86 vs. 453.16 × 109/L, P<0.001) values. In addition, they had higher concentrations of creatine kinase, LDH, CRP, ESR, IL-10 and IFNγ (all P<0.05) on admission. The incidence of lung consolidation detected by imaging was higher in the progression group (74.4% vs. 43.8%, P<0.001).
Risk factors associated with COVID-19 progression
To identify the predictors of COVID-19 progression, multiple clinical parameters were evaluated by logistic regression analysis (Table 2). In univariable analysis, the odds of disease progression were higher in patients with fever or cough (Table 2). Age, elevated neutrophil count, reduced lymphocyte count, elevated NLR, SII, creatine kinase, LDH, CRP, ESR, IL-10 and IFNγ were also associated with disease progression. To avoid overfitting in multivariate analysis, age, NLR, CRP, LDH, IL-10 and IFNγ were selected for multivariate regression analysis, as justified in the Methods section. Multivariable logistic regression analysis documented that increased NLR, LDH and IL-10 were independent predictors of disease progression in COVID-19 patients with ORs of 1.183 (95% CI: 1.045–1.339), 1.007 (95%CI: 1.003–1.010) and 1.344 (95%CI: 1.080–1.672), respectively.
Risk factors associated with COVID-19 progression.
OR: odds ratio. NLR: neutrophil-to-lymphocyte ratio; SII: systemic immune-inflammation index; ALT: alanine aminotransferase; AST: aspartate aminotransferase; BUN: blood urea nitrogen; LDH: lactate dehydrogenase; CRP: C-reactive protein; ESR: erythrocyte sedimentation rate; IL: interleukin; TNFα: tumor necrosis factor α; IFNγ: interferon γ.
A risk model was established based on NLR, LDH and IL-10, and receiver operating characteristic (ROC) curves were performed to assess the value of risk model and other single parameters (Figure 2). The values of the area under the ROC curve (AUROC), reflecting the accuracy of predicting COVID-19 progression, were 0.739 (95%CI: 0.605–0.804) for NLR, superior to other single parameters (Table S1). Moreover, the risk model based on NLR, LDH and IL-10 had the highest AUROC (Table S1).

ROC curve: Predicting COVID-19 progression with risk model and other single parameters.
The ROC curve analysis indicated that the optimal cut-off values for predicting the progression of COVID-19 were 3.75 for NLR, 213.5 for LDH, 4.15 for IL-10, respectively. Table 3 presents sensitivity, specificity, positive predictive values (PPV) and negative predictive values (NPV) (95%CI) of risk model and other single parameters. The Yoden index of risk model was superior to other single parameters (Table 3).
Sensitivity, specificity and negative and positive predictive values of a NLR of 3.75 and LDH of 213.5 U/L in the prediction of COVID-19 progression.
NLR: neutrophil-to-lymphocyte ratio; LDH: lactate dehydrogenase; IL: interleukin; NPV: negative predictive value; PPV: positive predictive value.
Risk factors associated with COVID-19 progression analysed after propensity score matching
To further validate the association between elevated NLR and disease progression, PSM was used to reduce confounding bias. The patients were divided into two groups based on the best threshold value of NLR predicting disease progression: the high NLR (≥3.75) group and the low NLR (<3.75) group. The high NLR group had a higher incidence of COVID-19 progression than the low NLR group (P<0.001). Seventy-four propensity score-matched pairs of COVID-19 patients were selected and enrolled in a new cohort. However, a statistically significant difference in COVID-19 progression continued to be present between the high and low NLR groups (P<0.001) (Table 4). After univariate analysis, the variables with P < 0.1, including age, NLR, LDH, CRP, IL-10 and IFNγ, were selected for multivariate regression analysis using the backward stepwise procedure (LR). This analysis confirmed that NLR ≥3.75 was still an independent predictor of disease progression in COVID-19 patients (Table S2).
Comparison of clinical parameters between the low and high NLR groups.
Note: Data are median (IQR) or n (%). P values were calculated by t-test, Welch’s t-test, χ2 test, or Fisher’s exact test, as appropriate.
NLR: neutrophil-to-lymphocyte ratio.
Discussion
The present study has demonstrated that NLR, measured at the early stage of the disease, is a significant independent predictor for the progression of COVID-19 to severe and critical stages. Moreover, the finding that NLR ≥3.75 is an independent predictor of disease progression was also confirmed in the propensity score-matched cohort. The PPV of NLR is actually quite poor. Moreover, the risk model combined with NLR, LDH and IL-10 enlarged the accuracy of the prediction of COVID-19 progression, with AUROC of 0.77 (95% CI: 0.71–0.83). The relevance of early adequate treatment to prevent mild or moderate cases from developing into severe ones is well-recognized since the treatment of critical COVID-19 patients requires not only significant medical resources but, most importantly, results in a high mortality rate. 20 When thinking about COVID, there is a spectrum of asymptomatic to very severe illness and the precise point on the journey is very time dependent. So getting a cohort who are in a similar clinical state on the journey to more severe illness is difficult, especially when the criteria for hospitalization depends on signs and symptoms. So the rate of disease progression will also affect this. One person might have a slow disease progression and present late with mild symptoms, another might have rapid disease progression and present earlier with mild or moderate symptoms.
Although previous studies reported several models for predicting disease progression to a severe or critical state,24–34 both mild and severe COVID-19 patients were included in these papers dealing with COVID-19 prediction of disease progression to very severe illness.24–34 It seems unreasonable to apply a risk model based on the analysis of the parameters of severe COVID-19 patients to mild COVID-19 patients. Consequently, patients admitted to the hospital with mild or moderate COVID-19 were included in this study, and patients with severe or critical COVID-19 at admission were excluded. All patients admitted to the hospital with mild or moderate COVID-19 were divided into the improvement group and the progression group. The laboratory data and CT scan of the lung were collected corresponded to the results of the first test performed upon admission. This current analysis provides a new evidence-based risk model for early identification of mild or moderate COVID-19 patients at high risk of COVID-19 progression.
Previous studies have documented that older age is a significant independent predictor of mortality in MERS and SARS.35,36 The current investigation has found that older age is also associated with increased odds of in-hospital death in COVID-19 patients. 37 However, age is not an independent predictor of the progression of COVID-19 to severe or critical stages. COVID-19 is commonly susceptible in the population, and there is a risk of progression to severe disease. 4 A possible explanation of this apparent discrepancy might be that older patients with severe or critical COVID-19 have a higher risk of death. Since patients with severe or critical COVID-19 on admission were not included in the study, the fatality rate among the analysed population was lower than that documented in previous reports. 37
In agreement with previous research, 4 the present analysis has shown that an increased concentration of LDH at the early stage of COVID-19 was also a significant independent predictor for disease progression. Moreover, COVID-19 patients in the progression group had higher concentrations of IL-10 and IL-6 at the early stage of the disease, although differences in IL-6 between the groups were not statistically significant. However, IL-10 at the early stage was also a significant independent predictor for COVID-19 progression. The identification of elevated concentrations of inflammatory factors in COVID-19 patients is consistent with previous studies. 38 The current work documented that, except for IL-6, the serum concentration of cytokines peaked at three to six days after the onset of the disease in critically ill COVID-19 patients; these cytokines included IL-10, IL-2, IL-4, TNFα and IFNγ. 38 In addition, concentrations of IL-10 and IL-6 continued to increase in patients with severe COVID-19, and with the concentrations of IL-6 beginning to decrease after 16 days. 38
Some limitations of the present work should be acknowledged. First, this investigation was designed as a retrospective study, and this study was based on just 338 patients and relatively few cases of progressing COVID-19 were included in the analyses. Thus, large-scale multinational prospective cohort studies are necessary to confirm the results and strengthen the conclusions reached. Second, the proper cut-off value of single parameters is still to be verified by large-scale multinational prospective cohort studies. Then, mild or general cases were treated before they progressed to severe cases. As stated earlier, patients with mild COVID-19 are at risk of progressing to the severe stage, and once they become severe, they have a high mortality rate. Therefore, it would be unreasonable not to treat the patients already admitted to the hospital. Although they were treated at different sites, the two designated hospitals followed the same guidelines for treating mild or moderate COVID-19 specified in the Diagnosis and Treatment Protocol for COVID-19 issued by the National Health Commission of China. Thus, the use of the data from two different centres did not affect the results of the study, since similar treatment strategies were employed.
In conclusion, the performed analysis demonstrates that high concentrations of NLR, LDH and IL-10 were independent risk factors for predicting disease progression in COVID-19 patients at the early stage of the disease. The values of the AUROC of NLR were superior to other single parameters. The risk model combined with NLR, LDH and IL-10 improved the accuracy of the prediction of disease progression in patients in the early stages of COVID-19. Assessment of predictors might facilitate early identification of COVID-19 patients at high risk for disease progression and ensure timely administration of appropriate treatment to prevent mild cases from becoming severe.
Supplemental Material
sj-pdf-1-acb-10.1177_00045632211011194 - Supplemental material for A composite risk model predicts disease progression in early stages of COVID-19: A propensity score-matched cohort study
Supplemental material, sj-pdf-1-acb-10.1177_00045632211011194 for A composite risk model predicts disease progression in early stages of COVID-19: A propensity score-matched cohort study by Jianjun Xu, Yang Gao, Shaobo Hu, Suzhen Li, Weimin Wang, Yuzhe Wu, Zhe Su, Xing Zhou, Xiang Cheng and Qichang Zheng in Annals of Clinical Biochemistry
Supplemental Material
sj-pdf-2-acb-10.1177_00045632211011194 - Supplemental material for A composite risk model predicts disease progression in early stages of COVID-19: A propensity score-matched cohort study
Supplemental material, sj-pdf-2-acb-10.1177_00045632211011194 for A composite risk model predicts disease progression in early stages of COVID-19: A propensity score-matched cohort study by Jianjun Xu, Yang Gao, Shaobo Hu, Suzhen Li, Weimin Wang, Yuzhe Wu, Zhe Su, Xing Zhou, Xiang Cheng and Qichang Zheng in Annals of Clinical Biochemistry
Footnotes
Acknowledgements
This work was supported by the Natural Science Foundation of Hubei Province (No. 2019CFB729), the National Natural Science Foundation of China (No. 81903173, No. 81874231).
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Natural Science Foundation of Hubei Province (No. 2019CFB729), the National Natural Science Foundation of China (No.81903173, No. 81874231).
Ethical approval
This retrospective study was approved by the Tongji Medical College research ethics committee, Wuhan, China (NO: IORG0003571).
Guarantor
Qichang Zheng.
Contributorship
Jianjun Xu, Yang Gao, Shaobo Hu and Qichang Zheng researched literature and conceived the study. Suzhen Li, Weimin Wang, Yuzhe Wu, Zhe Su, Xing Zhou and Xiang Cheng were involved in protocol development, gaining ethical approval, patient recruitment and data analysis. Jianjun Xu, Shaobo Hu and Yang Gao wrote the first draft of the manuscript. All authors reviewed and edited the manuscript and approved the final version of the manuscript.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
