Abstract
Objective
To identify factors associated with recovery time from coronavirus disease 2019 (COVID-19).
Methods
In this retrospective study, data for patients with COVID-19 were obtained between 21 June and 30 August 2020. An accelerated failure time (AFT) model was used to identify covariates associated with recovery time (days from hospital admission to discharge). AFT models with different distributions (exponential, log-normal, Weibull, generalized gamma, and log-logistic) were generated. Akaike’s information criterion (AIC) was used to identify the most suitable model.
Results
A total of 730 patients with COVID-19 were included (92.5% recovered and 7.5% censored). Based on its low AIC value, the log-logistic AFT model was the best fit for the data. The covariates affecting length of hospital stay were oxygen saturation, lactate dehydrogenase, neutrophil-lymphocyte ratio, D-dimer, ferritin, creatinine, total leucocyte count, age > 80 years, and coronary artery disease.
Conclusions
The log-logistic AFT model accurately described the recovery time of patients with COVID-19.
Keywords
Introduction
The World Health Organization (WHO) has described the 2000s as the “century of non-communicable diseases”. Globally, the incidence of non-communicable diseases such as type 2 diabetes mellitus, hypertension, and dyslipidemia is increasing, resulting in increased morbidity and mortality. At the end of 2019, a virulent communicable disease called coronavirus disease 2019 (COVID-19) began to spread and subsequently developed into a global pandemic. COVID-19, caused by severe acute respiratory syndrome coronavirus-2, has affected every part of the world and is associated with high morbidity and a large economic burden. Managing COVID-19 is a major challenge for healthcare workers. As a communicable disease, COVID-19 affects all ages and sexes. Symptoms vary from mild and moderate to severe or fatal in some patients. Mild symptoms include pyrexia, sore throat, and dry cough. Some patients also report loss of taste and smell, severe headache, myalgia, joint pain, and gastrointestinal symptoms. Fewer patients exhibit severe symptoms such as chest pain and shortness of breath. The WHO has reported that approximately 85% of patients exhibit mild to moderate symptoms and recover well. By contrast, 10% to 15% of patients exhibit severe symptoms, and 5% require intensive care unit support with high mortality rates. A small proportion of individuals who contract COVID-19 remain asymptomatic and can spread the infection to others. Globally as of April 2021, 134 million cases of COVID-19 had been reported; approximately 107 million individuals recovered and approximately 2 million died. In India, 12.9 million cases of COVID-19 were reported; 11.8 million individuals recovered and 166,892 deaths were reported by the Union Ministry of Health and Family Welfare. 1 Recovery rate and recovery time among patients with COVID-19 depend upon various factors. The WHO has stated that the mean recovery time for patients with mild to moderate symptoms is around 14 days. By contrast, recovery requires around 3 to 6 weeks in individuals with severe COVID-19. A meta-analysis showed that the mean recovery time from symptom onset was 22.2 days (range: 18–83 days). 2
A study of 538 Chinese patients with COVID-19 (January to March 2020) found that the mean duration of hospital stay was 19 days (interquartile range 14–23 days). Recovery time depended on age, COVID-19 severity, and area of residence. 3 Another study of Belgian patients with COVID-19 reported hospitalization durations of 3 to 10.4 days after accounting for factors such as age, sex, and nursing home admission. 4 A study of 221 patients with COVID-19 in India reported an average recovery time of 25 days (95% confidence interval 16–34 days); this estimate accounted for only sex and age. 5
Accurate identification of the recovery time and length of hospital stay of patients with COVID-19 is an urgent priority for healthcare delivery. Recovery time also depends upon the underlying health status of the individual including any comorbid conditions such as diabetes, hypertension, cardiopulmonary disease, and renal failure. In this study we also examined laboratory parameters such as creatinine, lactate dehydrogenase (LDH), ferritin, hemoglobin (Hb), total leukocyte count (TC), polymorphonuclear leukocytes, lymphocytes, and D-dimer. These factors were not assessed in previous studies.
Methodology
This was a retrospective cohort study. The study was approved by the institutional ethics committee of Sri Ramachandra Institute of Higher Education and Research (EC-NI/20/AUG/75/51). Because this was a retrospective study of medical records, the requirement for informed consent was waived. The study was registered in the Clinical Trials Registry – India. Patients with COVID-19 admitted to the hospital of Sri Ramachandra Institute of Higher Education and Research were included. Patients admitted for other reasons and diagnosed with COVID-19 as part of routine evaluation were not included. Patient data were collected from medical records including demographic characteristics, past and current medical history (e.g., metabolic and other co-morbid diseases), clinical symptoms of COVID-19 (e.g., fever, sore throat, cough, myalgia, loss of taste and smell, and headache), and laboratory parameters such as oxygen saturation (SpO2), creatinine, LDH, ferritin, Hb, TC, neutrophil-lymphocyte ratio (NLR), and D-dimer.
The semiparametric Cox model is the most common statistical tool used for time to event analysis as it clearly defines the effects of covariates on the outcome variable. The parametric accelerated failure time (AFT) model is a better choice only if the distribution of survival time is well defined. In the AFT model, interpretation is simple because it is directly dependent on survival time; in this model the effects of covariates are directly proportional to survival time. In the current study, we examined the recovery time from COVID-19 and factors influencing it among patients in a tertiary care hospital in Chennai using an AFT model. Recovery time (length of hospital stay) was the dependent variable. Patients who died or were discharged against medical advice were censored.
AFT model
In survival analysis, various statistical procedures can be used to identify factors associated with response variables over time. Parametric models are classified into proportional hazards models and AFT models. AFT models use log-logistic, log-normal, and generalized gamma distributions whereas proportional hazards models use the Gompertz distribution. Weibull and exponential distributions are used in both proportional hazards and AFT models.
The AFT model is formulated as:
The time ratio (TR) is calculated by taking the exponent of the coefficient (exp(bi)). A positive coefficient (bi) or TR >1 indicates that the effect of the covariate prolongs recovery time, whereas a negative coefficient or TR < 1 indicates a shorter recovery time compared with the reference category.
To select the most suitable model, Akaike’s information criterion (AIC) was calculated as follows:
Statistical analysis
Data were entered in MS Excel (Microsoft, Redmond, WA, USA) and analyzed using STATA software version 14.0 (StataCorp, College Station, TX, USA). Univariate analysis was performed for all variables separately. Covariates showing statistical significance in the univariate analysis were included in the multivariate AFT model. The AIC was used to select the best-fitting AFT model (exponential, log-normal, generalized gamma, log-logistic, or Weibull distribution). Values of p < 0.05 were considered statistically significant.
Results
This was a retrospective cohort study comprising 730 patients with COVID-19 (456 men and 274 women). Among the participants, 675 (92.5%) recovered, 30 (4.1%) were discharged against medical advice, and 25 (3.4%) died. Thus, 55 (7.5%) patients were censored. The median age was 48 years (interquartile range 35–61 years). Almost all patients (97.4%) had COVID-19 symptoms. The most common symptom at the time of hospital admission was fever (86.9%) followed by sore throat (50.5%) and cough (38.8%). The least common symptom at the time of hospital admission was vomiting (1.4%) and loose stool (2.5%). The median recovery time of hospitalized patients was 7 days. The mean duration from symptom onset to hospitalization was 4.6 days (standard deviation 2.3 days).
In AFT models using exponential, Weibull, generalized gamma, log-normal, and log-logistic distributions, 2, 5, 8, 8, and 9 parameters were significantly associated with time to recovery, respectively. Irrespective of model, SpO2 and LDH were significantly associated with recovery time. Other than in the model using an exponential distribution, NLR was significantly associated with recovery time for all other distributions. As shown in Table 1, the most suitable distribution was log-logistic as shown by its low AIC value (AIC = 675.398). Using the AFT model with log-logistic distribution, covariates significantly associated with recovery time were SpO2, LDH, NLR, creatinine, D-dimer, ferritin, TC, age >80 years, and coronary artery disease. Clinical symptoms, sex, and other comorbid conditions including diabetes, hypertension, and chronic kidney disease were not significantly associated with recovery time. The AFT model with log-logistic distribution showed that age >80 years vs. ≤60 years (TR = 1.461, p = 0.017), SpO2 <95% vs. ≥95% (TR = 1.481, p < 0.001), ferritin <25 ng/mL or >200 ng/mL vs. 25–200 ng/mL (TR = 1.129, p = 0.017), TC <4000 cells/µL or >11,000 cells/µL vs. 4000–10,000 cells/µL (TR = 1.111, p = 0.007), NLR >3 vs. ≤3 (TR = 1.098, p = 0.01), D-dimer >0.55 µg/mL vs. ≤0.55 µg/mL (TR = 1.094, p = 0.01), LDH >300 international units (IU)/L vs. ≤300 IU/L (TR = 1.181, p < 0.001), creatinine >1.4 mg/dL vs. ≤1.4 mg/dL (TR = 1.226, p = 0.038), and coronary artery disease (TR = 1.268, p = 0.004) were associated with prolonged length of hospitalization. The goodness of fit of the AFT model was tested. The AFT model with log-logistic distribution had a R2 value of 0.47, indicating that 47% of variation in recovery time was explained by the model.
Parameters of multivariate AFT model of time to recovery from COVID-19.
AFT, accelerated failure time; COVID-19, coronavirus disease 2019; SE, standard error; ref, reference; NLR, neutrophil to lymphocyte ratio; SpO2, oxygen saturation; LDH, lactate dehydrogenase; IU, international units; TC, total leukocyte count; DM, diabetes mellitus; HT, hypertension; CKD, chronic kidney disease; CAD, coronary artery disease; AIC, Akaike’s information criterion.
Discussion
The current study aimed to identify factors associated with length of hospital stay among patients with COVID-19. The most commonly used statistical tool for analyzing time to event data is the Cox proportional hazards model. 7 An AFT model is a parametric survival model used to assess the effects of covariates on a response variable, and is an alternative to the Cox model for time-to-event data. AFT models do not require Cox proportional hazards modeling as an a priori test. 8 For parametric models, it is necessary to identify the appropriate distribution for survival time. Identifying the appropriate distribution will result in better fit of the model to the data. AFT model interpretation directly relates to the survival time and time ratio rather than the hazard ratio. For time to event analysis, Cox models are very common and only a few studies of breast cancer, gastric cancer, ovarian cancer, anxiety disorder, and liver failure have used AFT models. A recent study of patients with COVID-19 conducted in Singapore found that an AFT model with log-logistic distribution was more suitable than other AFT models. 9 This is in agreement with the results of our study, which showed that the AFT with log-logistic distribution was the most suitable model. This model showed that abnormal values of SpO2, creatinine, LDH, NLR, D-dimer, ferritin, TC, age >80 years, and coronary artery disease were associated with longer hospital stays. These results are consistent with recent studies of D-dimer levels 10 and the NLR.10,11 However, our findings regarding LDH disagree with the results of a study conducted in China, 10 which found that LDH was not associated with length of hospital stay but was instead associated with increased risk of complications.
In the current study, COVID-19 symptoms were not associated with length of hospital stay. In contrast, a study conducted in China reported an association between dyspnea and length of hospitalization. 10 Our results showed that sex was not associated with length of hospitalization, in agreement with the results of previous studies.4,5,10 Thus, the AFT model with log logistic distribution is the most suitable model for predicting time to recovery among patients with COVID-19.
Limitations
This was a retrospective cohort study conducted among 730 patients with COVID-19 in a single tertiary care hospital in southern India. Hence, future studies with larger sample sizes should be conducted among individuals of various ethnicities to extend the results of the current study.
Conclusions
An AFT model with log-logistic distribution was the most suitable model for predicting length of hospital stay among patients with COVID-19. Clinical parameters such as SpO2, creatinine, LDH, NLR, D-dimer, ferritin, TC, age >80 years, and coronary artery disease were associated with time to recovery. Understanding the factors that reduce length of hospitalization could help in developing strategies to optimize delivery of health care services.
Footnotes
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Declaration of conflicting interest
All the authors have no conflicts of interest.
Author contributions
Study design: Gayathri T, Ravanan R.
Data collection: Gayathri T, Lakshmi M.
Statistical Analysis: Gayathri T.
Results interpretation: Gayathri T, Ravanan R.
Manuscript preparation: Gayathri T, Ravanan R, Lakshmi M.
