Abstract
Background:
Poor adherence to tuberculosis (TB) treatment is a substantial barrier to global TB control. The aim of this study was to construct a nomogram for predicting the probability of TB treatment default.
Methods:
A total of 1185 TB patients who had received treatment between 2010 and 2011 in Peru were analyzed in this study. Patient demographics, social, and medical information were recorded. Predictors were selected by least absolute shrinkage and selection operator (LASSO) regression analysis, and a nomogram for predicting TB treatment default was constructed by using multivariable logistic regression analysis. Bootstrapping method was applied for internal validation. Calibration and clinical utility of the nomogram was also evaluated.
Results:
The incidence of TB treatment default among the study patients was 11.6% (138/1185). Six predictors (secondary education status, alcohol use, illegal drug use, body mass index, multidrug-resistant tuberculosis, and human immunodeficiency virus serostatus) were selected through the LASSO regression analysis. A nomogram was developed based on the six predictors and it yielded an area under the curve (AUC) value of 0.797 [95% confidence interval (CI), 0.755–0.839]. In the internal validation, the AUC achieved 0.805 (95% CI, 0.759–0.844). Additionally, the nomogram was well-calibrated, and it showed clinical utility in decision curve analysis.
Conclusion:
A nomogram was constructed that incorporates six characteristics of the TB patients, which provides a good reference for predicting TB treatment default.
Introduction
Tuberculosis (TB) remains a heavy burden on global society and economy. 1 Alarmingly, although TB is usually curable with medication, there were still 1.2 million TB deaths among human immunodeficiency virus (HIV)-negative people and an additional 208,000 deaths among HIV-positive people globally in 2019. 1 With regard to TB medication regimens, current treatment periods are long. A 2-month intensive phase followed by a 4-month continuation phase is commonly recommended for drug-susceptible TB treatment, and, for multidrug-resistant TB (MDR-TB), a longer period of time is usually required.1,2 It has been recognized that poor adherence to treatment can lead to increased risk of transmission, drug resistance, and death. 3 TB treatment default has been a substantial barrier to global TB control. 3 Therefore, in order to decrease the occurrence of treatment default, it is critical to identify patients with poor adherence and then strengthen management and treatment supervision for these patients.
Several risk factors for TB treatment default have been proposed, including being male, alcohol abuse, illegal drug use, underweight, multidrug-resistant TB, previous treatment default, drug side effects, unemployment, and uncompleted secondary education.3–7 The potential causes of TB treatment default may be multifaceted. 8 There are many models for predicting TB treatment outcomes; however, few of them were built adhere to the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) statement.9–13 Since adherence is crucial in the treatment of diseases, such as TB, that require long-term medication, a method that can estimate patient treatment adherence would be very helpful.
In the current study, based on a completed prospective cohort study, demographic, social, and medical information of TB patients were analyzed retrospectively to develop a nomogram for predicting TB treatment default.
Research methods and design
Study population and ethics
In this study, a total of 1185 adult patients with first episode of smear-positive TB were enrolled between 2010 and 2011 in a semi-urban district of northeastern Lima, Peru, 3 which is classified by the World Health Organization (WHO) as a high TB burden country. 1 The majority of these patients were managed at public health facilities (33 community clinics and one hospital). 3 All the patients had a definitive treatment outcome (treatment completion or treatment default). 3 This study was a secondary analysis and it was approved by the Ethics Committee of Jinhua Municipal Central Hospital (No. 2019-133-001). The Ethics Committee waived the need for informed consent because all participant information was anonymous in this secondary analysis.
Variables collection
For this study, the following characteristics of the participants were collected: gender, age (year) (⩽21, 22–26, 27–37, and ⩾38), marital status (married, divorced, single, or widowed), poverty status (yes/no), 14 completed secondary education (yes/no), prison history (yes/no), smoking (never, former, or current), alcohol use (at least weekly or not), illegal drug use (yes/no), rehabilitation history (yes/no), MDR-TB (yes/no), body mass index (BMI) (normal: 18.5–24.9; underweight: <18.5, and overweight: ⩾25.0), HIV infection status (negative, positive, or test not done), coexisting diabetes (yes/no), and other chronic disease status (yes/no). 3 All the patients were treated (free of cost) through facility-based directly observed therapy short-course (DOTS) strategy and the treatment outcome (according to standard WHO definitions) was followed up.3,15 Treatment default in this study was defined as continuously missed doses ⩾2 months. 1
Statistical analysis
A multiple imputation method was applied to account for missing data. The baseline characteristics of the patients were summarized. The least absolute shrinkage and selection operator (LASSO) regression method was used for predictor selection.
16
Multivariable logistic regression analysis with a stepwise strategy was used to develop a predictive model and a nomogram. The discriminatory capacity of the nomogram was determined by calculating the area under the curve (AUC). Bootstrapping (resampling times = 500) was performed for the internal validation of the model.
17
Additionally, calibration of the nomogram was evaluated together with the unreliability test. Clinical utility of the nomogram was evaluated using decision curve analysis (DCA).
18
All statistical analyses were conducted with R software (version 3.5.3), and
Results
Among the 1185 participants, 138 [11.6%, 95% confidence interval (CI), 9.8–13.5%] defaulted from treatment, whereas 1047 (88.4%) patients completed treatment. The median (25–75% interquartile) time for the treatment default was 134 (78–205) days. The characteristics of participants are shown in Table 1. Missing value accounts for 0.27% of the total data.
Baseline characteristics of the study participants.
BMI, body mass index; HIV, human immunodeficiency virus; MDR-TB, multidrug-resistant tuberculosis; TB, tuberculosis.
In LASSO regression analysis (Figure 1), 7 variables with potential predictive value were selected from 15 variables according to non-zero coefficients [the selection criteria of 1 standard error (SE)]. These variables were secondary education status, alcohol use, illegal drug use, rehabilitation history, MDR-TB, BMI, and HIV status.

Potential predictors screening through the LASSO regression analysis. (a) Tuning parameter (lambda) selection of binomial deviance based on the 1-SE criteria. (b) The coefficients of variables were produced against the log (lambda) sequence. The dotted vertical line was drawn according to the 1-SE criteria, where seven nonzero coefficients were selected.
To construct a nomogram, the aforementioned seven variables were entered into multivariable logistic regression analysis and a stepwise selection was applied based on the likelihood ratio test with Akaike’s information criterion. Eventually, six predictors (all the variables described above, except for rehabilitation history) were incorporated into the nomogram with a statistical model as shown below: Y = −2.98386–0.49145 × (secondary education status = 1) + 0.83455 × (alcohol use = 1) + 1.60056 × (illegal drug use = 1) + 1.39013 × (MDR−TB = 1) + 0.85935 × (BMI = 1)−0.22707 × (BMI = 2) + 0.74679 × (HIV status = 1) + 0.91240 × (HIV status = 2).
The discriminatory capacity of the nomogram was determined by calculating the AUC. The AUC for the nomogram was 0.797 (95% CI, 0.755–0.839) (Figure 2a), whereas for the internal validation using the bootstrapping method (resampling times = 500) the AUC was 0.805 (95% CI, 0.759–0.844) (Figure 2b).

The ROC curve and AUC of the model (a) and the internal validation cohort (bootstrap resampling times = 500) (b). The vertical lines represent 95% CI.
A nomogram that incorporates the six predictors is shown in Figure 3. The probability of TB treatment default can be predicted quantitatively and individually by using this nomogram by calculating the total points of the corresponding point for each predictor.

Nomogram for predicting the probability of TB treatment default. First of all, find the point for each variable of a patient on the uppermost rule; then add all points together and find the total points; finally, the corresponding predicted probability of TB treatment default could be calculated on the lowest rule.
The proposed nomogram showed good calibration (Figure 4). The unreliability test yielded a

Calibration curve of the nomogram showing the degree of consistency between the predicted probability and the actual outcome. The shaded line represents a perfect prediction by an ideal model, and the dotted line represents the performance of the proposed nomogram. The unreliability test yielded a
DCA (Figure 5) revealed that, when the risk threshold of treatment default for TB patients was between 3% and 70% based on the predictive model, application of this nomogram to predict the risk of TB treatment default would add more benefit than either the treat-all or the treat-none strategy.

Decision curve analysis of the nomogram. The red curve represents the model. The decision curve shows that when the risk threshold is between 3% and 70%, application of this nomogram would add more net benefit than either a treat-all or treat-none strategy. For example, assume 100 patients were using the proposed nomogram. If the risk threshold was 20%, then approximately 30 (the corresponding value on the vertical coordinate is approximately 0.3) patients would benefit, without harming the interests of the remaining 70 patients (i.e., when 100 workers use the nomogram to decide whether to take interventions, the added net benefit is approximately 30%).
Discussion
In the present study, a nomogram for predicting the probability of TB treatment default among TB patients was developed. This nomogram incorporates six predictors: secondary education status, alcohol use, illegal drug use, MDR-TB, BMI, and HIV status.
The long duration required for drug treatment is one of the main reasons for the high rate of TB treatment default.
7
Default rates reportedly ranged from 0.5% to 56.6% based on 75 studies conducted in different countries, with a pooled proportion of 14.8%.
19
Therefore, taking measures to improve adherence to treatment is very important in global TB control.6,7,20 Fortunately, with the adjustment of treatment strategies in recent years, there are declining trends in loss to follow up (LTFU) and increasing trends in the success rate of TB treatment.21,22 Kibuule
Several factors affecting TB treatment default, such as alcohol use, illegal drug use, male gender, educational level, and prison or rehabilitation history, have been described in previous articles.3,5,24 As expected, patients with MDR-TB are more likely to default from treatment than those with drug-susceptible TB on the standard treatment regimens.
3
Several models for predicting TB treatment default have been reported. Rodrigo
The proposed nomogram in the present study was constructed based on a prospective study of TB patients, 3 and their demographics, socioeconomic, and medical information were included for analysis. Six potential predictors were selected by LASSO regression analysis and eventually included in the nomogram. All these six predictors could be easily available clinically. The nomogram showed good discriminatory ability (AUC: 0.797) and internal verification results. Moreover, the nomogram showed excellent calibration. More importantly, this nomogram was revealed to have good clinical utility in DCA evaluation. Specifically, DCA demonstrated that when the risk threshold is between 3–70%, application of this nomogram would add net benefit than either the treat-all or the treat-none strategy.
Limitations
Some limitations of this study are worth noting. Firstly, only smear-positive TB patients were included in this study, so the nomogram is suitable only for patients with smear-positive TB. Secondly, there are regional differences in TB epidemiology and socioeconomic status, and differences in TB treatment strategies at different times. This nomogram was constructed based on a study population collected 10 years ago from a TB high-incidence neighborhood of Lima, Peru. 3 Therefore, whether this nomogram can be applied to other regions and the current TB strategies require further verification. Thirdly, the study was also limited by the self-reporting of several variables, and some other potentially affecting factors (e.g., household support, healthcare-related infrastructure, quality of care, availability or change of healthcare practitioners, and treatment regimen change) were not included in the analysis because they were not available in the original data.27 Despite these limitations, this study developed a nomogram, which provides a good reference for other studies predicting TB treatment default.
Conclusions
A nomogram for predicting TB treatment default was developed, which incorporates six characteristics of TB patients. The proposed nomogram showed good discriminatory ability and calibration, as well as good clinical utility. It may be of great value to help adherence management in TB treatment.
Footnotes
Acknowledgements
I appreciate all the participants involved in the study, authors and providers (Dr. Brian Lackey (University of Texas School of Public Health Austin Regional Campus, Austin, Texas, United States of America), Dr. Carlos Seas (Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, Lima, Peru; Facultad de Medicina Alberto Hurtado, Universidad Peruana Cayetano Heredia, Lima, Peru), Dr. Patrick Van der Stuyft (General Epidemiology and Disease Control Unit, Department of Public Health, Institute of Tropical Medicine, Antwerp, Belgium; Department of Public Health, Faculty of Medicine, Ghent University, Ghent, Belgium), and Dr. Larissa Otero (Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, Lima, Peru) who share data unselfishly.26
Author contributions
S.W. contributed to the study design, data analysis and interpretation, the writing of the manuscript, and takes responsibility for the accuracy of the data analysis.
Conflict of interest statement
The author declares that there are no conflict of interests.
Funding
The author disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Saibin Wang, the Science and Technology Project of Jinhua City (2020XG-15, 2020-4-013).
