The calibration belt is a graphical approach designed to evaluate the goodness of fit of binary outcome models such as logistic regression models. The calibration belt examines the relationship between estimated probabilities and observed outcome rates. Significant deviations from the perfect calibration can be spotted on the graph. The graphical approach is paired to a statistical test, synthesizing the calibration assessment in a standard hypothesis testing framework. In this article, we present the calibrationbelt command, which implements the calibration belt and its associated test in Stata.
CoxD. R., and HinkleyD. V.1974. Theoretical Statistics.London: Chapman & Hall.
2.
FinazziS., PooleD., LucianiD., CogoP. E., and BertoliniG.2011. Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLOS ONE6: e16110.
3.
HosmerD. W.Jr., and LemeshowS.1980. Goodness of fit tests for the multiple logistic regression model. Communications in Statistics—Theory and Methods9: 1043–1069.
Le GallJ.-R., LemeshowS., and SaulnierF.1993. A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA270: 2957–2963.
6.
NattinoG., FinazziS., and BertoliniG.2014a. Comments on ‘Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers’ by Peter C. Austin and Ewout W. Steyerberg. Statistics in Medicine33: 2696–2698.
7.
NattinoG., FinazziS., and BertoliniG.2014b. A new calibration test and a reappraisal of the calibration belt for the assessment of prediction models based on dichotomous outcomes. Statistics in Medicine33: 2390–2407.
8.
NattinoG., FinazziS., and BertoliniG.2016a. A new test and graphical tool to assess the goodness of fit of logistic regression models. Statistics in Medicine35: 709–720.
9.
NattinoG., FinazziS., BertoliniG., RossiC., and CarraraG.2016b. givitiR: The GiViTI Calibration Test and Belt. R package version 1.3. http://CRAN.R-project.org/package=givitiR.