Assessing the Calibration of Dichotomous Outcome Models with the Calibration Belt

Abstract

The calibration belt is a graphical approach designed to evaluate the goodness of fit of binary outcome models such as logistic regression models. The calibration belt examines the relationship between estimated probabilities and observed outcome rates. Significant deviations from the perfect calibration can be spotted on the graph. The graphical approach is paired to a statistical test, synthesizing the calibration assessment in a standard hypothesis testing framework. In this article, we present the calibrationbelt command, which implements the calibration belt and its associated test in Stata.

Keywords

gr0071 calibrationbelt logistic regression calibration goodness of fit binary outcome

References

Cox

D. R.

, and Hinkley

D. V.

1974. Theoretical Statistics. London: Chapman & Hall.

Finazzi

, Poole

, Luciani

, Cogo

P. E.

, and Bertolini

2011. Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLOS ONE 6: e16110.

Hosmer

D. W.

Jr. , and Lemeshow

1980. Goodness of fit tests for the multiple logistic regression model. Communications in Statistics—Theory and Methods 9: 1043–1069.

Hosmer

D. W.

Jr. , Lemeshow

, and Sturdivant

R. X.

2013. Applied Logistic Regression. 3rd ed. Hoboken, NJ: Wiley.

Le Gall

J.-R.

, Lemeshow

, and Saulnier

1993. A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA 270: 2957–2963.

Nattino

, Finazzi

, and Bertolini

2014a. Comments on ‘Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers’ by Peter C. Austin and Ewout W. Steyerberg. Statistics in Medicine 33: 2696–2698.

Nattino

, Finazzi

, and Bertolini

2014b. A new calibration test and a reappraisal of the calibration belt for the assessment of prediction models based on dichotomous outcomes. Statistics in Medicine 33: 2390–2407.

Nattino

, Finazzi

, and Bertolini

2016a. A new test and graphical tool to assess the goodness of fit of logistic regression models. Statistics in Medicine 35: 709–720.

Nattino

, Finazzi

, Bertolini

, Rossi

, and Carrara

2016b. givitiR: The GiViTI Calibration Test and Belt. R package version 1.3. http://CRAN.R-project.org/package=givitiR.