Abstract
The Bayes theorem is advocated as the appropriate measure for the Weight of evidence in medical decision making. It is based on the calculation of posttest probability as a function of the accuracy of the test and pretest probability. Nevertheless, for subjective diagnostic findings, there might be substantial variability in the accuracy among human observers, making the point estimate of posttest probability imprecise. Although there is limited evidence regarding the actual variability of accuracy among observers for the majority of diagnostic findings, classical observer agreement studies provide us With an indirect estimate of such variability. The aim of this Work Was to explicate the relationship betWeen observer disagreement and variability of posttest probability. Using a random effects signal detection model With 3 stochastic components (betWeen subject, betWeen observer, and residual variations), the authors modeled diagnostic tests With various characteristics and calculated the expected betWeen-observer disagreement and 95% interval of the observers' posttest probability. For the majority of simulated conditions, variation in posttest probability Was surprisingly high, even in the presence of substantial agreement. Although the model is based on parametric assumptions, these results are a clue to a source of inaccuracy in the calculation of posttest probability. Practitioners should be aWare of such variation in their clinical practice, and diagnostic studies need to develop strategies to address this uncertainty.
Get full access to this article
View all access options for this article.
