Assessing the Reliability of Criterion-Referenced Measures Used To Evaluate Health-Education Programs

Abstract

The reliability of criterion-referenced tests (CRTs) used in health program evaluations can be conceptualized in different ways. Classical conceptualizations of reliability have limited usefulness when applied to health-related CRTs. When a cut-score is set, the test's reliability can be represented as the consistency of mastery/nonmastery classifications. When a cut-score is not set, the size of the standard error of measurement (SEM) of a domain score estimate is a reflection of the test's reliability. Formulas are presented for estimating appropriate SEMsfor CRTs. The SEM can be used in computing confidence intervals for domain score estimates andfor a cut-score.

Get full access to this article

View all access options for this article.

References

Berk, R.A. ( 1980) "A consumer's guide to criterion-referenced test reliability." J. of Educ. Measurement 17: 323-349.

Brennan, R.L. (1980) "Applications of generalizability theory," pp. 186-232 in R. A. Berk (ed.) Criterion-Referenced Measurement: The State of the Art. Baltimore, MD: Johns Hopkins Univ. Press.

Cohen, J. (1960) "A coefficient of agreement for nominal scales ." Educ. and Psych. Measurement 20: 37-46.

Cronbach, L.J. , G.C. Gleser , H. Nanda and N. Rajaratnam (1972) The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. New York: John Wiley.

Hambleton, R.K. and M.R. Novick (1973) "Toward an integration of theory and method for criterion-referenced tests." J. of Educ. Measurement 10: 159-170.

Hambleton, R.K. , H. Swaminathan , J. Algina , and D.B. Coulson (1978) "Criterion-referenced testing and measurement: A review of technical issues and developments." Rev. of Educ. Research 48: 1-47.

Lord, F.M. (1957) "Do tests of the same length have the same standard errors of measurement?" Educ. and Psych. Measurement 17: 510-521.

———and Novick, M.R. (1968) Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley.

Millman, J. (1979) "Reliability and validity of criterion-referenced test scores," in R. E. Traub (ed.) New Directions for Testing and Measurement. San Francisco : Jossey-Bass.

10.

Popham, W.J. (1978) Criterion-Referenced Measurement. Englewood Cliffs, NJ: Prentice-Hall.

11.

Subkoviak, M.J. (1980) "Decision-consistency approaches," pp. 129-185 in R. A. Berk (ed.) Criterion-Referenced Measurement: The State of the Art. Baltimore, MD: Johns Hopkins Univ. Press.