Sage Journals: Discover world-class research

Abstract

For criterion-referenced (C-R) tests, variance was defined as a measure of the variability of scores from the criterion of minimal, acceptable performance. By analogy to norm-referenced test theory C-R reliability was defined as the proportion of observed variance attributable to true variance. The definition of C-R reliability was then extended to the mean of a number of parallel measures. A typical test situation was described as a randomized, complete block design. For this design a score model was formulated which incorporated the criterion of minimal, acceptable performance. Expected values for the mean squares, error and person, were derived and shown to be equal to observed and error variance for C-R tests. C-R reliability was then redefined in terms of the expected values, error and person. Using the methods developed, C-R was calculated for a hypothetical example.

Get full access to this article

View all access options for this article.

References

Dayton, C. M. The design of educational experiments . New York: McGraw-Hill, 1970.

Hambleton, R. K. and Novick, M. R. Toward an integration of theory and method for criterion-referenced tests. Journal of Educational Measurement, 1973, 10, 159-170.

Harris, C. W. An interpretation of Livingston's reliability coefficient for criterion-referenced tests. Journal of Educational Measurement, 1972, 9, 27-29.

Hoyt, C. J. Test reliability estimated by analysis of variance. Psychometrika, 1941, 6, 153-160.

Lindman, H. R. Analysis of variance in complex experimental designs. San Francisco: W. H. Freeman, 1974.

Livingston, S. A. Criterion-referenced applications of classical test theory. Journal of Educational Measurement , 1972, 9, 13-26.

Lord, F. M. and Novick, M. R. Statistical theories of mental test scores. Reading, Massachusetts: Addison-Wesley , 1968.

Winer, B. J. Statistical principles in experimental design (2nd ed.). New York: McGraw-Hill, 1971.

Criterion-Referenced Reliability Estimated by Anova

Abstract

Get full access to this article

References