Sage Journals: Discover world-class research

Abstract

While measurement textbooks typically recommend the use of four or five alternatives with multiple-choice test items theoretical work by Tversky (1964) and some empirical studies indicate that three-choice items may be optimal under certain circumstances. In this study the characteristics of tests composed entirely of two, three or four-choice items were investigated given a fixed total number of alternatives across the whole test (Tversky's condition). Their relative merits were also estimated after allowing for differences in testing time. The results showed that number of alternatives per item was inversely related to item difficulty but directly related to item discrimination. Reliability and standard error of measurement of three-choice item tests was equivalent or superior to tests of four or two-choice items and these results held up after taking account of testing time. The use of three-choice items in typical classroom settings is recommended.

Get full access to this article

View all access options for this article.

References

Costin, F. The optimal number of alternatives in multiple-choice achievement tests: Some empirical evidence for a mathematical proof. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1970, 30, 353-358.

Costin, F. Three-choice versus four-choice items: Implications for reliability and validity of objective achievement tests. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1972, 32, 1035-1038.

Ebel, R. L. Expected reliability as a function of choices per item. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT , 1969, 29, 565-570.

Feldt, L. S. A test of the hypothesis that Cronbach's Alpha or Kuder-Richardson Coefficient Twenty is the same for two tests. Psychometrika, 1969, 34, 363-373.

Grier, J. B. The number of alternatives for optimum test reliability. Journal of Educational Measurement , 1975, 12, 109-113.

Hogben, D. The reliability, discrimination and difficulty of word-knowledge tests employing multiple choice items containing three, four or five alternatives. Australian Journal of Education , 1973, 17, 63-68.

Hoyt, C. Test reliability estimated by analysis of variance. Psychometrika, 1941, 6, 153-160.

Lord, F. M. Optimal number of choices per item: A comparison of four approaches. Journal of Educational Measurement, 1977, 14, 33-38.

Mattson, D. The effects of guessing on the standard error of measurement and the reliability of test scores. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1965, 25, 727-730.

10.

Remmers, H. H. and Sageser, H. W. Reliability of multiple-choice measuring instruments as a function of the Spearman Brown prophecy formula, V. Journal of Educational Psychology, 1941, 32, 445-451.

11.

Symonds, P. M. Factors influencing test reliability . Journal of Educational Psychology, 1928 , 19, 73-87.

12.

Tversky, A. On the optimal number of alternatives at a choice point. Journal of Mathematical Psychology, 1964, 1, 386-391.

13.

Williams, B. J. and Ebel, R. L. The effect of varying the number of alternatives per item on multiple-choice vocabulary test items. In The 14th Yearbook of the National Council on Measurements used in Education, East Lansing, Michigan: Michigan State University, 1957, pp. 63-65.

14.

Zimmerman, W. S. and Humphreys, L. G. Item reliability as a function of the omission of misleads. American Psychologist, 1953, 8, 460-461.

A Comparison of Two,Three and Four-Choice Item Tests Given a Fixed Total Number of Choices

Abstract

Get full access to this article

References