This study compared the reliability and validity indexes of randomly parallel tests administered under inclusion, exclusion, and correction for guessing directions. It also compared the criterion-referenced grading decisions based on the different scoring methods. Inclusion and exclusion scores were not so highly correlated as theory would predict. There were no significant differences in the reliability and validity indices for the three scoring methods. However, the scoring methods differed substantially in the proportion of students assigned to different grade categories.
Get full access to this article
View all access options for this article.
References
1.
Abu-Sayf, F. K. (1979). The scoring of multiple-choice tests: A closer look. Educational Technology, 19, 5-15.
2.
Collet, L. S. (1971). Elimination scoring: An empirical evaluation. Journal of Educational Measurement, 8, 209-214.
3.
Coombs, C. H. (1953). On the use of objective examinations. Educational And Psychological Measurement, 13, 309-310.
4.
Dressel, P. L. and Schmid, J. (1953). Some modifications of the multiple-choice item. Educational And Psychological Measurement, 13, 574-595.
5.
Feldt, L. S. (1980). A test of the hypothesis that Cronbach's alpha reliability coefficient is the same for two tests administered to the same sample. Psychometrika, 45, 99-105.
6.
Frary, R. B. (1980). The effect of misinformation, partial information, and guessing on expected multiple-choice test item scores. Applied Psychological Measurement, 4, 79-90.
7.
Gibbons, J. D. , Olkin, I., and Sobel, M. A. (1979). Subset selection technique for scoring items on a multiple-choice test. Psychometrika, 44, 259-270.
8.
Jaradat, D. and Swaged, S. (1986). The subset selection technique for multiple-choice tests: An empirical inquiry. Educational And Psychological Measurement, 23, 369-376.
9.
Lamke, T. A. and Nelson, M. J. (1961). Henmon-Nelson tests of mental ability. Boston: Houghton-Mifflin.