Sage Journals: Discover world-class research

Abstract

Judgmental methods for estimating passing scores generally are accomplished by having testing specialists evaluate items in an intact test form. Alternatively, judges are asked to rate individual items from an item pool. Cutscores are determined by these ratings when the items are chosen from the pool to form the operational test. This approach to establishing cutscores assumes (a) stable definitions of minimal competency from the time of review to time of assembly into the operational form and (b) invariance of item rating judgments to test form contextual variables. This study was concerned with the impact of overall test length and difficulty on the expert judgments of item performance by using the Nedelsky method. Results suggest that the judges are fairly consistent in their ratings of items regardless of overall test length or difficulty.

Get full access to this article

View all access options for this article.

References

Angoff, W. H. (1971). Scales, norms, and equivalent scores. In R. L. Thorndike (Ed.) Educational Measurement, Washington, DC: American Council on Education, pp. 514-515.

Beuk, C. H. (1984). A compromise between absolute and relative standards in examinations. Journal of Educational Measurement, 21, 147-152.

Ebel, R. L. (1972). Essentials of Educational Measurement, Englewood Cliffs, NJ: Prentice Hall.

Follman, J. , Lowe, A. S. , and Miller, C. (1971). Graphics variables and reliability and level of essay grades. American Educational Research Journal, 8, 365-373.

Hales, L. W. and Tokar, E. (1975). The effect of quality of preceding responses on grades assigned by subsequent responses to an essay question. Journal of Educational Measurement, 12, 115-117.

Huck, S. and Bounds, W. (1972). Essay grades: An interaction between graders' handwriting clarity and neatness of examination papers. American Educational Research Journal, 9, 279-283.

Jaeger, R. M. (1978). A proposal for setting a standard on the North Carolina High School Competency Test. Paper presented at the spring meeting of North Carolina Association of Research in Education, Chapel Hill, NC.

Nedelsky, L. (1954). Absolute grading standards for objective tests. Educational And Psychological Measurement, 14, 3-19.

Effects of Item Context on Intrajudge Consistency of Expert Judgments via the Nedelsky Standard Setting Method

Abstract

Get full access to this article

References