This paper reviews some results on optimal design of tests with items of dichotomous and poly-tomous response formats and offers some rules and guidelines for optimal test assembly. The problem of optimal design of tests with dichotomous and polytomous response formats is discussed for two optimality criteria.
Get full access to this article
View all access options for this article.
References
1.
Adema, J. J. (1990). Models and algorithms for the construction of achievement tests. Unpublished doctoral thesis, University of Twente, The Nether-lands.
2.
Andrich, D. (1978). A rating scale formulation for ordered response categories. Psychometrika, 43, 561-573.
3.
Atkinson, A. C. (1982). Developments in the design of experiments. International Statistical Review, 50, 161-177.
4.
Berger, M. P. F. (1992). Sequential sampling designs for the two-parameter item response theory model. Psychometrika, 57, 521-538.
5.
Berger, M. P. F. (1994a). D-optimal sequential sam-pling designs for item response theory models. Joumal of Educational Statistics, 19, 43-56.
6.
Berger, M. P. F. (1994b). A general approach to algo-rithmic design of fixed-form tests, adaptive tests, and testlets. Applied Psychological Measurement, 18, 141-153.
7.
Berger, M. P. F. , & Mathijssen, E. (1997). Optimal test design for polytomous items. British Journal of Mathematical and Statistical Psychology, 50, 127-141.
8.
Berger, M. P. F. , & van der Linden, W. J. (1995). Ontwerpen van tests met verschillende opti-maliteitscriteria [Designing educational tests with different criteria of optimality]. Tijdschrift voor Onderwijsresearch, 20, 79-92.
9.
Berger, M. P. F , & Veerkamp, W. J. J. (1996). A re-view of selection methods for optimal test design. In M. Wilson & G. Engelhard (Eds.), Objective Measurement: Theory into practice (Vol. 3, pp. 437-455). Norwood NJ: Ablex.
10.
Bock, R. D. (1972) Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37, 29-51.
11.
Boekkooi-Timminga, E. (1989). Models for com-puterized test construction. Unpublished doctoral thesis, University of Twente, The Netherlands.
12.
Ford, I. , Kitsos, C. P., & Titterington, D. M. (1989). Recent advances in nonlinear experimental design. Technometrics, 31, 49-60.
13.
Hambleton, R. K. , & Swaminathan, H. (1985). Item response theory. Boston: Kluwer-Nijhoff.
14.
Jones, D. H. , & Jin, Z. (1994). Optimal sequential de-signs for on-line item estimation. Psychometrika, 59, 59-75.
15.
Lord, F. M. (1962). Estimating norms by item-sampling. Educational and Psychological Meas-urement, 22, 259-267.
16.
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale NJ: Erlbaum.
17.
Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174.
18.
Pandey, T. N. , & Carlson, D. (1976). Assessing pay-offs in the estimation of the mean using multiple matrix sampling designs. In D. N. M. de Gruijter & L. J. van der Kamp (Eds.), Advances in psycho-logical and educational measurement. London: Wiley.
19.
Stocking, M. L. (1990). Specifying optimum exam-inees for item parameter estimation in item re-sponse theory. Psychometrika, 55, 461-475.
20.
Theunissen, T. J. J. M. (1986). Binary programming and test design. Psychometrika, 50, 411-420.
21.
Thissen, D. , & Wainer, H. (1982). Some standard er-rors in item response theory. Psychometrika, 47, 397-412.
22.
van der Linden, W. J. , & Boekkooi-Timminga, E. (1989). A maximin model for test design with practical constraints. Psychometrika, 53, 237-247.
23.
Wald, A. (1943). On efficient design of statistical in-vestigations. Annals of Mathematical Statistics, 14, 134-140.
24.
Wingersky, M. S. , & Lord, F. M. (1984). An inves-tigation of methods for reducing sampling error in certain IRT procedures. Applied Psychological Measurement, 8, 347-364.
25.
Wynn, H. P. (1970). The sequential generation of D-optimum experimental designs. Annals of Mathe-matical Statistics, 41, 1655-1664.