An integer programming approach to item bank design is presented that can be used to calculate an optimal blueprint for an item bank, in order to support an existing testing program. The results are optimal in that they minimize the effort involved in producing the items as revealed by current item writing patterns. Also presented is an adaptation of the models, which can be used as a set of monitoring tools in item bank management. The approach is demonstrated empirically for an item bank that was designed for the Law School Admission Test.
Get full access to this article
View all access options for this article.
References
1.
Ackerman, T. (1989, March). An alternative methodology for creating parallel test forms using the IRT information function. Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco.
2.
Adema, J. J. (1990). The construction of customized two-stage tests. Journal of Educational Measurement, 27, 241–253.
3.
Adema, J. J. (1992). Implementations of the branch-and-bound method for test construction. Methodika, 6, 99–117.
4.
Adema, J. J. , Boekkooi-Timminga, E., & van der Linden, W. J. (1991). Achievement test construction using 0-1 linear programming. European Journal of Operations Research, 55, 103–111.
5.
Armstrong, R. D. , & Jones, D. H. (1992). Polynomial algorithms for item matching. Applied Psychological Measurement, 16, 365–373.
6.
Armstrong, R. D. , Jones, D. H., & Wang, Z. (1994). Automated parallel test construction using classical test theory. Journal of Educational Statistics, 19, 73–90.
7.
Armstrong, R. D. , Jones, D. H., & Wang, Z. (1995). Network procedures to optimize test information curves with side constraints. In K. D. Lawrence (Ed.), Applications of management science: Network optimization applications (Volume 8; pp. 189–212). Greenwich CT: JAI Press.
8.
Armstrong, R. D. , Jones, D. H., & Wu, I.-L. (1992). An automated test development of parallel tests. Psychometrika, 57, 271–288.
9.
Berger, M. P. F. (1994). A general approach to algorithmic design of fixed-form tests, adaptive tests, and testlets. Applied Psychological Measurement, 18, 141–153.
10.
Boekkooi-Timminga, E. (1987). Simultaneous test construction by zero-one programming. Methodika, 1, 101–112.
11.
Boekkooi-Timminga, E. (1990). The construction of parallel tests from IRT-based item banks. Journal of Educational Statistics, 15, 129–145.
12.
Boekkooi-Timminga, E. (1991, June). A method for designing Rasch model based item banks. Paper presented at the annual meeting of the Psychometric Society, Princeton NJ.
13.
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale NJ: Erlbaum.
14.
Luecht, R. D. (1998). Computer-assisted test assembly using optimization heuristics. Applied Psychological Measurement, 22, 224–236.
15.
Nemhauser, G. L. , & Wolsey, L. A. (1988). Integer and combinatorial optimization. New York: Wiley.
16.
Sanders, P. F. , & Verschoor, A. J. (1998). Parallel test construction using classical item parameters. Applied Psychological Measurement, 22, 212–223.
17.
Stocking, M. L. , & Swanson, L. (1998). Optimal design of item pools for computerized adaptive tests. Applied Psychological Measurement, 22, 271–279.
18.
Swanson, L. , & Stocking, M. L. (1993). A model and heuristic for solving very large item selection problems. Applied Psychological Measurement, 17, 151–166.
19.
Theunissen, T. J. J. M. (1985). Binary programming and test design. Psychometrika, 50, 411–420.
20.
Theunissen, T. J. J. M. (1996). Combinatorial issues in test construction.Unpublished doctoral dissertation, University of Amsterdam, The Netherlands.
21.
Timminga, E. , & Adema, J. J. (1995). Test construction from item banks. In G. H. Fischer & I. W. Molenaar (Eds.), The Rasch model: Foundations, recent developments, and applications(pp. 111–127). New York: Springer-Verlag.
22.
Timminga, E. , van der Linden, W. J., & Schweizer, D. A. (1996). ConTEST[Computer program and manual]. Groningen, The Netherlands: iec ProGAMMA.
23.
van der Linden, W. J. (1994). Optimum design in item response theory: Applications to test assembly and item calibration. In G. H. Fischer & D. Laming (Eds.), Contributions to mathematical psychology, psychometrics, and methodology(pp. 308–318). New York: Springer-Verlag.
24.
van der Linden, W. J. (1996). Assembling tests for the measurement of multiple traits. Applied Psychological Measurement, 20, 373–388.
25.
van der Linden, W. J. (1998a). Optimal assembly of psychological and educational tests. Applied Psychological Measurement, 22, 195–211.
26.
van der Linden, W. J. (Ed.) (1998b). Optimal test assembly. Applied Psychological Measurement, 22 (3) [Special issue].
27.
van der Linden, W. J. (in press). Optimal assembly of tests with item sets. Applied Psychological Measurement.
28.
van der Linden, W. J. ,& Adema, J. J. (1998). Simultaneous assembly of multiple test forms. Journal of Educational Measurement, 35, 185–198 (Erratum in Vol. 34, 90–91).
29.
van der Linden, W. J. , & Boekkooi-Timminga, E. (1989). Amaximin model for test design with practical constraints. Psychometrika, 54, 237–247.
30.
Wagner, H. M. (1978). Principles of operations research with applications to managerial decisions. London: Prentice/Hall.