Gulliksen, H. (1950). Theory of mental tests. Hillsdale, NJ: Lawrence Erlbaum.
2.
Linacre, J. M., & Wright, B. D. (2003). A user's guide to Bigsteps. Chicago: Mesa Press.
3.
Müller, J. M. (in press). The probability of obtaining two statistically different test scores as a test index. Educational and Psychological Measurement.
4.
Muraki, E., & Bock, R. D. (1997). PARSCALE 3: IRT based test scoring and item analysis for graded items and rating scales. Chicago: Scientific Software International.
5.
von Davier, M. (1998). WINMIRA: A Windows program for mixed Rasch models. Kiel, Germany: IPN.
6.
Zimowski, M.F., Muraki, E., Mislevy, R.J., &Bock, R. D. (1996). Bilog-MG: Multiple-group IRT analysis and test maintenance for binary items. Chicago: Scientific Software International.