Bielinski, J., & Davison, M.L. ( 1998). Gender differences by item difficulty interactions in multiple-choice mathematics items. American Educational Research Journal, 35, 455-476.
2.
Borsboom, D. ( 2006). When does measurement invariance matter? Medical Care, 44, 176-181.
3.
Furlow, C.F., Ross, T.R., & Gagné, P. (2009). The impact of multidimensionality on the detection of differential bundle functioning using simultaneous item bias test. Applied Psychological Measurement, 33, 441-464.
4.
Holland, P.W., & Thayer, D.T. ( 1988). Differential item performance and the Mantel-Haenszel procedure. In H. Wainer & H. I. Braun (Eds.), Test validity (pp. 129-145). Hillsdale, NJ: Lawrence Erlbaum.
5.
Holland, P. W., & Wainer, H. (Eds.). (1993). Differential item functioning . Hillside, NJ: Lawrence Erlbaum .
6.
Maller, S.J. ( 2001). Differential item functioning in the WISC-III: Item parameters for boys and girls in the national standardization sample. Educational and Psychological Measurement, 61, 793-817.
7.
Mantel, N., & Haenszel, W. ( 1959). Statistical aspects of the analysis of data from retrospective studies of disease. Journal of the National Cancer Institute , 22, 719-748.
8.
Monahan, P.O., & Ankenmann, R.D. (2005). Effect of unequal variances in proficiency distributions on Type I error of the Mantel-Haenszel chi-square test for differential item functioning. Journal of Educational Measurement, 42, 101-131.
9.
Narayanan, P., & Swaminathan, H. (1994). Performance of the Mantel-Haenszel and simultaneous item bias procedures for detecting differential item functioning . Applied Psychological Measurement, 18, 315-328.
10.
Rivas, L., Gabriel, E., Stark, S., & Chernyshenko, O.S. (2009). The effects of referent item parameters on differential item functioning detection using the free baseline likelihood ratio test. Applied Psychological Measurement , 33, 251-265.
11.
Roussos, L., & Stout, W. ( 1996). A multidimensionality-based DIF analysis paradigm. Applied Psychological Measurement, 20, 355-371.
12.
Shealy, R.T., & Stout, W.F. ( 1993). A model based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DIF as well as item bias/DIF. Psychometrika, 58, 159-194.
13.
Shih, C.L., & Wang, W.C. ( 2009). Differential item functioning detection using the multiple indicators, multiple causes method with a pure short anchor. Applied Psychological Measurement, 33, 184-199.
14.
Thissen, D., Steinberg, L., & Wainer, H. ( 1988). Use of item response theory in the study of group difference in trace lines. In H. Wainer & H. I. Braun (Eds.), Test validity (pp. 147-169). Hillsdale, NJ: Lawrence Erlbaum.
15.
Thissen, D., Steinberg, L., & Wainer, H. ( 1993). Detection of differential item functioning using the parameters of item response models. In P. W. Holland & H. Wainer (Eds.), Differential item functioning (pp. 67-113). Hillsdale, NJ: Lawrence Erlbaum.
Woods, C.M. ( 2009). Testing for differential item functioning with measures of partial association. Applied Psychological Measurement , 33, 538-554.
18.
Zumbo, B.D. ( 1999). A handbook on the theory and methods of differential item functioning (DIF): Logistic regression modeling as a unitary framework for binary and Likert-type (ordinal) item scores. Ottawa, Ontario, Canada: Directorate of Human Resources Research and Evaluation, Department of National Defense.