Abstract
Data from a 45-item vocabulary test (tryout edition) were used to assess the degree of congruence between the signed area measure, the unsigned area measure, Lord's chi-square statistic, and the Mantel-Haenszel (MH) technique in identifying items with significant differential item functioning (DIF). In both the Black-White and Female-Male comparisons, the three methods based on item response theory (IRT) identified the same items as having statistically significant DIF. There was close agreement between the MH technique and the three IRT-based procedures only in the Female-Male comparison; different items were identified as biased in the Black-White comparison.
Get full access to this article
View all access options for this article.
