Abstract
The usefulness of three IRT-based methods and the Mantel-Haenszel technique in evaluating the measure ment equivalence of translated assessment instruments was investigated. A 15-item numerical test and an 18- item reasoning test that were originally developed in English and then translated to French were used. The analyses were based on four groups, each containing 1,000 examinees. Two groups of English-speaking ex aminees were administered the English version of the tests; the other two were French-speaking examinees who were administered the French version of the tests. The percent of items identified with significant differ ential item functioning (DIF) in this study was similar to findings in previous large-sample studies. The four DIF methods showed substantial consistency in identi fying items with significant DIF when replicated. Sug gestions for future research are provided.
Keywords
Get full access to this article
View all access options for this article.
