Sage Journals: Discover world-class research

Abstract

Korean

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent mixed-methods study design. The study used the Yes/No ratings as the baseline method in two rounds of ratings, while differentiating the two methods by incorporating item maps and an Ordered Item Booklet, each of which is an integral tool of the Mapmark and the Bookmark methods. The results showed that the internal validity evidence is similar across both methods, especially after Round 2 ratings. When procedural validity evidence was considered, however, a preference emerged for the method where panelists conducted the initial ratings unbeknownst to the empirical item difficulty information, and then such information was provided on an item map as part of the Round 1 feedback. The findings highlight the importance of evaluating both internal and procedural validity evidence when considering standard-setting methods.

Keywords

Internal validity Mapmark method modified Yes/No Angoff procedural validity proficiency test standard setting

Get full access to this article

View all access options for this article.

References

American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. American Educational Research Association. https://www.testingstandards.net/uploads/7/6/6/4/76643089/standards_2014edition.pdf

Angoff

W. H.

(1971). Scales, norms and equivalent scores. In Thorndike

R. L.

(Ed.), Educational measurement (2nd ed., pp. 508–600). American Council on Education.

Cizek

G. J.

(1996). Setting passing scores. Educational Measurement: Issues and Practice, 15(2), 20–31. https://doi.org/10.1111/j.1745-3992.1996.tb00809.x

Cizek

G. J.

Plake

B. S.

(2009). Developing a plan for setting proficiency standards for the DLPT5. Defense Language Institute and Defense Manpower Data Center.

Creswell

J. W.

Plano Clark

V. L.

(2018). Designing and conducting mixed methods research (3rd ed.). Sage.

Hambleton

R. K.

Pitoniak

M. J.

(2006). Setting performance standards. In Brennan

R.L.

(Ed.), Educational measurement (4th ed., pp. 433–470). American Council on Education/Praeger.

Hambleton

R. K.

Pitoniak

M. J.

Copella

J. M.

(2012). Essential steps in setting performance standards on educational tests and strategies for assessing reliability of results. In Cizek

G. J.

(Ed.), Setting performance standards: Foundations, methods, and innovations (2nd ed., pp. 47–76). Routledge.

Impara

J. C.

Plake

B. S.

(1997). Standard setting: An alternative approach. Journal of Educational Measurement, 34(4), 353–366. https://doi.org/10.1111/j.1745-3984.1997.tb00523.x

Interagency Language Roundtable. (n.d.). Welcome to the ILR! https://www.govtilr.org/

10.

Kaftandjieva

(2010). Methods for setting cut scores in criterion-referenced achievement tests: A comparative analysis of six recent methods with an application to tests of reading in EFL. European Association for Language Testing and Assessment. https://www.ealta.eu.org/documents/resources/FK_second_doctorate.pdf

11.

Kane

M. T.

(1987). On the use of IRT models with judgmental standard-setting procedures. Journal of Educational Measurement, 24(4), 333–345. https://doi.org/10.1111/j.1745-3984.1987.tb00284.x

12.

Kane

M. T.

(1994). Validating the performance standards associated with passing scores. Review of Educational Research, 64(3), 425–461. https://doi.org/10.3102/00346543064003425

13.

Kane

M. T.

(2001). So much remains the same: Conception and status of validation in setting standards. In Cizek

G. J.

(Ed.), Setting performance standards. Concepts, methods, and perspectives (pp. 53–88). Lawrence Erlbaum Associates.

14.

Mitzel

H. C.

Lewis

D. M.

Patz

R. J.

Green

D. R.

(2001). The bookmark procedure: Psychological perspectives. In Cizek

G. J.

(Ed.), Setting performance standards (pp. 249–281). Lawrence Erlbaum Associates.

15.

Powers

Schedl

Papageorgiou

(2017). Facilitating the interpretation of English language proficiency scores: Combining scale anchoring and test score mapping methodologies. Language Testing, 34(2), 175–195. https://doi.org/10.1177/0265532215623582

16.

Reckase

M.D.

Chen

(2012). The role, format, and impact of feedback to standard setting panelists. In Cizek

G. J.

(Ed.), Setting performance standards: Foundations, methods, and innovations (2nd ed., pp. 149–164). Routledge.

17.

Schulz

E. M.

Mitzel

H. C.

(2005). The Mapmark standard setting method (ED490643). ERIC. https://files.eric.ed.gov/fulltext/ED490643.pdf

18.

Schulz

E. M.

Mitzel

H. C.

(2011). A Mapmark method of standard setting as implemented for the National Assessment Governing Board. Journal of Applied Measurement, 12(2), 165–193.

19.

Segall

D. O.

(2012). A model-based procedure for estimating cut-scores from Yes/No ratings. Defense Manpower Data Center.

20.

Shin

S.-Y.

Lidster

(2017). Evaluating different standard-setting methods in an ESL placement testing context. Language Testing, 34(3), 357–381. https://doi.org/10.1177/0265532216646605

21.

Smith

R. W.

Davis-Becker

S. L.

O’Leary

L. S.

(2014). Combining the best of two standard setting methods: The Ordered Item Booklet Angoff. Journal of Applied Testing Technology, 15(1), 18–26. https://jattjournal.net/index.php/atp/article/view/52752/42342

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.07 MB

0.09 MB

1.50 MB

0.14 MB

0.25 MB

0.27 MB

Evaluating methodological enhancements to the Yes/No Angoff standard-setting method in language proficiency assessment

Abstract

Keywords

Get full access to this article

References

Supplementary Material