BachmanL.PalmerA. (2010). Language assessment in practice. New York: Oxford University Press.
8.
BärenfängerO.TschirnerE. (2012). Assessing evidence of validity of assigning CEFR ratings to the ACTFL Oral Proficiency Interview (OPI) and the Oral Proficiency Interview by computer (OPIc) (Technical Report 2012-US-PUB-1). Leipzig: Institute for Test Research and Test Development.
9.
BernhardtE.MolitorisJ.RomeoK.LinN.ValderramaP. (2015). Designing and sustaining a foreign language writing assessment programs at the postsecondary level. Foreign Language Annals, 48(3), 329–349. doi:10.1111/flan.12153
10.
BrownA. V.CoxT. L.ThompsonG. L. (2017). A comparative discourse analysis of Spanish past narrations from the ACTFL OPI and OPIc. Foreign Language Annals, 50(4), 793–807. doi:10.1111/flan.12302
11.
ChapelleC. A.EnrightM. K.JamiesonJ. (2010). Does an argument-based approach to validity make a difference?Educational Measurement: Issues and Practice, 29(1), 3–13. doi:10.1111/j.1745–3992.2009.00165.x
12.
CoxT. (2017). Understanding Intermediate-level speakers’ strengths and weaknesses: An examination of OPIc tests from Korean learners of English. Foreign Language Annals, 50(1), 84–113. doi:10.1111/flan.12258
DavinK. J.RempertT. A.HammerandA. A. (2014). Converting data to knowledge: One district’s experience using large-scale proficiency assessment. Foreign Language Annals, 47(2), 241–260. doi:10.1111/flan.12081
15.
GassS.WinkeP.Van GorpK. (2016). The Language Flagship Proficiency Initiative [research in the news]. Language Teaching, 49(4), 592–595.
16.
GlisanE. W.SwenderE.SurfaceE. A. (2013). Oral proficiency standards and foreign language teacher candidates: Current findings and future research directions. Foreign Language Annals, 46(2), 264–289. doi:10.1111/flan.12030
KissauS. (2014). The impact of the Oral Proficiency Interview on one foreign language teacher education program. Foreign Language Annals, 47(3), 527–545. doi:10.1111/flan.12092
19.
KnochU.ChapelleC. A. (2018). Validation of rating processes within an argument-based framework. Language Testing, 35(4), 477–499. doi:10.1177/0265532217710049
20.
LaFlairG. T.StaplesS. (2017). Using corpus linguistics to examine the extrapolation inference in the validity argument for a high-stakes speaking assessment. Language Testing, 34(4), 451–475. doi:10.1177/0265532217713951
21.
Liskin-GasparroJ. E. (2003). The ACTFL Proficiency Guidelines and Oral Proficiency Interview: A brief history and analysis of their survival. Foreign Language Annals, 36(4), 484–490. doi:10.1111/j.1944–9720.2003.tb02137.x
22.
MaloneM. E. (2003). Research on the Oral Proficiency Interview: Analysis, synthesis, and future directions. Foreign Language Annals, 36(4), 491–497. doi:10.1111/j.1944–9720.2003.tb02138.x
23.
McNamaraT. F. (1990). Item Response Theory and the validation of an ESP test for health professionals. Language Testing, 7(1), 52–75.
ThompsonG. L.CoxT. L.KnappN. (2016). Comparing the OPI and the OPIc: The effect of test method on oral proficiency scores and student preference. Foreign Language Annals, 49(1), 75–92. doi:10.1111/flan.12178
32.
TigchelaarM.BowlesR. P.WinkeP.GassS. (2017). Assessing the validity of ACTFL Can-Do Statements for spoken proficiency: A Rasch analysis. Foreign Language Annals, 50(3), 584–600. doi:10.1111/flan.12286
WindS. A.PetersonM. E. (2018). A systematic review of methods for evaluating rating quality in language assessment. Language Testing, 35(2), 161–192. doi:10.1177/0265532216686999
35.
WinkeP.GassS.MyfordC. (2012). Raters’ L2 background as a potential source of bias in rating oral performance. Language Testing, 30(2), 231–252. doi:10.1177/0265532212456968
36.
WinkeP.Van GorpK.GassS.VanPattenB. (2016, October1). Using different carrots: How incentivization affects proficiency testing outcomes.Paper presentation at the Midwest Association of Language Testers (MwALT) conference, Purdue University, West Lafayette, IN.