Al-HoorieA. H.CinagliaC.HiverP.HuenschA.IsbellD. R.LeungC.SudinaE. (2024). Open science: Considerations and issues for TESOL research. TESOL Quarterly, 58(1), 537–556. https://doi.org/10.1002/tesq.3304
2.
Al-HoorieA. H.VittaJ. P. (2019). The seven sins of L2 research: A review of 30 journals’ statistical quality and their CiteScore, SJR, SNIP, JCR Impact Factors. Language Teaching Research, 23(6), 727–744. https://doi.org/10.1177/1362168818767191
3.
BachmanL. F.PalmerA. S. (1982). The construct validation of some components of communicative proficiency. TESOL Quarterly, 16(4), 449–465. https://doi.org/10.2307/3586464
4.
BurtonJ. D. (2024). Evaluating the impact of nonverbal behavior on language ability ratings. Language Testing, 41(4), 729–758. https://doi.org/10.31219/osf.io/tc3qg
5.
ByrnesH. (2013). Notes from the editor. Modern Language Journal, 97(4), 825–827.
6.
ChapelleC. A. (2021). Argument-based validation in testing and assessment. Sage.
7.
ChapelleC. A.OckeyG. (2024). Open Science in language assessment research contexts: A reply to Winke. Language Testing, 41(4), 882–885. https://doi.org/10.1177/02655322241239377
8.
DimovaS.YanX.GintherA. (2022). Local tests, local contexts. Language Testing, 39(3), 341–354.
9.
DouglasD. (2001). Performance consistency in second language acquisition and language testing research: A conceptual gap. Second Language Research, 17(4), 442–456.
10.
DudleyA.MarsdenE. J.BovolentaG. (2024). A context-aligned two thousand test: Towards estimating high-frequency French vocabulary knowledge for beginner-to-low intermediate proficiency adolescent learners in England. Language Testing, 41(4), 759–791. https://doi.org/10.31219/osf.io/x6bzs
HaH. T.NguyenD. T. B.StoeckelT. (2024). What is the best predictor of word difficulty? A case of data mining using random forest. Language Testing, 41(4), 828–844. https://doi.org/10.1177/02655322241263628
IsaacsT.WinkeP. M. (2024). Purposeful turns for more equitable and transparent publishing in language testing and assessment. Language Testing, 41(1), 3–8. https://doi.org/10.1177/02655322231203234
17.
IsbellD. R.BrownD.ChenM.DerrickD. J.GhanemR.ArvizuM. N. G.SchnurE.ZhangM.PlonskyL. (2022). Misconduct and questionable research practices: The ethics of quantitative data handling and reporting in applied linguistics. The Modern Language Journal, 106(1), 172–195. https://doi.org/10.1111/modl.12760
18.
IsbellD. R.KimJ. (2023). Developer involvement and COI disclosure in high-stakes English proficiency test validation research: A systematic review. Research Methods in Applied Linguistics, 2(3), 100060. https://doi.org/10.1016/j.rmal.2023.100060
19.
IsbellD. R.SonY.-A. (2022). Measurement properties of a standardized Elicited Imitation Test: An integrative data analysis. Studies in Second Language Acquisition, 44(3), 859–885. https://doi.org/10.1017/S0272263121000383
KoizumiR.MaieR.YanagisawaA.In’namiY. (2024). Considerations to promote and accelerate Open Science: A response to Winke. Language Testing, 41(4), 892–897. https://doi.org/10.1177/02655322241239379
22.
KunnanA. J. (2018). Evaluating language assessments. Routledge.
23.
KyleK.EguchiM. (2024). Evaluating NLP models with written and spoken L2 samples. Research Methods in Applied Linguistics, 3(2), 100120.
LarssonT.PlonskyL.SterlingS.KytöM.YawK.WoodM. (2023). On the frequency, prevalence, and perceived severity of questionable research practices. Research Methods in Applied Linguistics, 2(3), 100064. https://doi.org/10.1016/j.rmal.2023.100064
26.
LiuM.Al-HoorieA. H.HiverP. V. (2024). Open access in language testing and assessment: The case of two flagship journals. Language Testing, 41(4), 703–728. https://doi.org/10.31219/osf.io/aedbu
27.
LiuM.ChongS. W.MarsdenE.McManusK.Morgan-ShortK.Al-HoorieA. H.PlonskyL.BolibaughC.HiverP.WinkeP.HuenschA.HuiB. (2023). Open scholarship in applied linguistics: What, why, and how. Language Teaching, 56(3), 432–437. https://doi.org/10.1017/S0261444822000349
28.
MarsdenE.Morgan-ShortK. (2023). (Why) Are open research practices the future for the study of language learning?Language Learning, 73, 344–387. https://doi.org/10.1111/lang.12568
29.
MarsdenE.Morgan-ShortK.ThompsonS.AbugaberD. (2018). Replication in second language research: Narrative and systematic reviews and recommendations for the field. Language Learning, 68(2), 321–391. https://doi.org/10.1111/lang.12286
30.
McNamaraT. F. (1990). Item Response Theory and the validation of an ESP test for health professionals. Language Testing, 7(1), 52–76. https://doi.org/10.1177/026553229000700105
PanJ.MarsdenE. (2024). Developing internet-based Tests of Aptitude for Language Learning (TALL): An open research endeavour. Language Testing, 41(4), 817–827. https://doi.org/10.1177/02655322241241849
TaylorL.BanerjeeJ. (2023). Accommodations in language testing and assessment: Safeguarding equity, access, and inclusion. Language Testing, 40(4), 847–855. https://doi.org/10.1177/02655322231186221
35.
WinkeP. (2024). Sharing, collaborating, and building trust: How Open Science advances language testing. Language Testing, 41(4), 845–859. https://doi.org/10.1177/02655322231211159
36.
WinkeP.BrunfautT. (Eds.). (2021). The Routledge handbook of second language acquisition and language testing. Routledge.
37.
YanJ.FanJ. (2024). Open Science for language assessment research and practice in China: A response to Winke. Language Testing, 41(4), 877–881. https://doi.org/10.1177/02655322231223100