Al-GheziR.VoskoboinikK.GetmanY.Von ZansenA.KallioH.KurimoM.HuhtaA.HildénR. (2023). Automatic speaking assessment of spontaneous L2 Finnish and Swedish. Language Assessment Quarterly, 20(4-5), 421–444.
2.
BachmanL.AdrianP. (2022). Language assessment in practice: Developing language assessments and justifying their use in the real world. Oxford University Press.
3.
BolenderB.FosterC.VispoelS. (2023). The criticality of implementing principled design when using AI technologies in test development. Language Assessment Quarterly, 20(4–5), 512–519. https://doi.org/10.1080/15434303.2023.2288266
4.
BoultonA. (2017). Data–driven learning and language pedagogy. In ThorneS.MayS. (Eds.), Language, education and technology: Encyclopedia of language and education (pp. 181–192). Springer. https://doi.org/10.1007/978-3-319-02237-6_15
5.
BriggsD. C. (2024). Strive for measurement, set new standards, and try not to be evil. Journal of Educational and Behavioral Statistics, 49(5), 694–701. https://doi.org/10.3102/10769986241238479
6.
ChapelleC. A. (2001). Computer applications in second language acquisition. Cambridge University Press.
7.
ChapelleC. A. (2012). Validity argument for language assessment: The framework is simple…. Language Testing, 29(1), 19–27.
8.
ChapelleC. A.VossE. (Eds.). (2021). Validity argument in language testing: Case studies of validation research. Cambridge University Press.
9.
ChenY.JensenS.AlbertL. J.GuptaS.LeeT. (2023). Artificial intelligence (AI) student assistants in the classroom: Designing chatbots to support student success. Information Systems Frontiers, 25, 161–182. https://doi.org/10.1007/s10796-022-10291-4
FerraraS.QunbarS. (2022). Validity arguments for AI-based automated scores: Essay scoring as an illustration. Journal of Educational Measurement, 59(3), 288–313. https://doi.org/10.1111/jedm.12333
12.
GalacziE.LuckinR. (2024). Generative AI and language education: Opportunities, challenges and the need for critical perspectives. Cambridge Papers in English Language Education. Cambridge University Press.
13.
GalacziE.TaylorL. (2018). Interactional competence: Conceptualisations, operationalisations, and outstanding questions. Language Assessment Quarterly, 15, 219–236.
14.
GallegosI. O.RossiR. A.BarrowJ.TanjimM. M.KimS.DernoncourtF.YuT.ZhangR.AhmedN. K. (2024). Bias and fairness in large language models: A survey. Computational Linguistics, 50(3), 1097–1179. https://doi.org/10.48550/arXiv.2309.00770
15.
Godwin-JonesR. (2022). Partnering with AI: Intelligent writing assistance and instructed language learning. Language Learning & Technology, 26(2), 5–24. https://doi.org/10.10125/73474
16.
GuoS.WangY.YuJ.WuX.AyikB.WattsF.LatifE.LiuL.LiuN.ZhaiX. (2025). Artificial intelligence bias on English language learners in automatic scoring. In Proceedings of the International Conference on AI in Education (pp. 1–12). https://doi.org/10.48550/arXiv.2505.10643
17.
HannahL.JangE. E.LeeM.-H.RussellB. (2025). Investigating construct representativeness and linguistic equity of automated oral reading fluency assessment with prosody. Language Testing, 42(4). https://doi.org/10.1177/02655322251348956
18.
HannahL.JangE. E.ShahM.GuptaV. (2023). Validity arguments for automated essay scoring of young students’ writing traits. Language Assessment Quarterly, 20(4–5), 399–420. https://doi.org/10.1080/15434303.2023.2288253
19.
JinY.FanJ. (2023). Testtaker engagement in AI technology-mediated language assessment. Language Assessment Quarterly, 20(4–5), 488–500. https://doi.org/10.1080/15434303.2023.2291731
20.
KhabbazbashiN.GalacziE.AllenH.LopesS.NakatsuharaF.HalleyK. (2023). Exploring the impact of generative AI on language education: Insights from teachers. Cambridge Papers in English Language Education. Cambridge University Press.
21.
OckeyG. J.Chukharev-HudilainenE.HirchR. R. (2023). Assessing interactional competence: ICE versus a human partner. Language Assessment Quarterly, 20(4–5), 377–398. https://doi.org/10.1080/15434303.2023.2237486
22.
O’SullivanB. (2023). Reflections on the application and validation of technology in language testing. Language Assessment Quarterly, 20(4–5), 501–511. https://doi.org/10.1080/15434303.2023.2291486
23.
PloughI.BanerjeeJ.IwashitaN. (2018). Interactional competence: Genie out of the bottle. Language Testing, 35(3), 427–445. https://doi.org/10.1177/0265532218772325
24.
PurpuraJ. E. (2025). Learning-oriented language assessment. In KunnanA. J. (Ed.), The concise companion to language assessment (pp. 22–41). John Wiley.
25.
RungeA.GoodwinS.AttaliY.PoeM.MulcaireP.LoK.-L.LaFlairG. T. (2025). A multi-stage interactive writing task for the assessment of English language writing proficiency. Language Testing, 42(4). https://doi.org/10.1177/02655322251349908
26.
SawakiY. (2022). Computer-based testing. In FulcherG.HardingL. (Eds.), The Routledge handbook of language testing (2nd ed., pp. 530–544). Routledge.
27.
SawakiY.IshiiY.YamadaH.TokunagaT. (2025). Examining the consistency of instructor vs. large language model ratings on summary content: Toward checklist-based feedback provision with second language writers. Language Testing, 42(4). https://doi.org/10.1177/02655322251349217
28.
ShermisM. D.BursteinJ. (Eds.). (2013). Handbook of automated essay evaluation: Current applications and future directions. Routledge.
29.
SuzukiS.TakatsuH.MatsuuraR.KoyamaM.SaekiM.MatsuyamaY. (2025). Feedforwarding diagnostic language assessment: Artificial intelligence-driven weakness identification and contextualised feedback for second language speaking. Language Testing, 42(4). https://doi.org/10.1177/02655322251348725
30.
VaswaniA.ShazeerN. M.ParmarN.UszkoreitJ.JonesL.GomezA. N.KaiserL.PolosukhinI. (2017). Attention is all you need. 31st International Conference on Neural Information Processing Systems. https://doi.org/10.48550/arXiv.1706.03762
31.
von DavierA. A.BursteinJ. (2024). AI in the assessment ecosystem: A human-centered AI perspective. In IlicP.CasebourneI.WegerifR. (Eds.), Intelligent systems reference library. Artificial intelligence in education: The intersection of technology and pedagogy (Vol. 261, pp. 93–109). Springer. https://doi.org/10.1007/978-3-031-71232-6_6
32.
VoskoboinikE.von ZansenA.PhanN.GetmanY.GrószT.KurimoM. (2025). Enhancing second language speech assessment: Integrating large language models for Finnish and Finland Swedish proficiency scoring. Language Testing, 42(4). https://doi.org/10.1177/02655322251351648
33.
VossE. (2021). The role of technology in learning-oriented assessment. In GebrilA. (Ed.) Learning, oriented language assessment: Putting theory into practice (pp. 207–224). Routledge.
34.
VossE. (2025). Comparison of traditional machine learning and neural network approaches for automated scoring of second language English essays. Language Testing, 42(4). https://doi.org/10.1177/02655322251348959
35.
WarschauerM.GrimesD. (2007). Audience, authorship, and artifact: The emergent semiotics of Web 2.0. Annual Review of Applied Linguistics27, 1–27.
36.
WarschauerM.XuY. (2024). Artificial intelligence for language learning: Entering a new era. Language Learning & Technology, 28(2), 1–4. https://hdl.handle.net/10125/73569
37.
XiX. (2023). Advancing language assessment with AI and ML–learning into AI is inevitable, but can theory keep up?Language Assessment Quarterly, 20(4–5), 357–376.
38.
ZhangD.HoangT.PanS.HuY.XingZ.StaplesM.XuX.LuQ.QuigleyA. (2023). Test-takers have a say: Understanding the implications of the use of AI in language tests (arXiv:2307.09885). https://doi.org/10.48550/arXiv.2307.09885