Sage Journals: Discover world-class research

Abstract

Get full access to this article

View all access options for this article.

References

Al-Ghezi

Voskoboinik

Getman

Von Zansen

Kallio

Kurimo

Huhta

Hildén

(2023). Automatic speaking assessment of spontaneous L2 Finnish and Swedish. Language Assessment Quarterly, 20(4-5), 421–444.

Bachman

Adrian

(2022). Language assessment in practice: Developing language assessments and justifying their use in the real world. Oxford University Press.

Bolender

Foster

Vispoel

(2023). The criticality of implementing principled design when using AI technologies in test development. Language Assessment Quarterly, 20(4–5), 512–519. https://doi.org/10.1080/15434303.2023.2288266

Boulton

(2017). Data–driven learning and language pedagogy. In Thorne

May

(Eds.), Language, education and technology: Encyclopedia of language and education (pp. 181–192). Springer. https://doi.org/10.1007/978-3-319-02237-6_15

Briggs

D. C.

(2024). Strive for measurement, set new standards, and try not to be evil. Journal of Educational and Behavioral Statistics, 49(5), 694–701. https://doi.org/10.3102/10769986241238479

Chapelle

C. A.

(2001). Computer applications in second language acquisition. Cambridge University Press.

Chapelle

C. A.

(2012). Validity argument for language assessment: The framework is simple…. Language Testing, 29(1), 19–27.

Chapelle

C. A.

Voss

(Eds.). (2021). Validity argument in language testing: Case studies of validation research. Cambridge University Press.

Chen

Jensen

Albert

L. J.

Gupta

Lee

(2023). Artificial intelligence (AI) student assistants in the classroom: Designing chatbots to support student success. Information Systems Frontiers, 25, 161–182. https://doi.org/10.1007/s10796-022-10291-4

10.

Choi

(2025). Generating language assessment content free from representational harms. Language Testing, 42(4). https://doi.org/10.1177/02655322251349560

11.

Ferrara

Qunbar

(2022). Validity arguments for AI-based automated scores: Essay scoring as an illustration. Journal of Educational Measurement, 59(3), 288–313. https://doi.org/10.1111/jedm.12333

12.

Galaczi

Luckin

(2024). Generative AI and language education: Opportunities, challenges and the need for critical perspectives. Cambridge Papers in English Language Education. Cambridge University Press.

13.

Galaczi

Taylor

(2018). Interactional competence: Conceptualisations, operationalisations, and outstanding questions. Language Assessment Quarterly, 15, 219–236.

14.

Gallegos

I. O.

Rossi

R. A.

Barrow

Tanjim

M. M.

Kim

Dernoncourt

Zhang

Ahmed

N. K

. (2024). Bias and fairness in large language models: A survey. Computational Linguistics, 50(3), 1097–1179. https://doi.org/10.48550/arXiv.2309.00770

15.

Godwin-Jones

(2022). Partnering with AI: Intelligent writing assistance and instructed language learning. Language Learning & Technology, 26(2), 5–24. https://doi.org/10.10125/73474

16.

Guo

Wang

Ayik

Watts

Latif

Liu

Zhai

(2025). Artificial intelligence bias on English language learners in automatic scoring. In Proceedings of the International Conference on AI in Education (pp. 1–12). https://doi.org/10.48550/arXiv.2505.10643

17.

Hannah

Jang

E. E.

Lee

M.-H.

Russell

(2025). Investigating construct representativeness and linguistic equity of automated oral reading fluency assessment with prosody. Language Testing, 42(4). https://doi.org/10.1177/02655322251348956

18.

Hannah

Jang

E. E.

Shah

Gupta

(2023). Validity arguments for automated essay scoring of young students’ writing traits. Language Assessment Quarterly, 20(4–5), 399–420. https://doi.org/10.1080/15434303.2023.2288253

19.

Jin

Fan

(2023). Testtaker engagement in AI technology-mediated language assessment. Language Assessment Quarterly, 20(4–5), 488–500. https://doi.org/10.1080/15434303.2023.2291731

20.

Khabbazbashi

Galaczi

Allen

Lopes

Nakatsuhara

Halley

(2023). Exploring the impact of generative AI on language education: Insights from teachers. Cambridge Papers in English Language Education. Cambridge University Press.

21.

Ockey

G. J.

Chukharev-Hudilainen

Hirch

R. R.

(2023). Assessing interactional competence: ICE versus a human partner. Language Assessment Quarterly, 20(4–5), 377–398. https://doi.org/10.1080/15434303.2023.2237486

22.

O’Sullivan

(2023). Reflections on the application and validation of technology in language testing. Language Assessment Quarterly, 20(4–5), 501–511. https://doi.org/10.1080/15434303.2023.2291486

23.

Plough

Banerjee

Iwashita

(2018). Interactional competence: Genie out of the bottle. Language Testing, 35(3), 427–445. https://doi.org/10.1177/0265532218772325

24.

Purpura

J. E.

(2025). Learning-oriented language assessment. In Kunnan

A. J.

(Ed.), The concise companion to language assessment (pp. 22–41). John Wiley.

25.

Runge

Goodwin

Attali

Poe

Mulcaire

K.-L.

LaFlair

G. T.

(2025). A multi-stage interactive writing task for the assessment of English language writing proficiency. Language Testing, 42(4). https://doi.org/10.1177/02655322251349908

26.

Sawaki

(2022). Computer-based testing. In Fulcher

Harding

(Eds.), The Routledge handbook of language testing (2nd ed., pp. 530–544). Routledge.

27.

Sawaki

Ishii

Yamada

Tokunaga

(2025). Examining the consistency of instructor vs. large language model ratings on summary content: Toward checklist-based feedback provision with second language writers. Language Testing, 42(4). https://doi.org/10.1177/02655322251349217

28.

Shermis

M. D.

Burstein

(Eds.). (2013). Handbook of automated essay evaluation: Current applications and future directions. Routledge.

29.

Suzuki

Takatsu

Matsuura

Koyama

Saeki

Matsuyama

(2025). Feedforwarding diagnostic language assessment: Artificial intelligence-driven weakness identification and contextualised feedback for second language speaking. Language Testing, 42(4). https://doi.org/10.1177/02655322251348725

30.

Vaswani

Shazeer

N. M.

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Polosukhin

(2017). Attention is all you need. 31st International Conference on Neural Information Processing Systems. https://doi.org/10.48550/arXiv.1706.03762

31.

von Davier

A. A.

Burstein

. (2024). AI in the assessment ecosystem: A human-centered AI perspective. In Ilic

Casebourne

Wegerif

(Eds.), Intelligent systems reference library. Artificial intelligence in education: The intersection of technology and pedagogy (Vol. 261, pp. 93–109). Springer. https://doi.org/10.1007/978-3-031-71232-6_6

32.

Voskoboinik

von Zansen

Phan

Getman

Grósz

Kurimo

(2025). Enhancing second language speech assessment: Integrating large language models for Finnish and Finland Swedish proficiency scoring. Language Testing, 42(4). https://doi.org/10.1177/02655322251351648

33.

Voss

(2021). The role of technology in learning-oriented assessment. In Gebril

(Ed.) Learning, oriented language assessment: Putting theory into practice (pp. 207–224). Routledge.

34.

Voss

(2025). Comparison of traditional machine learning and neural network approaches for automated scoring of second language English essays. Language Testing, 42(4). https://doi.org/10.1177/02655322251348959

35.

Warschauer

Grimes

(2007). Audience, authorship, and artifact: The emergent semiotics of Web 2.0. Annual Review of Applied Linguistics 27, 1–27.

36.

Warschauer

(2024). Artificial intelligence for language learning: Entering a new era. Language Learning & Technology, 28(2), 1–4. https://hdl.handle.net/10125/73569

37.

(2023). Advancing language assessment with AI and ML–learning into AI is inevitable, but can theory keep up? Language Assessment Quarterly, 20(4–5), 357–376.

38.

Zhang

Hoang

Pan

Xing

Staples

Quigley

(2023). Test-takers have a say: Understanding the implications of the use of AI in language tests (arXiv:2307.09885). https://doi.org/10.48550/arXiv.2307.09885

Advancing language assessment for teaching and learning in the era of the artificial intelligence (AI) revolution: Promises and challenges

Abstract

Get full access to this article

References