American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing.
2.
BachmanL. F.PalmerA. S. (1996). Language testing in practice: Designing and developing useful language tests. Oxford University Press.
3.
BolusR. E.HinofotisF. B.BaileyK. M. (1982). An introduction to generalizability theory in second language research. Language Learning, 32(2), 245–258. https://doi.org/10.1111/j.1467-1770.1982.tb00970.x
4.
ByrnesH. (2002). The role of task and task-based assessment in a content-oriented collegiate foreign language curriculum. Language Testing, 19(4), 419–437. https://doi.org/10.1191/0265532202lt238oa
DavidsonF.HenningG. (1985). A self-rating scale of English difficulty: Rasch scalar analysis of items and rating categories. Language Testing, 2(2), 164–179. https://doi.org/10.1177/026553228500200205
7.
DimovaS.KlingJ. (2018). Assessing English-medium instruction lecturer language proficiency across disciplines. TESOL Quarterly, 52(3), 634–656. https://doi.org/10.1002/tesq.454
FulcherG. (2021). Language assessment literacy in a learning-oriented assessment framework. In GebrilA. (Ed.), Learning-oriented language assessment (pp. 34–48). Routledge. https://doi.org/10.4324/9781003014102
11.
HasselgreenA.CarlsenC.HelnessH. (2004). European survey of language testing and assessment needs. Part one: General findings. http://www.ealta.eu.org/resources.htm
12.
HenningG.HudsonT.TurnerJ. (1985). Item response theory and the assumption of unidimensionality for language tests. Language Testing, 2(2), 141–154. https://doi.org/10.1177/026553228500200203
Inbar-LourieO. (2008). Constructing a language assessment knowledge base: A focus on language assessment courses. Language Testing, 25(3), 385–402. https://doi.org/10.1177/0265532208090158
15.
Inbar-LourieO. (2013). Guest editorial to the special issue on language assessment literacy. Language Testing, 30(3), 301–307. https://doi.org/10.1177/0265532213480126
16.
KremmelB.HardingL. (2020). Towards a comprehensive, empirical model of language assessment literacy across stakeholder groups: Developing the language assessment literacy survey. Language Assessment Quarterly, 17(1), 100–120. https://doi.org/10.1080/15434303.2019.1674855
17.
LamD. M. (2018). What counts as “responding”? Contingency on previous speaker contribution as a feature of interactional competence. Language Testing, 35(3), 377–401. https://doi.org/10.1177/0265532218758126
18.
LeviT.Inbar-LourieO. (2020). Assessment literacy or language assessment literacy: Learning from the teachers. Language Assessment Quarterly, 17(2), 168–182. https://doi.org/10.1080/15434303.2019.1692347
19.
NorrisJ. M. (2006). The issue: The why (and how) of assessing student learning outcomes in college foreign language programs. The Modern Language Journal, 90(4), 576–583. https://www.jstor.org/stable/4127045
20.
OckeyG. J. (2007). Construct implications of including still image or video in computer-based listening tests. Language Testing, 24(4), 517–537. https://doi.org/10.1177/0265532207080771
21.
Oral English Proficiency Program. (2019). The OEPT technical manual. Purdue University.
22.
PillJ.HardingL. (2013). Defining the language assessment literacy gap: Evidence from a parliamentary inquiry. Language Testing, 30(3), 381–402. https://doi.org/10.1177/0265532213480337
SongM. (2008). Do divisible subskills exist in second language (L2): A structural equation modeling approach. Language Testing, 25(4), 435–464. https://doi.org/10.1177/0265532208094272
25.
TaylorL. (2013). Communicating the theory, practice and principles of language testing to test stakeholders: Some reflections. Language Testing, 30(3), 403–412. https://doi.org/10.1177/0265532213480338
TsouW.ChenF. (2014). ESP program evaluation framework: Description and application to a Taiwanese university ESP program. English for Specific Purposes, 33, 39–53. https://doi.org/10.1016/j.esp.2013.07.008
28.
VogtK.TsagariD. (2014). Assessment literacy of foreign language teachers: Findings of a European study. Language Assessment Quarterly, 11(4), 374–402. https://doi.org/10.1080/15434303.2014.960046
29.
WatanabeY.NorrisJ. M.González-LloretM. (2009). Identifying and responding to evaluation needs in college foreign language programs. In NorrisJ. M. (Ed.), Toward useful program evaluation in college foreign language education (pp. 5–56). NFLRC Monographs. https://nflrc.hawaii.edu/PDFs/MG04TOC.pdf
30.
YanX. (2014). An examination of rater performance on a local oral English proficiency test: A mixed-methods approach. Language Testing, 31(4), 501–527. https://doi.org/10.1177/0265532214536171
31.
YanX.DimovaS.GintherA. (Eds.). (in preparation). Local language testing: Practice across contexts. Springer.
32.
YanX.FanJ. (2021). “Am I qualified to be a language tester?”: Understanding the development of language assessment literacy across three stakeholder groups. Language Testing, 38(2), 219–246. https://doi.org/10.1177/0265532220929924
33.
YanX.ThirakunkovitS. P.KauperN. L.GintherA. (2016). What do test-takers say? Test-taker feedback as input for quality management of a local oral English proficiency test. In ReadJ. (Ed.), Post-admission language assessment of university students (pp. 113–136). Springer.
34.
YanX.ZhangC.FanJ. J. (2018). Assessment knowledge is important, but. . .: How contextual and experiential factors mediate assessment practice and training needs of language teachers. System, 74, 158–168. https://doi.org/10.1016/j.system.2018.03.003