Sage Journals: Discover world-class research

Abstract

Objective

We examine AI trust miscalibration—the discrepancy between an individual’s trust in AI and its actual performance—among university students. We assess how the length of explanations and students’ expertise shape the likelihood of alignment with AI recommendations.

Background

The relationship between explainability and users’ trust in AI systems has been scarcely addressed in the current literature, even though AI-assisted processes increasingly affect all professions and hierarchical levels. Given that human–AI relationships are often formed during education, it is crucial to understand how individual and contextual factors influence students’ assessment of AI outputs.

Method

We conducted in-class experiments with 248 students from multiple universities. Participants solved GMAT questions, then viewed an AI recommendation—sometimes correct, sometimes incorrect—with varying explanation depth and eventually could revise their initial answer; student’s final answer being in line with AI recommendation operationalized our measure of “trust.” We estimated logistic models with control variables, including mixed-effects specifications to account for repeated observations.

Results

Explanation complexity is associated with higher trust on average, but its relevance depends on who reads it and whether AI is correct. Students who previously answered correctly exhibited lower willingness to defer, especially when AI was incorrect; conversely, agreement and consistency effects significantly amplified trust. These behavioral patterns highlight conditions under which AI-generated explanations can foster critical engagement or conversely encourage uncritical acceptance.

Conclusion

Our results point to a “AI knows better” heuristic at work—especially among nonexperts—where polished presentation is easily read as reliability, encouraging uncritical agreement with incorrect recommendations; in parallel, experts benefit more from deeper rationales when AI is accurate, yet still display under-reliance of correct assistance in many cases. Overall, trust calibration is driven less by any single cue than by the alignment of student performance, AI reliability, and explanation design, with prior agreement acting as a powerful amplifier of subsequent alignment.

Application

Our findings imply that instructional approaches should promote independent reasoning before exposure to AI, deploy concise but diagnostically informative explanations, and include brief verification steps before accepting AI recommendations, especially for nonexperts who are more prone to harmful switches. Simple monitoring tools that track helpful versus harmful changes could support a more discerning and productive use of AI tools.

Keywords

explainable AI trust (Mis)Calibration domain expertise performance

Get full access to this article

View all access options for this article.

References

Afzaal

Zia

Nouri

Fors

(2024). Informative feedback and explainable AI-Based recommendations to support students’ self-regulation. Technology, Knowledge and Learning, 29(1), 331–354. https://doi.org/10.1007/s10758-023-09650-0

Annamalai

Bervell

Mireku

D. O.

Andoh

R. P. K.

(2025). Artificial intelligence in higher education: Modelling students’ motivation for continuous use of ChatGPT based on a modified self-determination theory. Computers and Education: Artificial Intelligence, 8, 100346. https://doi.org/10.1016/j.caeai.2024.100346

Ben-Ner

Halldorsson

(2010). Trusting and trustworthiness: What are they, how to measure them, and what affects them. Journal of Economic Psychology, 31(1), 64–79. https://doi.org/10.1016/j.joep.2009.10.001

Bhatt

MacKenzie

(2019). Just Google it! digital literacy and the epistemology of ignorance. Teaching in Higher Education, 24(3), 302–317. https://doi.org/10.1080/13562517.2018.1547276

Buçinca

Lin

Gajos

K. Z.

Glassman

E. L.

(2020). Proxy tasks and subjective measures can be misleading in evaluating explainable AI systems. ArXiv Preprint arXiv:2001.08298.

Bussone

Stumpf

O’Sullivan

(2015). The role of explanations on trust and reliance in clinical decision support systems. In 2015 international conference on healthcare informatics, Dallas, TX, USA, 21–23 October 2015, pp. 160–169.

Contrino

M. F.

Reyes-Millán

Vázquez-Villegas

Membrillo-Hernández

(2024). Using an adaptive learning tool to improve student performance and satisfaction in online and face-to-face education for a more personalized approach. Smart Learning Environments, 11(1), 6. https://doi.org/10.1186/s40561-024-00292-y

Deck

Schoeffer

De-Arteaga

Kühl

(2024). A critical survey on fairness benefits of explainable AI. In Proceedings of the 2024 ACM conference on fairness, accountability, and transparency, Rio de Janeiro, Brazil, 05 June 2024, pp. 1579–1595.

Dikmen

Burns

(2022). The effects of domain knowledge on trust in explainable AI and task performance: A case of peer-to-peer lending. International Journal of Human-Computer Studies, 162, 102792. https://doi.org/10.1016/j.ijhcs.2022.102792

10.

Druckman

J. N.

Kam

C. D.

(2011). Students as experimental participants. In Cambridge handbook of experimental political science (pp. 41–57). Cambridge University Press.

11.

Dzindolet

M. T.

Peterson

S. A.

Pomranky

R. A.

Pierce

L. G.

Beck

H. P.

(2003). The role of trust in automation reliance. International Journal of Human-Computer Studies, 58(6), 697–718. https://doi.org/10.1016/s1071-5819(03)00038-7

12.

Ehsan

Passi

Liao

Q. V.

Chan

Lee

I. H.

Muller

Riedl

M. O.

(2024). The who in XAI: How AI background shapes perceptions of AI explanations. In Proceedings of the 2024 CHI conference on human factors in computing systems, Honolulu, HI, USA, 11–16 May 2024, pp. 1–32.

13.

Eiband

Buschek

Kremer

Hussmann

(2019). The impact of placebic explanations on trust in intelligent systems. In Extended abstracts of the 2019 CHI conference on human factors in computing systems, Glasgow, Scotland, UK, 02 May 2019, pp. 1–6.

14.

European Commission: European Education and Culture Executive Agency . (2025). Explainable AI in education – Fostering human oversight and shared responsibility – by the European digital education Hub’s squad on artificial intelligence in education. Publications Office of the European Union. https://doi.org/10.2797/6780469

15.

Fehr

(2009). On the economics and biology of trust. Journal of the European Economic Association, 7(2-3), 235–266. https://doi.org/10.1162/jeea.2009.7.2-3.235

16.

Fredricks

J. A.

Blumenfeld

P. C.

Paris

A. H.

(2004). School engagement: Potential of the concept, state of the evidence. Review of Educational Research, 74(1), 59–109. https://doi.org/10.3102/00346543074001059

17.

Grinschgl

Neubauer

(2022). Supporting cognition with modern technology: Distributed cognition today and in an AI-enhanced future. Frontiers in Artificial Intelligence, 5, 908261. https://doi.org/10.3389/frai.2022.908261

18.

Herm

L. V.

(2023). Impact of explainable AI on cognitive load: Insights from an empirical study. ArXiv Preprint arXiv:2304.08861.

19.

Herm

L. V.

Heinrich

Wanner

Janiesch

(2023). Stop ordering machine learning algorithms by their explainability! A user-centered investigation of performance and explainability. International Journal of Information Management, 69, 102538. https://doi.org/10.1016/j.ijinfomgt.2022.102538

20.

Hoff

K. A.

Bashir

(2015). Trust in automation: Integrating empirical evidence on factors that influence trust. Human Factors, 57(3), 407–434. https://doi.org/10.1177/0018720814547570

21.

Hosmer

D. W.

Lemeshow

Sturdivant

R. X.

(2013). Applied logistic regression. John Wiley & Sons.

22.

Ismatullaev

U. V. U.

Kim

S. H.

(2024). Review of the factors affecting acceptance of AI-infused systems. Human Factors, 66(1), 126–144. https://doi.org/10.1177/00187208211064707

23.

Jones

W. L.

Sonner

B. S.

(2001). Just say no to traditional student samples. Journal of Advertising Research, 41(5), 63–70. https://doi.org/10.2501/jar-41-5-63-71

24.

Khosravi

Shum

S. B.

Chen

Conati

Tsai

Y. S.

Kay

Knight

Martinez-Maldonado

Sadiq

Gašević

(2022). Explainable artificial intelligence in education. Computers & Education: Artificial Intelligence, 3, 100074. https://doi.org/10.1016/j.caeai.2022.100074

25.

Kim

J. Y.

Lester

Yang

X. J.

(2025). Beyond binary decisions: Evaluating the effects of AI error type on trust and performance in AI-assisted tasks. Human Factors: The Journal of the Human Factors and Ergonomics Society, 67(10), 1062–1083. https://doi.org/10.1177/00187208251326795

26.

Kosmyna

Hauptmann

Yuan

Y. T.

Situ

Liao

X. H.

Beresnitzky

A. V.

Braunstein

Maes

(2025). Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task. ArXiv Preprint arXiv:2506.08872.

27.

Kulesza

Stumpf

Burnett

Yang

Kwan

Wong

W. K.

(2013). Too much, too little, or just right? Ways explanations impact end users’ mental models. In 2013 IEEE symposium on visual languages and human centric computing, San Jose, CA, USA, 15–19 September 2013, pp. 3–10.

28.

Laupichler

M. C.

Aster

Schirch

Raupach

(2022). Artificial intelligence literacy in higher and adult education: A scoping literature review. Computers and Education: Artificial Intelligence, 3, 100101. https://doi.org/10.1016/j.caeai.2022.100101

29.

Lee

J. D.

See

K. A.

(2004). Trust in automation: Designing for appropriate reliance. Human Factors, 46(1), 50–80. https://doi.org/10.1518/hfes.46.1.50_30392

30.

Lin

Chan

R. Y.

Sharma

Bista

(2024). The impact of Artificial Intelligence (AI) on global higher education: Opportunities and challenges of using ChatGPT and generative AI. In ChatGPT and global higher education: Using artificial intelligence in teaching and learning (pp. 1–17). STAR Scholars Press.

31.

Lynch

J. G.

Jr. (1982). On the external validity of experiments in consumer research. Journal of Consumer Research, 9(3), 225–239. https://doi.org/10.1086/208919

32.

McCune

S. L.

Reed

(2017). GMAT: 1,001 practice questions for dummies. John Wiley & Sons.

33.

McKnight

D. H.

Choudhury

Kacmar

(2002). The impact of initial consumer trust on intentions to transact with a web site: A trust building model. The Journal of Strategic Information Systems, 11(3-4), 297–323. https://doi.org/10.1016/s0963-8687(02)00020-3

34.

Merritt

S. M.

Lee

Unnerstall

J. L.

Huber

(2015). Are well-calibrated users effective users? Associations between calibration of trust and performance on an automation-aided task. Human Factors, 57(1), 34–47. https://doi.org/10.1177/0018720814561675

35.

Miller

(2019). Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence, 267, 1–38. https://doi.org/10.1016/j.artint.2018.07.007

36.

Noble

S. M.

Mende

Grewal

Parasuraman

(2022). The fifth industrial revolution: How harmonious human–machine collaboration is triggering a retail and service [r] evolution. Journal of Retailing, 98(2), 199–208. https://doi.org/10.1016/j.jretai.2022.04.003

37.

Nourani

King

Ragan

(2020). The role of domain expertise in user trust and the impact of first impressions with intelligent systems. Proceedings of the AAAI conference on human computation and crowdsourcing, 8(1), 112–121. https://doi.org/10.1609/hcomp.v8i1.7469

38.

Parasuraman

Riley

(1997). Humans and automation: Use, misuse, disuse, abuse. Human Factors: The Journal of the Human Factors and Ergonomics Society, 39(2), 230–253. https://doi.org/10.1518/001872097778543886

39.

Park

Ahn

(2024). The promise and peril of ChatGPT in higher education: Opportunities, challenges, and design implications. In Proceedings of the 2024 CHI conference on human factors in computing systems, Honolulu, HI, USA, 11 May 2024, pp. 1–21.

40.

Peres

Schreier

Schweidel

Sorescu

(2023). On ChatGPT and beyond: How generative artificial intelligence may affect research, teaching, and practice. International Journal of Research in Marketing, 40(2), 269–275. https://doi.org/10.1016/j.ijresmar.2023.03.001

41.

Rong

Castner

Bozkir

Kasneci

(2022). User trust on an explainable ai-based medical diagnosis support system. ArXiv Preprint arXiv:2204.12230.

42.

Rotgans

J. I.

Schmidt

H. G.

(2011). Cognitive engagement in the problem-based learning classroom. Advances in Health Sciences Education: Theory and Practice, 16(4), 465–479. https://doi.org/10.1007/s10459-011-9272-9

43.

Ryan

R. M.

Deci

E. L.

(2000). Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being. American Psychologist, 55(1), 68–78. https://doi.org/10.1037//0003-066x.55.1.68

44.

Salmon

P. M.

Baber

Burns

Carden

Cooke

Cummings

Stanton

N. A.

McLean

Read

G. J. M.

(2023). Managing the risks of artificial general intelligence: A human factors and ergonomics perspective. Human Factors and Ergonomics in Manufacturing & Service Industries, 33(5), 366–378. https://doi.org/10.1002/hfm.20996

45.

Schaffer

O’Donovan

Michaelis

Raglin

Höllerer

(2019). I can do better than your AI: Expertise and explanations. In Proceedings of the 24th international conference on intelligent user interfaces, Marina del Ray, CA, 17 March 2019, pp. 240–251.

46.

Southworth

Migliaccio

Glover

J. N.

Reed

McCarty

Thomas

(2023). Developing a model for AI across the curriculum: Transforming the higher education landscape via innovation in AI literacy. Computers and Education: Artificial Intelligence, 4, 100127. https://doi.org/10.1016/j.caeai.2023.100127

47.

Starke

Baleis

Keller

Marcinkowski

(2022). Fairness perceptions of algorithmic decision-making: A systematic review of the empirical literature. Big Data & Society, 9(2), 20539517221115189. https://doi.org/10.1177/20539517221115189

48.

Swaroop

Buçinca

Gajos

K. Z.

Doshi-Velez

(2024). Accuracy-time tradeoffs in AI-assisted decision making under time pressure. In Proceedings of the 29th international conference on intelligent user interfaces, Greenville, SC, USA, 05 April 2024, pp. 138–154.

49.

Sweller

van Merriënboer

J. J. G.

Paas

(1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251–296. https://doi.org/10.1023/a:1022193728205

50.

Türkmen

(2025). The review of studies on explainable artificial intelligence in educational research. Journal of Educational Computing Research, 63(2), 277–310.

51.

You

Yang

C. L.

(2022). Algorithmic versus human advice: Does presenting prediction performance matter for algorithm appreciation? Journal of Management Information Systems, 39(2), 336–365. https://doi.org/10.1080/07421222.2022.2063553

52.

Zhai

Wibowo

L. D.

(2024). The effects of over-reliance on AI dialogue systems on students’ cognitive abilities: A systematic review. Smart Learning Environments, 11(1), 28. https://doi.org/10.1186/s40561-024-00316-7

53.

Zhao

Llorente

A. M. P.

Gómez

M. C. S.

(2021). Digital competence in higher education research: A systematic literature review. Computers & Education, 168, 104212. https://doi.org/10.1016/j.compedu.2021.104212

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.25 MB

In AI We Trust? Exploring the Role of Explainable GenAI and Expertise in Education

Abstract

Objective

Background

Method

Results

Conclusion

Application

Keywords

Get full access to this article

References

Supplementary Material