Sage Journals: Discover world-class research

Abstract

Purpose

Artificial intelligence (AI) is increasingly integrated into healthcare, including psychiatric care. This study evaluates ChatGPT-4o’s reliability in answering frequently asked antidepressant-related questions by comparing its performance with psychiatrists across four key dimensions: accuracy, conciseness, readability, and clarity.

Design

A comparative study analyzing ChatGPT-4o-generated responses and those of psychiatrists with at least five years of clinical experience.

Setting

Participants were recruited through institutional and professional networks and provided with standardized questions derived from authoritative treatment guidelines.

Subjects

Twenty-six psychiatrists participated, and ChatGPT-4o responses were generated using a standardized prompt for each question.

Measures

Two independent psychiatrists evaluated accuracy and conciseness using a blinded rating system. Readability was assessed with the Flesch-Kincaid Grade Level test, and clarity was measured with the Writing Clarity Index Calculator.

Analysis

The Shapiro-Wilk test assessed normality. Paired t-tests were used for normally distributed data, and the Wilcoxon signed-rank test for non-normally distributed data. Statistical significance was set at P < .05.

Results

ChatGPT-4o showed comparable accuracy to psychiatrists (P = .0645) but was significantly more concise (P = .0019). Readability differences were not statistically significant (P = .0892), while psychiatrists provided clearer responses (P = .0059).

Conclusion

ChatGPT-4o delivers accurate and concise responses, highlighting its potential as a patient education tool. However, psychiatrists offer greater clarity, underscoring the indispensable role of clinical expertise in psychiatric care.

Keywords

antidepressants ChatGPT psychiatrists accuracy artificial intelligence in healthcare

Get full access to this article

View all access options for this article.

References

Bohr

Memarzadeh

. The rise of artificial intelligence in healthcare applications. In: Artificial Intelligence in Healthcare. Amsterdam: Elsevier; 2020:25-60.

Bajwa

Munir

Nori

Williams

. Artificial intelligence in healthcare: transforming the practice of medicine. Future Healthc J. 2021;8(2):e188-e194. doi:10.7861/fhj.2021-0095

Davenport

Kalakota

. The potential for artificial intelligence in healthcare. Future Healthc J. 2019;6(2):94-98. doi:10.7861/futurehosp.6-2-94

Baxter

Zhou

Zhang

. The practical implementation of artificial intelligence technologies in medicine. Nat Med. 2019;25(1):30-36. doi:10.1038/s41591-018-0307-0

Shen

Zhang

Jiang

, et al. Artificial intelligence versus clinicians in disease diagnosis: systematic review. JMIR Med Inform. 2019;7(3):e10010. doi:10.2196/10010

Kelly

Karthikesalingam

Suleyman

Corrado

King

. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195. doi:10.1186/s12916-019-1426-2

Levkovich

Elyoseph

. Identifying depression and its determinants upon initiating treatment: ChatGPT versus primary care physicians. Fam Med Community Health. 2023;11(4):e002391. doi:10.1136/fmch-2023-002391

González de León

Abt-Sacks

Acosta Artiles

, et al. Barriers and facilitating factors of adherence to antidepressant treatments: an exploratory qualitative study with patients and psychiatrists. Int J Environ Res Publ Health. 2022;19(24):16788. doi:10.3390/ijerph192416788

Poirot

Ruhe

Mutsaerts

H-JM

, et al Treatment response prediction in major depressive disorder using multimodal MRI and clinical data: secondary analysis of a randomized clinical trial. Am J Psychiatr. 2024;181:223-233. appi. ajp. 20230206.

10.

Montazeri

Galavi

Ahmadian

. What are the applications of ChatGPT in healthcare: gain or loss? Health Sci Rep. 2024;7(2):e1878. doi:10.1002/hsr2.1878

11.

Parikh

Kcomt

Fonseka

Pong

. The Choice– D Patient and Family Guide to Depression Treatment. Toronto, ON: Mood Disorders Association of Ontario; 2018.

12.

Antidepressants

[Internet].

London: Royal College of Psychiatrists; 2020. Available from: https://www.rcpsych.ac.uk/mental-health/treatments-and-wellbeing/antidepressants

13.

Health NIf, Excellence C . Depression in Adults: Treatment and Management. London: National Institute for Health and Care Excellence (NICE); 2022.

14.

Flesch

. Flesch-Kincaid readability test. Retrieved October. 2007;26(3). https://rockstar-english.com/lessons/advanced/12-Flesch_Kincaid_Readability_Test.pdf

15.

Warren

Farmer

Warren

. Marketing ideas: how to write research articles that readers understand and cite. J Market. 2021;85(5):42-57.

16.

Gupta

Seeja

. A comparative study and systematic analysis of XAI models and their applications in healthcare. Arch Comput Methods Eng. 2024;31:3977-4002. doi:10.1007/s11831-024-10103-9

17.

Arrieta

Díaz-Rodríguez

Del Ser

, et al. Explainable Artificial Intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI. Inf Fusion. 2020;58:82-115.

ChatGPT-4o vs Psychiatrists in Responding to Common Antidepressant Concerns