Performance of Three Conversational Artificial Intelligence Agents in Defining End-of-Life Care Terms

Abstract

Background:

Conversational artificial intelligence agents, or chatbots, are a transformational technology understudied in end-of-life care.

Methods:

OpenAI’s ChatGPT, Google’s Bard, and Microsoft’s Bing were asked to define “terminally ill,” “end of life,” “transitions of care,” “actively dying,” and provide three references. Outputs were scored by six physicians on a scale of 0–10 for accuracy, comprehensiveness, and credibility. Flesch-Kincaid Grade Level and Flesch Reading Ease (FRE) were used to calculate readability.

Results:

Mean (standard deviation) scores for accuracy were 9 (1.9) for ChatGPT, 7.5 (2.4) for Bard, and 8.3 (2.4) for Bing. Comprehensiveness scores averaged 8.5 (1.7) for ChatGPT, 7.3 (2.1) for Bard, and 6.5 (2.3) for Bing. Credibility was low with a mean score of 3 (1.8). The mean FRE score was 41.7, and the mean grade level was 14.1, indicating low readability.

Conclusion:

Chatbot outputs had important deficiencies that necessitated clinician oversight to prevent misinformation.

Get full access to this article

View all access options for this article.

References

Hui

, Mori

, Parsons

, et al. The lack of standard definitions in the supportive and palliative oncology literature. J Pain Symptom Manage, 2012; 43(3):582–592.

Hui

, Nooruddin

, Didwaniya

, et al. Concepts and Definitions for “Actively Dying,” “End of Life,” “Terminally Ill,” “Terminal Care,” and “Transition of Care”: A systematic review. Journal of Pain and Symptom Management, 2014; 47(1):77–89.

Kelemen

, Groninger

. When we document end-of-life care, words still matter. J Pain Symptom Manage, 2019; 57(1):e14.

Kelemen

, Groninger

. Ambiguity in End-of-Life care terminology—what do we mean by “Comfort Care?”. JAMA Intern Med, 2018; 178(11):1442–1443.

Kirsten

, Wentlandt

, Philippe

, et al. Language Used by Health Care Professionals to Describe Dying at an Acute Care Hospital. J Pain Symptom Manage, 2018; 56(3):337–343.

Wright

, Zhang

, Ray

, et al. Associations between end-of-life discussions, patient mental health, medical care near death, and caregiver bereavement adjustment. JAMA, 2008; 300(14):1665–1673.

Jia

, Pang

, Liu

. Online Health Information Seeking Behavior: A Systematic Review. Healthcare, 2021; 9(12):1740.

OpenAI. Introducing ChatGPT 2022 [updated 2022/11/30]. Available from: https://openai.com/blog/chatgpt

Thirunavukarasu

, Ting

DSJ

, Elangovan

, et al. Large language models in medicine. Nat Med, 2023; 29(8):1930–1940.

10.

Pichai

. An important next step on our AI journey. 2023 [updated 2023/02/06]. Available from: https://blog.google/technology/ai/bard-google-ai-search-updates/

11.

Announcing the next wave of AI innovation with Microsoft Bing and Edge 2023 [updated 2023/05/04]. Available from: https://blogs.microsoft.com/blog/2023/05/04/announcing-the-next-wave-of-ai-innovation-with-microsoft-bing-and-edge/

12.

Kung

, Cheatham

, Medenilla

, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health, 2023; 2(2):e0000198.

13.

Lau-Min

, Marini

, Shah

, et al. Pilot Study of a Mobile Phone Chatbot for Medication Adherence and Toxicity Management Among Patients With GI Cancers on Capecitabine. JCO Oncol Pract, 2024; 20(4):483–490.

14.

Rajpurkar

, Lungren

. The current and future state of ai interpretation of medical images. N Engl J Med, 2023; 388(21):1981–1990.

15.

John

, Adam

, Mark

, et al. Comparing physician and artificial intelligence chatbot responses to patient questions posted to a public social media forum. JAMA Internal Medicine, 2023; 183(6):589.

16.

Boyd

. Microsoft and Epic expand AI collaboration to accelerate generative AI’s impact in healthcare, addressing the industry’s most pressing needs: Official Microsoft Blog; 2023. Available from: https://blogs.microsoft.com/blog/2023/08/22/microsoft-and-epic-expand-ai-collaboration-to-accelerate-generative-ais-impact-in-healthcare-addressing-the-industrys-most-pressing-needs/

17.

Omiye

, Gui

, Rezaei

, et al. Large language models in medicine: The potentials and pitfalls: A narrative review. Ann Intern Med, 2024; 177(2):210–220.

18.

Kim

, Admane

, Chang

, et al. Chatbot performance in defining and differentiating palliative care, supportive care, hospice care. J Pain Symptom Manage, 2024; 67(5):e381–e391.

19.

Flesch

. A new readability yardstick. J Appl Psychol, 1948; 32(3):221–233.

20.

Washington

. Health Literacy Online: A Guide to Writing and Designing Easy-to-Use Health Web Sites. U.S. Department of Health and Human Services Office of Disease Prevention and Health Promotion; 2010, 2010.

21.

Huff

, Tadi

22.

Goodman

, Patrinely

, Stone

, et al. Accuracy and Reliability of Chatbot Responses to Physician Questions. JAMA Netw Open, 2023; 6(10):e2336483.

23.

Johnson

, King

, Warner

, et al. Using ChatGPT to evaluate cancer myths and misconceptions: Artificial intelligence and cancer information. JNCI Cancer Spectr, 2023; 7(2):pkad015.

24.

Chen

, Chen

. Accuracy of Chatbots in Citing Journal Articles. JAMA Netw Open, 2023; 6(8):e2327647.

25.

Weis

. Health Literacy: A Manual for Clinicians. Chicago, IL: American Medical Association, American Medical Foundation; 2003.

26.

Decker

, Trang

, Ramirez

, et al. Large Language Model−Based Chatbot vs Surgeon-Generated Informed Consent Documentation for Common Procedures. JAMA Netw Open, 2023; 6(10):e2336997.

27.

Lee

, Bubeck

, Petro

. Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine. N Engl J Med, 2023; 388(13):1233–1239.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.03 MB