Appraisal of ChatGPT's responses to common patient questions regarding Tommy John surgery

Abstract

Background

Artificial intelligence (AI) has progressed at a fast pace. ChatGPT, a rapidly expanding AI platform, has several growing applications in medicine and patient care. However, its ability to provide high-quality answers to patient questions about orthopedic procedures such as Tommy John surgery is unknown. Our objective is to evaluate the quality of information provided by ChatGPT 3.5 and 4.0 in response to patient questions regarding Tommy John surgery.

Methods

Twenty-five patient questions regarding Tommy John surgery were posed to ChatGPT 3.5 and 4.0. Readability was assessed via Flesch Kincaid Reading Ease, Flesh Kinkaid Grade Level, Gunning Fog Score, Simple Measure of Gobbledygook, Coleman Liau, and Automated Readability Index. The quality of each response was graded using a 5-point Likert scale.

Results

ChatGPT generated information at an educational level that greatly exceeds the recommended level. ChatGPT 4.0 produced slightly better responses to common questions regarding Tommy John surgery with fewer inaccuracies than ChatGPT 3.5.

Conclusion

Although ChatGPT can provide accurate information regarding Tommy John surgery, its responses may not be easily comprehended by the average patient. As AI platforms become more accessible to the public, patients must be aware of their limitations.

Keywords

Tommy John artificial intelligence patient education

Get full access to this article

View all access options for this article.

References

Jobe

Stark

Lombardo

. Reconstruction of the ulnar collateral ligament in athletes. J Bone Joint Surg Am 1986; 68: 1158–1163.

Spivey

Constantinescu

Costello

, et al. The rate of medial ulnar collateral ligament repair is increasing while reconstruction remains the most common procedure overall among early-career orthopaedic surgeons. Arthrosc Sports Med Rehabil 2023; 5: e549–e557.

Erickson

Nwachukwu

Rosas

, et al. Trends in medial ulnar collateral ligament reconstruction in the United States: a retrospective review of a large private-payer database from 2007 to 2011. Am J Sports Med 2015; 43: 1770–1774.

Hones

Simcox

Hao

, et al. Graft choice and techniques used in elbow ulnar collateral ligament reconstruction over the last 20 years: a systematic review and meta-analysis. J Shoulder Elbow Surg 2024; 33: 1185–1199.

Jack

Rao

D’Amore

, et al. Long-term sports participation and satisfaction after UCL reconstruction in amateur baseball players. Orthop J Sports Med 2021; 9: 23259671211027551.

Olsen

Fleisig

Dun

, et al. Risk factors for shoulder and elbow injuries in adolescent baseball pitchers. Am J Sports Med 2006; 34: 905–912.

Krempec

Hall

Biermann

. Internet use by patients in orthopaedic surgery. Iowa Orthop J 2003; 23: 80–82.

Megalla

Imam

Almadani

, et al. Youtube videos on ulnar collateral ligament reconstruction are highly variable in terms of reliability and quality: a quantitative analysis. Shoulder Elbow 2023; 15: 674–679.

Manzi

Apostolakos

, et al. Youtube as a source of patient education information for elbow ulnar collateral ligament injuries: a quality control content analysis. Clin Shoulder Elb 2022; 25: 145–153.

10.

Hautala

Comadoll

Raffetto

, et al.

Most orthopaedic trauma patients are using the internet, but do you know where they’re going?

Injury 2021; 52: 3299–3303.

11.

Hertling

Matziolis

Graul

. The role of the internet as a source of medical information for orthopedic patients. Orthopadie (Heidelb) 2022; 51: 521–530.

12.

Desai

Anderson

Crutchfield

, et al. Systematic assessment of the quality and comprehensibility of YouTube content on ulnar collateral ligament injury and management. Orthop J Sports Med 2023; 11: 23259671221147921.

13.

Ahmed

Mahon

Karkuri

. Readability of online information on core decompression of the hip for avascular necrosis. Cureus 2023; 15: e50298.

14.

Ganjavi

Eppler

Ramacciotti

, et al. Clinical patient summaries not fit for purpose: a study in urology. Eur Urol Focus 2023; 9: 1068–1071.

15.

WebFx Readability Test. WebFX. Accessed December 20, 2023. https://www.webfx.com/tools/read-able/

16.

Hurley

Crook

Lorentz

, et al. Evaluation high-quality of information from ChatGPT (artificial intelligence-large language model) artificial intelligence on shoulder stabilization surgery. Arthroscopy 2024; 40: 726–731.

17.

Eid

Wang

, et al. Optimizing ophthalmology patient education via ChatBot-generated materials: readability analysis of AI-generated patient education materials and the American society of ophthalmic plastic and reconstructive surgery patient brochures. Ophthalmic Plast Reconstr Surg 2024; 40: 212–216.

18.

Gulbrandsen

O’Reilly

Gao

, et al. Health literacy in rotator cuff repair: a quantitative assessment of the understandability of online patient education material. JSES Int 2023; 7: 2344–2348.

19.

Armstrong

. Bloom’s taxonomy. Published online 2010. Accessed December 20, 2023. https://cft.vanderbilt.edu/guides-sub-pages/blooms-taxonomy/.

20.

Christy

Morris

Goldfarb

, et al. Appropriateness and reliability of an online artificial intelligence platform’s responses to common questions regarding distal radius fractures. J Hand Surg Am 2024; 49: 91–98.

21.

Trofa

Constant

Crutchfield

, et al. Return-to-sport outcomes after primary ulnar collateral ligament reconstruction with palmarisa versus hamstring tendon grafts: a systematic review. Orthop J Sports Med 2021; 9: 23259671211055726.

22.

Bujnowska-Fedak

Waligóra

Mastalerz-Migas

. The internet as a source of health information and services. Adv Exp Med Biol 2019; 1211: 1–16.

23.

Baskwill

. Navigating generative AI: opportunities, limitations, and ethical considerations in massage therapy and beyond. Int J Ther Massage Bodywork 2023; 16: 1–4.

24.

Raja

DeShazo

Bowcutt

, et al. Quality and readability analysis of online information on first metatarsophalangeal joint fusion. J Foot Ankle Surg 2024; 63: 256–261.

25.

Michel

Dijanic

Abdelmalek

, et al. Readability assessment of patient educational materials for pediatric spinal conditions from top academic orthopedic institutions. J Child Orthop 2023; 17: 284–290.

26.

Tang

Wang

, et al.

Will ChatGPT/GPT-4 be a lighthouse to guide spinal surgeons?

Ann Biomed Eng 2023; 51: 1362–1365.

27.

Rajjoub

Arroyave

Zaidat

, et al. ChatGPT and its role in the decision-making for the diagnosis and treatment of lumbar spinal stenosis: a comparative analysis and narrative review. Global Spine J 2024; 14: 998–1017.

28.

Kung

Marshall

Gauthier

, et al. Evaluating ChatGPT performance on the orthopaedic in-training examination. JB JS Open Access 2023; 8: e23.00056.

29.

Fares

Singh

Vadhera

, et al. Online resources for information on shoulder arthroplasty: an assessment of quality and readability. Clin Shoulder Elb 2023; 26: 238–244.

30.

Guzman

Dela Rueda

Williams

, et al. Online patient education resources for anterior cruciate ligament reconstruction: an assessment of the accuracy and reliability of information on the internet over the past decade. Cureus 2023; 15: e46599.

31.

Miskiewicz

Capotosto

Wang

. Evaluation of readability of patient education materials on lateral epicondylitis (tennis elbow) from the top 25 orthopedic institutions. JSES Int 2023; 7: 877–880.

32.

Vallee

Lucasti

Scott

, et al. A readability analysis of online spondylolisthesis and spondylolysis patient resources among paediatric hospital web pages: a US-based study. J Am Acad Orthop Surg Glob Res Rev 2023; 7: e23.00177.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.06 MB