Billing and Coding in Foot and Ankle Surgery: Can We Trust Artificial Intelligence?

Abstract

Background

Billing and coding for orthopaedic procedures is a complex process with thousands of procedure codes and associated modifiers in existence. Foot and ankle faces an additional challenge as it is among the highest variability regarding procedures performed compared with other orthopaedic subspecialities. This study aimed to investigate the capabilities of the top AI search engines in accurately identifying Current Procedural Terminology (CPT) codes for common foot and ankle procedures.

Methods

A comparative analysis of 3 publically available AI search engines (ChatGPT, Bing, and Google Gemini) was performed investigating their accuracy in generating CPT codes for common orthopaedic foot and ankle procedures. The generated CPT codes were recorded and compared with the codes generated by 3 fellowship trained foot and ankle surgeons, serving as the reference standard. Cohen kappa coefficient was used to determine agreement across AI platforms regarding the surgeon coding reference standard.

Results

The AI search engines were able to correctly generate the appropriate CPT codes 44% of the time, with Bing being the most accurate in generating the correct CPT codes for 8 of the 13 procedures (62%) and partially correct codes 3 of the 13 procedures (23%). ChatGPT demonstrated the worst accuracy, generating the correct CPT codes only 23% of the time (3/13). AI platforms demonstrated an overall Fair Agreement with the reference standard (kappa = 0.201). Individually, Bing demonstrated Moderate Agreement (kappa = 0.405), Google Gemini demonstrated Fair Agreement (kappa = 0.255), and ChatGPT demonstrated Poor Agreement with the reference standard (kappa = 0.171).

Conclusion

Although the capabilities of AI show great promise for many industries, the results of this study bring caution to relying on AI for accurately generating orthopaedic foot and ankle procedure CPT codes.

Level of Evidence:

III, Comparative Study

Keywords

artificial intelligence large language models billing and coding ChatGPT foot and ankle

Get full access to this article

View all access options for this article.

References

Daher

Koa

Boufadel

Singh

Fares

Abboud

JA.

Breaking barriers: can ChatGPT compete with a shoulder and elbow specialist in diagnosis and management?

JSES Int. 2023;7(6):2534-2541. doi:10.1016/j.jseint.2023.07.018

Guerra

Hofmann

Sobhani

, et al. GPT-4 artificial intelligence model outperforms ChatGPT, medical students, and neurosurgery residents on neurosurgery written board-like questions. World Neurosurg. 2023;179:e160-e165. doi:10.1016/j.wneu.2023.08.042

Kung

Marshall

Gauthier

Gonzalez

Jackson

3rd . Evaluating ChatGPT performance on the orthopaedic in-training examination. JB JS Open Access. 2023;8(3):e23.00056. doi:10.2106/JBJS.OA.23.00056

O’Malley

Jr Sarwar

Cassimatis

, et al. Can publicly available artificial intelligence successfully identify current procedural terminology codes for common procedures in neurosurgery? World Neurosurg. 2024;183:e860-e870. doi:10.1016/j.wneu.2024.01.043

Radhakrishnan

Perry

Misra

, et al. Targeted education for clinicians and clinical coding staff improves the accuracy of clinical coding: a quality improvement project. Future Healthc J. 2024;11(1):100127. doi:10.7861/fhj.2023-0021

DiGiorgio

Ehrenfeld

JM.

Artificial intelligence in medicine & ChatGPT: de-tether the physician. J Med Syst. 2023;47(1):32. doi:10.1007/s10916-023-01926-3

Brennan

Probe

RA.

Common errors in billing and coding for orthopaedic trauma care. Curr Orthop Pract. 2011;22(1):12. doi:10.1097/BCO.0b013e31820598bd

Introducing ChatGPT. Accessed November 13, 2023. https://openai.com/blog/chatgpt

Zaidat

Tang

Arvind

, et al. Can a novel natural language processing model and artificial intelligence automatically generate billing codes from spine surgical operative notes? Global Spine J. 2023;14:2022-2030. doi:10.1177/21925682231164935

10.

Kaya Bicer

Fangerau

Sur

. Artificial intelligence use in orthopedics: an ethical point of view. EFORT Open Rev. 2023;8(8):5926-5959. doi:10.1530/EOR-23-0083