Abstract
Artificial intelligence (AI)-based algorithms are rapidly entering the health care field and have the potential to improve patient care. Our article focuses on the use of autonomous AI algorithms (ie, algorithms that can make clinical decisions without human oversight) in diagnostic imaging. In this article, we have used the example of diabetic retinopathy screening to highlight some important aspects to be considered by developers, policymakers, and end users when bringing autonomous AI algorithms into clinical practice. We have divided these aspects into (1) following the principles of safety, efficacy, and equity in all phases of development and implementation of the algorithm; (2) regulatory processes involving medical records, medical liability, and patient privacy; (3) cost and billing; and (4) the role of health care providers.
Augmented or artificial intelligence (AI) algorithms are rapidly making inroads into health care. These can be assistive or autonomous. In assistive AI, a physician makes the final decision, whereas autonomous AI makes a clinical decision without physician involvement. Autonomous AI can be invaluable when specialists are not available. However, rigorous oversight and extensive validation is needed if autonomous AI is to be applied to clinical practice.
Many AI-based algorithms are being developed for the management of patients with diabetes mellitus (DM). While this article focuses on autonomous AI for diagnosis of diabetic retinopathy (DR), other applications in diabetes include artificial pancreas systems, fully automated closed loop, and hybrid closed loop insulin delivery systems automating insulin delivery based on continuous glucose monitoring data. 1 These systems make diabetes management easier for patients while improving glycemic control. Although the goal is to make this process fully automated, most systems adopt a hybrid approach where manual administration of insulin is required for meals. Some patients are also utilizing “do-it-yourself” algorithms where the patient is directly “in the loop” using an insulin pump, continuous glucose monitor, and open source algorithms to optimize glycemic control.2-4
An important vision-threatening complication in patients with DM is the development of DR. Early detection and treatment of DR can prevent blindness. Yet, adherence to screening guidelines is dismal, as low as 15%. 5 With advancements in deep learning algorithms, it is now possible for an automated algorithm to detect DR changes from fundus photos with the same accuracy as an eye doctor. 6 Autonomous AI-based detection enables real-time, point-of-care screening for DR, potentially improving access to screening services and preventing blindness. 7
In April 2018, the Food and Drug Administration (FDA) authorized the first autonomous AI system for early detection of DR. This allowed non-eye care professionals to detect referable DR, in primary care settings, without requiring an eye care provider to interpret retinal images, filling a huge unmet need for patients with DM. Many resources were invested in vetting the algorithm, conducting a pivotal clinical trial to establish safety and efficacy and the exact indications for which nonexperts in the real-world setting could use the algorithm. 8 In this article, we focus on the aspects involved in bringing an autonomous AI algorithm into clinical practice. Although these principles will apply to other autonomous AI algorithms in the pipeline, we share our experiences and lessons learnt from introducing an autonomous AI algorithm for early detection of DR.
(1)
In the pivotal trial for the IDx-DR retinal screening algorithm, the system was determined to have a sensitivity of 87.2% and specificity of 90.7%, in detecting more than mild DR among adults when compared with grading of wide-angle stereoscopic fundus photographs by the University of Wisconsin Fundus Photography Reading Center. 8 In comparison, the sensitivity and specificity of a dilated ophthalmic exam by board certified ophthalmologists was 34% and 100%, respectively, when compared with a similar, outcome-based reference standard of stereoscopic mydriatic fundus photos graded by a reading center. 11 Thus the IDx-DR algorithm was more sensitive when compared with experts in identifying early DR. Importantly, there were no significant differences in sensitivity or specificity by sex, race, or ethnicity, suggesting that it met the “bias” metric and can be successfully used across populations irrespective of their gender or race/ethnicity. 8 For diagnostic imaging, an important variable to assess is “diagnosability” or “imageability.” This is the proportion of patients who have an interpretable image using the AI algorithm when compared with the reference standard. In the case of IDx-DR, 96% of patients were able to get a disease classification via the AI algorithm when compared with the reference standard of the fundus photography reading center. 8
(2)
(3)
(4)
Levels of External Validity of the Reference Standard Used in AI-Based Clinical Studies.
AI, artificial or augmented intelligence; Reference standard or “truth” refers to the well-defined disease state or patient outcome that is used to train the algorithm. Source: Michael D Abramoff MD, PhD.
Potential Future Applications of AI in Diabetes
AI has the potential to help us with currently unmet challenges of managing patients with DM. One such challenge is the earlier identification of patients at risk for developing DM. Machine learning algorithms can leverage electronic health care data to learn the characteristics of patients with DM and potentially predict development of DM and its complications when patients are still in the subclinical stages.14,15 However, it has been much harder to achieve high SEE for these systems, as their inputs consist of noisy, subjective text-based data, rather than the more objective sensor-based data that is the basis for image-based autonomous AI. Recent advances in deep learning techniques have made it possible to extract information, previously thought to be impossible to extract from photos, such as an individual’s age, gender, glycated hemoglobin, and blood pressure levels. 16 This suggests that incorporation of retinal photos into prediction algorithms may further enhance our ability to diagnose diabetes in its earlier stages. However, before predictive algorithms are implemented in clinical practice they need to be carefully validated to ensure that they are safe, efficient, and equitable, which can only be done via preregistered clinical trials that include the complete workflow and human factors design, and compare AI algorithms with outcome data. A recent study reported that a prediction algorithm widely used to identify patients with medical needs exhibited racial bias by using health care cost as a proxy for severity of medical illness. Black patients incurred lower costs for the same level of disease severity as whites and were incorrectly identified by the algorithm as having fewer health care needs. 17 The possibility of bias needs to be carefully addressed in the design, validation, and implementation of algorithms.
In summary, AI algorithms offer the potential to improve patient care. However, autonomous AI algorithms, in particular, need to be assessed for safety, efficacy, and freedom from bias in well-designed clinical studies before they can be implemented for patient care. Physicians, regulatory authorities, and policymakers need to ensure that algorithms are rigorously validated and operated in clinical care according to their indications for use, and further research is needed to identify the real-world barriers encountered in implementing AI algorithms.
Footnotes
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Michael D. Abramoff, MD, PhD, FARVO, investor, board member, employee at IDx. Patents and patent applications: University of Iowa and IDx.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
