Abstract
Introduction:
Timely access to ophthalmological care is a challenge in primary health care (PHC). Teleophthalmology and artificial intelligence (AI) are tools to expand access, but real-world evidence remains limited. This study evaluates the diagnostic performance of an AI model for detecting retinal abnormalities in PHC, analyzing variability across clinical and demographic subgroups.
Methods:
This cross-sectional study analyzed 2,158 initial retinal exams, resulting in a final paired sample of 824 retinographies for accuracy analysis. The exams were conducted within a teleophthalmology workflow in the PHC of six municipalities in Minas Gerais, Brazil. A commercial AI model based on a convolutional neural network (Eyer Maps®) evaluated the images and was compared with classifications performed by an ophthalmologist, considered the reference standard. Cluster analysis was performed, and sensitivity, specificity, positive and negative predictive values, Cohen’s Kappa coefficient, and area under the receiver operating characteristic curve (AUC) were calculated.
Results:
The AI model demonstrated almost perfect agreement with the human specialist (Kappa = 0.819; p < 0.001). In the global sample, sensitivity was 83.0%, specificity was 98.0%, positive predictive value was 98.0%, negative predictive value was 86.0%, and AUC was 0.91. Eight distinct clusters were identified, in which the model maintained high specificity (96.0–100.0%), with sensitivity varying only in one specific subgroup.
Conclusions:
In a real-world teleophthalmology context within PHC, AI showed robust diagnostic performance and a high capacity for confirming retinal abnormalities, minimizing unnecessary referrals. The findings reinforce its potential as a supportive tool for organizing ophthalmological care and optimizing assistance workflows in PHC.
Keywords
Get full access to this article
View all access options for this article.
