Key Laboratory of Ocular Fundus Diseases, Department of Ophthalmology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China.
Intelligent Healthcare Unit, Baidu, Beijing, China.
Front Public Health. 2022 Jul 22;10:944967. doi: 10.3389/fpubh.2022.944967. eCollection 2022.
To assess the accuracy and robustness of the AI algorithm for detecting referable diabetic retinopathy (RDR), referable macular diseases (RMD), and glaucoma suspect (GCS) from fundus images in community and in-hospital screening scenarios.
We collected two color fundus image datasets, namely, PUMCH (556 images, 166 subjects, and four camera models) and NSDE (534 images, 134 subjects, and two camera models). The AI algorithm generates the screening report after taking fundus images. The images were labeled as RDR, RMD, GCS, or none of the three by 3 licensed ophthalmologists. The resulting labels were treated as "ground truth" and then were used to compare against the AI screening reports to validate the sensitivity, specificity, and area under the receiver operating characteristic curve (AUC) of the AI algorithm.
On the PUMCH dataset, regarding the prediction of RDR, the AI algorithm achieved overall results of 0.950 ± 0.058, 0.963 ± 0.024, and 0.954 ± 0.049 on sensitivity, specificity, and AUC, respectively. For RMD, the overall results are 0.919 ± 0.073, 0.929 ± 0.039, and 0.974 ± 0.009. For GCS, the overall results are 0.950 ± 0.059, 0.946 ± 0.016, and 0.976 ± 0.025.
The AI algorithm can work robustly with various fundus camera models and achieve high accuracies for detecting RDR, RMD, and GCS.
评估人工智能算法在社区和医院筛查场景中从眼底图像中检测可转诊糖尿病视网膜病变(RDR)、可转诊黄斑疾病(RMD)和青光眼疑似(GCS)的准确性和稳健性。
我们收集了两个彩色眼底图像数据集,即 PUMCH(556 张图像,166 名受试者,4 种相机型号)和 NSDE(534 张图像,134 名受试者,2 种相机型号)。人工智能算法在拍摄眼底图像后生成筛查报告。图像由 3 名持照眼科医生标记为 RDR、RMD、GCS 或三者均无。由此产生的标签被视为“真实标签”,然后与人工智能筛查报告进行比较,以验证人工智能算法的敏感性、特异性和接受者操作特征曲线(ROC)下面积(AUC)。
在 PUMCH 数据集上,对于 RDR 的预测,人工智能算法在敏感性、特异性和 AUC 方面的总体结果分别为 0.950±0.058、0.963±0.024 和 0.954±0.049。对于 RMD,总体结果为 0.919±0.073、0.929±0.039 和 0.974±0.009。对于 GCS,总体结果为 0.950±0.059、0.946±0.016 和 0.976±0.025。
人工智能算法可以在各种眼底相机型号下稳健工作,并实现对 RDR、RMD 和 GCS 的高准确性检测。