Yang Wei-Hua, Zheng Bo, Wu Mao-Nian, Zhu Shao-Jun, Fei Fang-Qin, Weng Ming, Zhang Xian, Lu Pei-Rong
Department of Ophthalmology, The First Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China.
Department of Ophthalmology, The First People's Hospital of Huzhou, Huzhou, Zhejiang, China.
Diabetes Ther. 2019 Oct;10(5):1811-1822. doi: 10.1007/s13300-019-0652-0. Epub 2019 Jul 9.
In April 2018, the US Food and Drug Administration (FDA) approved the world's first artificial intelligence (AI) medical device for detecting diabetic retinopathy (DR), the IDx-DR. However, there is a lack of evaluation systems for DR intelligent diagnostic technology.
Five hundred color fundus photographs of diabetic patients were selected. DR severity varied from grade 0 to 4, with 100 photographs for each grade. Following that, these were diagnosed by both ophthalmologists and the intelligent technology, the results of which were compared by applying the evaluation system. The system includes primary, intermediate, and advanced evaluations, of which the intermediate evaluation incorporated two methods. Main evaluation indicators were sensitivity, specificity, and kappa value.
The AI technology diagnosed 93 photographs with no DR, 107 with mild non-proliferative DR (NPDR), 107 with moderate NPDR, 108 with severe NPDR, and 85 with proliferative DR (PDR). The sensitivity, specificity, and kappa value of the AI diagnoses in the primary evaluation were 98.8%, 88.0%, and 0.89, respectively. According to method 1 of the intermediate evaluation, the sensitivity of AI diagnosis was 98.0%, specificity 97.0%, and the kappa value 0.95. In method 2 of the intermediate evaluation, the sensitivity of AI diagnosis was 95.5%, the specificity 99.3%, and kappa value 0.95. In the advanced evaluation, the kappa value of the intelligent diagnosis was 0.86.
This article proposes an evaluation system for color fundus photograph-based intelligent diagnostic technology of DR and demonstrates an application of this system in a clinical setting. The results from this evaluation system serve as the basis for the selection of scenarios in which DR intelligent diagnostic technology can be applied.
2018年4月,美国食品药品监督管理局(FDA)批准了全球首款用于检测糖尿病视网膜病变(DR)的人工智能(AI)医疗设备IDx-DR。然而,目前缺乏针对DR智能诊断技术的评估体系。
选取500张糖尿病患者的彩色眼底照片。DR严重程度从0级到4级不等,每个级别各有100张照片。随后,由眼科医生和智能技术分别对这些照片进行诊断,并通过应用评估体系比较两者的诊断结果。该体系包括初级、中级和高级评估,其中中级评估包含两种方法。主要评估指标为灵敏度、特异度和kappa值。
AI技术诊断出93张无DR的照片、107张轻度非增殖性DR(NPDR)照片、107张中度NPDR照片、108张重度NPDR照片以及85张增殖性DR(PDR)照片。初级评估中AI诊断的灵敏度、特异度和kappa值分别为98.8%、88.0%和0.89。根据中级评估的方法1,AI诊断的灵敏度为98.0%,特异度为97.0%,kappa值为0.95。在中级评估的方法2中,AI诊断的灵敏度为95.5%,特异度为99.3%,kappa值为0.95。在高级评估中,智能诊断的kappa值为0.86。
本文提出了一种基于彩色眼底照片的DR智能诊断技术评估体系,并展示了该体系在临床环境中的应用。该评估体系的结果为选择可应用DR智能诊断技术的场景提供了依据。