Li Yu-Hsuan, Sheu Wayne Huey-Herng, Chou Chien-Chih, Lin Chun-Hsien, Cheng Yuan-Shao, Wang Chun-Yuan, Wu Chieh Liang, Lee I-Te
Division of Endocrinology and Metabolism, Department of Internal Medicine, Taichung Veterans General Hospital, Taichung 40705, Taiwan.
Department of Computer Science & Information Engineering, National Taiwan University, Taipei 10617, Taiwan.
Life (Basel). 2021 Mar 5;11(3):200. doi: 10.3390/life11030200.
Deep learning-based software is developed to assist physicians in terms of diagnosis; however, its clinical application is still under investigation. We integrated deep-learning-based software for diabetic retinopathy (DR) grading into the clinical workflow of an endocrinology department where endocrinologists grade for retinal images and evaluated the influence of its implementation. A total of 1432 images from 716 patients and 1400 images from 700 patients were collected before and after implementation, respectively. Using the grading by ophthalmologists as the reference standard, the sensitivity, specificity, and area under the receiver operating characteristic curve (AUC) to detect referable DR (RDR) were 0.91 (0.87-0.96), 0.90 (0.87-0.92), and 0.90 (0.87-0.93) at the image level; and 0.91 (0.81-0.97), 0.84 (0.80-0.87), and 0.87 (0.83-0.91) at the patient level. The monthly RDR rate dropped from 55.1% to 43.0% after implementation. The monthly percentage of finishing grading within the allotted time increased from 66.8% to 77.6%. There was a wide range of agreement values between the software and endocrinologists after implementation (kappa values of 0.17-0.65). In conclusion, we observed the clinical influence of deep-learning-based software on graders without the retinal subspecialty. However, the validation using images from local datasets is recommended before clinical implementation.
基于深度学习的软件被开发出来以协助医生进行诊断;然而,其临床应用仍在研究中。我们将基于深度学习的糖尿病视网膜病变(DR)分级软件集成到一个内分泌科的临床工作流程中,由内分泌科医生对视网膜图像进行分级,并评估其实施的影响。实施前后分别收集了716例患者的1432张图像和700例患者的1400张图像。以眼科医生的分级作为参考标准,在图像层面检测可转诊糖尿病视网膜病变(RDR)的灵敏度、特异度和受试者操作特征曲线下面积(AUC)分别为0.91(0.87 - 0.96)、0.90(0.87 - 0.92)和0.90(0.87 - 0.93);在患者层面分别为0.91(0.81 - 0.97)、0.84(0.80 - 0.87)和0.87(0.83 - 0.91)。实施后每月的RDR率从55.1%降至43.0%。在规定时间内完成分级的月度百分比从66.8%提高到77.6%。实施后软件与内分泌科医生之间存在广泛的一致性值(kappa值为0.17 - 0.65)。总之,我们观察到了基于深度学习的软件对没有视网膜亚专业知识的分级人员的临床影响。然而,建议在临床实施前使用本地数据集的图像进行验证。