Grzybowski Andrzej, Brona Piotr, Krzywicki Tomasz, Gaca-Wysocka Magdalena, Berlińska Arleta, Święch Anna
Department of Ophthalmology, University of Warmia and Mazury, 10-719 Olsztyn, Poland.
Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, 60-553 Poznan, Poland.
J Clin Med. 2022 May 31;11(11):3125. doi: 10.3390/jcm11113125.
Poland has never had a widespread diabetic retinopathy (DR) screening program and subsequently has no purpose-trained graders and no established grader training scheme. Herein, we compare the performance and variability of three retinal specialists with no additional DR grading training in assessing images from 335 real-life screening encounters and contrast their performance against IDx-DR, a US Food and Drug Administration (FDA) approved DR screening suite. A total of 1501 fundus images from 670 eyes were assessed by each grader with a final grade on a per-eye level. Unanimous agreement between all graders was achieved for 385 eyes, and 110 patients, out of which 98% had a final grade of no DR. Thirty-six patients had final grades higher than mild DR, out of which only two had no grader disagreements regarding severity. A total of 28 eyes underwent adjudication due to complete grader disagreement. Four patients had discordant grades ranging from no DR to severe DR between the human graders and IDx-DR. Retina specialists achieved kappa scores of 0.52, 0.78, and 0.61. Retina specialists had relatively high grader variability and only a modest concordance with IDx-DR results. Focused training and verification are recommended for any potential DR graders before assessing DR screening images.
波兰从未有过广泛的糖尿病视网膜病变(DR)筛查项目,因此没有专门培训过的分级人员,也没有既定的分级人员培训方案。在此,我们比较了三名未接受额外DR分级培训的视网膜专家在评估335次实际筛查病例图像时的表现和变异性,并将他们的表现与美国食品药品监督管理局(FDA)批准的DR筛查套件IDx-DR进行对比。每位分级人员对来自670只眼睛的1501张眼底图像进行了评估,并给出每只眼睛的最终分级。所有分级人员对385只眼睛和110名患者达成了一致意见,其中98%的最终分级为无DR。36名患者的最终分级高于轻度DR,其中只有两名患者在严重程度上没有分级人员意见分歧。由于分级人员完全意见不一致,共有28只眼睛进行了裁定。在人工分级人员和IDx-DR之间,有四名患者的分级结果不一致,范围从无DR到重度DR。视网膜专家的kappa评分分别为0.52、0.78和0.61。视网膜专家的分级人员变异性相对较高,与IDx-DR结果的一致性仅为中等。建议在评估DR筛查图像之前,对任何潜在的DR分级人员进行针对性培训和验证。