Moorfields Biomedical Research Centre, Moorfields Eye Hospital, London, United Kingdom.
Department of Social Policy, LSE Health, London School of Economics and Political Science, London, United Kingdom.
Ophthalmology. 2017 Mar;124(3):343-351. doi: 10.1016/j.ophtha.2016.11.014. Epub 2016 Dec 23.
With the increasing prevalence of diabetes, annual screening for diabetic retinopathy (DR) by expert human grading of retinal images is challenging. Automated DR image assessment systems (ARIAS) may provide clinically effective and cost-effective detection of retinopathy. We aimed to determine whether ARIAS can be safely introduced into DR screening pathways to replace human graders.
Observational measurement comparison study of human graders following a national screening program for DR versus ARIAS.
Retinal images from 20 258 consecutive patients attending routine annual diabetic eye screening between June 1, 2012, and November 4, 2013.
Retinal images were manually graded following a standard national protocol for DR screening and were processed by 3 ARIAS: iGradingM, Retmarker, and EyeArt. Discrepancies between manual grades and ARIAS results were sent to a reading center for arbitration.
Screening performance (sensitivity, false-positive rate) and diagnostic accuracy (95% confidence intervals of screening-performance measures) were determined. Economic analysis estimated the cost per appropriate screening outcome.
Sensitivity point estimates (95% confidence intervals) of the ARIAS were as follows: EyeArt 94.7% (94.2%-95.2%) for any retinopathy, 93.8% (92.9%-94.6%) for referable retinopathy (human graded as either ungradable, maculopathy, preproliferative, or proliferative), 99.6% (97.0%-99.9%) for proliferative retinopathy; Retmarker 73.0% (72.0 %-74.0%) for any retinopathy, 85.0% (83.6%-86.2%) for referable retinopathy, 97.9% (94.9%-99.1%) for proliferative retinopathy. iGradingM classified all images as either having disease or being ungradable. EyeArt and Retmarker saved costs compared with manual grading both as a replacement for initial human grading and as a filter prior to primary human grading, although the latter approach was less cost-effective.
Retmarker and EyeArt systems achieved acceptable sensitivity for referable retinopathy when compared with that of human graders and had sufficient specificity to make them cost-effective alternatives to manual grading alone. ARIAS have the potential to reduce costs in developed-world health care economies and to aid delivery of DR screening in developing or remote health care settings.
随着糖尿病患病率的不断增加,通过专家对视网膜图像进行分级来对糖尿病性视网膜病变(DR)进行年度筛查具有挑战性。自动化 DR 图像评估系统(ARIAS)可能会提供具有临床有效性和成本效益的视网膜病变检测。我们旨在确定 ARIAS 是否可以安全地引入 DR 筛查途径以替代人工分级。
对 2012 年 6 月 1 日至 2013 年 11 月 4 日期间参加常规年度糖尿病眼病筛查的 20258 例连续患者的人群进行了 DR 筛查的全国性筛查计划中,比较了人工分级和 ARIAS 的观测测量比较研究。
来自参加常规年度糖尿病眼病筛查的 20258 例连续患者的视网膜图像。
根据 DR 筛查的标准全国方案对视网膜图像进行手动分级,并由 3 种 ARIAS(iGradingM、Retmarker 和 EyeArt)进行处理。手动分级和 ARIAS 结果之间的差异被发送到一个阅读中心进行仲裁。
确定筛查性能(敏感性、假阳性率)和诊断准确性(筛查性能指标的 95%置信区间)。经济分析估计了每个适当筛查结果的成本。
ARIAS 的敏感性点估计值(95%置信区间)如下:EyeArt 为 94.7%(94.2%-95.2%)用于任何视网膜病变,93.8%(92.9%-94.6%)用于可分级视网膜病变(人工分级为不可分级、黄斑病变、前增生期或增生期),99.6%(97.0%-99.9%)用于增生性视网膜病变;Retmarker 为 73.0%(72.0%-74.0%)用于任何视网膜病变,85.0%(83.6%-86.2%)用于可分级视网膜病变,97.9%(94.9%-99.1%)用于增生性视网膜病变。iGradingM 将所有图像分类为有疾病或无法分级。与人工分级相比,EyeArt 和 Retmarker 都节省了成本,无论是作为初始人工分级的替代,还是作为主要人工分级之前的筛选,尽管后者的成本效益较低。
当与人工分级相比时,Retmarker 和 EyeArt 系统在可分级视网膜病变方面达到了可接受的敏感性,并且特异性足以使它们成为单独进行人工分级的具有成本效益的替代方法。ARIAS 有可能降低发达经济体的医疗保健成本,并有助于在发展中国家或偏远的医疗保健环境中提供 DR 筛查。