Tufail Adnan, Kapetanakis Venediktos V, Salas-Vega Sebastian, Egan Catherine, Rudisill Caroline, Owen Christopher G, Lee Aaron, Louw Vern, Anderson John, Liew Gerald, Bolter Louis, Bailey Clare, Sadda SriniVas, Taylor Paul, Rudnicka Alicja R
National Institute for Health Research Moorfields Biomedical Research Centre, Moorfields Eye Hospital, London, UK.
Population Health Research Institute, St George's, University of London, London, UK.
Health Technol Assess. 2016 Dec;20(92):1-72. doi: 10.3310/hta20920.
Diabetic retinopathy screening in England involves labour-intensive manual grading of retinal images. Automated retinal image analysis systems (ARIASs) may offer an alternative to manual grading.
To determine the screening performance and cost-effectiveness of ARIASs to replace level 1 human graders or pre-screen with ARIASs in the NHS diabetic eye screening programme (DESP). To examine technical issues associated with implementation.
Observational retrospective measurement comparison study with a real-time evaluation of technical issues and a decision-analytic model to evaluate cost-effectiveness.
A NHS DESP.
Consecutive diabetic patients who attended a routine annual NHS DESP visit.
Retinal images were manually graded and processed by three ARIASs: iGradingM (version 1.1; originally Medalytix Group Ltd, Manchester, UK, but purchased by Digital Healthcare, Cambridge, UK, at the initiation of the study, purchased in turn by EMIS Health, Leeds, UK, after conclusion of the study), Retmarker (version 0.8.2, Retmarker Ltd, Coimbra, Portugal) and EyeArt (Eyenuk Inc., Woodland Hills, CA, USA). The final manual grade was used as the reference standard. Arbitration on a subset of discrepancies between manual grading and the use of an ARIAS by a reading centre masked to all grading was used to create a reference standard manual grade modified by arbitration.
Screening performance (sensitivity, specificity, false-positive rate and likelihood ratios) and diagnostic accuracy [95% confidence intervals (CIs)] of ARIASs. A secondary analysis explored the influence of camera type and patients' ethnicity, age and sex on screening performance. Economic analysis estimated the cost per appropriate screening outcome identified.
A total of 20,258 patients with 102,856 images were entered into the study. The sensitivity point estimates of the ARIASs were as follows: EyeArt 94.7% (95% CI 94.2% to 95.2%) for any retinopathy, 93.8% (95% CI 92.9% to 94.6%) for referable retinopathy and 99.6% (95% CI 97.0% to 99.9%) for proliferative retinopathy; and Retmarker 73.0% (95% CI 72.0% to 74.0%) for any retinopathy, 85.0% (95% CI 83.6% to 86.2%) for referable retinopathy and 97.9% (95% CI 94.9 to 99.1%) for proliferative retinopathy. iGradingM classified all images as either 'disease' or 'ungradable', limiting further iGradingM analysis. The sensitivity and false-positive rates for EyeArt were not affected by ethnicity, sex or camera type but sensitivity declined marginally with increasing patient age. The screening performance of Retmarker appeared to vary with patient's age, ethnicity and camera type. Both EyeArt and Retmarker were cost saving relative to manual grading either as a replacement for level 1 human grading or used prior to level 1 human grading, although the latter was less cost-effective. A threshold analysis testing the highest ARIAS cost per patient before which ARIASs became more expensive per appropriate outcome than human grading, when used to replace level 1 grader, was Retmarker £3.82 and EyeArt £2.71 per patient.
The non-randomised study design limited the health economic analysis but the same retinal images were processed by all ARIASs in this measurement comparison study.
Retmarker and EyeArt achieved acceptable sensitivity for referable retinopathy and false-positive rates (compared with human graders as reference standard) and appear to be cost-effective alternatives to a purely manual grading approach. Future work is required to develop technical specifications to optimise deployment and address potential governance issues.
The National Institute for Health Research (NIHR) Health Technology Assessment programme, a Fight for Sight Grant (Hirsch grant award) and the Department of Health's NIHR Biomedical Research Centre for Ophthalmology at Moorfields Eye Hospital and the University College London Institute of Ophthalmology.
英国的糖尿病视网膜病变筛查涉及对视网膜图像进行劳动密集型的人工分级。自动化视网膜图像分析系统(ARIASs)可能为人工分级提供一种替代方案。
确定在英国国民医疗服务体系(NHS)糖尿病眼病筛查项目(DESP)中,ARIASs替代一级人工分级员或在一级人工分级前进行预筛查的筛查性能和成本效益。研究与实施相关的技术问题。
观察性回顾性测量比较研究,对技术问题进行实时评估,并使用决策分析模型评估成本效益。
NHS DESP。
连续参加NHS DESP年度常规检查的糖尿病患者。
视网膜图像由三个ARIASs进行人工分级和处理:iGradingM(版本1.1;最初由英国曼彻斯特的Medalytix Group Ltd公司开发,但在研究开始时被英国剑桥的Digital Healthcare公司收购,研究结束后又被英国利兹的EMIS Health公司收购)、Retmarker(版本0.8.2,葡萄牙科英布拉的Retmarker Ltd公司)和EyeArt(美国加利福尼亚州伍德兰希尔斯的Eyenuk Inc.公司)。最终的人工分级用作参考标准。由一个对所有分级情况均不知情的阅读中心对人工分级与ARIASs使用之间的部分差异进行仲裁,以创建经仲裁修改后的参考标准人工分级。
ARIASs的筛查性能(敏感性、特异性、假阳性率和似然比)和诊断准确性[95%置信区间(CIs)]。二次分析探讨了相机类型以及患者的种族、年龄和性别对筛查性能的影响。经济分析估计了每识别出一个适当筛查结果的成本。
共有20258名患者的102856张图像纳入研究。ARIASs的敏感性点估计如下:对于任何视网膜病变,EyeArt为94.7%(95%CI 94.2%至95.2%),对于可转诊视网膜病变为93.8%(95%CI 92.9%至94.6%),对于增殖性视网膜病变为99.6%(95%CI 97.0%至99.9%);对于任何视网膜病变,Retmarker为73.0%(95%CI 72.0%至74.0%),对于可转诊视网膜病变为85.0%(95%CI 83.6%至86.2%),对于增殖性视网膜病变为97.9%(95%CI 94.9至99.1%)。iGradingM将所有图像分类为“疾病”或“不可分级”,限制了对iGradingM的进一步分析。EyeArt的敏感性和假阳性率不受种族、性别或相机类型的影响,但敏感性随患者年龄增加略有下降。Retmarker的筛查性能似乎因患者年龄、种族和相机类型而异。无论是替代一级人工分级还是在一级人工分级前使用,EyeArt和Retmarker相对于人工分级均具有成本效益,尽管后者的成本效益较低。一项阈值分析测试了每位患者ARIASs的最高成本,在该成本之前,ARIASs用于替代一级分级员时,每个适当结果的成本比人工分级更高,Retmarker为每位患者3.82英镑,EyeArt为每位患者2.71英镑。
非随机研究设计限制了健康经济分析,但在本测量比较研究中,所有ARIASs处理的是相同的视网膜图像。
Retmarker和EyeArt对可转诊视网膜病变的敏感性和假阳性率(与作为参考标准的人工分级员相比)达到了可接受水平,似乎是纯人工分级方法具有成本效益的替代方案。未来需要开展工作来制定技术规范,以优化部署并解决潜在的管理问题。
国家卫生研究院(NIHR)卫生技术评估项目、视力保护基金(赫希赠款奖)以及卫生部在摩尔菲尔德眼科医院和伦敦大学学院眼科研究所设立的NIHR眼科生物医学研究中心。