Casey Eye Institute, Oregon Health & Science University, Portland, Oregon.
Arkansas Children's Hospital, University of Arkansas for Medical Sciences, Little Rock, Arkansas.
Ophthalmology. 2024 Nov;131(11):1290-1296. doi: 10.1016/j.ophtha.2024.06.006. Epub 2024 Jun 10.
To evaluate whether providing clinicians with an artificial intelligence (AI)-based vascular severity score (VSS) improves consistency in the diagnosis of plus disease in retinopathy of prematurity (ROP).
Multireader diagnostic accuracy imaging study.
Eleven ROP experts, 9 of whom had been in practice for 10 years or more.
RetCam (Natus Medical Incorporated) fundus images were obtained from premature infants during routine ROP screening as part of the Imaging and Informatics in ROP study between January 2012 and July 2020. From all available examinations, a subset of 150 eye examinations from 110 infants were selected for grading. An AI-based VSS was assigned to each set of images using the i-ROP DL system (Siloam Vision). The clinicians were asked to diagnose plus disease for each examination and to assign an estimated VSS (range, 1-9) at baseline, and then again 1 month later with AI-based VSS assistance. A reference standard diagnosis (RSD) was assigned to each eye examination from the Imaging and Informatics in ROP study based on 3 masked expert labels and the ophthalmoscopic diagnosis.
Mean linearly weighted κ value for plus disease diagnosis compared with RSD. Area under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AUPR) for labels 1 through 9 compared with RSD for plus disease.
Expert agreement improved significantly, from substantial (κ value, 0.69 [0.59, 0.75]) to near perfect (κ value, 0.81 [0.71, 0.86]), when AI-based VSS was integrated. Additionally, a significant improvement in plus disease discrimination was achieved as measured by mean AUC (from 0.94 [95% confidence interval (CI), 0.92-0.96] to 0.98 [95% CI, 0.96-0.99]; difference, 0.04 [95% CI, 0.01-0.06]) and AUPR (from 0.86 [95% CI, 0.81-0.90] to 0.95 [95% CI, 0.91-0.97]; difference, 0.09 [95% CI, 0.03-0.14]).
Providing ROP clinicians with an AI-based measurement of vascular severity in ROP was associated with both improved plus disease diagnosis and improved continuous severity labeling as compared with an RSD for plus disease. If implemented in practice, AI-based VSS could reduce interobserver variability and could standardize treatment for infants with ROP.
FINANCIAL DISCLOSURE(S): Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
评估为临床医生提供基于人工智能(AI)的血管严重程度评分(VSS)是否能提高早产儿视网膜病变(ROP)中 plus 病诊断的一致性。
多读者诊断准确性影像学研究。
11 名 ROP 专家,其中 9 名从事该领域工作 10 年或以上。
在 2012 年 1 月至 2020 年 7 月期间,作为 Imaging and Informatics in ROP 研究的一部分,使用 RetCam(Natus Medical Incorporated)对接受常规 ROP 筛查的早产儿进行眼底图像采集。从所有可获得的检查中,从 110 名婴儿中选择了 150 只眼的检查进行分级。使用 i-ROP DL 系统(Siloam Vision)为每一组图像分配一个基于 AI 的 VSS。要求临床医生对每一次检查进行 plus 病诊断,并在基线时和使用 AI 辅助后的 1 个月后分别分配一个估计的 VSS(范围为 1-9)。根据 3 位盲法专家标签和检眼镜诊断,对每只眼睛的检查进行参考标准诊断(RSD)。
与 RSD 相比,plus 病诊断的平均线性加权κ 值。标签 1 至 9 的受试者工作特征曲线(ROC)下面积(AUC)和精度-召回曲线(AUPR)与 RSD 相比的 plus 病 AUC。
当整合基于 AI 的 VSS 时,专家的一致性显著提高,从有意义(κ 值,0.69 [0.59,0.75])变为接近完美(κ 值,0.81 [0.71,0.86])。此外,如平均 AUC(从 0.94 [95%置信区间(CI),0.92-0.96]到 0.98 [95% CI,0.96-0.99];差值,0.04 [95% CI,0.01-0.06])和 AUPR(从 0.86 [95% CI,0.81-0.90]到 0.95 [95% CI,0.91-0.97];差值,0.09 [95% CI,0.03-0.14])所示,plus 病的判别能力也有显著提高。
与 RSD 相比,为 ROP 临床医生提供基于 AI 的 ROP 血管严重程度测量值,与提高 plus 病诊断和改善连续严重程度标记有关。如果在实践中实施,基于 AI 的 VSS 可以减少观察者间的变异性,并可以为 ROP 婴儿的治疗提供标准化。
在本文末尾的脚注和披露中可以找到专有或商业披露信息。