Department of Biomedical Sciences, University of Alcalá (IRYCIS), 28871, Madrid, Spain.
Department of Ophthalmology, Retina Unit, University Hospital "Príncipe de Asturias", 28805, Madrid, Spain.
Sci Rep. 2024 Jul 31;14(1):17633. doi: 10.1038/s41598-024-68489-2.
Several studies published so far used highly selective image datasets from unclear sources to train computer vision models and that may lead to overestimated results, while those studies conducted in real-life remain scarce. To avoid image selection bias, we stacked convolutional and recurrent neural networks (CNN-RNN) to analyze complete optical coherence tomography (OCT) cubes in a row and predict diabetic macular edema (DME), in a real-world diabetic retinopathy screening program. A retrospective cohort study was carried out. Throughout 4-years, 5314 OCT cubes from 4408 subjects who attended to the diabetic retinopathy (DR) screening program were included. We arranged twenty-two (22) pre-trained CNNs in parallel with a bidirectional RNN layer stacked at the bottom, allowing the model to make a prediction for the whole OCT cube. The staff of retina experts built a ground truth of DME later used to train a set of these CNN-RNN models with different configurations. For each trained CNN-RNN model, we performed threshold tuning to find the optimal cut-off point for binary classification of DME. Finally, the best models were selected according to sensitivity, specificity, and area under the receiver operating characteristics curve (AUROC) with their 95% confidence intervals (95%CI). An ensemble of the best models was also explored. 5188 cubes were non-DME and 126 were DME. Three models achieved an AUROC of 0.94. Among these, sensitivity, and specificity (95%CI) ranged from 84.1-90.5 and 89.7-93.3, respectively, at threshold 1, from 89.7-92.1 and 80-83.1 at threshold 2, and from 80.2-81 and 93.8-97, at threshold 3. The ensemble model improved these results, and lower specificity was observed among subjects with sight-threatening DR. Analysis by age, gender, or grade of DME did not vary the performance of the models. CNN-RNN models showed high diagnostic accuracy for detecting DME in a real-world setting. This engine allowed us to detect extra-foveal DMEs commonly overlooked in other studies, and showed potential for application as the first filter of non-referable patients in an outpatient center within a population-based DR screening program, otherwise ended up in specialized care.
目前已发表的多项研究使用了来源不明的高度选择性的图像数据集来训练计算机视觉模型,这可能导致结果被高估,而实际生活中进行的研究仍然很少。为了避免图像选择偏差,我们将卷积和循环神经网络 (CNN-RNN) 堆叠起来,对一系列完整的光学相干断层扫描 (OCT) 体素进行分析,并在现实世界的糖尿病视网膜病变筛查项目中预测糖尿病黄斑水肿 (DME)。进行了一项回顾性队列研究。在 4 年的时间里,纳入了 4408 名参加糖尿病视网膜病变 (DR) 筛查项目的患者的 5314 个 OCT 体素。我们将 22 个 (22) 个预先训练好的 CNN 平行排列,并在底部堆叠一个双向 RNN 层,使模型能够对整个 OCT 体素进行预测。视网膜专家组成员构建了 DME 的真实数据,后来用于训练具有不同配置的一组这些 CNN-RNN 模型。对于每个训练好的 CNN-RNN 模型,我们进行了阈值调整,以找到 DME 二进制分类的最佳截断点。最后,根据敏感性、特异性和接收器工作特性曲线下的面积 (AUROC) 及其 95%置信区间 (95%CI) 选择最佳模型。还探索了最佳模型的集合。5188 个体素是非 DME,126 个体素是 DME。三个模型的 AUROC 达到 0.94。在这些模型中,阈值为 1 时的敏感性和特异性 (95%CI) 范围为 84.1-90.5 和 89.7-93.3,阈值为 2 时为 89.7-92.1 和 80-83.1,阈值为 3 时为 80.2-81 和 93.8-97。集合模型提高了这些结果,并且在有威胁视力的 DR 患者中观察到特异性较低。按年龄、性别或 DME 分级进行分析并未改变模型的性能。CNN-RNN 模型在真实环境下检测 DME 具有较高的诊断准确性。该引擎使我们能够检测到在其他研究中通常被忽视的中心凹外 DME,并显示出作为基于人群的 DR 筛查项目中门诊中心非转诊患者的第一道筛选的应用潜力,否则这些患者将被转诊至专科治疗。
IEEE J Biomed Health Inform. 2020-12
Int Ophthalmol. 2024-4-23
Invest Ophthalmol Vis Sci. 2019-3-1
Turk J Ophthalmol. 2019-12-31
Cochrane Database Syst Rev. 2015-1-7
Bioengineering (Basel). 2025-4-23
Diabetes Metab Syndr Obes. 2025-4-26
Curr Opin Ophthalmol. 2023-9-1
Transl Vis Sci Technol. 2020-2-18