Zhu Wenhui, Qiu Peijie, Lepore Natasha, Dumitrascu Oana M, Wang Yalin
School of Computing and Augmented Intelligence, Arizona State University, AZ 85281, USA.
McKeley School of Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA.
Proc SPIE Int Soc Opt Eng. 2022 Nov;12567. doi: 10.1117/12.2669772. Epub 2023 Mar 6.
Lesion appearance is a crucial clue for medical providers to distinguish referable diabetic retinopathy (rDR) from non-referable DR. Most existing large-scale DR datasets contain only image-level labels rather than pixel-based annotations. This motivates us to develop algorithms to classify rDR and segment lesions via image-level labels. This paper leverages self-supervised equivariant learning and attention-based multi-instance learning (MIL) to tackle this problem. MIL is an effective strategy to differentiate positive and negative instances, helping us discard background regions (negative instances) while localizing lesion regions (positive ones). However, MIL only provides coarse lesion localization and cannot distinguish lesions located across adjacent patches. Conversely, a self-supervised equivariant attention mechanism (SEAM) generates a segmentation-level class activation map (CAM) that can guide patch extraction of lesions more accurately. Our work aims at integrating both methods to improve rDR classification accuracy. We conduct extensive validation experiments on the Eyepacs dataset, achieving an area under the receiver operating characteristic curve (AU ROC) of 0.958, outperforming current state-of-the-art algorithms.
病变外观是医疗人员区分可转诊糖尿病视网膜病变(rDR)与不可转诊糖尿病视网膜病变的关键线索。大多数现有的大规模糖尿病视网膜病变数据集仅包含图像级标签,而非基于像素的注释。这促使我们开发通过图像级标签对rDR进行分类和分割病变的算法。本文利用自监督等变学习和基于注意力的多实例学习(MIL)来解决这一问题。多实例学习是区分正例和负例的有效策略,有助于我们在定位病变区域(正例)时舍弃背景区域(负例)。然而,多实例学习仅提供粗略的病变定位,无法区分跨相邻图像块的病变。相反,自监督等变注意力机制(SEAM)生成一个分割级类激活映射(CAM),可以更准确地指导病变的图像块提取。我们的工作旨在整合这两种方法,以提高rDR分类准确率。我们在Eyepacs数据集上进行了广泛的验证实验,受试者工作特征曲线下面积(AU ROC)达到0.958,优于当前的最先进算法。