Khan Sultan Daud, Basalamah Saleh, Lbath Ahmed
Department of Computer Science, National University of Technology, Islamabad, Pakistan.
Department of Computer and Network Engineering, Umm Al-Qura University, Makkah, Saudi Arabia.
Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.
Retinal diseases are a serious global threat to human vision, and early identification is essential for effective prevention and treatment. However, current diagnostic methods rely on manual analysis of fundus images, which heavily depends on the expertise of ophthalmologists. This manual process is time-consuming and labor-intensive and can sometimes lead to missed diagnoses. With advancements in computer vision technology, several automated models have been proposed to improve diagnostic accuracy for retinal diseases and medical imaging in general. However, these methods face challenges in accurately detecting specific diseases within images due to inherent issues associated with fundus images, including inter-class similarities, intra-class variations, limited local information, insufficient contextual understanding, and class imbalances within datasets. To address these challenges, we propose a novel deep learning framework for accurate retinal disease classification. This framework is designed to achieve high accuracy in identifying various retinal diseases while overcoming inherent challenges associated with fundus images. Generally, the framework consists of three main modules. The first module is Densely Connected Multidilated Convolution Neural Network (DCM-CNN) that extracts global contextual information by effectively integrating novel Casual Dilated Dense Convolutional Blocks (CDDCBs). The second module of the framework, namely, Local-Patch-based Convolution Neural Network (LP-CNN), utilizes Class Activation Map (CAM) (obtained from DCM-CNN) to extract local and fine-grained information. To identify the correct class and minimize the error, a synergic network is utilized that takes the feature maps of both DCM-CNN and LP-CNN and connects both maps in a fully connected fashion to identify the correct class and minimize the errors. The framework is evaluated through a comprehensive set of experiments, both quantitatively and qualitatively, using two publicly available benchmark datasets: RFMiD and ODIR-5K. Our experimental results demonstrate the effectiveness of the proposed framework and achieves higher performance on RFMiD and ODIR-5K datasets compared to reference methods.
视网膜疾病是对人类视力的严重全球威胁,早期识别对于有效预防和治疗至关重要。然而,目前的诊断方法依赖于眼底图像的人工分析,这在很大程度上取决于眼科医生的专业知识。这种人工过程既耗时又费力,有时还会导致漏诊。随着计算机视觉技术的进步,已经提出了几种自动化模型来提高视网膜疾病以及一般医学成像的诊断准确性。然而,由于与眼底图像相关的固有问题,包括类间相似性、类内变化、局部信息有限、上下文理解不足以及数据集中的类不平衡,这些方法在准确检测图像中的特定疾病方面面临挑战。为了应对这些挑战,我们提出了一种用于准确视网膜疾病分类的新型深度学习框架。该框架旨在在识别各种视网膜疾病时实现高精度,同时克服与眼底图像相关的固有挑战。一般来说,该框架由三个主要模块组成。第一个模块是密集连接多扩张卷积神经网络(DCM-CNN),它通过有效集成新型因果扩张密集卷积块(CDDCB)来提取全局上下文信息。该框架的第二个模块,即基于局部补丁的卷积神经网络(LP-CNN),利用(从DCM-CNN获得的)类激活映射(CAM)来提取局部和细粒度信息。为了识别正确的类别并最小化误差,使用了一个协同网络,该网络获取DCM-CNN和LP-CNN的特征图,并以全连接方式连接这两个图,以识别正确的类别并最小化误差。该框架通过使用两个公开可用的基准数据集RFMiD和ODIR-5K进行了一系列全面的定量和定性实验来评估。我们的实验结果证明了所提出框架的有效性,并且与参考方法相比,在RFMiD和ODIR-5K数据集上实现了更高的性能。