基于注意力机制和 EfficientNet-B3 网络的自动青光眼分级方法。

An automatic glaucoma grading method based on attention mechanism and EfficientNet-B3 network.

机构信息

School of Software Engineering, Xiamen University of Technology, Xiamen, China.

School of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China.

出版信息

PLoS One. 2024 Aug 16;19(8):e0296229. doi: 10.1371/journal.pone.0296229. eCollection 2024.

DOI:10.1371/journal.pone.0296229

PMID:39150930

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11329118/

Abstract

Glaucoma infection is rapidly spreading globally and the number of glaucoma patients is expected to exceed 110 million by 2040. Early identification and detection of glaucoma is particularly important as it can easily lead to irreversible vision damage or even blindness if not treated with intervention in the early stages. Deep learning has attracted much attention in the field of computer vision and has been widely studied especially in the recognition and diagnosis of ophthalmic diseases. It is challenging to efficiently extract effective features for accurate grading of glaucoma in a limited dataset. Currently, in glaucoma recognition algorithms, 2D fundus images are mainly used to automatically identify the disease or not, but do not distinguish between early or late stages; however, in clinical practice, the treatment of early and late glaucoma is not the same, so it is more important to proceed to achieve accurate grading of glaucoma. This study uses a private dataset containing modal data, 2D fundus images, and 3D-OCT scanner images, to extract the effective features therein to achieve an accurate triple classification (normal, early, and moderately advanced) for optimal performance on various measures. In view of this, this paper proposes an automatic glaucoma classification method based on the attention mechanism and EfficientNetB3 network. The EfficientNetB3 network and ResNet34 network are built to extract and fuse 2D fundus images and 3D-OCT scanner images, respectively, to achieve accurate classification. The proposed auto-classification method minimizes feature redundancy while improving classification accuracy, and incorporates an attention mechanism in the two-branch model, which enables the convolutional neural network to focus its attention on the main features of the eye and discard the meaningless black background region in the image to improve the performance of the model. The auto-classification method combined with the cross-entropy function achieves the highest accuracy up to 97.83%. Since the proposed automatic grading method is effective and ensures reliable decision-making for glaucoma screening, it can be used as a second opinion tool by doctors, which can greatly reduce missed diagnosis and misdiagnosis by doctors, and buy more time for patient's treatment.

摘要

青光眼感染在全球范围内迅速蔓延，预计到 2040 年，青光眼患者人数将超过 1.1 亿。早期识别和检测青光眼尤为重要，因为如果不在早期阶段进行干预，很容易导致不可逆转的视力损害甚至失明。深度学习在计算机视觉领域引起了广泛关注，并在眼科疾病的识别和诊断方面得到了广泛研究。在有限的数据集上，有效地提取有效特征以进行准确的青光眼分级具有挑战性。目前，在青光眼识别算法中，主要使用 2D 眼底图像来自动识别疾病，但无法区分早期或晚期；然而，在临床实践中，早期和晚期青光眼的治疗方法并不相同，因此更重要的是实现青光眼的准确分级。本研究使用包含模态数据、2D 眼底图像和 3D-OCT 扫描仪图像的私有数据集，提取其中的有效特征，以在各种指标上实现最佳性能的准确三重分类（正常、早期和中度晚期）。鉴于此，本文提出了一种基于注意力机制和 EfficientNetB3 网络的自动青光眼分类方法。分别构建 EfficientNetB3 网络和 ResNet34 网络，提取和融合 2D 眼底图像和 3D-OCT 扫描仪图像，实现准确分类。所提出的自动分类方法在最小化特征冗余的同时提高了分类精度，并在两分支模型中引入了注意力机制，使卷积神经网络能够关注眼睛的主要特征，忽略图像中无意义的黑色背景区域，从而提高模型的性能。结合交叉熵函数的自动分类方法达到了高达 97.83%的最高精度。由于所提出的自动分级方法是有效的，并确保了青光眼筛查的可靠决策，因此可以作为医生的第二意见工具，这可以大大减少医生的误诊和漏诊，并为患者的治疗争取更多时间。