Ai Zhuang, Huang Xuan, Feng Jing, Wang Hui, Tao Yong, Zeng Fanxin, Lu Yaping
Department of Research and Development, Sinopharm Genomics Technology Co., Ltd., Jiangsu, China.
Department of Ophthalmology, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, China.
Front Neuroinform. 2022 Jun 16;16:876927. doi: 10.3389/fninf.2022.876927. eCollection 2022.
Optical coherence tomography (OCT) is a new type of tomography that has experienced rapid development and potential in recent years. It is playing an increasingly important role in retinopathy diagnoses. At present, due to the uneven distributions of medical resources in various regions, the uneven proficiency levels of doctors in grassroots and remote areas, and the development needs of rare disease diagnosis and precision medicine, artificial intelligence technology based on deep learning can provide fast, accurate, and effective solutions for the recognition and diagnosis of retinal OCT images. To prevent vision damage and blindness caused by the delayed discovery of retinopathy, a fusion network (FN)-based retinal OCT classification algorithm (FN-OCT) is proposed in this paper to improve upon the adaptability and accuracy of traditional classification algorithms. The InceptionV3, Inception-ResNet, and Xception deep learning algorithms are used as base classifiers, a convolutional block attention mechanism (CBAM) is added after each base classifier, and three different fusion strategies are used to merge the prediction results of the base classifiers to output the final prediction results (choroidal neovascularization (CNV), diabetic macular oedema (DME), drusen, normal). The results show that in a classification problem involving the UCSD common retinal OCT dataset (108,312 OCT images from 4,686 patients), compared with that of the InceptionV3 network model, the prediction accuracy of FN-OCT is improved by 5.3% (accuracy = 98.7%, area under the curve (AUC) = 99.1%). The predictive accuracy and AUC achieved on an external dataset for the classification of retinal OCT diseases are 92 and 94.5%, respectively, and gradient-weighted class activation mapping (Grad-CAM) is used as a visualization tool to verify the effectiveness of the proposed FNs. This finding indicates that the developed fusion algorithm can significantly improve the performance of classifiers while providing a powerful tool and theoretical support for assisting with the diagnosis of retinal OCT.
光学相干断层扫描(OCT)是近年来迅速发展且颇具潜力的一种新型断层扫描技术。它在视网膜病变诊断中发挥着越来越重要的作用。目前,由于各地区医疗资源分布不均、基层和偏远地区医生水平参差不齐以及罕见病诊断和精准医学的发展需求,基于深度学习的人工智能技术可为视网膜OCT图像的识别与诊断提供快速、准确且有效的解决方案。为防止因视网膜病变发现延迟导致的视力损害和失明,本文提出了一种基于融合网络(FN)的视网膜OCT分类算法(FN - OCT),以提高传统分类算法的适应性和准确性。将InceptionV3、Inception - ResNet和Xception深度学习算法用作基础分类器,在每个基础分类器之后添加卷积块注意力机制(CBAM),并使用三种不同的融合策略合并基础分类器的预测结果,以输出最终预测结果(脉络膜新生血管(CNV)、糖尿病性黄斑水肿(DME)、玻璃膜疣、正常)。结果表明,在涉及加州大学圣地亚哥分校常见视网膜OCT数据集(来自4686名患者的108312张OCT图像)的分类问题中,与InceptionV3网络模型相比,FN - OCT的预测准确率提高了5.3%(准确率 = 98.7%,曲线下面积(AUC) = 99.1%)。在用于视网膜OCT疾病分类的外部数据集上实现的预测准确率和AUC分别为92%和94.5%,并使用梯度加权类激活映射(Grad - CAM)作为可视化工具来验证所提出融合网络的有效性。这一发现表明,所开发的融合算法可显著提高分类器的性能,同时为辅助视网膜OCT诊断提供强大的工具和理论支持。