基于多模态图像的视网膜疾病分类跨模态注意力网络

Cross-modal attention network for retinal disease classification based on multi-modal images.

作者信息

Liu Zirong, Hu Yan, Qiu Zhongxi, Niu Yanyan, Zhou Dan, Li Xiaoling, Shen Junyong, Jiang Hongyang, Li Heng, Liu Jiang

机构信息

School of Ophthalmology and Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou 325027, China.

Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China.

出版信息

Biomed Opt Express. 2024 May 14;15(6):3699-3714. doi: 10.1364/BOE.516764. eCollection 2024 Jun 1.

DOI:10.1364/BOE.516764

PMID:38867787

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11166426/

Abstract

Multi-modal eye disease screening improves diagnostic accuracy by providing lesion information from different sources. However, existing multi-modal automatic diagnosis methods tend to focus on the specificity of modalities and ignore the spatial correlation of images. This paper proposes a novel cross-modal retinal disease diagnosis network (CRD-Net) that digs out the relevant features from modal images aided for multiple retinal disease diagnosis. Specifically, our model introduces a cross-modal attention (CMA) module to query and adaptively pay attention to the relevant features of the lesion in the different modal images. In addition, we also propose multiple loss functions to fuse features with modality correlation and train a multi-modal retinal image classification network to achieve a more accurate diagnosis. Experimental evaluation on three publicly available datasets shows that our CRD-Net outperforms existing single-modal and multi-modal methods, demonstrating its superior performance.

摘要

多模态眼病筛查通过提供来自不同来源的病变信息提高诊断准确性。然而，现有的多模态自动诊断方法往往侧重于模态的特异性，而忽略了图像的空间相关性。本文提出了一种新颖的跨模态视网膜疾病诊断网络（CRD-Net），该网络从模态图像中挖掘相关特征，辅助多种视网膜疾病诊断。具体而言，我们的模型引入了跨模态注意力（CMA）模块，以查询并自适应地关注不同模态图像中病变的相关特征。此外，我们还提出了多个损失函数，以融合具有模态相关性的特征，并训练一个多模态视网膜图像分类网络，以实现更准确的诊断。在三个公开可用数据集上的实验评估表明，我们的CRD-Net优于现有的单模态和多模态方法，证明了其卓越的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd1b/11166426/4f6c79119cca/boe-15-6-3699-g001.jpg

相似文献

Cross-modal attention network for retinal disease classification based on multi-modal images.基于多模态图像的视网膜疾病分类跨模态注意力网络

Biomed Opt Express. 2024 May 14;15(6):3699-3714. doi: 10.1364/BOE.516764. eCollection 2024 Jun 1.

Multi-Modal Retinal Image Classification With Modality-Specific Attention Network.基于模态特定注意力网络的多模态视网膜图像分类。

IEEE Trans Med Imaging. 2021 Jun;40(6):1591-1602. doi: 10.1109/TMI.2021.3059956. Epub 2021 Jun 1.

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。

Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.

Skin lesion classification based on two-modal images using a multi-scale fully-shared fusion network.基于多尺度全共享融合网络的双模图像皮肤损伤分类。

Comput Methods Programs Biomed. 2023 Feb;229:107315. doi: 10.1016/j.cmpb.2022.107315. Epub 2022 Dec 16.

Multi-modal cross-attention network for Alzheimer's disease diagnosis with multi-modality data.多模态跨注意网络用于基于多模态数据的阿尔茨海默病诊断。

Comput Biol Med. 2023 Aug;162:107050. doi: 10.1016/j.compbiomed.2023.107050. Epub 2023 May 22.

An adaptive multi-modal hybrid model for classifying thyroid nodules by combining ultrasound and infrared thermal images.基于超声和红外热图像融合的甲状腺结节分类自适应多模态混合模型。

BMC Bioinformatics. 2023 Aug 19;24(1):315. doi: 10.1186/s12859-023-05446-2.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross：用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

Multi-modality self-attention aware deep network for 3D biomedical segmentation.多模态自注意力感知深度网络用于 3D 生物医学分割。

BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):119. doi: 10.1186/s12911-020-1109-0.

Learning from dermoscopic images in association with clinical metadata for skin lesion segmentation and classification.结合临床元数据从皮肤镜图像中学习以进行皮肤病变分割和分类。

Comput Biol Med. 2023 Jan;152:106321. doi: 10.1016/j.compbiomed.2022.106321. Epub 2022 Nov 17.

DGCBG-Net: A dual-branch network with global cross-modal interaction and boundary guidance for tumor segmentation in PET/CT images.DGCBG-Net：一种具有全局跨模态交互和边界引导的双分支网络，用于 PET/CT 图像中的肿瘤分割。

Comput Methods Programs Biomed. 2024 Jun;250:108125. doi: 10.1016/j.cmpb.2024.108125. Epub 2024 Mar 20.

本文引用的文献

GAMMA challenge: Glaucoma grAding from Multi-Modality imAges.伽马挑战赛：多模态图像的青光眼分级。

Med Image Anal. 2023 Dec;90:102938. doi: 10.1016/j.media.2023.102938. Epub 2023 Sep 18.

Fundus image classification using Inception V3 and ResNet-50 for the early diagnostics of fundus diseases.使用Inception V3和ResNet-50进行眼底图像分类以实现眼底疾病的早期诊断。

Front Physiol. 2023 Feb 15;14:1126780. doi: 10.3389/fphys.2023.1126780. eCollection 2023.

Structure-Oriented Transformer for retinal diseases grading from OCT images.用于从光学相干断层扫描（OCT）图像进行视网膜疾病分级的面向结构的Transformer

Comput Biol Med. 2023 Jan;152:106445. doi: 10.1016/j.compbiomed.2022.106445. Epub 2022 Dec 16.

Quantitative approaches in multimodal fundus imaging: State of the art and future perspectives.多模态眼底成像中的定量方法：现状与未来展望。

Prog Retin Eye Res. 2023 Jan;92:101111. doi: 10.1016/j.preteyeres.2022.101111. Epub 2022 Aug 4.

A deep-learning system predicts glaucoma incidence and progression using retinal photographs.深度学习系统通过视网膜照片预测青光眼的发病和进展。

J Clin Invest. 2022 Jun 1;132(11). doi: 10.1172/JCI157968.

Learning Two-Stream CNN for Multi-Modal Age-Related Macular Degeneration Categorization.学习用于多模态年龄相关性黄斑变性分类的双流卷积神经网络

IEEE J Biomed Health Inform. 2022 Aug;26(8):4111-4122. doi: 10.1109/JBHI.2022.3171523. Epub 2022 Aug 11.

Automated diagnosis of age-related macular degeneration using multi-modal vertical plane feature fusion via deep learning.基于深度学习的多模态垂直面特征融合的年龄相关性黄斑变性自动诊断。

Med Phys. 2022 Apr;49(4):2324-2333. doi: 10.1002/mp.15541. Epub 2022 Mar 11.

From Data to Deployment: The Collaborative Community on Ophthalmic Imaging Roadmap for Artificial Intelligence in Age-Related Macular Degeneration.从数据到部署：眼科成像人工智能相关的年龄相关性黄斑变性协作社区路线图。

Ophthalmology. 2022 May;129(5):e43-e59. doi: 10.1016/j.ophtha.2022.01.002. Epub 2022 Jan 10.

Multicolor image classification using the multimodal information bottleneck network (MMIB-Net) for detecting diabetic retinopathy.多模态信息瓶颈网络（MMIB-Net）在检测糖尿病性视网膜病变中的多色图像分类。

Opt Express. 2021 Jul 5;29(14):22732-22748. doi: 10.1364/OE.430508.

Prospective assessment of breast cancer risk from multimodal multiview ultrasound images via clinically applicable deep learning.通过临床适用的深度学习对多模态多角度超声图像进行前瞻性乳腺癌风险评估。

Nat Biomed Eng. 2021 Jun;5(6):522-532. doi: 10.1038/s41551-021-00711-2. Epub 2021 Apr 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于多模态图像的视网膜疾病分类跨模态注意力网络

Cross-modal attention network for retinal disease classification based on multi-modal images.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献