学习用于多模态年龄相关性黄斑变性分类的双流卷积神经网络

Learning Two-Stream CNN for Multi-Modal Age-Related Macular Degeneration Categorization.

作者信息

Wang Weisen, Li Xirong, Xu Zhiyan, Yu Weihong, Zhao Jianchun, Ding Dayong, Chen Youxin

出版信息

IEEE J Biomed Health Inform. 2022 Aug;26(8):4111-4122. doi: 10.1109/JBHI.2022.3171523. Epub 2022 Aug 11.

DOI:10.1109/JBHI.2022.3171523

Abstract

This paper tackles automated categorization of Age-related Macular Degeneration (AMD), a common macular disease among people over 50. Previous research efforts mainly focus on AMD categorization with a single-modal input, let it be a color fundus photograph (CFP) or an OCT B-scan image. By contrast, we consider AMD categorization given a multi-modal input, a direction that is clinically meaningful yet mostly unexplored. Contrary to the prior art that takes a traditional approach of feature extraction plus classifier training that cannot be jointly optimized, we opt for end-to-end multi-modal Convolutional Neural Networks (MM-CNN). Our MM-CNN is instantiated by a two-stream CNN, with spatially-invariant fusion to combine information from the CFP and OCT streams. In order to visually interpret the contribution of the individual modalities to the final prediction, we extend the class activation mapping (CAM) technique to the multi-modal scenario. For effective training of MM-CNN, we develop two data augmentation methods. One is GAN-based CFP/OCT image synthesis, with our novel use of CAMs as conditional input of a high-resolution image-to-image translation GAN. The other method is Loose Pairing, which pairs a CFP image and an OCT image on the basis of their classes instead of eye identities. Experiments on a clinical dataset consisting of 1,094 CFP images and 1,289 OCT images acquired from 1,093 distinct eyes show that the proposed solution obtains better F1 and Accuracy than multiple baselines for multi-modal AMD categorization. Code and data are available at https://github.com/li-xirong/mmc-amd.

摘要

本文致力于年龄相关性黄斑变性（AMD）的自动分类，AMD是50岁以上人群中常见的黄斑疾病。以往的研究主要集中在基于单模态输入的AMD分类，无论是彩色眼底照片（CFP）还是OCT B扫描图像。相比之下，我们考虑基于多模态输入的AMD分类，这是一个具有临床意义但大多未被探索的方向。与采用传统特征提取加分类器训练方法且无法联合优化的现有技术不同，我们选择端到端的多模态卷积神经网络（MM-CNN）。我们的MM-CNN由双流CNN实例化，通过空间不变融合来组合来自CFP和OCT流的信息。为了直观地解释各个模态对最终预测的贡献，我们将类激活映射（CAM）技术扩展到多模态场景。为了有效训练MM-CNN，我们开发了两种数据增强方法。一种是基于GAN的CFP/OCT图像合成，我们创新性地将CAM用作高分辨率图像到图像翻译GAN的条件输入。另一种方法是宽松配对，它基于CFP图像和OCT图像的类别而不是眼睛标识来进行配对。对一个由从1093只不同眼睛获取的1094张CFP图像和1289张OCT图像组成的临床数据集进行的实验表明，所提出的解决方案在多模态AMD分类方面比多个基线获得了更好的F1值和准确率。代码和数据可在https://github.com/li-xirong/mmc-amd获取。

相似文献

Learning Two-Stream CNN for Multi-Modal Age-Related Macular Degeneration Categorization.学习用于多模态年龄相关性黄斑变性分类的双流卷积神经网络

IEEE J Biomed Health Inform. 2022 Aug;26(8):4111-4122. doi: 10.1109/JBHI.2022.3171523. Epub 2022 Aug 11.

The possibility of the combination of OCT and fundus images for improving the diagnostic accuracy of deep learning for age-related macular degeneration: a preliminary experiment.OCT 与眼底图像相结合提高深度学习诊断年龄相关性黄斑变性准确性的可能性：一项初步实验。

Med Biol Eng Comput. 2019 Mar;57(3):677-687. doi: 10.1007/s11517-018-1915-z. Epub 2018 Oct 22.

Multi-scale convolutional neural network for automated AMD classification using retinal OCT images.用于使用视网膜光学相干断层扫描（OCT）图像进行年龄相关性黄斑变性（AMD）自动分类的多尺度卷积神经网络。

Comput Biol Med. 2022 May;144:105368. doi: 10.1016/j.compbiomed.2022.105368. Epub 2022 Mar 2.

Optical coherence tomography and color fundus photography in the screening of age-related macular degeneration: A comparative, population-based study.光学相干断层扫描和眼底彩色照相在年龄相关性黄斑变性筛查中的比较：基于人群的研究。

PLoS One. 2020 Aug 14;15(8):e0237352. doi: 10.1371/journal.pone.0237352. eCollection 2020.

Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images.基于视网膜光学相干断层扫描图像的老年性黄斑变性检测用缝合视觉Transformer。

PLoS One. 2024 Jun 5;19(6):e0304943. doi: 10.1371/journal.pone.0304943. eCollection 2024.

A novel multiscale and multipath convolutional neural network based age-related macular degeneration detection using OCT images.基于 OCT 图像的新型多尺度多路径卷积神经网络年龄相关性黄斑变性检测。

Comput Methods Programs Biomed. 2021 Sep;209:106294. doi: 10.1016/j.cmpb.2021.106294. Epub 2021 Jul 27.

U-Net-Based Segmentation of Current Imaging Biomarkers in OCT-Scans of Patients with Age Related Macular Degeneration.基于 U-Net 的年龄相关性黄斑变性患者 OCT 扫描中当前成像生物标志物的分割。

Stud Health Technol Inform. 2023 May 18;302:947-951. doi: 10.3233/SHTI230315.

Deep Ensemble Learning Based Objective Grading of Macular Edema by Extracting Clinically Significant Findings from Fused Retinal Imaging Modalities.基于深度集成学习的融合视网膜成像模态中临床显著发现提取的黄斑水肿客观分级。

Sensors (Basel). 2019 Jul 5;19(13):2970. doi: 10.3390/s19132970.

Predicting Progression of Age-Related Macular Degeneration Using OCT and Fundus Photography.使用 OCT 和眼底照相预测年龄相关性黄斑变性的进展。

Ophthalmol Retina. 2021 Feb;5(2):118-125. doi: 10.1016/j.oret.2020.06.026. Epub 2020 Jun 26.

Automated detection of exudative age-related macular degeneration in spectral domain optical coherence tomography using deep learning.使用深度学习在光谱域光学相干断层扫描中自动检测渗出性年龄相关性黄斑变性。

Graefes Arch Clin Exp Ophthalmol. 2018 Feb;256(2):259-265. doi: 10.1007/s00417-017-3850-3. Epub 2017 Nov 20.

引用本文的文献

An eyecare foundation model for clinical assistance: a randomized controlled trial.一种用于临床辅助的眼保健基础模型：一项随机对照试验。

Nat Med. 2025 Aug 28. doi: 10.1038/s41591-025-03900-7.

Low-Rank Fine-Tuning Meets Cross-modal Analysis: A Robust Framework for Age-Related Macular Degeneration Categorization.低秩微调与跨模态分析：一种用于年龄相关性黄斑变性分类的稳健框架。

J Imaging Inform Med. 2025 Apr 29. doi: 10.1007/s10278-025-01513-7.

Artificial intelligence in the diagnosis of uveal melanoma: advances and applications.人工智能在葡萄膜黑色素瘤诊断中的进展与应用

Exp Biol Med (Maywood). 2025 Feb 19;250:10444. doi: 10.3389/ebm.2025.10444. eCollection 2025.

Interpretable multimodal classification for age-related macular degeneration diagnosis.基于可解释的多模态分类的年龄相关性黄斑变性诊断

PLoS One. 2024 Nov 11;19(11):e0311811. doi: 10.1371/journal.pone.0311811. eCollection 2024.

Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Cross-modal attention network for retinal disease classification based on multi-modal images.基于多模态图像的视网膜疾病分类跨模态注意力网络

Biomed Opt Express. 2024 May 14;15(6):3699-3714. doi: 10.1364/BOE.516764. eCollection 2024 Jun 1.

Automated Age-Related Macular Degeneration Detector on Optical Coherence Tomography Images Using Slice-Sum Local Binary Patterns and Support Vector Machine.基于切片和局部二值模式与支持向量机的光学相干断层扫描图像自动年龄相关性黄斑变性检测

Sensors (Basel). 2023 Aug 22;23(17):7315. doi: 10.3390/s23177315.

Advances in artificial intelligence models and algorithms in the field of optometry.验光领域人工智能模型与算法的进展

Front Cell Dev Biol. 2023 Apr 28;11:1170068. doi: 10.3389/fcell.2023.1170068. eCollection 2023.

Application of generative adversarial networks (GAN) for ophthalmology image domains: a survey.生成对抗网络（GAN）在眼科图像领域的应用：一项综述。

Eye Vis (Lond). 2022 Feb 2;9(1):6. doi: 10.1186/s40662-022-00277-3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习用于多模态年龄相关性黄斑变性分类的双流卷积神经网络

Learning Two-Stream CNN for Multi-Modal Age-Related Macular Degeneration Categorization.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献