• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Transformer 的聚焦注意力机制在视网膜图像可解释分类中的应用。

Focused Attention in Transformers for interpretable classification of retinal images.

机构信息

LIV4D, Polytechnique Montréal, 2500 Ch. de Polytechnique, Montréal, QC, H3T 1J4, Canada.

Centre Universitaire d'Ophtalmologie, Maisonneuve-Rosemont Hospital, 5415 Boul. de l'Assomption, Montréal, QC, H1T 2M4, Canada.

出版信息

Med Image Anal. 2022 Nov;82:102608. doi: 10.1016/j.media.2022.102608. Epub 2022 Sep 7.

DOI:10.1016/j.media.2022.102608
PMID:36150271
Abstract

Vision Transformers have recently emerged as a competitive architecture in image classification. The tremendous popularity of this model and its variants comes from its high performance and its ability to produce interpretable predictions. However, both of these characteristics remain to be assessed in depth on retinal images. This study proposes a thorough performance evaluation of several Transformers compared to traditional Convolutional Neural Network (CNN) models for retinal disease classification. Special attention is given to multi-modality imaging (fundus and OCT) and generalization to external data. In addition, we propose a novel mechanism to generate interpretable predictions via attribution maps. Existing attribution methods from Transformer models have the disadvantage of producing low-resolution heatmaps. Our contribution, called Focused Attention, uses iterative conditional patch resampling to tackle this issue. By means of a survey involving four retinal specialists, we validated both the superior interpretability of Vision Transformers compared to the attribution maps produced from CNNs and the relevance of Focused Attention as a lesion detector.

摘要

视觉转换器最近在图像分类中崭露头角,成为一种具有竞争力的架构。这种模型及其变体的巨大流行,源于其高性能和产生可解释预测的能力。然而,这些特性在视网膜图像上仍需要深入评估。本研究提出了对几种转换器与传统卷积神经网络(CNN)模型在视网膜疾病分类方面的全面性能评估。特别关注多模态成像(眼底和 OCT)和对外部数据的泛化。此外,我们提出了一种通过归因图生成可解释预测的新机制。来自 Transformer 模型的现有归因方法存在生成低分辨率热图的缺点。我们的贡献称为聚焦注意力,使用迭代条件补丁重采样来解决这个问题。通过涉及四名视网膜专家的调查,我们验证了与 CNN 生成的归因图相比,Vision Transformer 具有更高的可解释性,以及 Focused Attention 作为病变检测器的相关性。

相似文献

1
Focused Attention in Transformers for interpretable classification of retinal images.基于 Transformer 的聚焦注意力机制在视网膜图像可解释分类中的应用。
Med Image Anal. 2022 Nov;82:102608. doi: 10.1016/j.media.2022.102608. Epub 2022 Sep 7.
2
HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.HTC-retina:一种使用来自光学相干断层扫描图像的变压器-卷积神经网络的混合视网膜疾病分类模型。
Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.
3
How to Extract More Information With Less Burden: Fundus Image Classification and Retinal Disease Localization With Ophthalmologist Intervention.如何用更少的负担提取更多信息:带有眼科医生干预的眼底图像分类和视网膜疾病定位。
IEEE J Biomed Health Inform. 2020 Dec;24(12):3351-3361. doi: 10.1109/JBHI.2020.3011805. Epub 2020 Dec 4.
4
Vision Transformer-based recognition of diabetic retinopathy grade.基于 Vision Transformer 的糖尿病视网膜病变分级识别。
Med Phys. 2021 Dec;48(12):7850-7863. doi: 10.1002/mp.15312. Epub 2021 Nov 16.
5
Development and Validation of Deep Learning Models for Screening Multiple Abnormal Findings in Retinal Fundus Images.深度学习模型在视网膜眼底图像多种异常发现筛查中的开发与验证。
Ophthalmology. 2020 Jan;127(1):85-94. doi: 10.1016/j.ophtha.2019.05.029. Epub 2019 May 31.
6
Multi-Label Retinal Disease Classification Using Transformers.基于 Transformer 的多标签视网膜疾病分类。
IEEE J Biomed Health Inform. 2023 Jun;27(6):2739-2750. doi: 10.1109/JBHI.2022.3214086. Epub 2023 Jun 5.
7
Attention to Lesion: Lesion-Aware Convolutional Neural Network for Retinal Optical Coherence Tomography Image Classification.关注病灶:用于视网膜光学相干断层扫描图像分类的病灶感知卷积神经网络。
IEEE Trans Med Imaging. 2019 Aug;38(8):1959-1970. doi: 10.1109/TMI.2019.2898414. Epub 2019 Feb 8.
8
Effects of Hypertension, Diabetes, and Smoking on Age and Sex Prediction from Retinal Fundus Images.高血压、糖尿病和吸烟对视网膜眼底图像年龄和性别预测的影响。
Sci Rep. 2020 Mar 12;10(1):4623. doi: 10.1038/s41598-020-61519-9.
9
Deep Ensemble Learning for Retinal Image Classification.基于深度集成学习的视网膜图像分类。
Transl Vis Sci Technol. 2022 Oct 3;11(10):39. doi: 10.1167/tvst.11.10.39.
10
Scale-space approximated convolutional neural networks for retinal vessel segmentation.用于视网膜血管分割的尺度空间逼近卷积神经网络。
Comput Methods Programs Biomed. 2019 Sep;178:237-246. doi: 10.1016/j.cmpb.2019.06.030. Epub 2019 Jun 29.

引用本文的文献

1
Low-Rank Fine-Tuning Meets Cross-modal Analysis: A Robust Framework for Age-Related Macular Degeneration Categorization.低秩微调与跨模态分析:一种用于年龄相关性黄斑变性分类的稳健框架。
J Imaging Inform Med. 2025 Apr 29. doi: 10.1007/s10278-025-01513-7.
2
Discriminative, generative artificial intelligence, and foundation models in retina imaging.视网膜成像中的判别式、生成式人工智能及基础模型。
Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec.
3
Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection.
具有多方向选择机制的多分辨率视觉曼巴用于视网膜疾病检测
Front Cell Dev Biol. 2024 Oct 11;12:1484880. doi: 10.3389/fcell.2024.1484880. eCollection 2024.
4
Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.医学图像分析中视觉转换器与卷积神经网络的比较:系统评价。
J Med Syst. 2024 Sep 12;48(1):84. doi: 10.1007/s10916-024-02105-8.
5
A Comprehensive Review of AI Diagnosis Strategies for Age-Related Macular Degeneration (AMD).年龄相关性黄斑变性(AMD)人工智能诊断策略的综合综述
Bioengineering (Basel). 2024 Jul 13;11(7):711. doi: 10.3390/bioengineering11070711.
6
Multi-label classification of retinal diseases based on fundus images using Resnet and Transformer.基于眼底图像的 Resnet 和 Transformer 的视网膜疾病多标签分类。
Med Biol Eng Comput. 2024 Nov;62(11):3459-3469. doi: 10.1007/s11517-024-03144-6. Epub 2024 Jun 14.
7
Glaucoma detection model by exploiting multi-region and multi-scan-pattern OCT images with dynamical region score.基于动态区域评分利用多区域和多扫描模式光学相干断层扫描(OCT)图像的青光眼检测模型
Biomed Opt Express. 2024 Feb 2;15(3):1370-1392. doi: 10.1364/BOE.512138. eCollection 2024 Mar 1.
8
Ultrasound Image Analysis with Vision Transformers-Review.基于视觉Transformer的超声图像分析——综述
Diagnostics (Basel). 2024 Mar 4;14(5):542. doi: 10.3390/diagnostics14050542.
9
Automated detection of nine infantile fundus diseases and conditions in retinal images using a deep learning system.使用深度学习系统自动检测视网膜图像中的九种婴儿眼底疾病及状况。
EPMA J. 2024 Feb 15;15(1):39-51. doi: 10.1007/s13167-024-00350-y. eCollection 2024 Mar.
10
Multi-Dataset Comparison of Vision Transformers and Convolutional Neural Networks for Detecting Glaucomatous Optic Neuropathy from Fundus Photographs.用于从眼底照片中检测青光眼性视神经病变的视觉Transformer与卷积神经网络的多数据集比较
Bioengineering (Basel). 2023 Oct 30;10(11):1266. doi: 10.3390/bioengineering10111266.