• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于细粒度医学图像分类的Transformer注意力融合

Transformer attention fusion for fine grained medical image classification.

作者信息

Badar Danyal, Abbas Junaid, Alsini Raed, Abbas Tahir, ChengLiang Wang, Daud Ali

机构信息

College of Computer Science, Chongqing University, Chongqing, China.

School of Big Data and Software Engineering, Chongqing University, Chongqing, China.

出版信息

Sci Rep. 2025 Jul 1;15(1):20655. doi: 10.1038/s41598-025-07561-x.

DOI:10.1038/s41598-025-07561-x
PMID:40596233
Abstract

Fine-grained visual classification is fundamental for medical image applications because it detects minor lesions. Diabetic retinopathy (DR) is a preventable cause of blindness, which requires exact and timely diagnosis to prevent vision damage. The challenges automated DR classification systems face include irregular lesions, uneven distributions between image classes, and inconsistent image quality that reduces diagnostic accuracy during early detection stages. Our solution to these problems includes MSCAS-Net (Multi-Scale Cross and Self-Attention Network), which uses the Swin Transformer as the backbone. It extracts features at three different resolutions (12 × 12, 24 × 24, 48 × 48), allowing it to detect subtle local features and global elements. This model uses self-attention mechanics to improve spatial connections between single scales and cross-attention to automatically match feature patterns across multiple scales, thereby developing a comprehensive information structure. The model becomes better at detecting significant lesions because of its dual mechanism, which focuses on both attention points. MSCAS-Net displays the best performance on APTOS and DDR and IDRID benchmarks by reaching accuracy levels of 93.8%, 89.80% and 86.70%, respectively. Through its algorithm, the model solves problems with imbalanced datasets and inconsistent image quality without needing data augmentation because it learns stable features. MSCAS-Net demonstrates a breakthrough in automated DR diagnostics since it combines high diagnostic precision with interpretable abilities to become an efficient AI-powered clinical decision support system. The presented research demonstrates how fine-grained visual classification methods benefit detecting and treating DR during its early stages.

摘要

细粒度视觉分类对于医学图像应用至关重要,因为它能检测出微小病变。糖尿病视网膜病变(DR)是一种可预防的致盲原因,需要准确及时的诊断以防止视力受损。自动化DR分类系统面临的挑战包括病变不规则、图像类别之间分布不均以及图像质量不一致,这些都会降低早期检测阶段的诊断准确性。我们针对这些问题的解决方案包括MSCAS-Net(多尺度交叉与自注意力网络),它以Swin Transformer作为主干。该模型在三种不同分辨率(12×12、24×24、48×48)下提取特征,使其能够检测到细微的局部特征和全局元素。此模型使用自注意力机制来改善单尺度之间的空间连接,并通过交叉注意力自动匹配多尺度之间的特征模式,从而构建出一个全面的信息结构。由于其双重机制专注于两个注意力点,该模型在检测重大病变方面表现更佳。MSCAS-Net在APROS、DDR和IDRID基准测试中分别达到了93.8%、89.80%和86.70%的准确率,展现出最佳性能。通过其算法,该模型无需数据增强就能解决数据集不平衡和图像质量不一致的问题,因为它能学习到稳定的特征。MSCAS-Net将高诊断精度与可解释能力相结合,成为一个高效的人工智能驱动的临床决策支持系统,在自动化DR诊断方面取得了突破。本研究展示了细粒度视觉分类方法如何在DR的早期阶段有助于检测和治疗。

相似文献

1
Transformer attention fusion for fine grained medical image classification.用于细粒度医学图像分类的Transformer注意力融合
Sci Rep. 2025 Jul 1;15(1):20655. doi: 10.1038/s41598-025-07561-x.
2
..
Int Ophthalmol. 2025 Jun 27;45(1):266. doi: 10.1007/s10792-025-03602-6.
3
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
4
Coinciding Diabetic Retinopathy and Diabetic Macular Edema Grading With Rat Swarm Optimization Algorithm for Enhanced Capsule Generation Adversarial Network.基于大鼠群优化算法的增强胶囊生成对抗网络实现糖尿病视网膜病变和糖尿病黄斑水肿分级的同步
Microsc Res Tech. 2025 Feb;88(2):555-563. doi: 10.1002/jemt.24709. Epub 2024 Nov 2.
5
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.
6
ThreeF-Net: Fine-grained feature fusion network for breast ultrasound image segmentation.ThreeF-Net:用于乳腺超声图像分割的细粒度特征融合网络。
Comput Biol Med. 2025 Aug;194:110527. doi: 10.1016/j.compbiomed.2025.110527. Epub 2025 Jun 14.
7
A novel deep learning framework for retinal disease detection leveraging contextual and local features cues from retinal images.一种用于视网膜疾病检测的新型深度学习框架,利用来自视网膜图像的上下文和局部特征线索。
Med Biol Eng Comput. 2025 Feb 7. doi: 10.1007/s11517-025-03314-0.
8
Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.光学相干断层扫描(OCT)用于检测糖尿病视网膜病变患者的黄斑水肿。
Cochrane Database Syst Rev. 2011 Jul 6(7):CD008081. doi: 10.1002/14651858.CD008081.pub2.
9
Fine-Grained Classification of Pressure Ulcers and Incontinence-Associated Dermatitis Using Multimodal Deep Learning: Algorithm Development and Validation Study.使用多模态深度学习对压疮和失禁相关性皮炎进行细粒度分类:算法开发与验证研究
JMIR AI. 2025 May 1;4:e67356. doi: 10.2196/67356.
10
MRI software and cognitive fusion biopsies in people with suspected prostate cancer: a systematic review, network meta-analysis and cost-effectiveness analysis.磁共振成像软件联合认知融合活检用于疑似前列腺癌患者:系统评价、网络荟萃分析和成本效果分析。
Health Technol Assess. 2024 Oct;28(61):1-310. doi: 10.3310/PLFG4210.

本文引用的文献

1
Diabetic retinopathy prediction based on vision transformer and modified capsule network.基于视觉转换器和改进胶囊网络的糖尿病视网膜病变预测。
Comput Biol Med. 2024 Jun;175:108523. doi: 10.1016/j.compbiomed.2024.108523. Epub 2024 Apr 25.
2
Interpretable Detection of Diabetic Retinopathy, Retinal Vein Occlusion, Age-Related Macular Degeneration, and Other Fundus Conditions.糖尿病性视网膜病变、视网膜静脉阻塞、年龄相关性黄斑变性及其他眼底疾病的可解释性检测。
Diagnostics (Basel). 2024 Jan 5;14(2):121. doi: 10.3390/diagnostics14020121.
3
HDR-EfficientNet: A Classification of Hypertensive and Diabetic Retinopathy Using Optimize EfficientNet Architecture.
HDR-EfficientNet:一种使用优化的EfficientNet架构对高血压性和糖尿病性视网膜病变进行的分类
Diagnostics (Basel). 2023 Oct 17;13(20):3236. doi: 10.3390/diagnostics13203236.
4
Development of revised ResNet-50 for diabetic retinopathy detection.用于糖尿病性视网膜病变检测的改进型 ResNet-50 的开发。
BMC Bioinformatics. 2023 Apr 19;24(1):157. doi: 10.1186/s12859-023-05293-1.
5
Diabetic Retinopathy Prediction Based on Wavelet Decomposition and Modified Capsule Network.基于小波分解和改进胶囊网络的糖尿病视网膜病变预测。
J Digit Imaging. 2023 Aug;36(4):1739-1751. doi: 10.1007/s10278-023-00813-0. Epub 2023 Mar 27.
6
A Regression-Based Approach to Diabetic Retinopathy Diagnosis Using Efficientnet.一种基于回归的使用Efficientnet进行糖尿病视网膜病变诊断的方法。
Diagnostics (Basel). 2023 Feb 17;13(4):774. doi: 10.3390/diagnostics13040774.
7
EDLDR: An Ensemble Deep Learning Technique for Detection and Classification of Diabetic Retinopathy.EDLDR:一种用于糖尿病视网膜病变检测与分类的集成深度学习技术。
Diagnostics (Basel). 2022 Dec 30;13(1):124. doi: 10.3390/diagnostics13010124.
8
CoT-XNet: contextual transformer with Xception network for diabetic retinopathy grading.CoT-XNet:用于糖尿病视网膜病变分级的基于Xception网络的上下文变换器
Phys Med Biol. 2022 Dec 6;67(24). doi: 10.1088/1361-6560/ac9fa0.
9
Applying supervised contrastive learning for the detection of diabetic retinopathy and its severity levels from fundus images.应用监督对比学习从眼底图像中检测糖尿病性视网膜病变及其严重程度。
Comput Biol Med. 2022 Jul;146:105602. doi: 10.1016/j.compbiomed.2022.105602. Epub 2022 May 10.
10
A Survey on Vision Transformer.视觉Transformer综述
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.