• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将Softmax和池化算子集成到视觉变换器中进行病变分类和糖尿病视网膜病变分级。

Lesion classification and diabetic retinopathy grading by integrating softmax and pooling operators into vision transformer.

作者信息

Liu Chong, Wang Weiguang, Lian Jian, Jiao Wanzhen

机构信息

School of Intelligence Engineering, Shandong Management University, Jinan, China.

Department of Ophthalmology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, China.

出版信息

Front Public Health. 2025 Jan 6;12:1442114. doi: 10.3389/fpubh.2024.1442114. eCollection 2024.

DOI:10.3389/fpubh.2024.1442114
PMID:39835306
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11743363/
Abstract

INTRODUCTION

Diabetic retinopathy grading plays a vital role in the diagnosis and treatment of patients. In practice, this task mainly relies on manual inspection using human visual system. However, the human visual system-based screening process is labor-intensive, time-consuming, and error-prone. Therefore, plenty of automated screening technique have been developed to address this task.

METHODS

Among these techniques, the deep learning models have demonstrated promising outcomes in various types of machine vision tasks. However, most of the medical image analysis-oriented deep learning approaches are built upon the convolutional operations, which might neglect the global dependencies between long-range pixels in the medical images. Therefore, the vision transformer models, which can unveil the associations between global pixels, have been gradually employed in medical image analysis. However, the quadratic computation complexity of attention mechanism has hindered the deployment of vision transformer in clinical practices. Bearing the analysis above in mind, this study introduces an integrated self-attention mechanism with both softmax and linear modules to guarantee efficiency and expressiveness, simultaneously. To be specific, a portion of query and key tokens, which are much less than the original query and key tokens, are adopted in the attention module by adding a set of proxy tokens. Note that the proxy tokens can fully utilize both the advantages of softmax and linear attention.

RESULTS

To evaluate the performance of the presented approach, the comparison experiments between state-of-the-art algorithms and the proposed approach are conducted. Experimental results demonstrate that the proposed approach achieves superior outcome over the state-of-the-art algorithms on the publicly available datasets.

DISCUSSION

Accordingly, the proposed approach can be taken as a potentially valuable instrument in clinical practices.

摘要

引言

糖尿病视网膜病变分级在患者的诊断和治疗中起着至关重要的作用。在实际操作中,这项任务主要依靠人工视觉系统进行人工检查。然而,基于人工视觉系统的筛查过程劳动强度大、耗时且容易出错。因此,已经开发了大量自动化筛查技术来处理这项任务。

方法

在这些技术中,深度学习模型在各种类型的机器视觉任务中都取得了有前景的成果。然而,大多数面向医学图像分析的深度学习方法都是基于卷积操作构建的,这可能会忽略医学图像中远距离像素之间的全局依赖性。因此,能够揭示全局像素之间关联的视觉Transformer模型已逐渐应用于医学图像分析中。然而,注意力机制的二次计算复杂度阻碍了视觉Transformer在临床实践中的应用。考虑到上述分析,本研究引入了一种同时具有softmax和线性模块的集成自注意力机制,以确保效率和表现力。具体而言,通过添加一组代理令牌,在注意力模块中采用比原始查询和键令牌少得多的一部分查询和键令牌。请注意,代理令牌可以充分利用softmax和线性注意力的优点。

结果

为了评估所提出方法的性能,进行了最先进算法与所提出方法之间的对比实验。实验结果表明,在所公开的数据集上,所提出的方法比最先进的算法取得了更好的结果。

讨论

因此,所提出的方法可以被视为临床实践中一种潜在有价值的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/ea0677748b6f/fpubh-12-1442114-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/2337abbc1611/fpubh-12-1442114-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/b3e75a7329c5/fpubh-12-1442114-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/1fafb158bd6b/fpubh-12-1442114-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/39dce6f890e3/fpubh-12-1442114-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/d80d9491643f/fpubh-12-1442114-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/ea0677748b6f/fpubh-12-1442114-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/2337abbc1611/fpubh-12-1442114-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/b3e75a7329c5/fpubh-12-1442114-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/1fafb158bd6b/fpubh-12-1442114-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/39dce6f890e3/fpubh-12-1442114-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/d80d9491643f/fpubh-12-1442114-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a3a/11743363/ea0677748b6f/fpubh-12-1442114-g0006.jpg

相似文献

1
Lesion classification and diabetic retinopathy grading by integrating softmax and pooling operators into vision transformer.通过将Softmax和池化算子集成到视觉变换器中进行病变分类和糖尿病视网膜病变分级。
Front Public Health. 2025 Jan 6;12:1442114. doi: 10.3389/fpubh.2024.1442114. eCollection 2024.
2
Identification of diabetic retinopathy lesions in fundus images by integrating CNN and vision mamba models.通过整合卷积神经网络(CNN)和视觉曼巴模型识别眼底图像中的糖尿病视网膜病变病变。
PLoS One. 2025 Jan 28;20(1):e0318264. doi: 10.1371/journal.pone.0318264. eCollection 2025.
3
ResViT FusionNet Model: An explainable AI-driven approach for automated grading of diabetic retinopathy in retinal images.ResViT融合网络模型:一种用于视网膜图像中糖尿病视网膜病变自动分级的可解释人工智能驱动方法。
Comput Biol Med. 2025 Mar;186:109656. doi: 10.1016/j.compbiomed.2025.109656. Epub 2025 Jan 16.
4
A deep learning based model for diabetic retinopathy grading.一种基于深度学习的糖尿病视网膜病变分级模型。
Sci Rep. 2025 Jan 30;15(1):3763. doi: 10.1038/s41598-025-87171-9.
5
Vision Transformer-based recognition of diabetic retinopathy grade.基于 Vision Transformer 的糖尿病视网膜病变分级识别。
Med Phys. 2021 Dec;48(12):7850-7863. doi: 10.1002/mp.15312. Epub 2021 Nov 16.
6
MediDRNet: Tackling category imbalance in diabetic retinopathy classification with dual-branch learning and prototypical contrastive learning.MediDRNet:使用双分支学习和原型对比学习解决糖尿病视网膜病变分类中的类别不平衡问题。
Comput Methods Programs Biomed. 2024 Aug;253:108230. doi: 10.1016/j.cmpb.2024.108230. Epub 2024 May 17.
7
UC-stack: a deep learning computer automatic detection system for diabetic retinopathy classification.UC-stack:一种用于糖尿病视网膜病变分类的深度学习计算机自动检测系统。
Phys Med Biol. 2024 Feb 12;69(4). doi: 10.1088/1361-6560/ad22a1.
8
Severity grading of hypertensive retinopathy using hybrid deep learning architecture.使用混合深度学习架构对高血压性视网膜病变进行严重程度分级。
Comput Methods Programs Biomed. 2025 Apr;261:108585. doi: 10.1016/j.cmpb.2025.108585. Epub 2025 Jan 15.
9
Enhanced MobileNet for skin cancer image classification with fused spatial channel attention mechanism.基于融合空间通道注意力机制的增强型 MobileNet 皮肤癌图像分类。
Sci Rep. 2024 Nov 21;14(1):28850. doi: 10.1038/s41598-024-80087-w.
10
Diabetic retinopathy prediction based on vision transformer and modified capsule network.基于视觉转换器和改进胶囊网络的糖尿病视网膜病变预测。
Comput Biol Med. 2024 Jun;175:108523. doi: 10.1016/j.compbiomed.2024.108523. Epub 2024 Apr 25.

引用本文的文献

1
Multi-Modal AI for Multi-Label Retinal Disease Prediction Using OCT and Fundus Images: A Hybrid Approach.使用光学相干断层扫描(OCT)和眼底图像进行多标签视网膜疾病预测的多模态人工智能:一种混合方法。
Sensors (Basel). 2025 Jul 19;25(14):4492. doi: 10.3390/s25144492.
2
Innovative Approach for Diabetic Retinopathy Severity Classification: An AI-Powered Tool using CNN-Transformer Fusion.糖尿病视网膜病变严重程度分类的创新方法:一种使用卷积神经网络-Transformer融合的人工智能工具
J Biomed Phys Eng. 2025 Apr 1;15(2):137-158. doi: 10.31661/jbpe.v0i0.2408-1811. eCollection 2025 Apr.

本文引用的文献

1
Vision transformer with masked autoencoders for referable diabetic retinopathy classification based on large-size retina image.基于大尺寸视网膜图像的可引用糖尿病视网膜病变分类的掩蔽自动编码器视觉转换器。
PLoS One. 2024 Mar 6;19(3):e0299265. doi: 10.1371/journal.pone.0299265. eCollection 2024.
2
Multi-label classification of retinal disease via a novel vision transformer model.通过新型视觉Transformer模型进行视网膜疾病的多标签分类
Front Neurosci. 2024 Jan 8;17:1290803. doi: 10.3389/fnins.2023.1290803. eCollection 2023.
3
Augmentation-Consistent Clustering Network for Diabetic Retinopathy Grading with Fewer Annotations.
基于增强一致性聚类网络的糖尿病视网膜病变分级方法(少标注样本)
J Healthc Eng. 2022 Mar 28;2022:4246239. doi: 10.1155/2022/4246239. eCollection 2022.
4
Clinical Wide-Field Retinal Image Deep Learning Classification of Exudative and Non-Exudative Age-Related Macular Degeneration.渗出性和非渗出性年龄相关性黄斑变性的临床广域视网膜图像深度学习分类
Cureus. 2021 Aug 30;13(8):e17579. doi: 10.7759/cureus.17579. eCollection 2021 Aug.
5
Simple methods for the lesion detection and severity grading of diabetic retinopathy by image processing and transfer learning.基于图像处理和迁移学习的糖尿病视网膜病变病变检测和严重程度分级的简单方法。
Comput Biol Med. 2021 Oct;137:104795. doi: 10.1016/j.compbiomed.2021.104795. Epub 2021 Aug 25.
6
Hybrid Model Structure for Diabetic Retinopathy Classification.用于糖尿病视网膜病变分类的混合模型结构。
J Healthc Eng. 2020 Oct 13;2020:8840174. doi: 10.1155/2020/8840174. eCollection 2020.
7
Novel Low-Shot Deep Learning Approach for Retinal Image Classification With Few Examples.用于少样本视网膜图像分类的新型低样本深度学习方法。
JAMA Ophthalmol. 2020 Oct 1;138(10):1077-1078. doi: 10.1001/jamaophthalmol.2020.3256.
8
Semi-Supervised Medical Image Classification With Relation-Driven Self-Ensembling Model.基于关系驱动自集成模型的半监督医学图像分类
IEEE Trans Med Imaging. 2020 Nov;39(11):3429-3440. doi: 10.1109/TMI.2020.2995518. Epub 2020 Oct 28.
9
CANet: Cross-Disease Attention Network for Joint Diabetic Retinopathy and Diabetic Macular Edema Grading.CANet:用于联合糖尿病性视网膜病变和糖尿病性黄斑水肿分级的跨疾病注意力网络。
IEEE Trans Med Imaging. 2020 May;39(5):1483-1493. doi: 10.1109/TMI.2019.2951844. Epub 2019 Nov 6.
10
IDRiD: Diabetic Retinopathy - Segmentation and Grading Challenge.IDRiD:糖尿病视网膜病变 - 分割与分级挑战赛。
Med Image Anal. 2020 Jan;59:101561. doi: 10.1016/j.media.2019.101561. Epub 2019 Oct 3.