• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于混合注意力的深度学习用于眼底图像多标签眼科疾病检测

Hybrid attention-based deep learning for multi-label ophthalmic disease detection on fundus images.

作者信息

Hanfi Rabiya, Mathur Harsh, Shrivastava Ritu

机构信息

Department of Computer Science & Engineering, Rabindranath Tagore University, Bhopal, India.

Department of Computer Science & Engineering, Sagar Institute of Research and Technology, Bhopal, India.

出版信息

Graefes Arch Clin Exp Ophthalmol. 2025 May 29. doi: 10.1007/s00417-025-06858-x.

DOI:10.1007/s00417-025-06858-x
PMID:40439748
Abstract

BACKGROUND

Ophthalmic diseases significantly impact vision and quality of life. Early diagnosis using fundus images is critical for timely treatment. Traditional deep learning models often lack accuracy, interpretability, and efficiency for multi-label classification tasks in ophthalmology.

METHODS

We propose HAM-DNet, a hybrid deep learning model combining EfficientNetV2 and Vision Transformers (ViT) for multi-label ophthalmic disease detection. The model includes SE (Squeeze-and-Excitation) blocks for attention-based feature refinement and a U-Net-based lesion localization module for improved interpretability. The model was trained and tested on multiple fundus image datasets (ODIR-5 K, Messidor, G1020, and Joint Shantou International Eye Centre).

RESULTS

HAM-DNet achieved superior performance with an accuracy of 95.3%, precision of 96.2%, recall of 97.1%, AUC of 98.42, and F1-score of 96.75, while maintaining low computational cost (9.7 GFLOPS). It outperformed existing models including Shallow CNN and EfficientNet, particularly in handling multi-label classifications and reducing false positives and negatives.

CONCLUSIONS

HAM-DNet offers a robust, accurate, and interpretable solution for automated detection of multiple ophthalmic diseases. Its lightweight architecture makes it suitable for clinical deployment, especially in telemedicine and resource-constrained environments.

摘要

背景

眼科疾病对视力和生活质量有显著影响。利用眼底图像进行早期诊断对于及时治疗至关重要。传统的深度学习模型在眼科多标签分类任务中往往缺乏准确性、可解释性和效率。

方法

我们提出了HAM-DNet,一种结合EfficientNetV2和视觉Transformer(ViT)的混合深度学习模型,用于多标签眼科疾病检测。该模型包括用于基于注意力的特征细化的SE(挤压与激励)块和用于提高可解释性的基于U-Net的病变定位模块。该模型在多个眼底图像数据集(ODIR-5K、Messidor、G1020和汕头国际眼科中心联合数据集)上进行了训练和测试。

结果

HAM-DNet取得了优异的性能,准确率为95.3%,精确率为96.2%,召回率为97.1%,AUC为98.42,F1分数为96.75,同时保持了较低的计算成本(9.7 GFLOPS)。它优于包括浅层卷积神经网络和EfficientNet在内的现有模型,特别是在处理多标签分类以及减少误报和漏报方面。

结论

HAM-DNet为多种眼科疾病的自动检测提供了一种强大、准确且可解释的解决方案。其轻量级架构使其适用于临床部署,尤其是在远程医疗和资源受限的环境中。

相似文献

1
Hybrid attention-based deep learning for multi-label ophthalmic disease detection on fundus images.基于混合注意力的深度学习用于眼底图像多标签眼科疾病检测
Graefes Arch Clin Exp Ophthalmol. 2025 May 29. doi: 10.1007/s00417-025-06858-x.
2
Discriminative kernel convolution network for multi-label ophthalmic disease detection on imbalanced fundus image dataset.用于不平衡眼底图像数据集上多标签眼科疾病检测的判别性核卷积网络。
Comput Biol Med. 2023 Feb;153:106519. doi: 10.1016/j.compbiomed.2022.106519. Epub 2023 Jan 2.
3
Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.使用混合卷积和视觉Transformer网络增强胸部X光片中的肺炎检测
Curr Med Imaging. 2025;21:e15734056326685. doi: 10.2174/0115734056326685250101113959.
4
BFENet: A two-stream interaction CNN method for multi-label ophthalmic diseases classification with bilateral fundus images.BFENet:一种用于双侧眼底图像的多标签眼科疾病分类的双流交互 CNN 方法。
Comput Methods Programs Biomed. 2022 Jun;219:106739. doi: 10.1016/j.cmpb.2022.106739. Epub 2022 Mar 11.
5
Multi-Label Fundus Image Classification Using Attention Mechanisms and Feature Fusion.基于注意力机制和特征融合的多标签眼底图像分类
Micromachines (Basel). 2022 Jun 15;13(6):947. doi: 10.3390/mi13060947.
6
Improved diabetic retinopathy severity classification using squeeze-and-excitation and sparse light weight multi-level attention u-net with transfer learning from xception.利用挤压与激励以及稀疏轻量级多级注意力U型网络并结合来自Xception的迁移学习改进糖尿病视网膜病变严重程度分类
Acta Diabetol. 2025 Jan;62(1):95-112. doi: 10.1007/s00592-024-02341-x. Epub 2024 Jul 26.
7
A multi-class fundus disease classification system based on an adaptive scale discriminator and hybrid loss.一种基于自适应尺度判别器和混合损失的多类别眼底疾病分类系统。
Comput Biol Chem. 2024 Dec;113:108241. doi: 10.1016/j.compbiolchem.2024.108241. Epub 2024 Oct 11.
8
Multi-label classification of retinal diseases based on fundus images using Resnet and Transformer.基于眼底图像的 Resnet 和 Transformer 的视网膜疾病多标签分类。
Med Biol Eng Comput. 2024 Nov;62(11):3459-3469. doi: 10.1007/s11517-024-03144-6. Epub 2024 Jun 14.
9
ResViT FusionNet Model: An explainable AI-driven approach for automated grading of diabetic retinopathy in retinal images.ResViT融合网络模型:一种用于视网膜图像中糖尿病视网膜病变自动分级的可解释人工智能驱动方法。
Comput Biol Med. 2025 Mar;186:109656. doi: 10.1016/j.compbiomed.2025.109656. Epub 2025 Jan 16.
10
Multi-label classification of fundus images with graph convolutional network and LightGBM.基于图卷积网络和 LightGBM 的眼底图像多标签分类。
Comput Biol Med. 2022 Oct;149:105909. doi: 10.1016/j.compbiomed.2022.105909. Epub 2022 Aug 11.

引用本文的文献

1
ODDM: Integration of SMOTE Tomek with Deep Learning on Imbalanced Color Fundus Images for Classification of Several Ocular Diseases.ODDM:在不平衡的彩色眼底图像上集成SMOTE Tomek与深度学习以对多种眼部疾病进行分类
J Imaging. 2025 Aug 18;11(8):278. doi: 10.3390/jimaging11080278.

本文引用的文献

1
Advantages of the Utilization of Wide-Field OCT and Wide-Field OCT Angiography in Clinical Practice.临床实践中使用广角光学相干断层扫描(OCT)和广角OCT血管造影的优势。
Diagnostics (Basel). 2024 Feb 1;14(3):321. doi: 10.3390/diagnostics14030321.
2
A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images.一种用于视网膜图像的具有注意力引导图像增强功能的多标签检测深度学习模型。
Micromachines (Basel). 2023 Mar 22;14(3):705. doi: 10.3390/mi14030705.
3
Comprehensive Review on the Use of Artificial Intelligence in Ophthalmology and Future Research Directions.
眼科人工智能应用综合综述及未来研究方向
Diagnostics (Basel). 2022 Dec 29;13(1):100. doi: 10.3390/diagnostics13010100.
4
Discriminative kernel convolution network for multi-label ophthalmic disease detection on imbalanced fundus image dataset.用于不平衡眼底图像数据集上多标签眼科疾病检测的判别性核卷积网络。
Comput Biol Med. 2023 Feb;153:106519. doi: 10.1016/j.compbiomed.2022.106519. Epub 2023 Jan 2.
5
Combining convolutional neural networks and self-attention for fundus diseases identification.结合卷积神经网络和自注意力机制进行眼底疾病识别。
Sci Rep. 2023 Jan 2;13(1):76. doi: 10.1038/s41598-022-27358-6.
6
The current clinical role of optical coherence tomography angiography in neuro-ophthalmological diseases.光学相干断层扫描血管造影在神经眼科疾病中的当前临床作用。
Taiwan J Ophthalmol. 2022 Feb 11;12(3):264-272. doi: 10.4103/tjo.tjo_55_21. eCollection 2022 Jul-Sep.
7
Ophthalmic sensing technologies for ocular disease diagnostics.眼科疾病诊断用眼部传感技术。
Analyst. 2021 Oct 25;146(21):6416-6444. doi: 10.1039/d1an01244d.
8
Artificial intelligence in OCT angiography.OCT 血管造影中的人工智能。
Prog Retin Eye Res. 2021 Nov;85:100965. doi: 10.1016/j.preteyeres.2021.100965. Epub 2021 Mar 22.