• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

nnMobileNet:重新思考用于视网膜病变研究的卷积神经网络

nnMobileNet: Rethinking CNN for Retinopathy Research.

作者信息

Zhu Wenhui, Qiu Peijie, Chen Xiwen, Li Xin, Lepore Natasha, Dumitrascu Oana M, Wang Yalin

机构信息

School of Computing and Augmented Intelligence, Arizona State University.

McKeley School of Engineering, Washington University in St. Louis.

出版信息

Conf Comput Vis Pattern Recognit Workshops. 2024 Jun;2024:2285-2294. doi: 10.1109/CVPRW63382.2024.00234. Epub 2024 Sep 27.

DOI:10.1109/CVPRW63382.2024.00234
PMID:40356800
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12068684/
Abstract

Over the past few decades, convolutional neural networks (CNNs) have been at the forefront of the detection and tracking of various retinal diseases (RD). Despite their success, the emergence of vision transformers (ViT) in the 2020s has shifted the trajectory of RD model development. The leading-edge performance of ViT-based models in RD can be largely credited to their scalability-their ability to improve as more parameters are added. As a result, ViT-based models tend to outshine traditional CNNs in RD applications, albeit at the cost of increased data and computational demands. ViTs also differ from CNNs in their approach to processing images, working with patches rather than local regions, which can complicate the precise localization of small, variably presented lesions in RD. In our study, we revisited and updated the architecture of a CNN model, specifically MobileNet, to enhance its utility in RD diagnostics. We found that an optimized MobileNet, through selective modifications, can surpass ViT-based models in various RD benchmarks, including diabetic retinopathy grading, detection of multiple fundus diseases, and classification of diabetic macular edema. The code is available at https://github.com/Retinal-Research/NN-MOBILENET.

摘要

在过去几十年里,卷积神经网络(CNN)一直处于各种视网膜疾病(RD)检测与跟踪的前沿。尽管取得了成功,但2020年代视觉Transformer(ViT)的出现改变了视网膜疾病模型的发展轨迹。基于ViT的模型在视网膜疾病方面的前沿性能很大程度上归功于其可扩展性——随着添加更多参数而提升性能的能力。因此,基于ViT的模型在视网膜疾病应用中往往比传统的CNN更出色,尽管代价是数据和计算需求增加。ViT在处理图像的方式上也与CNN不同,它处理的是图像块而非局部区域,这可能会使视网膜疾病中小的、呈现方式各异的病变的精确定位变得复杂。在我们的研究中,我们重新审视并更新了一个CNN模型(具体来说是MobileNet)的架构,以增强其在视网膜疾病诊断中的效用。我们发现,通过选择性修改而优化的MobileNet,在包括糖尿病视网膜病变分级、多种眼底疾病检测以及糖尿病黄斑水肿分类等各种视网膜疾病基准测试中,可以超越基于ViT的模型。代码可在https://github.com/Retinal-Research/NN-MOBILENET获取。

相似文献

1
nnMobileNet: Rethinking CNN for Retinopathy Research.nnMobileNet:重新思考用于视网膜病变研究的卷积神经网络
Conf Comput Vis Pattern Recognit Workshops. 2024 Jun;2024:2285-2294. doi: 10.1109/CVPRW63382.2024.00234. Epub 2024 Sep 27.
2
ResViT FusionNet Model: An explainable AI-driven approach for automated grading of diabetic retinopathy in retinal images.ResViT融合网络模型:一种用于视网膜图像中糖尿病视网膜病变自动分级的可解释人工智能驱动方法。
Comput Biol Med. 2025 Mar;186:109656. doi: 10.1016/j.compbiomed.2025.109656. Epub 2025 Jan 16.
3
Detecting Tuberculosis-Consistent Findings in Lateral Chest X-Rays Using an Ensemble of CNNs and Vision Transformers.使用卷积神经网络(CNN)和视觉Transformer的集成在胸部侧位X光片中检测与肺结核一致的表现。
Front Genet. 2022 Feb 24;13:864724. doi: 10.3389/fgene.2022.864724. eCollection 2022.
4
Comparative Analysis of Vision Transformers and Conventional Convolutional Neural Networks in Detecting Referable Diabetic Retinopathy.视觉Transformer与传统卷积神经网络在检测可转诊糖尿病视网膜病变中的对比分析
Ophthalmol Sci. 2024 May 17;4(6):100552. doi: 10.1016/j.xops.2024.100552. eCollection 2024 Nov-Dec.
5
Advancing breast cancer diagnosis: token vision transformers for faster and accurate classification of histopathology images.推进乳腺癌诊断:用于组织病理学图像更快、更准确分类的令牌视觉变换器
Vis Comput Ind Biomed Art. 2025 Jan 8;8(1):1. doi: 10.1186/s42492-024-00181-8.
6
Multi-class Classification of Retinal Eye Diseases from Ophthalmoscopy Images Using Transfer Learning-Based Vision Transformers.基于迁移学习的视觉变换器对眼底镜图像中的视网膜眼病进行多类别分类
J Imaging Inform Med. 2025 Jan 27. doi: 10.1007/s10278-025-01416-7.
7
Vision transformers: The next frontier for deep learning-based ophthalmic image analysis.视觉Transformer:基于深度学习的眼科图像分析的新前沿。
Saudi J Ophthalmol. 2023 Jul 14;37(3):173-178. doi: 10.4103/sjopt.sjopt_91_23. eCollection 2023 Jul-Sep.
8
From Binary to Multi-Class Classification: A Two-Step Hybrid CNN-ViT Model for Chest Disease Classification Based on X-Ray Images.从二分类到多分类:一种基于X射线图像的胸部疾病分类的两步混合卷积神经网络-视觉Transformer模型
Diagnostics (Basel). 2024 Dec 6;14(23):2754. doi: 10.3390/diagnostics14232754.
9
Systematic Review of Hybrid Vision Transformer Architectures for Radiological Image Analysis.用于放射图像分析的混合视觉Transformer架构的系统综述
J Imaging Inform Med. 2025 Jan 27. doi: 10.1007/s10278-024-01322-4.
10
Mobile-friendly under-sampling single-pixel imaging based on a lightweight hybrid CNN-ViT architecture.基于轻量级混合卷积神经网络-视觉Transformer架构的移动友好型欠采样单像素成像
Opt Express. 2024 Dec 30;32(27):48672-48682. doi: 10.1364/OE.546375.

引用本文的文献

1
CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement.CUNSB-RFIE:视网膜眼底图像增强中的上下文感知非配对神经薛定谔桥
IEEE Winter Conf Appl Comput Vis. 2025 Feb-Mar;2025:4502-4511. doi: 10.1109/wacv61041.2025.00442. Epub 2025 Apr 8.
2
TPOT: TOPOLOGY PRESERVING OPTIMAL TRANSPORT IN RETINAL FUNDUS IMAGE ENHANCEMENT.TPOT:视网膜眼底图像增强中的拓扑保持最优传输
Proc IEEE Int Symp Biomed Imaging. 2025 Apr;2025. doi: 10.1109/isbi60581.2025.10981104. Epub 2025 May 12.
3
A comparative study of machine learning models for automated detection and classification of retinal diseases in Ghana.

本文引用的文献

1
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-Scale Attention.CrossFormer++:一种基于跨尺度注意力的通用视觉Transformer
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3123-3136. doi: 10.1109/TPAMI.2023.3341806. Epub 2024 Apr 3.
2
OPTIMAL TRANSPORT GUIDED UNSUPERVISED LEARNING FOR ENHANCING LOW-QUALITY RETINAL IMAGES.用于增强低质量视网膜图像的最优传输引导无监督学习
Proc IEEE Int Symp Biomed Imaging. 2023 Apr;2023. doi: 10.1109/isbi53787.2023.10230719. Epub 2023 Sep 1.
3
OTRE: Where Optimal Transport Guided Unpaired Image-to-Image Translation Meets Regularization by Enhancing.
加纳用于视网膜疾病自动检测和分类的机器学习模型的比较研究。
PLoS One. 2025 Aug 1;20(8):e0327743. doi: 10.1371/journal.pone.0327743. eCollection 2025.
4
RBAD: A Dataset and Benchmark for Retinal Vessels Branching Angle Detection.RBAD:用于视网膜血管分支角度检测的数据集与基准
IEEE EMBS Int Conf Biomed Health Inform. 2024 Nov;2024. doi: 10.1109/bhi62660.2024.10913865.
OTRE:最优传输引导的无配对图像到图像翻译与通过增强实现正则化的结合之处。
Inf Process Med Imaging. 2023 Jun;13939:415-427. doi: 10.1007/978-3-031-34048-2_32. Epub 2023 Jun 8.
4
Self-Supervised Equivariant Regularization Reconciles Multiple Instance Learning: Joint Referable Diabetic Retinopathy Classification and Lesion Segmentation.自监督等变正则化协调多实例学习:联合可参考糖尿病视网膜病变分类与病变分割
Proc SPIE Int Soc Opt Eng. 2022 Nov;12567. doi: 10.1117/12.2669772. Epub 2023 Mar 6.
5
CANet: Cross-Disease Attention Network for Joint Diabetic Retinopathy and Diabetic Macular Edema Grading.CANet:用于联合糖尿病性视网膜病变和糖尿病性黄斑水肿分级的跨疾病注意力网络。
IEEE Trans Med Imaging. 2020 May;39(5):1483-1493. doi: 10.1109/TMI.2019.2951844. Epub 2019 Nov 6.
6
Multi-Task Deep Model With Margin Ranking Loss for Lung Nodule Analysis.多任务深度模型与边缘排序损失在肺结节分析中的应用。
IEEE Trans Med Imaging. 2020 Mar;39(3):718-728. doi: 10.1109/TMI.2019.2934577. Epub 2019 Aug 12.
7
A multi-task deep learning model for the classification of Age-related Macular Degeneration.一种用于年龄相关性黄斑变性分类的多任务深度学习模型。
AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:505-514. eCollection 2019.
8
Evaluation of a computer-aided diagnosis system for diabetic retinopathy screening on public data.基于公共数据集评估糖尿病视网膜病变筛查的计算机辅助诊断系统。
Invest Ophthalmol Vis Sci. 2011 Jul 1;52(7):4866-71. doi: 10.1167/iovs.10-6633.
9
Retinal Diseases and VISION 2020.视网膜疾病与《视觉2020》
Community Eye Health. 2003;16(46):19-20.
10
Cotton-wool spots.棉絮斑
Retina. 1985 Fall-Winter;5(4):206-14. doi: 10.1097/00006982-198500540-00003.