• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视觉Transformer架构及其在数字健康中的应用:教程与综述

Vision transformer architecture and applications in digital health: a tutorial and survey.

作者信息

Al-Hammuri Khalid, Gebali Fayez, Kanan Awos, Chelvan Ilamparithi Thirumarai

机构信息

Electrical and Computer Engineering, University of Victoria, Victoria, V8W 2Y2, Canada.

Computer Engineering, Princess Sumaya University for Technology, Amman, 11941, Jordan.

出版信息

Vis Comput Ind Biomed Art. 2023 Jul 10;6(1):14. doi: 10.1186/s42492-023-00140-9.

DOI:10.1186/s42492-023-00140-9
PMID:37428360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10333157/
Abstract

The vision transformer (ViT) is a state-of-the-art architecture for image recognition tasks that plays an important role in digital health applications. Medical images account for 90% of the data in digital medicine applications. This article discusses the core foundations of the ViT architecture and its digital health applications. These applications include image segmentation, classification, detection, prediction, reconstruction, synthesis, and telehealth such as report generation and security. This article also presents a roadmap for implementing the ViT in digital health systems and discusses its limitations and challenges.

摘要

视觉Transformer(ViT)是一种用于图像识别任务的先进架构,在数字健康应用中发挥着重要作用。医学图像占数字医学应用中数据的90%。本文讨论了ViT架构的核心基础及其数字健康应用。这些应用包括图像分割、分类、检测、预测、重建、合成以及远程医疗,如报告生成和安全。本文还提出了在数字健康系统中实现ViT的路线图,并讨论了其局限性和挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/19e1b3a2b8d2/42492_2023_140_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/3ea791302d0c/42492_2023_140_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/5874e5afebce/42492_2023_140_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/af0a666689f1/42492_2023_140_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/37670b589219/42492_2023_140_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/ba4178b13df2/42492_2023_140_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/386983bf4df7/42492_2023_140_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/111272ecd409/42492_2023_140_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/603e265c5fe8/42492_2023_140_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/7c9d6985de24/42492_2023_140_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/b432add953fd/42492_2023_140_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/d69c03026bdd/42492_2023_140_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/334f61bb63e0/42492_2023_140_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/410b56472548/42492_2023_140_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/f924b9180011/42492_2023_140_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/4328e3cf3e86/42492_2023_140_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/30eb56b809cd/42492_2023_140_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/b0d9b2d7e9c6/42492_2023_140_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/2e67021269b9/42492_2023_140_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/28bff9cf2d8f/42492_2023_140_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/308f43e8cdba/42492_2023_140_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/246cfc6da0fc/42492_2023_140_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/19e1b3a2b8d2/42492_2023_140_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/3ea791302d0c/42492_2023_140_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/5874e5afebce/42492_2023_140_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/af0a666689f1/42492_2023_140_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/37670b589219/42492_2023_140_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/ba4178b13df2/42492_2023_140_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/386983bf4df7/42492_2023_140_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/111272ecd409/42492_2023_140_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/603e265c5fe8/42492_2023_140_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/7c9d6985de24/42492_2023_140_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/b432add953fd/42492_2023_140_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/d69c03026bdd/42492_2023_140_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/334f61bb63e0/42492_2023_140_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/410b56472548/42492_2023_140_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/f924b9180011/42492_2023_140_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/4328e3cf3e86/42492_2023_140_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/30eb56b809cd/42492_2023_140_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/b0d9b2d7e9c6/42492_2023_140_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/2e67021269b9/42492_2023_140_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/28bff9cf2d8f/42492_2023_140_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/308f43e8cdba/42492_2023_140_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/246cfc6da0fc/42492_2023_140_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a90a/10333157/19e1b3a2b8d2/42492_2023_140_Fig22_HTML.jpg

相似文献

1
Vision transformer architecture and applications in digital health: a tutorial and survey.视觉Transformer架构及其在数字健康中的应用:教程与综述
Vis Comput Ind Biomed Art. 2023 Jul 10;6(1):14. doi: 10.1186/s42492-023-00140-9.
2
Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System.基于皮肤镜的无创数字系统中使用视觉Transformer进行自动分析的皮肤癌分割与分类
Int J Biomed Imaging. 2024 Feb 3;2024:3022192. doi: 10.1155/2024/3022192. eCollection 2024.
3
Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer.使用视觉Transformer和Swin Transformer对基于移动设备的口腔癌图像进行分类
Cancers (Basel). 2024 Feb 29;16(5):987. doi: 10.3390/cancers16050987.
4
IEViT: An enhanced vision transformer architecture for chest X-ray image classification.IEViT:一种用于胸部 X 射线图像分类的增强型视觉Transformer 架构。
Comput Methods Programs Biomed. 2022 Nov;226:107141. doi: 10.1016/j.cmpb.2022.107141. Epub 2022 Sep 16.
5
Utilizing adaptive deformable convolution and position embedding for colon polyp segmentation with a visual transformer.利用自适应变形卷积和位置嵌入的视觉Transformer进行结肠息肉分割。
Sci Rep. 2024 Mar 27;14(1):7318. doi: 10.1038/s41598-024-57993-0.
6
RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers.RT-ViT:基于轻量级视觉Transformer 的实时单目深度估计。
Sensors (Basel). 2022 May 19;22(10):3849. doi: 10.3390/s22103849.
7
PLG-ViT: Vision Transformer with Parallel Local and Global Self-Attention.PLG-ViT:具有并行局部和全局自注意力的视觉 Transformer。
Sensors (Basel). 2023 Mar 25;23(7):3447. doi: 10.3390/s23073447.
8
Gait-ViT: Gait Recognition with Vision Transformer.步态-ViT:基于视觉Transformer 的步态识别。
Sensors (Basel). 2022 Sep 28;22(19):7362. doi: 10.3390/s22197362.
9
CellViT: Vision Transformers for precise cell segmentation and classification.CellViT:用于精确细胞分割和分类的视觉Transformer
Med Image Anal. 2024 May;94:103143. doi: 10.1016/j.media.2024.103143. Epub 2024 Mar 16.
10
ViT-MAENB7: An innovative breast cancer diagnosis model from 3D mammograms using advanced segmentation and classification process.基于先进分割和分类流程的 3D 乳腺 X 线摄影的乳腺癌诊断新模型:ViT-MAENB7。
Comput Methods Programs Biomed. 2024 Dec;257:108373. doi: 10.1016/j.cmpb.2024.108373. Epub 2024 Aug 23.

引用本文的文献

1
Development and evaluation of deep learning models for detecting and classifying various bone tumours in full-field limb radiographs using automated object detection models.使用自动目标检测模型开发和评估用于在全视野肢体X光片中检测和分类各种骨肿瘤的深度学习模型。
Bone Joint Res. 2025 Sep 2;14(9):760-768. doi: 10.1302/2046-3758.149.BJR-2024-0505.R1.
2
Intelligent quality assessment of ultrasound images for fetal nuchal translucency measurement during the first trimester of pregnancy based on deep learning models.基于深度学习模型的孕早期胎儿颈部透明带测量超声图像智能质量评估
BMC Pregnancy Childbirth. 2025 Jul 10;25(1):741. doi: 10.1186/s12884-025-07863-y.
3

本文引用的文献

1
UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation.UNesT:用于高效医学分割的分层转换器的局部空间表示学习。
Med Image Anal. 2023 Dec;90:102939. doi: 10.1016/j.media.2023.102939. Epub 2023 Aug 25.
2
ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer.ReconFormer:基于循环 Transformer 的加速 MRI 重建。
IEEE Trans Med Imaging. 2024 Jan;43(1):582-593. doi: 10.1109/TMI.2023.3314747. Epub 2024 Jan 2.
3
Transformers in medical imaging: A survey.
Toward automated plantar pressure analysis: machine learning-based segmentation and key point detection across multicenter data.
迈向自动化足底压力分析:基于机器学习的跨多中心数据分割与关键点检测。
Front Bioeng Biotechnol. 2025 Jun 19;13:1579072. doi: 10.3389/fbioe.2025.1579072. eCollection 2025.
4
Current application, possibilities, and challenges of artificial intelligence in the management of rheumatoid arthritis, axial spondyloarthritis, and psoriatic arthritis.人工智能在类风湿关节炎、轴性脊柱关节炎和银屑病关节炎管理中的当前应用、可能性及挑战。
Ther Adv Musculoskelet Dis. 2025 Jun 21;17:1759720X251343579. doi: 10.1177/1759720X251343579. eCollection 2025.
5
Lightweight and hybrid transformer-based solution for quick and reliable deepfake detection.基于轻量级和混合变压器的快速可靠深度伪造检测解决方案。
Front Big Data. 2025 Apr 1;8:1521653. doi: 10.3389/fdata.2025.1521653. eCollection 2025.
6
Metaheuristic optimizers integrated with vision transformer model for severity detection and classification via multimodal COVID-19 images.通过多模态新冠肺炎图像,将元启发式优化器与视觉Transformer模型集成用于严重程度检测和分类。
Sci Rep. 2025 Apr 22;15(1):13941. doi: 10.1038/s41598-025-98593-w.
7
Vision Transformer Autoencoders for Unsupervised Representation Learning: Capturing Local and Non-Local Features in Brain Imaging to Reveal Genetic Associations.用于无监督表征学习的视觉Transformer自动编码器:在脑成像中捕捉局部和非局部特征以揭示基因关联。
medRxiv. 2025 Mar 25:2025.03.24.25324549. doi: 10.1101/2025.03.24.25324549.
8
Establishment and evaluation of a deep learning-based tooth wear severity grading system using intraoral photographs.基于口腔内照片的深度学习牙齿磨损严重程度分级系统的建立与评估
J Dent Sci. 2025 Jan;20(1):477-486. doi: 10.1016/j.jds.2024.05.013. Epub 2024 May 21.
9
Research on grading detection methods for diabetic retinopathy based on deep learning.基于深度学习的糖尿病视网膜病变分级检测方法研究
Pak J Med Sci. 2025 Jan;41(1):225-229. doi: 10.12669/pjms.41.1.9171.
10
Decoding Schizophrenia: How AI-Enhanced fMRI Unlocks New Pathways for Precision Psychiatry.解码精神分裂症:人工智能增强的功能磁共振成像如何为精准精神病学开启新途径。
Brain Sci. 2024 Nov 27;14(12):1196. doi: 10.3390/brainsci14121196.
医学成像中的变压器:综述。
Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.
4
Tongue Contour Tracking and Segmentation in Lingual Ultrasound for Speech Recognition: A Review.用于语音识别的舌超声中的舌轮廓跟踪与分割:综述
Diagnostics (Basel). 2022 Nov 15;12(11):2811. doi: 10.3390/diagnostics12112811.
5
A transformer-based deep-learning approach for classifying brain metastases into primary organ sites using clinical whole-brain MRI images.一种基于变压器的深度学习方法,用于使用临床全脑MRI图像将脑转移瘤分类到原发器官部位。
Patterns (N Y). 2022 Oct 27;3(11):100613. doi: 10.1016/j.patter.2022.100613. eCollection 2022 Nov 11.
6
Automatic captioning for medical imaging (MIC): a rapid review of literature.医学成像自动字幕(MIC):文献快速综述
Artif Intell Rev. 2023;56(5):4019-4076. doi: 10.1007/s10462-022-10270-w. Epub 2022 Sep 17.
7
Artificial intelligence in histopathology: enhancing cancer research and clinical oncology.人工智能在组织病理学中的应用:增强癌症研究和临床肿瘤学。
Nat Cancer. 2022 Sep;3(9):1026-1038. doi: 10.1038/s43018-022-00436-4. Epub 2022 Sep 22.
8
Unsupervised contrastive learning based transformer for lung nodule detection.基于无监督对比学习的 Transformer 肺结节检测
Phys Med Biol. 2022 Oct 7;67(20). doi: 10.1088/1361-6560/ac92ba.
9
Fast MRI Reconstruction: How Powerful Transformers Are?快速磁共振成像重建:Transformer 有多强大?
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2066-2070. doi: 10.1109/EMBC48229.2022.9871475.
10
PETR: Rethinking the Capability of Transformer-Based Language Model in Scene Text Recognition.PETR:重新思考基于转换器的语言模型在场景文本识别中的能力。
IEEE Trans Image Process. 2022;31:5585-5598. doi: 10.1109/TIP.2022.3197981. Epub 2022 Aug 30.