• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用Transformer网络的医学图像分割

Medical Image Segmentation Using Transformer Networks.

作者信息

Karimi Davood, Dou Haoran, Gholipour Ali

机构信息

Department of Radiology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.

Centre for Computational Imaging & Simulation Technologies in Biomedicine (CISTIB), School of Computing, University of Leeds, Leeds LS2 9JT, U.K.

出版信息

IEEE Access. 2022;10:29322-29332. doi: 10.1109/access.2022.3156894. Epub 2022 Mar 4.

DOI:10.1109/access.2022.3156894
PMID:35656515
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9159704/
Abstract

Deep learning models represent the state of the art in medical image segmentation. Most of these models are fully-convolutional networks (FCNs), namely each layer processes the output of the preceding layer with convolution operations. The convolution operation enjoys several important properties such as sparse interactions, parameter sharing, and translation equivariance. Because of these properties, FCNs possess a strong and useful inductive bias for image modeling and analysis. However, they also have certain important shortcomings, such as performing a fixed and pre-determined operation on a test image regardless of its content and difficulty in modeling long-range interactions. In this work we show that a different deep neural network architecture, based entirely on self-attention between neighboring image patches and without any convolution operations, can achieve more accurate segmentations than FCNs. Our proposed model is based directly on the transformer network architecture. Given a 3D image block, our network divides it into non-overlapping 3D patches and computes a 1D embedding for each patch. The network predicts the segmentation map for the block based on the self-attention between these patch embeddings. Furthermore, in order to address the common problem of scarcity of labeled medical images, we propose methods for pre-training this model on large corpora of unlabeled images. Our experiments show that the proposed model can achieve segmentation accuracies that are better than several state of the art FCN architectures on two datasets. Our proposed network can be trained using only tens of labeled images. Moreover, with the proposed pre-training strategies, our network outperforms FCNs when labeled training data is small.

摘要

深度学习模型代表了医学图像分割的当前技术水平。这些模型大多是全卷积网络(FCN),即每一层都通过卷积操作处理前一层的输出。卷积操作具有几个重要特性,如稀疏交互、参数共享和平移不变性。由于这些特性,FCN在图像建模和分析方面具有强大且有用的归纳偏差。然而,它们也有一些重要缺点,比如无论测试图像的内容和难度如何,都对其执行固定且预先确定的操作,以及在对长距离交互进行建模时存在困难。在这项工作中,我们表明一种完全基于相邻图像块之间的自注意力且没有任何卷积操作的不同深度神经网络架构,能够比FCN实现更准确的分割。我们提出的模型直接基于Transformer网络架构。给定一个3D图像块,我们的网络将其划分为不重叠的3D块,并为每个块计算一个1D嵌入。网络基于这些块嵌入之间的自注意力预测该块的分割图。此外,为了解决标记医学图像稀缺的常见问题,我们提出了在大量未标记图像语料库上对该模型进行预训练的方法。我们的实验表明,所提出的模型在两个数据集上能够实现优于几种当前技术水平的FCN架构的分割精度。我们提出的网络仅使用几十张标记图像就可以进行训练。此外,通过所提出的预训练策略,当标记训练数据较少时,我们的网络优于FCN。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/28d79f2e76ba/nihms-1791234-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/b92cab83aaef/nihms-1791234-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/ba3400bc6b81/nihms-1791234-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/70d3de3a8d1b/nihms-1791234-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/dda9c862f275/nihms-1791234-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/28d79f2e76ba/nihms-1791234-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/b92cab83aaef/nihms-1791234-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/ba3400bc6b81/nihms-1791234-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/70d3de3a8d1b/nihms-1791234-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/dda9c862f275/nihms-1791234-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d08/9159704/28d79f2e76ba/nihms-1791234-f0005.jpg

相似文献

1
Medical Image Segmentation Using Transformer Networks.使用Transformer网络的医学图像分割
IEEE Access. 2022;10:29322-29332. doi: 10.1109/access.2022.3156894. Epub 2022 Mar 4.
2
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
3
Deep learning of the sectional appearances of 3D CT images for anatomical structure segmentation based on an FCN voting method.基于 FCN 投票方法的三维 CT 图像节段外观的深度学习用于解剖结构分割。
Med Phys. 2017 Oct;44(10):5221-5233. doi: 10.1002/mp.12480. Epub 2017 Aug 31.
4
Automated multi-modal Transformer network (AMTNet) for 3D medical images segmentation.用于3D医学图像分割的自动多模态Transformer网络(AMTNet)。
Phys Med Biol. 2023 Jan 9;68(2). doi: 10.1088/1361-6560/aca74c.
5
Robust Automated Tumour Segmentation Network Using 3D Direction-Wise Convolution and Transformer.基于 3D 方向卷积和 Transformer 的稳健自动肿瘤分割网络
J Imaging Inform Med. 2024 Oct;37(5):2444-2453. doi: 10.1007/s10278-024-01131-9. Epub 2024 May 9.
6
MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.MESTrans:用于医学图像分割的多尺度嵌入空间变换器
Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.
7
SW-UNet: a U-Net fusing sliding window transformer block with CNN for segmentation of lung nodules.SW-UNet:一种将滑动窗口变压器模块与卷积神经网络融合用于肺结节分割的U-Net。
Front Med (Lausanne). 2023 Sep 28;10:1273441. doi: 10.3389/fmed.2023.1273441. eCollection 2023.
8
Performance improvement of weakly supervised fully convolutional networks by skip connections for brain structure segmentation.基于 skip connections 的弱监督全卷积网络在脑结构分割中的性能提升。
Med Phys. 2021 Nov;48(11):7215-7227. doi: 10.1002/mp.15192. Epub 2021 Sep 13.
9
A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer.一种用于医学图像分割的新型深度学习模型:结合卷积神经网络和Transformer
Interdiscip Sci. 2023 Dec;15(4):663-677. doi: 10.1007/s12539-023-00585-9. Epub 2023 Sep 4.
10
Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation.基于转换器和卷积神经网络的协作网络是精确的 3D 医学图像分割的强大且多功能的学习者。
Comput Biol Med. 2023 Sep;164:107228. doi: 10.1016/j.compbiomed.2023.107228. Epub 2023 Jul 5.

引用本文的文献

1
An approach to building foundation models for brain image analysis.一种构建用于脑图像分析的基础模型的方法。
Med Image Comput Comput Assist Interv. 2024 Oct;15012:421-431. doi: 10.1007/978-3-031-72390-2_40. Epub 2024 Oct 23.
2
Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network.使用双U-Net、双向卷积长短期记忆网络U-Net和Transformer网络的近距离放射治疗中危及器官的三维医学图像分割集成学习
J Med Phys. 2024 Oct-Dec;49(4):574-582. doi: 10.4103/jmp.jmp_160_24. Epub 2024 Dec 18.
3
Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.

本文引用的文献

1
The Liver Tumor Segmentation Benchmark (LiTS).肝脏肿瘤分割基准(LiTS)。
Med Image Anal. 2023 Feb;84:102680. doi: 10.1016/j.media.2022.102680. Epub 2022 Nov 17.
2
Transfer learning in medical image segmentation: New insights from analysis of the dynamics of model parameters and learned representations.迁移学习在医学图像分割中的应用:基于模型参数和学习表示动态分析的新见解。
Artif Intell Med. 2021 Jun;116:102078. doi: 10.1016/j.artmed.2021.102078. Epub 2021 Apr 23.
3
A Deep Attentive Convolutional Neural Network for Automatic Cortical Plate Segmentation in Fetal MRI.
医学图像分析中视觉转换器与卷积神经网络的比较:系统评价。
J Med Syst. 2024 Sep 12;48(1):84. doi: 10.1007/s10916-024-02105-8.
4
A semi-automatic deep learning model based on biparametric MRI scanning strategy to predict bone metastases in newly diagnosed prostate cancer patients.一种基于双参数MRI扫描策略的半自动深度学习模型,用于预测新诊断前列腺癌患者的骨转移。
Front Oncol. 2024 Jun 11;14:1298516. doi: 10.3389/fonc.2024.1298516. eCollection 2024.
5
Precise and Rapid Whole-Head Segmentation from Magnetic Resonance Images of Older Adults using Deep Learning.使用深度学习从老年人的磁共振图像中进行精确快速的全脑分割
Imaging Neurosci (Camb). 2024 Mar;2. doi: 10.1162/imag_a_00090. Epub 2024 Feb 13.
6
MF-Net: Automated Muscle Fiber Segmentation From Immunofluorescence Images Using a Local-Global Feature Fusion Network.MF-Net:基于局部-全局特征融合网络的免疫荧光图像自动肌纤维分割。
J Digit Imaging. 2023 Dec;36(6):2411-2426. doi: 10.1007/s10278-023-00890-1. Epub 2023 Sep 15.
7
Performance Analysis of Segmentation and Classification of CT-Scanned Ovarian Tumours Using U-Net and Deep Convolutional Neural Networks.使用U-Net和深度卷积神经网络对CT扫描的卵巢肿瘤进行分割和分类的性能分析
Diagnostics (Basel). 2023 Jul 5;13(13):2282. doi: 10.3390/diagnostics13132282.
8
Deep Learning in Ischemic Stroke Imaging Analysis: A Comprehensive Review.深度学习在缺血性脑卒中影像分析中的应用:全面综述。
Biomed Res Int. 2022 Nov 14;2022:2456550. doi: 10.1155/2022/2456550. eCollection 2022.
基于深度关注卷积神经网络的胎儿 MRI 自动皮质板分割
IEEE Trans Med Imaging. 2021 Apr;40(4):1123-1133. doi: 10.1109/TMI.2020.3046579. Epub 2021 Apr 1.
4
UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++:一种用于医学图像分割的嵌套U-Net架构。
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.
5
Accurate and robust deep learning-based segmentation of the prostate clinical target volume in ultrasound images.基于深度学习的超声图像中前列腺临床靶区的准确且稳健分割
Med Image Anal. 2019 Oct;57:186-196. doi: 10.1016/j.media.2019.07.005. Epub 2019 Jul 15.
6
Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges.深度学习技术在医学图像分割中的应用:成就与挑战。
J Digit Imaging. 2019 Aug;32(4):582-596. doi: 10.1007/s10278-019-00227-x.
7
Recalibrating Fully Convolutional Networks With Spatial and Channel "Squeeze and Excitation" Blocks.空间和通道“挤压和激励”块的全卷积网络重新校准。
IEEE Trans Med Imaging. 2019 Feb;38(2):540-549. doi: 10.1109/TMI.2018.2867261.
8
Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved?深度学习技术在自动 MRI 心脏多结构分割与诊断中的应用:问题是否已解决?
IEEE Trans Med Imaging. 2018 Nov;37(11):2514-2525. doi: 10.1109/TMI.2018.2837502. Epub 2018 May 17.
9
Automated processing pipeline for neonatal diffusion MRI in the developing Human Connectome Project.新生儿弥散 MRI 在人类连接组计划发展中的自动化处理流水线。
Neuroimage. 2019 Jan 15;185:750-763. doi: 10.1016/j.neuroimage.2018.05.064. Epub 2018 May 28.
10
Prostate segmentation in MRI using a convolutional neural network architecture and training strategy based on statistical shape models.基于统计形状模型的卷积神经网络架构和训练策略的 MRI 前列腺分割。
Int J Comput Assist Radiol Surg. 2018 Aug;13(8):1211-1219. doi: 10.1007/s11548-018-1785-8. Epub 2018 May 15.