• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

UNesT:用于高效医学分割的分层转换器的局部空间表示学习。

UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation.

机构信息

Department of Computer Science, Vanderbilt University, Nashville TN, 37212, USA.

Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, 37212, USA.

出版信息

Med Image Anal. 2023 Dec;90:102939. doi: 10.1016/j.media.2023.102939. Epub 2023 Aug 25.

DOI:10.1016/j.media.2023.102939
PMID:37725868
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11229077/
Abstract

Transformer-based models, capable of learning better global dependencies, have recently demonstrated exceptional representation learning capabilities in computer vision and medical image analysis. Transformer reformats the image into separate patches and realizes global communication via the self-attention mechanism. However, positional information between patches is hard to preserve in such 1D sequences, and loss of it can lead to sub-optimal performance when dealing with large amounts of heterogeneous tissues of various sizes in 3D medical image segmentation. Additionally, current methods are not robust and efficient for heavy-duty medical segmentation tasks such as predicting a large number of tissue classes or modeling globally inter-connected tissue structures. To address such challenges and inspired by the nested hierarchical structures in vision transformer, we proposed a novel 3D medical image segmentation method (UNesT), employing a simplified and faster-converging transformer encoder design that achieves local communication among spatially adjacent patch sequences by aggregating them hierarchically. We extensively validate our method on multiple challenging datasets, consisting of multiple modalities, anatomies, and a wide range of tissue classes, including 133 structures in the brain, 14 organs in the abdomen, 4 hierarchical components in the kidneys, inter-connected kidney tumors and brain tumors. We show that UNesT consistently achieves state-of-the-art performance and evaluate its generalizability and data efficiency. Particularly, the model achieves whole brain segmentation task complete ROI with 133 tissue classes in a single network, outperforming prior state-of-the-art method SLANT27 ensembled with 27 networks. Our model performance increases the mean DSC score of the publicly available Colin and CANDI dataset from 0.7264 to 0.7444 and from 0.6968 to 0.7025, respectively. Code, pre-trained models, and use case pipeline are available at: https://github.com/MASILab/UNesT.

摘要

基于转换器的模型能够更好地学习全局依赖关系,最近在计算机视觉和医学图像分析中表现出了出色的表示学习能力。转换器将图像转换为单独的补丁,并通过自注意力机制实现全局通信。然而,在这种 1D 序列中,补丁之间的位置信息很难保留,当处理 3D 医学图像分割中大量不同大小的异质组织时,丢失位置信息会导致性能不佳。此外,当前的方法在处理大型医疗分割任务(如预测大量组织类别或建模全局互连的组织结构)时不够稳健和高效。为了解决这些挑战,并受到视觉转换器中嵌套层次结构的启发,我们提出了一种新的 3D 医学图像分割方法(UNesT),采用简化且收敛更快的转换器编码器设计,通过分层聚合实现空间相邻补丁序列之间的局部通信。我们在多个具有挑战性的数据集上广泛验证了我们的方法,这些数据集包含多种模态、解剖结构和广泛的组织类别,包括大脑中的 133 个结构、腹部中的 14 个器官、肾脏中的 4 个层次组件、互连的肾脏肿瘤和脑肿瘤。我们表明,UNesT 始终能够实现最先进的性能,并评估其泛化能力和数据效率。特别是,该模型在单个网络中实现了 133 个组织类别的整个大脑分割任务的完整 ROI,优于由 27 个网络集成的先前最先进的 SLANT27 方法。我们的模型性能分别将 Colin 和 CANDI 数据集的平均 DSC 评分从 0.7264 提高到 0.7444,从 0.6968 提高到 0.7025。代码、预训练模型和用例管道可在 https://github.com/MASILab/UNesT 上获得。

相似文献

1
UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation.UNesT:用于高效医学分割的分层转换器的局部空间表示学习。
Med Image Anal. 2023 Dec;90:102939. doi: 10.1016/j.media.2023.102939. Epub 2023 Aug 25.
2
VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.VSmTrans:一种融合自注意力机制和卷积的 3D 医学图像分割混合范式。
Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.
3
Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution.Swin Unet3D:一种结合视觉Transformer 和卷积的三维医学图像分割网络。
BMC Med Inform Decis Mak. 2023 Feb 14;23(1):33. doi: 10.1186/s12911-023-02129-z.
4
ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet:探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。
Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.
5
MultiTrans: Multi-branch transformer network for medical image segmentation.多分支转换器网络在医学图像分割中的应用。
Comput Methods Programs Biomed. 2024 Sep;254:108280. doi: 10.1016/j.cmpb.2024.108280. Epub 2024 Jun 8.
6
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
7
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
8
3D whole brain segmentation using spatially localized atlas network tiles.使用空间局部化图谱网络瓦片进行 3D 全脑分割。
Neuroimage. 2019 Jul 1;194:105-119. doi: 10.1016/j.neuroimage.2019.03.041. Epub 2019 Mar 23.
9
Adaptive cascaded transformer U-Net for MRI brain tumor segmentation.基于自适应级联变换的 U-Net 模型在 MRI 脑肿瘤分割中的应用。
Phys Med Biol. 2024 May 27;69(11). doi: 10.1088/1361-6560/ad4081.
10
MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.MISSFormer:用于二维医学图像分割的有效 Transformer。
IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.

引用本文的文献

1
Scalable quality control on processing of large diffusion-weighted and structural magnetic resonance imaging datasets.对大型扩散加权和结构磁共振成像数据集处理的可扩展质量控制
PLoS One. 2025 Aug 1;20(8):e0327388. doi: 10.1371/journal.pone.0327388. eCollection 2025.
2
Assessing the role of volumetric brain information in multiple sclerosis progression.评估脑容量信息在多发性硬化症进展中的作用。
Comput Struct Biotechnol J. 2025 May 12;27:2014-2033. doi: 10.1016/j.csbj.2025.05.003. eCollection 2025.
3
Enhanced nuclear information fusion and visual transformer for pathological breast cancer image classification.

本文引用的文献

1
TotalSegmentator: Robust Segmentation of 104 Anatomic Structures in CT Images.全段分割器:CT图像中104种解剖结构的稳健分割
Radiol Artif Intell. 2023 Jul 5;5(5):e230024. doi: 10.1148/ryai.230024. eCollection 2023 Sep.
2
MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.MISSFormer:用于二维医学图像分割的有效 Transformer。
IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.
3
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives.
用于病理乳腺癌图像分类的增强核信息融合与视觉Transformer
Sci Rep. 2025 Jun 3;15(1):19490. doi: 10.1038/s41598-025-04344-2.
4
A 4D atlas of diffusion-informed spatial smoothing windows for BOLD signal in white matter.用于白质中BOLD信号的扩散信息空间平滑窗口的4D图谱。
Proc SPIE Int Soc Opt Eng. 2025 Feb;13406. doi: 10.1117/12.3047240. Epub 2025 Apr 11.
5
Multi-contrast computed tomography atlas of healthy pancreas with dense displacement sampling registration.采用密集位移采样配准的健康胰腺多对比计算机断层扫描图谱
J Med Imaging (Bellingham). 2025 Mar;12(2):024006. doi: 10.1117/1.JMI.12.2.024006. Epub 2025 Apr 17.
6
Improving coronary artery segmentation with self-supervised learning and automated pericoronary adipose tissue segmentation: a multi-institutional study on coronary computed tomography angiography images.通过自监督学习和自动冠状动脉周围脂肪组织分割改善冠状动脉分割:一项关于冠状动脉计算机断层扫描血管造影图像的多机构研究。
J Med Imaging (Bellingham). 2025 Jan;12(1):016002. doi: 10.1117/1.JMI.12.1.016002. Epub 2025 Feb 17.
7
Data-Oriented Octree Inverse Hierarchical Order Aggregation Hybrid Transformer-CNN for 3D Medical Segmentation.用于3D医学分割的面向数据的八叉树逆层次顺序聚合混合Transformer-CNN
J Imaging Inform Med. 2025 Jan 7. doi: 10.1007/s10278-024-01299-0.
8
Comparison and calibration of MP2RAGE quantitative T1 values to multi-TI inversion recovery T1 values.MP2RAGE定量T1值与多TI反转恢复T1值的比较与校准。
Magn Reson Imaging. 2025 Apr;117:110322. doi: 10.1016/j.mri.2025.110322. Epub 2025 Jan 3.
9
Automatic jawbone structure segmentation on dental CBCT images via deep learning.基于深度学习的口腔锥形束 CT 图像下颌骨结构自动分割。
Clin Oral Investig. 2024 Nov 28;28(12):663. doi: 10.1007/s00784-024-06061-y.
10
OpenMAP-T1: A Rapid Deep-Learning Approach to Parcellate 280 Anatomical Regions to Cover the Whole Brain.OpenMAP-T1:一种快速深度学习方法,可将 280 个解剖区域分割为覆盖整个大脑的区域。
Hum Brain Mapp. 2024 Nov;45(16):e70063. doi: 10.1002/hbm.70063.
基于 Transformer 的医学影像变革?关键特性、当前进展和未来展望的对比综述。
Med Image Anal. 2023 Apr;85:102762. doi: 10.1016/j.media.2023.102762. Epub 2023 Jan 31.
4
Quantification of muscle, bones, and fat on single slice thigh CT.单层面大腿CT上肌肉、骨骼和脂肪的定量分析。
Proc SPIE Int Soc Opt Eng. 2022 Feb-Mar;12032. doi: 10.1117/12.2611664. Epub 2022 Apr 4.
5
Label efficient segmentation of single slice thigh CT with two-stage pseudo labels.基于两阶段伪标签的单层面大腿CT图像的标签高效分割
J Med Imaging (Bellingham). 2022 Sep;9(5):052405. doi: 10.1117/1.JMI.9.5.052405. Epub 2022 May 19.
6
Renal Cortex, Medulla and Pelvicaliceal System Segmentation on Arterial Phase CT Images with Random Patch-based Networks.基于随机补丁网络的动脉期CT图像上肾皮质、髓质和肾盂肾盏系统分割
Proc SPIE Int Soc Opt Eng. 2021;11596. doi: 10.1117/12.2581101. Epub 2021 Feb 15.
7
High-resolution 3D abdominal segmentation with random patch network fusion.基于随机补丁网络融合的高分辨率三维腹部分割
Med Image Anal. 2021 Apr;69:101894. doi: 10.1016/j.media.2020.101894. Epub 2020 Dec 16.
8
nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.nnU-Net:一种基于深度学习的生物医学图像分割的自配置方法。
Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.
9
Models Genesis.模型起源。
Med Image Anal. 2021 Jan;67:101840. doi: 10.1016/j.media.2020.101840. Epub 2020 Oct 13.
10
The state of the art in kidney and kidney tumor segmentation in contrast-enhanced CT imaging: Results of the KiTS19 challenge.对比增强CT成像中肾脏及肾肿瘤分割的技术现状:KiTS19挑战赛结果
Med Image Anal. 2021 Jan;67:101821. doi: 10.1016/j.media.2020.101821. Epub 2020 Oct 2.