Suppr超能文献

FastSAM3D:一种用于3D体医学图像的高效图像分割模型。

FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images.

作者信息

Shen Yiqing, Li Jingxing, Shao Xinyuan, Romillo Blanca Inigo, Jindal Ankush, Dreizin David, Unberath Mathias

机构信息

Johns Hopkins University, Baltimore, MD 21218, USA.

University of Maryland School of Medicine and R Adams Cowley Shock Trauma Center, Baltimore, MD 21201, USA.

出版信息

Med Image Comput Comput Assist Interv. 2024 Oct;15012:542-552. doi: 10.1007/978-3-031-72390-2_51. Epub 2024 Oct 23.

Abstract

Segment anything models (SAMs) are gaining attention for their zero-shot generalization capability in segmenting objects of unseen classes and in unseen domains when properly prompted. Interactivity is a key strength of SAMs, allowing users to iteratively provide prompts that specify objects of interest to refine outputs. However, to realize the interactive use of SAMs for 3D medical imaging tasks, rapid inference times are necessary. High memory requirements and long processing delays remain constraints that hinder the adoption of SAMs for this purpose. Specifically, while 2D SAMs applied to 3D volumes contend with repetitive computation to process all slices independently, 3D SAMs suffer from an exponential increase in model parameters and FLOPS. To address these challenges, we present FastSAM3D which accelerates SAM inference to 8 milliseconds per 128 × 128 × 128 3D volumetric image on an NVIDIA A100 GPU. This speedup is accomplished through 1) a novel layer-wise progressive distillation scheme that enables knowledge transfer from a complex 12-layer ViT-B to a lightweight 6-layer ViT-Tiny variant encoder without training from scratch; and 2) a novel 3D sparse flash attention to replace vanilla attention operators, substantially reducing memory needs and improving parallelization. Experiments on three diverse datasets reveal that FastSAM3D achieves a remarkable speedup of 527.38× compared to 2D SAMs and 8.75× compared to 3D SAMs on the same volumes without significant performance decline. Thus, FastSAM3D opens the door for low-cost truly interactive SAM-based 3D medical imaging segmentation with commonly used GPU hardware. Code is available at https://github.com/arcadelab/FastSAM3D.

摘要

分割一切模型(SAMs)因其在适当提示下对未见类别和未见领域中的对象进行分割的零样本泛化能力而受到关注。交互性是SAMs的一项关键优势,它允许用户迭代地提供指定感兴趣对象的提示,以优化输出。然而,要实现SAMs在3D医学成像任务中的交互使用,快速推理时间是必要的。高内存需求和长处理延迟仍然是阻碍为此目的采用SAMs的限制因素。具体而言,虽然应用于3D体积的2D SAMs要处理所有切片,存在重复计算的问题,而3D SAMs则面临模型参数和浮点运算次数(FLOPS)呈指数增长的问题。为应对这些挑战,我们提出了FastSAM3D,它在NVIDIA A100 GPU上,将对128×128×128的3D体积图像的SAM推理加速到每幅图像8毫秒。这种加速是通过以下方式实现的:1)一种新颖的逐层渐进式蒸馏方案,该方案能够在无需从头训练的情况下,将复杂的12层ViT-B中的知识转移到轻量级的6层ViT-Tiny变体编码器;2)一种新颖的3D稀疏闪存注意力机制,以取代普通注意力算子,大幅减少内存需求并提高并行化程度。在三个不同数据集上的实验表明,FastSAM3D与2D SAMs相比,在相同体积上实现了527.38倍的显著加速,与3D SAMs相比实现了8.75倍的加速,且性能没有显著下降。因此,FastSAM3D为使用常用GPU硬件进行低成本、基于SAM的真正交互式3D医学成像分割打开了大门。代码可在https://github.com/arcadelab/FastSAM3D获取。

相似文献

1
FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images.FastSAM3D:一种用于3D体医学图像的高效图像分割模型。
Med Image Comput Comput Assist Interv. 2024 Oct;15012:542-552. doi: 10.1007/978-3-031-72390-2_51. Epub 2024 Oct 23.
5
SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts.SAMCT:支持无人工任务指标提示的任意CT分割
IEEE Trans Med Imaging. 2025 Mar;44(3):1386-1399. doi: 10.1109/TMI.2024.3493456. Epub 2025 Mar 17.

本文引用的文献

1
MoViT: Memorizing Vision Transformers for Medical Image Analysis.MoViT:用于医学图像分析的记忆视觉Transformer
Mach Learn Med Imaging. 2024;14349:205-213. doi: 10.1007/978-3-031-45676-3_21. Epub 2023 Oct 15.
3
Segment anything in medical images.在医学图像中分割任何内容。
Nat Commun. 2024 Jan 22;15(1):654. doi: 10.1038/s41467-024-44824-z.
4
6
A survey on deep learning for skin lesion segmentation.深度学习在皮肤病变分割中的研究综述。
Med Image Anal. 2023 Aug;88:102863. doi: 10.1016/j.media.2023.102863. Epub 2023 Jun 9.
10
State-of-the-Art Methods for Brain Tissue Segmentation: A Review.用于脑组织分割的最新方法:综述。
IEEE Rev Biomed Eng. 2017;10:235-249. doi: 10.1109/RBME.2017.2715350. Epub 2017 Jun 14.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验