FastSAM3D：一种用于3D体医学图像的高效图像分割模型。

FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images.

作者信息

Shen Yiqing, Li Jingxing, Shao Xinyuan, Romillo Blanca Inigo, Jindal Ankush, Dreizin David, Unberath Mathias

机构信息

Johns Hopkins University, Baltimore, MD 21218, USA.

University of Maryland School of Medicine and R Adams Cowley Shock Trauma Center, Baltimore, MD 21201, USA.

出版信息

Med Image Comput Comput Assist Interv. 2024 Oct;15012:542-552. doi: 10.1007/978-3-031-72390-2_51. Epub 2024 Oct 23.

DOI:10.1007/978-3-031-72390-2_51

PMID:40861900

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12377522/

Abstract

Segment anything models (SAMs) are gaining attention for their zero-shot generalization capability in segmenting objects of unseen classes and in unseen domains when properly prompted. Interactivity is a key strength of SAMs, allowing users to iteratively provide prompts that specify objects of interest to refine outputs. However, to realize the interactive use of SAMs for 3D medical imaging tasks, rapid inference times are necessary. High memory requirements and long processing delays remain constraints that hinder the adoption of SAMs for this purpose. Specifically, while 2D SAMs applied to 3D volumes contend with repetitive computation to process all slices independently, 3D SAMs suffer from an exponential increase in model parameters and FLOPS. To address these challenges, we present FastSAM3D which accelerates SAM inference to 8 milliseconds per 128 × 128 × 128 3D volumetric image on an NVIDIA A100 GPU. This speedup is accomplished through 1) a novel layer-wise progressive distillation scheme that enables knowledge transfer from a complex 12-layer ViT-B to a lightweight 6-layer ViT-Tiny variant encoder without training from scratch; and 2) a novel 3D sparse flash attention to replace vanilla attention operators, substantially reducing memory needs and improving parallelization. Experiments on three diverse datasets reveal that FastSAM3D achieves a remarkable speedup of 527.38× compared to 2D SAMs and 8.75× compared to 3D SAMs on the same volumes without significant performance decline. Thus, FastSAM3D opens the door for low-cost truly interactive SAM-based 3D medical imaging segmentation with commonly used GPU hardware. Code is available at https://github.com/arcadelab/FastSAM3D.

摘要

分割一切模型（SAMs）因其在适当提示下对未见类别和未见领域中的对象进行分割的零样本泛化能力而受到关注。交互性是SAMs的一项关键优势，它允许用户迭代地提供指定感兴趣对象的提示，以优化输出。然而，要实现SAMs在3D医学成像任务中的交互使用，快速推理时间是必要的。高内存需求和长处理延迟仍然是阻碍为此目的采用SAMs的限制因素。具体而言，虽然应用于3D体积的2D SAMs要处理所有切片，存在重复计算的问题，而3D SAMs则面临模型参数和浮点运算次数（FLOPS）呈指数增长的问题。为应对这些挑战，我们提出了FastSAM3D，它在NVIDIA A100 GPU上，将对128×128×128的3D体积图像的SAM推理加速到每幅图像8毫秒。这种加速是通过以下方式实现的：1）一种新颖的逐层渐进式蒸馏方案，该方案能够在无需从头训练的情况下，将复杂的12层ViT-B中的知识转移到轻量级的6层ViT-Tiny变体编码器；2）一种新颖的3D稀疏闪存注意力机制，以取代普通注意力算子，大幅减少内存需求并提高并行化程度。在三个不同数据集上的实验表明，FastSAM3D与2D SAMs相比，在相同体积上实现了527.38倍的显著加速，与3D SAMs相比实现了8.75倍的加速，且性能没有显著下降。因此，FastSAM3D为使用常用GPU硬件进行低成本、基于SAM的真正交互式3D医学成像分割打开了大门。代码可在https://github.com/arcadelab/FastSAM3D获取。

相似文献

FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images.FastSAM3D：一种用于3D体医学图像的高效图像分割模型。

Med Image Comput Comput Assist Interv. 2024 Oct;15012:542-552. doi: 10.1007/978-3-031-72390-2_51. Epub 2024 Oct 23.

FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification.FastSAM-3DSlicer：一种用于具有不确定性量化的3D体积分割一切模型的3D Slicer扩展。

Found Models Gen Med AI (2024). 2025;15184:1-9. doi: 10.1007/978-3-031-73471-7_1. Epub 2024 Sep 28.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。

Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.

SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts.SAMCT：支持无人工任务指标提示的任意CT分割

IEEE Trans Med Imaging. 2025 Mar;44(3):1386-1399. doi: 10.1109/TMI.2024.3493456. Epub 2025 Mar 17.

Short-Term Memory Impairment短期记忆障碍

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Sexual Harassment and Prevention Training性骚扰与预防培训

A segmentation method for oral CBCT image based on Segment Anything Model and semi-supervised teacher-student model.一种基于分割一切模型和半监督师生模型的口腔锥形束计算机断层扫描（CBCT）图像分割方法。

Med Phys. 2025 May 7. doi: 10.1002/mp.17854.

Using Segment Anything Model 2 for Zero-Shot 3D Segmentation of Abdominal Organs in Computed Tomography Scans to Adapt Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation.使用Segment Anything Model 2对计算机断层扫描中的腹部器官进行零样本三维分割，以适应三维医学成像的视频跟踪能力：算法开发与验证

JMIR AI. 2025 Apr 29;4:e72109. doi: 10.2196/72109.

本文引用的文献

MoViT: Memorizing Vision Transformers for Medical Image Analysis.MoViT：用于医学图像分析的记忆视觉Transformer

Mach Learn Med Imaging. 2024;14349:205-213. doi: 10.1007/978-3-031-45676-3_21. Epub 2023 Oct 15.

Segment anything model for medical image segmentation: Current applications and future directions.用于医学图像分割的分割模型：当前应用与未来方向。

Comput Biol Med. 2024 Mar;171:108238. doi: 10.1016/j.compbiomed.2024.108238. Epub 2024 Feb 27.

Segment anything in medical images.在医学图像中分割任何内容。

Nat Commun. 2024 Jan 22;15(1):654. doi: 10.1038/s41467-024-44824-z.

TotalSegmentator: Robust Segmentation of 104 Anatomic Structures in CT Images.全段分割器：CT图像中104种解剖结构的稳健分割

Radiol Artif Intell. 2023 Jul 5;5(5):e230024. doi: 10.1148/ryai.230024. eCollection 2023 Sep.

Segment anything model for medical image analysis: An experimental study.用于医学图像分析的分割模型：一项实验研究。

Med Image Anal. 2023 Oct;89:102918. doi: 10.1016/j.media.2023.102918. Epub 2023 Aug 2.

A survey on deep learning for skin lesion segmentation.深度学习在皮肤病变分割中的研究综述。

Med Image Anal. 2023 Aug;88:102863. doi: 10.1016/j.media.2023.102863. Epub 2023 Jun 9.

ClusterSeg: A crowd cluster pinpointed nucleus segmentation framework with cross-modality datasets.ClusterSeg：一个具有跨模态数据集的人群聚类精准细胞核分割框架。

Med Image Anal. 2023 Apr;85:102758. doi: 10.1016/j.media.2023.102758. Epub 2023 Jan 24.

Swin Transformer Improves the IDH Mutation Status Prediction of Gliomas Free of MRI-Based Tumor Segmentation.基于无MRI肿瘤分割的Swin Transformer改善了胶质瘤的异柠檬酸脱氢酶（IDH）突变状态预测。

J Clin Med. 2022 Aug 8;11(15):4625. doi: 10.3390/jcm11154625.

Interactive Medical Image Segmentation Using Deep Learning With Image-Specific Fine Tuning.基于图像特定精细调整的深度学习的交互式医学图像分割。

IEEE Trans Med Imaging. 2018 Jul;37(7):1562-1573. doi: 10.1109/TMI.2018.2791721.

State-of-the-Art Methods for Brain Tissue Segmentation: A Review.用于脑组织分割的最新方法：综述。

IEEE Rev Biomed Eng. 2017;10:235-249. doi: 10.1109/RBME.2017.2715350. Epub 2017 Jun 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验