MA-SAM：用于 3D 医学图像分割的模态无关 SAM 适配。

MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation.

机构信息

Center of Advanced Medical Computing and Analysis, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA.

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China.

出版信息

Med Image Anal. 2024 Dec;98:103310. doi: 10.1016/j.media.2024.103310. Epub 2024 Aug 22.

DOI:10.1016/j.media.2024.103310

PMID:39182302

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11381141/

Abstract

The Segment Anything Model (SAM), a foundation model for general image segmentation, has demonstrated impressive zero-shot performance across numerous natural image segmentation tasks. However, SAM's performance significantly declines when applied to medical images, primarily due to the substantial disparity between natural and medical image domains. To effectively adapt SAM to medical images, it is important to incorporate critical third-dimensional information, i.e., volumetric or temporal knowledge, during fine-tuning. Simultaneously, we aim to harness SAM's pre-trained weights within its original 2D backbone to the fullest extent. In this paper, we introduce a modality-agnostic SAM adaptation framework, named as MA-SAM, that is applicable to various volumetric and video medical data. Our method roots in the parameter-efficient fine-tuning strategy to update only a small portion of weight increments while preserving the majority of SAM's pre-trained weights. By injecting a series of 3D adapters into the transformer blocks of the image encoder, our method enables the pre-trained 2D backbone to extract third-dimensional information from input data. We comprehensively evaluate our method on five medical image segmentation tasks, by using 11 public datasets across CT, MRI, and surgical video data. Remarkably, without using any prompt, our method consistently outperforms various state-of-the-art 3D approaches, surpassing nnU-Net by 0.9%, 2.6%, and 9.9% in Dice for CT multi-organ segmentation, MRI prostate segmentation, and surgical scene segmentation respectively. Our model also demonstrates strong generalization, and excels in challenging tumor segmentation when prompts are used. Our code is available at: https://github.com/cchen-cc/MA-SAM.

摘要

Segment Anything Model（SAM）是一种通用图像分割的基础模型，在众多自然图像分割任务中展示了令人印象深刻的零样本性能。然而，当应用于医学图像时，SAM 的性能会显著下降，主要是因为自然图像和医学图像领域之间存在巨大的差异。为了有效地将 SAM 应用于医学图像，在微调过程中纳入关键的三维信息（即体积或时间知识）非常重要。同时，我们旨在最大限度地利用 SAM 在其原始 2D 主干中的预训练权重。在本文中，我们引入了一种称为 MA-SAM 的与模态无关的 SAM 适应框架，适用于各种体积和视频医学数据。我们的方法基于参数有效的微调策略，只更新一小部分权重增量，同时保留 SAM 的大部分预训练权重。通过在图像编码器的变压器块中注入一系列 3D 适配器，我们的方法使预训练的 2D 主干能够从输入数据中提取三维信息。我们在五个医学图像分割任务上全面评估了我们的方法，使用了 11 个公共数据集，涵盖 CT、MRI 和手术视频数据。值得注意的是，在不使用任何提示的情况下，我们的方法始终优于各种最先进的 3D 方法，在 CT 多器官分割、MRI 前列腺分割和手术场景分割方面，我们的方法分别比 nnU-Net 高出 0.9%、2.6%和 9.9%。我们的模型还表现出很强的泛化能力，在使用提示时，在具有挑战性的肿瘤分割方面表现出色。我们的代码可在：https://github.com/cchen-cc/MA-SAM 获得。

相似文献

MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation.MA-SAM：用于 3D 医学图像分割的模态无关 SAM 适配。

Med Image Anal. 2024 Dec;98:103310. doi: 10.1016/j.media.2024.103310. Epub 2024 Aug 22.

A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。

Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.

Cross-domain subcortical brain structure segmentation algorithm based on low-rank adaptation fine-tuning SAM.基于低秩自适应微调SAM的跨域皮质下脑结构分割算法

BMC Med Imaging. 2025 Jul 1;25(1):248. doi: 10.1186/s12880-025-01779-x.

MA-SAM: A Multi-Atlas Guided SAM Using Pseudo Mask Prompts Without Manual Annotation for Spine Image Segmentation.MA-SAM：一种用于脊柱图像分割的多图谱引导的SAM，使用无人工标注的伪掩码提示。

IEEE Trans Med Imaging. 2025 May;44(5):2157-2169. doi: 10.1109/TMI.2024.3524570. Epub 2025 May 2.

Learnable Prompting SAM-Induced Knowledge Distillation for Semi-Supervised Medical Image Segmentation.用于半监督医学图像分割的可学习提示SAM诱导知识蒸馏

IEEE Trans Med Imaging. 2025 May;44(5):2295-2306. doi: 10.1109/TMI.2025.3530097. Epub 2025 May 2.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts.SAMCT：支持无人工任务指标提示的任意CT分割

IEEE Trans Med Imaging. 2025 Mar;44(3):1386-1399. doi: 10.1109/TMI.2024.3493456. Epub 2025 Mar 17.

Using Segment Anything Model 2 for Zero-Shot 3D Segmentation of Abdominal Organs in Computed Tomography Scans to Adapt Video Tracking Capabilities for 3D Medical Imaging: Algorithm Development and Validation.使用Segment Anything Model 2对计算机断层扫描中的腹部器官进行零样本三维分割，以适应三维医学成像的视频跟踪能力：算法开发与验证

JMIR AI. 2025 Apr 29;4:e72109. doi: 10.2196/72109.

A segmentation method for oral CBCT image based on Segment Anything Model and semi-supervised teacher-student model.一种基于分割一切模型和半监督师生模型的口腔锥形束计算机断层扫描（CBCT）图像分割方法。

Med Phys. 2025 May 7. doi: 10.1002/mp.17854.

Taming large vision model for medical image segmentation via Dual Visual Prompt Tuning.通过双视觉提示调整驯服用于医学图像分割的大型视觉模型。

Comput Med Imaging Graph. 2025 Sep;124:102608. doi: 10.1016/j.compmedimag.2025.102608. Epub 2025 Jul 19.

引用本文的文献

A generalist foundation model and database for open-world medical image segmentation.用于开放世界医学图像分割的通用基础模型和数据库。

Nat Biomed Eng. 2025 Sep 5. doi: 10.1038/s41551-025-01497-3.

A new low-rank adaptation method for brain structure and metastasis segmentation via decoupled principal weight direction and magnitude.一种通过解耦主权重方向和大小实现脑结构与转移灶分割的新型低秩自适应方法。

Sci Rep. 2025 Jul 28;15(1):27388. doi: 10.1038/s41598-025-11632-4.

Leveraging advanced feature extraction for improved kidney biopsy segmentation.利用先进的特征提取技术改进肾活检分割。

Front Med (Lausanne). 2025 Jun 18;12:1591999. doi: 10.3389/fmed.2025.1591999. eCollection 2025.

A narrative review of foundation models for medical image segmentation: zero-shot performance evaluation on diverse modalities.医学图像分割基础模型的叙述性综述：不同模态下的零样本性能评估

Quant Imaging Med Surg. 2025 Jun 6;15(6):5825-5858. doi: 10.21037/qims-2024-2826. Epub 2025 Jun 3.

CDA-mamba: cross-directional attention mamba for enhanced 3D medical image segmentation.CDA-曼巴：用于增强3D医学图像分割的交叉方向注意力曼巴

Sci Rep. 2025 Jul 1;15(1):21357. doi: 10.1038/s41598-025-06462-3.

Enhancing diagnostic accuracy in rare and common fundus diseases with a knowledge-rich vision-language model.利用知识丰富的视觉语言模型提高罕见和常见眼底疾病的诊断准确性。

Nat Commun. 2025 Jul 1;16(1):5528. doi: 10.1038/s41467-025-60577-9.

Segment Anything Model (SAM) and Medical SAM (MedSAM) for Lumbar Spine MRI.用于腰椎磁共振成像的分割一切模型（SAM）和医学分割一切模型（MedSAM）

Sensors (Basel). 2025 Jun 7;25(12):3596. doi: 10.3390/s25123596.

Research on Medical Image Segmentation Based on SAM and Its Future Prospects.基于SAM的医学图像分割研究及其未来展望

Bioengineering (Basel). 2025 Jun 3;12(6):608. doi: 10.3390/bioengineering12060608.

PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts.PRISM：一种具有视觉提示的可提示且强大的交互式分割模型。

Med Image Comput Comput Assist Interv. 2024 Oct;15003:389-399. doi: 10.1007/978-3-031-72384-1_37. Epub 2024 Oct 3.

Optimizing MR-based attenuation correction in hybrid PET/MR using deep learning: validation with a flatbed insert and consistent patient positioning.利用深度学习优化混合PET/MR中基于MR的衰减校正：使用平板插入物和一致的患者定位进行验证。

Eur J Nucl Med Mol Imaging. 2025 Jun;52(7):2577-2588. doi: 10.1007/s00259-025-07086-5. Epub 2025 Feb 6.

本文引用的文献

3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation.3DSAM-adapter：从 2D 到 3D 的 SAM 的整体自适应，用于可提示的肿瘤分割。

Med Image Anal. 2024 Dec;98:103324. doi: 10.1016/j.media.2024.103324. Epub 2024 Aug 23.

Segment anything model for medical images?用于医学图像的图像分割模型？

Med Image Anal. 2024 Feb;92:103061. doi: 10.1016/j.media.2023.103061. Epub 2023 Dec 7.

A foundation model for generalizable disease detection from retinal images.基于视网膜图像的通用疾病检测的基础模型。

Nature. 2023 Oct;622(7981):156-163. doi: 10.1038/s41586-023-06555-x. Epub 2023 Sep 13.

A visual-language foundation model for pathology image analysis using medical Twitter.一种使用医学推特进行病理学图像分析的视觉语言基础模型。

Nat Med. 2023 Sep;29(9):2307-2316. doi: 10.1038/s41591-023-02504-3. Epub 2023 Aug 17.

nnFormer: Volumetric Medical Image Segmentation via a 3D Transformer.nnFormer：通过3D变压器进行体积医学图像分割

IEEE Trans Image Process. 2023;32:4036-4045. doi: 10.1109/TIP.2023.3293771. Epub 2023 Jul 19.

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation.用于医学成像的通用视觉基础模型：以零样本医学分割中的分割一切模型为例

Diagnostics (Basel). 2023 Jun 2;13(11):1947. doi: 10.3390/diagnostics13111947.

The Medical Segmentation Decathlon.医学分割十项全能

Nat Commun. 2022 Jul 15;13(1):4128. doi: 10.1038/s41467-022-30695-9.

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation.探索手术语义场景分割中的视频内和视频间关系。

IEEE Trans Med Imaging. 2022 Nov;41(11):2991-3002. doi: 10.1109/TMI.2022.3177077. Epub 2022 Oct 27.

Multiple sclerosis lesions segmentation from multiple experts: The MICCAI 2016 challenge dataset.多发性硬化病变分割来自多位专家：MICCAI 2016 挑战赛数据集。

Neuroimage. 2021 Dec 1;244:118589. doi: 10.1016/j.neuroimage.2021.118589. Epub 2021 Sep 24.

Test-time adaptable neural networks for robust medical image segmentation.用于稳健医学图像分割的测试时自适应神经网络。

Med Image Anal. 2021 Feb;68:101907. doi: 10.1016/j.media.2020.101907. Epub 2020 Nov 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。