语义自动SAM：用于医学图像语义分割的自提示分割一切模型

Semantic AutoSAM: Self-Prompting Segment Anything Model for Semantic Segmentation of Medical Images.

作者信息

Wahd Assefa S, Kupper Jessica, Jaremko Jacob L, Hareendranathan Abhilash R

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-4. doi: 10.1109/EMBC53108.2024.10782494.

DOI:10.1109/EMBC53108.2024.10782494

Abstract

Segment Anything Model (SAM) is a foundation model that can be prompted with sparse prompts, like boxes or points, and dense prompts such as masks. SAM outputs binary masks based on the given prompts but lacks semantic understanding as it doesn't output the class of the predicted mask. We propose Semantic AutoSAM, a semantic segmentation model that builds upon SAM's binary segmentation. Semantic AutoSAM replaces SAM's manual prompt encoder with a lightweight cross-attention module, enabling it to predict prompt embeddings directly from the image features. This eliminates the need for manual prompting.In our experiments on the FLAIR 2022 dataset (20 CT scans) and a hip ultrasound dataset (4849 2D images), Semantic AutoSAM matches the performance of using groundtruth bounding box prompts for most organs. Our proposed method achieves a Dice score of 0.62 in the FLAIR dataset, and MobileSAM with groundtruth box achieves 0.7. In the hip ultrasound dataset, our approach achieves a Dice score of 0.83, surpassing MobileSAM's slightly lower score of 0.81 despite MobileSAM having access to the groundtruth box for prediction. Notably, our method doesn't require manual prompts at test time.

摘要

分割一切模型（SAM）是一种基础模型，可以通过稀疏提示（如框或点）以及密集提示（如掩码）进行提示。SAM根据给定提示输出二进制掩码，但由于它不输出预测掩码的类别，因此缺乏语义理解。我们提出了语义自动SAM，这是一种基于SAM的二进制分割构建的语义分割模型。语义自动SAM用一个轻量级交叉注意力模块取代了SAM的手动提示编码器，使其能够直接从图像特征预测提示嵌入。这消除了手动提示的需要。在我们对FLAIR 2022数据集（20次CT扫描）和髋关节超声数据集（4849张二维图像）的实验中，语义自动SAM在大多数器官上的表现与使用真实边界框提示的性能相匹配。我们提出的方法在FLAIR数据集中的Dice分数为0.62，使用真实框的移动SAM为0.7。在髋关节超声数据集中，我们的方法Dice分数为0.83，尽管移动SAM在预测时有真实框可用，但其分数略低，为0.81。值得注意的是，我们的方法在测试时不需要手动提示。

相似文献

Semantic AutoSAM: Self-Prompting Segment Anything Model for Semantic Segmentation of Medical Images.语义自动SAM：用于医学图像语义分割的自提示分割一切模型

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-4. doi: 10.1109/EMBC53108.2024.10782494.

Sam2Rad: A segmentation model for medical images with learnable prompts.Sam2Rad：一种具有可学习提示的医学图像分割模型。

Comput Biol Med. 2025 Mar;187:109725. doi: 10.1016/j.compbiomed.2025.109725. Epub 2025 Feb 5.

Segment anything model for medical image analysis: An experimental study.用于医学图像分析的分割模型：一项实验研究。

Med Image Anal. 2023 Oct;89:102918. doi: 10.1016/j.media.2023.102918. Epub 2023 Aug 2.

ProtoSAM-3D: Interactive semantic segmentation in volumetric medical imaging via a Segment Anything Model and mask-level prototypes.ProtoSAM-3D：通过分割一切模型和掩码级原型在体积医学成像中进行交互式语义分割。

Comput Med Imaging Graph. 2025 Apr;121:102501. doi: 10.1016/j.compmedimag.2025.102501. Epub 2025 Feb 1.

[Not Available].[无可用内容]

Med Phys. 2024 Mar;51(3):2187-2199. doi: 10.1002/mp.16965. Epub 2024 Feb 6.

MA-SAM: A Multi-Atlas Guided SAM Using Pseudo Mask Prompts Without Manual Annotation for Spine Image Segmentation.MA-SAM：一种用于脊柱图像分割的多图谱引导的SAM，使用无人工标注的伪掩码提示。

IEEE Trans Med Imaging. 2025 May;44(5):2157-2169. doi: 10.1109/TMI.2024.3524570. Epub 2025 May 2.

FNPC-SAM: Uncertainty-Guided False Negative/Positive Control for SAM on Noisy Medical Images.FNPC-SAM：用于有噪声医学图像上的SAM的不确定性引导的假阴性/阳性控制

Proc SPIE Int Soc Opt Eng. 2024 Feb;12926. doi: 10.1117/12.3006867. Epub 2024 Apr 2.

Segment anything model for medical images?用于医学图像的图像分割模型？

Med Image Anal. 2024 Feb;92:103061. doi: 10.1016/j.media.2023.103061. Epub 2023 Dec 7.

Enhancing Medical Imaging Segmentation with GB-SAM: A Novel Approach to Tissue Segmentation Using Granular Box Prompts.利用GB-SAM增强医学影像分割：一种使用粒度框提示进行组织分割的新方法。

Cancers (Basel). 2024 Jun 28;16(13):2391. doi: 10.3390/cancers16132391.

MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation.MA-SAM：用于 3D 医学图像分割的模态无关 SAM 适配。

Med Image Anal. 2024 Dec;98:103310. doi: 10.1016/j.media.2024.103310. Epub 2024 Aug 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

语义自动SAM：用于医学图像语义分割的自提示分割一切模型

Semantic AutoSAM: Self-Prompting Segment Anything Model for Semantic Segmentation of Medical Images.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献