• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于医学分割任务的大型基础模型专业化的必要性和影响。

Necessity and impact of specialization of large foundation model for medical segmentation tasks.

作者信息

Nguyen Eric, Liu Hengjie, Ruan Dan

机构信息

Department of Radiation Oncology, University of California Los Angeles, Los Angeles, California, USA.

Department of Bioengineering, University of California Los Angeles, Los Angeles, California, USA.

出版信息

Med Phys. 2025 Jan;52(1):321-328. doi: 10.1002/mp.17470. Epub 2024 Oct 21.

DOI:10.1002/mp.17470
PMID:39431952
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11699994/
Abstract

BACKGROUND

Large foundation models, such as the Segment Anything Model (SAM), have shown remarkable performance in image segmentation tasks. However, the optimal approach to achieve true utility of these models for domain-specific applications, such as medical image segmentation, remains an open question. Recent studies have released a medical version of the foundation model MedSAM by training on vast medical data, who promised SOTA medical segmentation. Independent community inspection and dissection is needed.

PURPOSE

Foundation models are developed for general purposes. On the other hand, stable delivery of reliable performance is key to clinical utility. This study aims at elucidating the potential advantage and limitations of landing the foundation models in clinical use by assessing the performance of off-the-shelf medical foundation model MedSAM for the segmentation of anatomical structures in pelvic MR images. We also explore the simple remedies by evaluating the dependency on prompting scheme. Finally, we demonstrate the need and performance gain of further specialized fine-tuning.

METHODS

MedSAM and its lightweight version LiteMedSAM were evaluated out-of-the-box on a public MR dataset consisting of 589 pelvic images split 80:20 for training and testing. An nnU-Net model was trained from scratch to serve as a benchmark and to provide bounding box prompts for MedSAM. MedSAM was evaluated using different quality bounding boxes, those derived from ground truth labels, those derived from nnU-Net, and those derived from the former two but with 5-pixel isometric expansion. Lastly, LiteMedSAM was refined on the training set and reevaluated on this task.

RESULTS

Out-of-the-box MedSAM and LiteMedSAM both performed poorly across the structure set, especially for disjoint or non-convex structures. Varying prompt with different bounding box inputs had minimal effect. For example, the mean Dice score and mean Hausdorff distances (in mm) for obturator internus using MedSAM and LiteMedSAM were {0.251 ± 0.110, 0.101 ± 0.079} and {34.142 ± 5.196, 33.688 ± 5.306}, respectively. Fine-tuning of LiteMedSAM led to significant performance gain, improving Dice score and Hausdorff distance for the obturator internus to 0.864 ± 0.123 and 5.022 ± 10.684, on par with nnU-Net with no significant difference in evaluation of most structures. All segmentation structures benefited significantly from specialized refinement, at varying improvement margin.

CONCLUSION

While our study alludes to the potential of deep learning models like MedSAM and LiteMedSAM for medical segmentation, it highlights the need for specialized refinement and adjudication. Off-the-shelf use of such large foundation models is highly likely to be suboptimal, and specialized fine-tuning is often necessary to achieve clinical desired accuracy and stability.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/6f9c0f2b4fae/MP-52-321-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/afcf3d62af77/MP-52-321-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/ebe254af1f73/MP-52-321-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/5d4527004686/MP-52-321-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/557f99ffdd06/MP-52-321-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/6f9c0f2b4fae/MP-52-321-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/afcf3d62af77/MP-52-321-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/ebe254af1f73/MP-52-321-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/5d4527004686/MP-52-321-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/557f99ffdd06/MP-52-321-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d742/11699994/6f9c0f2b4fae/MP-52-321-g004.jpg
摘要

背景

大型基础模型,如分割一切模型(SAM),在图像分割任务中表现出卓越性能。然而,要使这些模型在特定领域应用(如医学图像分割)中真正发挥效用,最佳方法仍是一个悬而未决的问题。最近的研究通过在大量医学数据上进行训练,发布了基础模型MedSAM的医学版本,宣称其具有领先的医学分割性能。需要独立的社区检查和剖析。

目的

基础模型是为通用目的而开发的。另一方面,稳定地提供可靠性能是临床应用的关键。本研究旨在通过评估现成的医学基础模型MedSAM对盆腔磁共振图像中解剖结构的分割性能,阐明将基础模型应用于临床的潜在优势和局限性。我们还通过评估对提示方案的依赖性来探索简单的补救措施。最后,我们证明了进一步进行专门微调的必要性和性能提升。

方法

在一个由589张盆腔图像组成的公共磁共振数据集上对MedSAM及其轻量级版本LiteMedSAM进行开箱即用的评估,该数据集按80:20划分为训练集和测试集。从零开始训练一个nnU-Net模型作为基准,并为MedSAM提供边界框提示。使用不同质量的边界框对MedSAM进行评估,这些边界框分别来自真实标签、nnU-Net,以及由前两者派生但有5像素等距扩展的边界框。最后,在训练集上对LiteMedSAM进行优化,并在该任务上重新评估。

结果

开箱即用的MedSAM和LiteMedSAM在整个结构集上的表现都很差,尤其是对于不连续或非凸结构。使用不同的边界框输入改变提示的效果甚微。例如,使用MedSAM和LiteMedSAM分割闭孔内肌的平均Dice分数和平均豪斯多夫距离(单位:毫米)分别为{0.251 ± 0.110, 0.101 ± 0.079}和{34.142 ± 5.196, 33.688 ± 5.306}。对LiteMedSAM进行微调带来了显著的性能提升,闭孔内肌的Dice分数和豪斯多夫距离分别提高到0.864 ± 0.123和5.022 ± 10. + 684,与nnU-Net相当,在大多数结构的评估中没有显著差异。所有分割结构都从专门的优化中显著受益,提升幅度各不相同。

结论

虽然我们的研究暗示了像MedSAM和LiteMedSAM这样的深度学习模型在医学分割方面的潜力,但它强调了进行专门优化和裁决的必要性。直接使用这种大型基础模型很可能不是最优的,通常需要进行专门的微调才能达到临床所需的准确性和稳定性。

相似文献

1
Necessity and impact of specialization of large foundation model for medical segmentation tasks.用于医学分割任务的大型基础模型专业化的必要性和影响。
Med Phys. 2025 Jan;52(1):321-328. doi: 10.1002/mp.17470. Epub 2024 Oct 21.
2
Sam2Rad: A segmentation model for medical images with learnable prompts.Sam2Rad:一种具有可学习提示的医学图像分割模型。
Comput Biol Med. 2025 Mar;187:109725. doi: 10.1016/j.compbiomed.2025.109725. Epub 2025 Feb 5.
3
Liver Observation Segmentation on Contrast-Enhanced MRI: SAM and MedSAM Performance in Patients With Probable or Definite Hepatocellular Carcinoma.对比增强 MRI 下肝脏观察分割:SAM 和 MedSAM 在疑似或明确肝细胞癌患者中的性能。
Can Assoc Radiol J. 2024 Nov;75(4):771-779. doi: 10.1177/08465371241250215. Epub 2024 May 7.
4
Enhancing Medical Imaging Segmentation with GB-SAM: A Novel Approach to Tissue Segmentation Using Granular Box Prompts.利用GB-SAM增强医学影像分割:一种使用粒度框提示进行组织分割的新方法。
Cancers (Basel). 2024 Jun 28;16(13):2391. doi: 10.3390/cancers16132391.
5
Foundational Segmentation Models and Clinical Data Mining Enable Accurate Computer Vision for Lung Cancer.基础分割模型和临床数据挖掘助力肺癌的精确计算机视觉。
J Imaging Inform Med. 2025 Jun;38(3):1552-1562. doi: 10.1007/s10278-024-01304-6. Epub 2024 Oct 22.
6
Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM.基于框提示MedSAM的点监督脑肿瘤分割
ArXiv. 2024 Aug 1:arXiv:2408.00706v1.
7
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
8
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
9
How much data do you need? An analysis of pelvic multi-organ segmentation in a limited data context.你需要多少数据?有限数据背景下盆腔多器官分割的分析。
Phys Eng Sci Med. 2025 Mar;48(1):409-419. doi: 10.1007/s13246-024-01514-w. Epub 2025 Mar 11.
10
Plug-and-play segment anything model improves nnUNet performance.即插即用的分割一切模型提升了nnUNet的性能。
Med Phys. 2025 Feb;52(2):899-912. doi: 10.1002/mp.17481. Epub 2024 Oct 28.

本文引用的文献

1
Segment anything in medical images.在医学图像中分割任何内容。
Nat Commun. 2024 Jan 22;15(1):654. doi: 10.1038/s41467-024-44824-z.
2
Segment anything model for medical images?用于医学图像的图像分割模型?
Med Image Anal. 2024 Feb;92:103061. doi: 10.1016/j.media.2023.103061. Epub 2023 Dec 7.
3
Prototypical few-shot segmentation for cross-institution male pelvic structures with spatial registration.基于空间配准的跨机构男性盆腔结构的典型少样本分割
Med Image Anal. 2023 Dec;90:102935. doi: 10.1016/j.media.2023.102935. Epub 2023 Aug 26.
4
Segment anything model for medical image analysis: An experimental study.用于医学图像分析的分割模型:一项实验研究。
Med Image Anal. 2023 Oct;89:102918. doi: 10.1016/j.media.2023.102918. Epub 2023 Aug 2.
5
Investigation and benchmarking of U-Nets on prostate segmentation tasks.基于 U-Nets 的前列腺分割任务的研究与基准测试。
Comput Med Imaging Graph. 2023 Jul;107:102241. doi: 10.1016/j.compmedimag.2023.102241. Epub 2023 May 12.
6
nnU-Net Deep Learning Method for Segmenting Parenchyma and Determining Liver Volume From Computed Tomography Images.基于计算机断层扫描图像的nnU-Net深度学习方法用于实质分割和肝脏体积测定
Ann Surg Open. 2022 Jun;3(2). doi: 10.1097/as9.0000000000000155. Epub 2022 Mar 30.
7
Transfer learning for medical image classification: a literature review.医学图像分类的迁移学习:文献综述。
BMC Med Imaging. 2022 Apr 13;22(1):69. doi: 10.1186/s12880-022-00793-7.
8
nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.nnU-Net:一种基于深度学习的生物医学图像分割的自配置方法。
Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.