用于医学成像的通用视觉基础模型：以零样本医学分割中的分割一切模型为例

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation.

作者信息

Shi Peilun, Qiu Jianing, Abaxi Sai Mu Dalike, Wei Hao, Lo Frank P-W, Yuan Wu

机构信息

Department of Biomedical Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China.

Department of Computing, Imperial College London, London SW7 2AZ, UK.

出版信息

Diagnostics (Basel). 2023 Jun 2;13(11):1947. doi: 10.3390/diagnostics13111947.

DOI:10.3390/diagnostics13111947

PMID:37296799

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10252742/

Abstract

Medical image analysis plays an important role in clinical diagnosis. In this paper, we examine the recent Segment Anything Model (SAM) on medical images, and report both quantitative and qualitative zero-shot segmentation results on nine medical image segmentation benchmarks, covering various imaging modalities, such as optical coherence tomography (OCT), magnetic resonance imaging (MRI), and computed tomography (CT), as well as different applications including dermatology, ophthalmology, and radiology. Those benchmarks are representative and commonly used in model development. Our experimental results indicate that while SAM presents remarkable segmentation performance on images from the general domain, its zero-shot segmentation ability remains restricted for out-of-distribution images, e.g., medical images. In addition, SAM exhibits inconsistent zero-shot segmentation performance across different unseen medical domains. For certain structured targets, e.g., blood vessels, the zero-shot segmentation of SAM completely failed. In contrast, a simple fine-tuning of it with a small amount of data could lead to remarkable improvement of the segmentation quality, showing the great potential and feasibility of using fine-tuned SAM to achieve accurate medical image segmentation for a precision diagnostics. Our study indicates the versatility of generalist vision foundation models on medical imaging, and their great potential to achieve desired performance through fine-turning and eventually address the challenges associated with accessing large and diverse medical datasets in support of clinical diagnostics.

摘要

医学图像分析在临床诊断中发挥着重要作用。在本文中，我们研究了最近的医学图像分割模型（SAM），并报告了在九个医学图像分割基准上的定量和定性零样本分割结果，这些基准涵盖了各种成像模态，如光学相干断层扫描（OCT）、磁共振成像（MRI）和计算机断层扫描（CT），以及不同的应用领域，包括皮肤病学、眼科和放射学。这些基准在模型开发中具有代表性且常用。我们的实验结果表明，虽然SAM在通用领域的图像上表现出卓越的分割性能，但其零样本分割能力在分布外图像（如医学图像）上仍然受限。此外，SAM在不同的未见医学领域中表现出不一致的零样本分割性能。对于某些结构化目标，如血管，SAM的零样本分割完全失败。相比之下，用少量数据对其进行简单微调可以显著提高分割质量，这表明使用微调后的SAM实现精确医学图像分割以进行精准诊断具有巨大潜力和可行性。我们的研究表明通用视觉基础模型在医学成像方面的通用性，以及通过微调实现理想性能并最终应对与获取大量多样的医学数据集以支持临床诊断相关挑战的巨大潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0913/10252742/336527d821f0/diagnostics-13-01947-g001.jpg

相似文献

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation.用于医学成像的通用视觉基础模型：以零样本医学分割中的分割一切模型为例

Diagnostics (Basel). 2023 Jun 2;13(11):1947. doi: 10.3390/diagnostics13111947.

Segment anything model for medical image analysis: An experimental study.用于医学图像分析的分割模型：一项实验研究。

Med Image Anal. 2023 Oct;89:102918. doi: 10.1016/j.media.2023.102918. Epub 2023 Aug 2.

MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation.MA-SAM：用于 3D 医学图像分割的模态无关 SAM 适配。

Med Image Anal. 2024 Dec;98:103310. doi: 10.1016/j.media.2024.103310. Epub 2024 Aug 22.

Enhancing Medical Imaging Segmentation with GB-SAM: A Novel Approach to Tissue Segmentation Using Granular Box Prompts.利用GB-SAM增强医学影像分割：一种使用粒度框提示进行组织分割的新方法。

Cancers (Basel). 2024 Jun 28;16(13):2391. doi: 10.3390/cancers16132391.

Segment anything model for medical images?用于医学图像的图像分割模型？

Med Image Anal. 2024 Feb;92:103061. doi: 10.1016/j.media.2023.103061. Epub 2023 Dec 7.

Improving Existing Segmentators Performance with Zero-Shot Segmentators.利用零样本分割器提高现有分割器的性能

Entropy (Basel). 2023 Oct 30;25(11):1502. doi: 10.3390/e25111502.

Enhancing Agricultural Image Segmentation with an Agricultural Segment Anything Model Adapter.使用农业分割一切模型适配器增强农业图像分割

Sensors (Basel). 2023 Sep 14;23(18):7884. doi: 10.3390/s23187884.

FNPC-SAM: Uncertainty-Guided False Negative/Positive Control for SAM on Noisy Medical Images.FNPC-SAM：用于有噪声医学图像上的SAM的不确定性引导的假阴性/阳性控制

Proc SPIE Int Soc Opt Eng. 2024 Feb;12926. doi: 10.1117/12.3006867. Epub 2024 Apr 2.

AGSAM: Agent-Guided Segment Anything Model for Automatic Segmentation in Few-Shot Scenarios.AGSAM：用于少样本场景下自动分割的智能体引导的“分割一切”模型

Bioengineering (Basel). 2024 Apr 30;11(5):447. doi: 10.3390/bioengineering11050447.

CellSAM: A Foundation Model for Cell Segmentation.CellSAM：一种用于细胞分割的基础模型。

bioRxiv. 2025 Feb 16:2023.11.17.567630. doi: 10.1101/2023.11.17.567630.

引用本文的文献

The generative revolution: AI foundation models in geospatial health-applications, challenges and future research.生成式革命：地理空间健康应用中的人工智能基础模型、挑战与未来研究

Int J Health Geogr. 2025 Apr 2;24(1):6. doi: 10.1186/s12942-025-00391-0.

Artificial intelligence in medical imaging: From task-specific models to large-scale foundation models.医学影像中的人工智能：从特定任务模型到大规模基础模型。

Chin Med J (Engl). 2025 Mar 20;138(6):651-663. doi: 10.1097/CM9.0000000000003489. Epub 2025 Feb 26.

MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models.MiMICRI：迈向心血管图像分类模型以领域为中心的反事实解释

FACCT 24 (2024). 2024 Jun;2024:1861-1874. doi: 10.1145/3630106.3659011. Epub 2024 Jun 5.

Improved Generalizability in Medical Computer Vision: Hyperbolic Deep Learning in Multi-Modality Neuroimaging.医学计算机视觉中泛化能力的提升：多模态神经成像中的双曲深度学习

J Imaging. 2024 Dec 12;10(12):319. doi: 10.3390/jimaging10120319.

Multi-domain improves classification in out-of-distribution and data-limited scenarios for medical image analysis.多领域模型提高了医学图像分析中分布外和数据有限场景下的分类性能。

Sci Rep. 2024 Oct 18;14(1):24412. doi: 10.1038/s41598-024-73561-y.

Reducing Training Data Using Pre-Trained Foundation Models: A Case Study on Traffic Sign Segmentation Using the Segment Anything Model.使用预训练基础模型减少训练数据：以使用“分割一切模型”进行交通标志分割为例

J Imaging. 2024 Sep 7;10(9):220. doi: 10.3390/jimaging10090220.

MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation.MA-SAM：用于 3D 医学图像分割的模态无关 SAM 适配。

Med Image Anal. 2024 Dec;98:103310. doi: 10.1016/j.media.2024.103310. Epub 2024 Aug 22.

An efficient segment anything model for the segmentation of medical images.一种用于医学图像分割的高效分割一切模型。

Sci Rep. 2024 Aug 21;14(1):19425. doi: 10.1038/s41598-024-70288-8.

The impact of data augmentation and transfer learning on the performance of deep learning models for the segmentation of the hip on 3D magnetic resonance images.数据增强和迁移学习对基于3D磁共振图像的髋关节分割深度学习模型性能的影响。

Inform Med Unlocked. 2024;45. doi: 10.1016/j.imu.2023.101444. Epub 2024 Jan 6.

Enhancing Meibography Image Analysis Through Artificial Intelligence-Driven Quantification and Standardization for Dry Eye Research.通过人工智能驱动的量化和标准化增强干眼病研究的眼表图像分析。

Transl Vis Sci Technol. 2024 Jun 3;13(6):16. doi: 10.1167/tvst.13.6.16.

本文引用的文献

Large AI Models in Health Informatics: Applications, Challenges, and the Future.大语言模型在健康信息学中的应用、挑战与未来

IEEE J Biomed Health Inform. 2023 Dec;27(12):6074-6087. doi: 10.1109/JBHI.2023.3316750. Epub 2023 Dec 5.

assessment of inflammatory bowel disease in rats with ultrahigh-resolution colonoscopic OCT.用超高分辨率结肠镜光学相干断层扫描评估大鼠炎症性肠病

Biomed Opt Express. 2022 Mar 15;13(4):2091-2102. doi: 10.1364/BOE.453396. eCollection 2022 Apr 1.

DoFE: Domain-Oriented Feature Embedding for Generalizable Fundus Image Segmentation on Unseen Datasets.DoFE：面向领域的特征嵌入在未见数据集上的通用眼底图像分割。

IEEE Trans Med Imaging. 2020 Dec;39(12):4237-4248. doi: 10.1109/TMI.2020.3015224. Epub 2020 Nov 30.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++：一种用于医学图像分割的嵌套U-Net架构。

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.

REFUGE Challenge: A unified framework for evaluating automated methods for glaucoma assessment from fundus photographs.REFUGE 挑战赛：从眼底照片评估青光眼评估自动化方法的统一框架。

Med Image Anal. 2020 Jan;59:101570. doi: 10.1016/j.media.2019.101570. Epub 2019 Oct 8.

The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions.HAM10000 数据集，一个大型的常见色素性皮肤病变多源皮肤镜图像集合。

Sci Data. 2018 Aug 14;5:180161. doi: 10.1038/sdata.2018.161.

Broadband rotary joint for high-speed ultrahigh-resolution endoscopic OCT imaging at 800 nm.用于800纳米高速超高分辨率内窥式光学相干断层扫描成像的宽带旋转接头。

Opt Lett. 2017 Dec 1;42(23):4978-4981. doi: 10.1364/OL.42.004978.

Lung segmentation in chest radiographs using anatomical atlases with nonrigid registration.基于解剖图谱的非刚性配准在胸部 X 光片中的肺分割。

IEEE Trans Med Imaging. 2014 Feb;33(2):577-90. doi: 10.1109/TMI.2013.2290491. Epub 2013 Nov 13.

Automatic tuberculosis screening using chest radiographs.利用胸部 X 光片进行自动结核病筛查。

IEEE Trans Med Imaging. 2014 Feb;33(2):233-45. doi: 10.1109/TMI.2013.2284099. Epub 2013 Oct 1.

An ensemble classification-based approach applied to retinal blood vessel segmentation.基于集成分类的方法在视网膜血管分割中的应用。

IEEE Trans Biomed Eng. 2012 Sep;59(9):2538-48. doi: 10.1109/TBME.2012.2205687. Epub 2012 Jun 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于医学成像的通用视觉基础模型：以零样本医学分割中的分割一切模型为例

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献