• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于文本提示的医学图像大词汇量分割

Large-vocabulary segmentation for medical images with text prompts.

作者信息

Zhao Ziheng, Zhang Yao, Wu Chaoyi, Zhang Xiaoman, Zhou Xiao, Zhang Ya, Wang Yanfeng, Xie Weidi

机构信息

Shanghai Jiao Tong University, Shanghai, China.

Shanghai Artificial Intelligence Laboratory, Shanghai, China.

出版信息

NPJ Digit Med. 2025 Sep 2;8(1):566. doi: 10.1038/s41746-025-01964-w.

DOI:10.1038/s41746-025-01964-w
PMID:40897901
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12405521/
Abstract

This paper aims to build a model that can Segment Anything in 3D medical images, driven by medical terminologies as Text prompts, termed as SAT. Our main contributions are three-fold: (i) We construct the first multimodal knowledge tree on human anatomy, including 6502 anatomical terminologies; Then, we build the largest and most comprehensive segmentation dataset for training, collecting over 22K 3D scans from 72 datasets, across 497 classes, with careful standardization on both image and label space; (ii) We propose to inject medical knowledge into a text encoder via contrastive learning and formulate a large-vocabulary segmentation model that can be prompted by medical terminologies in text form. (iii) We train SAT-Nano (110M parameters) and SAT-Pro (447M parameters). SAT-Pro achieves comparable performance to 72 nnU-Nets-the strongest specialist models trained on each dataset (over 2.2B parameters combined)-over 497 categories. Compared with the interactive approach MedSAM, SAT-Pro consistently outperforms across all 7 human body regions with +7.1% average Dice Similarity Coefficient (DSC) improvement, while showing enhanced scalability and robustness. On 2 external (cross-center) datasets, SAT-Pro achieves higher performance than all baselines (+3.7% average DSC), demonstrating superior generalization ability.

摘要

本文旨在构建一个模型,该模型能够在医学术语作为文本提示的驱动下,对3D医学图像中的任何物体进行分割,称为SAT。我们的主要贡献有三个方面:(i)我们构建了第一个关于人体解剖学的多模态知识树,包括6502个解剖学术语;然后,我们构建了用于训练的最大、最全面的分割数据集,从72个数据集中收集了超过22K的3D扫描数据,涵盖497个类别,并对图像和标签空间进行了仔细的标准化;(ii)我们建议通过对比学习将医学知识注入文本编码器,并制定一个可以由文本形式的医学术语提示的大词汇量分割模型。(iii)我们训练了SAT-Nano(1.1亿参数)和SAT-Pro(4.47亿参数)。SAT-Pro在497个类别上的性能与72个nnU-Net相当,后者是在每个数据集上训练的最强的专业模型(总共超过22亿参数)。与交互式方法MedSAM相比,SAT-Pro在所有7个人体区域上始终表现更优,平均骰子相似度系数(DSC)提高了7.1%,同时显示出更强的可扩展性和鲁棒性。在2个外部(跨中心)数据集上,SAT-Pro的性能高于所有基线(平均DSC提高3.7%),证明了其卓越的泛化能力。

相似文献

1
Large-vocabulary segmentation for medical images with text prompts.基于文本提示的医学图像大词汇量分割
NPJ Digit Med. 2025 Sep 2;8(1):566. doi: 10.1038/s41746-025-01964-w.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
5
Point-cloud segmentation with in-silico data augmentation for prostate cancer treatment.用于前列腺癌治疗的基于计算机模拟数据增强的点云分割
Med Phys. 2025 Apr 3. doi: 10.1002/mp.17815.
6
Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.早期至晚期融合对来自配准不完善的多模态磁共振成像的胰腺分割的影响。
J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.
7
Large-scale convolutional neural network for clinical target and multi-organ segmentation in gynecologic brachytherapy via multi-stage learning.基于多阶段学习的大规模卷积神经网络用于妇科近距离放疗中的临床靶区和多器官分割
Med Phys. 2025 Aug;52(8):e18067. doi: 10.1002/mp.18067.
8
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
9
ES-UNet: efficient 3D medical image segmentation with enhanced skip connections in 3D UNet.ES-UNet:通过在3D UNet中增强跳跃连接实现高效的3D医学图像分割
BMC Med Imaging. 2025 Aug 13;25(1):327. doi: 10.1186/s12880-025-01857-0.
10
Neural network dose prediction for cervical brachytherapy: Overcoming data scarcity for applicator-specific models.神经网络在宫颈癌近距离治疗中的剂量预测:克服施源器特异性模型的数据匮乏问题。
Med Phys. 2024 Jul;51(7):4591-4606. doi: 10.1002/mp.17230. Epub 2024 May 30.

本文引用的文献

1
UniBrain: Universal Brain MRI diagnosis with hierarchical knowledge-enhanced pre-training.统一大脑:基于分层知识增强预训练的通用脑部磁共振成像诊断
Comput Med Imaging Graph. 2025 Jun;122:102516. doi: 10.1016/j.compmedimag.2025.102516. Epub 2025 Mar 7.
2
MERIT: Multi-view evidential learning for reliable and interpretable liver fibrosis staging.MERIT:用于可靠且可解释的肝纤维化分期的多视图证据学习
Med Image Anal. 2025 May;102:103507. doi: 10.1016/j.media.2025.103507. Epub 2025 Feb 22.
3
SegRap2023: A benchmark of organs-at-risk and gross tumor volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma.
SegRap2023:鼻咽癌放疗计划中危及器官和大体肿瘤体积分割的基准
Med Image Anal. 2025 Apr;101:103447. doi: 10.1016/j.media.2024.103447. Epub 2025 Jan 2.
4
Large-scale long-tailed disease diagnosis on radiology images.大规模长尾疾病在放射影像中的诊断。
Nat Commun. 2024 Nov 22;15(1):10147. doi: 10.1038/s41467-024-54424-6.
5
A foundation model for joint segmentation, detection and recognition of biomedical objects across nine modalities.一种用于跨九种模态对生物医学对象进行联合分割、检测和识别的基础模型。
Nat Methods. 2025 Jan;22(1):166-176. doi: 10.1038/s41592-024-02499-w. Epub 2024 Nov 18.
6
Unleashing the strengths of unlabelled data in deep learning-assisted pan-cancer abdominal organ quantification: the FLARE22 challenge.在深度学习辅助的泛癌腹部器官定量分析中释放无标签数据的优势:FLARE22 挑战赛。
Lancet Digit Health. 2024 Nov;6(11):e815-e826. doi: 10.1016/S2589-7500(24)00154-7.
7
TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers.TransUNet:通过Transformer 的视角重新思考医学图像分割中的 U-Net 架构设计。
Med Image Anal. 2024 Oct;97:103280. doi: 10.1016/j.media.2024.103280. Epub 2024 Jul 22.
8
Noninvasive assessment of organ-specific and shared pathways in multi-organ fibrosis using T1 mapping.使用T1映射对多器官纤维化中器官特异性和共享通路进行无创评估。
Nat Med. 2024 Jun;30(6):1749-1760. doi: 10.1038/s41591-024-03010-w. Epub 2024 May 28.
9
Metrics reloaded: recommendations for image analysis validation.重新加载指标:图像分析验证的建议。
Nat Methods. 2024 Feb;21(2):195-212. doi: 10.1038/s41592-023-02151-z. Epub 2024 Feb 12.
10
Segment anything in medical images.在医学图像中分割任何内容。
Nat Commun. 2024 Jan 22;15(1):654. doi: 10.1038/s41467-024-44824-z.