• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医学图像分割中故障检测方法的比较基准测试:揭示置信度聚合的作用。

Comparative benchmarking of failure detection methods in medical image segmentation: Unveiling the role of confidence aggregation.

作者信息

Zenk Maximilian, Zimmerer David, Isensee Fabian, Traub Jeremias, Norajitra Tobias, Jäger Paul F, Maier-Hein Klaus

机构信息

German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany; Medical Faculty Heidelberg, Heidelberg University, Heidelberg, Germany.

German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany.

出版信息

Med Image Anal. 2025 Apr;101:103392. doi: 10.1016/j.media.2024.103392. Epub 2024 Nov 30.

DOI:10.1016/j.media.2024.103392
PMID:39657400
Abstract

Semantic segmentation is an essential component of medical image analysis research, with recent deep learning algorithms offering out-of-the-box applicability across diverse datasets. Despite these advancements, segmentation failures remain a significant concern for real-world clinical applications, necessitating reliable detection mechanisms. This paper introduces a comprehensive benchmarking framework aimed at evaluating failure detection methodologies within medical image segmentation. Through our analysis, we identify the strengths and limitations of current failure detection metrics, advocating for the risk-coverage analysis as a holistic evaluation approach. Utilizing a collective dataset comprising five public 3D medical image collections, we assess the efficacy of various failure detection strategies under realistic test-time distribution shifts. Our findings highlight the importance of pixel confidence aggregation and we observe superior performance of the pairwise Dice score (Roy et al., 2019) between ensemble predictions, positioning it as a simple and robust baseline for failure detection in medical image segmentation. To promote ongoing research, we make the benchmarking framework available to the community.

摘要

语义分割是医学图像分析研究的重要组成部分,最近的深度学习算法在各种数据集上都具有开箱即用的适用性。尽管有这些进展,但分割失败仍是实际临床应用中的一个重大问题,因此需要可靠的检测机制。本文介绍了一个全面的基准测试框架,旨在评估医学图像分割中的失败检测方法。通过我们的分析,我们确定了当前失败检测指标的优点和局限性,提倡将风险覆盖分析作为一种整体评估方法。利用一个包含五个公共3D医学图像集的集体数据集,我们在实际测试时分布变化的情况下评估了各种失败检测策略的有效性。我们的研究结果突出了像素置信度聚合的重要性,并且我们观察到集成预测之间的成对Dice分数(Roy等人,2019年)具有卓越的性能,将其定位为医学图像分割中失败检测的一种简单而稳健的基线。为了促进正在进行的研究,我们向社区提供了基准测试框架。

相似文献

1
Comparative benchmarking of failure detection methods in medical image segmentation: Unveiling the role of confidence aggregation.医学图像分割中故障检测方法的比较基准测试:揭示置信度聚合的作用。
Med Image Anal. 2025 Apr;101:103392. doi: 10.1016/j.media.2024.103392. Epub 2024 Nov 30.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI.R O O D-MRI:基准测试深度学习分割模型对 MRI 中分布外和损坏数据的鲁棒性。
Neuroimage. 2023 Sep;278:120289. doi: 10.1016/j.neuroimage.2023.120289. Epub 2023 Jul 24.
4
Enhanced brain tumor detection and segmentation using densely connected convolutional networks with stacking ensemble learning.使用带有堆叠集成学习的密集连接卷积网络增强脑肿瘤检测与分割
Comput Biol Med. 2025 Mar;186:109703. doi: 10.1016/j.compbiomed.2025.109703. Epub 2025 Jan 24.
5
ECPC-IDS: A benchmark endometrial cancer PET/CT image dataset for evaluation of semantic segmentation and detection of hypermetabolic regions.ECPC-IDS:用于评估语义分割和检测高代谢区域的子宫内膜癌 PET/CT 图像基准数据集。
Comput Biol Med. 2024 Mar;171:108217. doi: 10.1016/j.compbiomed.2024.108217. Epub 2024 Feb 28.
6
DMSPS: Dynamically mixed soft pseudo-label supervision for scribble-supervised medical image segmentation.DMSPS:用于涂鸦监督的医学图像分割的动态混合软伪标签监督。
Med Image Anal. 2024 Oct;97:103274. doi: 10.1016/j.media.2024.103274. Epub 2024 Jul 15.
7
ATOMMIC: An Advanced Toolbox for Multitask Medical Imaging Consistency to facilitate Artificial Intelligence applications from acquisition to analysis in Magnetic Resonance Imaging.ATOMMIC:一个高级的多任务医学成像一致性工具箱,旨在促进磁共振成像从采集到分析的人工智能应用。
Comput Methods Programs Biomed. 2024 Nov;256:108377. doi: 10.1016/j.cmpb.2024.108377. Epub 2024 Aug 22.
8
Benchmarking of deep learning algorithms for 3D instance segmentation of confocal image datasets.深度学习算法在共聚焦图像数据集 3D 实例分割中的基准测试。
PLoS Comput Biol. 2022 Apr 14;18(4):e1009879. doi: 10.1371/journal.pcbi.1009879. eCollection 2022 Apr.
9
Image-level supervision and self-training for transformer-based cross-modality tumor segmentation.基于 Transformer 的跨模态肿瘤分割的图像级监督和自训练。
Med Image Anal. 2024 Oct;97:103287. doi: 10.1016/j.media.2024.103287. Epub 2024 Jul 31.
10
Advancements in automated nuclei segmentation for histopathology using you only look once-driven approaches: A systematic review.使用仅看一次驱动方法的组织病理学自动细胞核分割进展:一项系统综述。
Comput Biol Med. 2025 May;190:110072. doi: 10.1016/j.compbiomed.2025.110072. Epub 2025 Mar 25.

引用本文的文献

1
Guidance on selecting and evaluating AI auto-segmentation systems in clinical radiotherapy: insights from a six-vendor analysis.临床放射治疗中人工智能自动分割系统的选择与评估指南:来自六家供应商分析的见解
Phys Eng Sci Med. 2025 Mar;48(1):301-316. doi: 10.1007/s13246-024-01513-x. Epub 2025 Jan 13.