• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数字病理学中分布外检测的预测不确定性估计。

Predictive uncertainty estimation for out-of-distribution detection in digital pathology.

机构信息

Department of Pathology, Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands.

Inify Laboratories AB, Stockholm, Sweden.

出版信息

Med Image Anal. 2023 Jan;83:102655. doi: 10.1016/j.media.2022.102655. Epub 2022 Oct 17.

DOI:10.1016/j.media.2022.102655
PMID:36306568
Abstract

Machine learning model deployment in clinical practice demands real-time risk assessment to identify situations in which the model is uncertain. Once deployed, models should be accurate for classes seen during training while providing informative estimates of uncertainty to flag abnormalities and unseen classes for further analysis. Although recent developments in uncertainty estimation have resulted in an increasing number of methods, a rigorous empirical evaluation of their performance on large-scale digital pathology datasets is lacking. This work provides a benchmark for evaluating prevalent methods on multiple datasets by comparing the uncertainty estimates on both in-distribution and realistic near and far out-of-distribution (OOD) data on a whole-slide level. To this end, we aggregate uncertainty values from patch-based classifiers to whole-slide level uncertainty scores. We show that results found in classical computer vision benchmarks do not always translate to the medical imaging setting. Specifically, we demonstrate that deep ensembles perform best at detecting far-OOD data but can be outperformed on a more challenging near-OOD detection task by multi-head ensembles trained for optimal ensemble diversity. Furthermore, we demonstrate the harmful impact OOD data can have on the performance of deployed machine learning models. Overall, we show that uncertainty estimates can be used to discriminate in-distribution from OOD data with high AUC scores. Still, model deployment might require careful tuning based on prior knowledge of prospective OOD data.

摘要

机器学习模型在临床实践中的部署需要实时风险评估,以识别模型不确定的情况。一旦部署,模型应该对训练期间看到的类别具有准确性,并提供有关不确定性的信息性估计,以标记异常情况和看不见的类别以进行进一步分析。尽管不确定性估计的最新进展导致了越来越多的方法,但缺乏对这些方法在大规模数字病理学数据集上的性能进行严格的实证评估。这项工作通过在整个幻灯片级别比较分布内和现实的近分布外 (OOD) 和远 OOD 数据的不确定性估计,为在多个数据集上评估流行方法提供了基准。为此,我们将基于补丁的分类器的不确定性值聚合到整个幻灯片级别的不确定性得分中。我们表明,在经典计算机视觉基准中找到的结果并不总是适用于医学成像环境。具体来说,我们证明深度集成在检测远 OOD 数据方面表现最佳,但通过针对最佳集成多样性进行训练的多头集成,可以在更具挑战性的近 OOD 检测任务中超越它。此外,我们展示了 OOD 数据对已部署机器学习模型性能的有害影响。总体而言,我们表明不确定性估计可以用于使用高 AUC 分数区分分布内和 OOD 数据。尽管如此,模型部署可能需要根据预期 OOD 数据的先验知识进行仔细调整。

相似文献

1
Predictive uncertainty estimation for out-of-distribution detection in digital pathology.数字病理学中分布外检测的预测不确定性估计。
Med Image Anal. 2023 Jan;83:102655. doi: 10.1016/j.media.2022.102655. Epub 2022 Oct 17.
2
Diffusion models for out-of-distribution detection in digital pathology.扩散模型在数字病理学中的分布外检测。
Med Image Anal. 2024 Apr;93:103088. doi: 10.1016/j.media.2024.103088. Epub 2024 Jan 13.
3
Bayesian uncertainty estimation for detection of long-tailed and unseen conditions in medical images.用于医学图像中长尾和罕见情况检测的贝叶斯不确定性估计。
J Med Imaging (Bellingham). 2023 Sep;10(5):054501. doi: 10.1117/1.JMI.10.5.054501. Epub 2023 Oct 9.
4
Does your dermatology classifier know what it doesn't know? Detecting the long-tail of unseen conditions.你的皮肤科分类器知道它所不知道的东西吗?检测罕见病症的长尾现象。
Med Image Anal. 2022 Jan;75:102274. doi: 10.1016/j.media.2021.102274. Epub 2021 Oct 20.
5
Differentiation of glioblastoma from solitary brain metastasis using deep ensembles: Empirical estimation of uncertainty for clinical reliability.使用深度学习集成对胶质母细胞瘤与单发脑转移瘤进行鉴别:对临床可靠性的不确定性进行经验估计。
Comput Methods Programs Biomed. 2024 Sep;254:108288. doi: 10.1016/j.cmpb.2024.108288. Epub 2024 Jun 21.
6
Out-of-Distribution Detection Algorithms for Robust Insect Classification.用于稳健昆虫分类的分布外检测算法
Plant Phenomics. 2024 Apr 30;6:0170. doi: 10.34133/plantphenomics.0170. eCollection 2024.
7
Investigation of out-of-distribution detection across various models and training methodologies.跨多种模型和训练方法的分布外检测研究。
Neural Netw. 2024 Jul;175:106288. doi: 10.1016/j.neunet.2024.106288. Epub 2024 Apr 4.
8
Augmentation by Counterfactual Explanation - Fixing an Overconfident Classifier.通过反事实解释进行增强——修正过度自信的分类器
IEEE Winter Conf Appl Comput Vis. 2023 Jan;2023:4709-4719. doi: 10.1109/wacv56688.2023.00470. Epub 2023 Feb 6.
9
Out-of-distribution detection with in-distribution voting using the medical example of chest x-ray classification.使用分布内投票进行分布外检测,以胸部 X 射线分类为例。
Med Phys. 2024 Apr;51(4):2721-2732. doi: 10.1002/mp.16790. Epub 2023 Oct 13.
10
Unsupervised out-of-distribution detection for safer robotically guided retinal microsurgery.无监督的离群分布检测以实现更安全的机器人引导视网膜微创手术。
Int J Comput Assist Radiol Surg. 2023 Jun;18(6):1085-1091. doi: 10.1007/s11548-023-02909-y. Epub 2023 May 3.

引用本文的文献

1
A foundational triage system for improving accuracy in moderate acuity level emergency classifications.一种用于提高中度急症级别紧急情况分类准确性的基础分诊系统。
Commun Med (Lond). 2025 Jul 31;5(1):322. doi: 10.1038/s43856-025-01052-w.
2
PRISM: A Promptable and Robust Interactive Segmentation Model with Visual Prompts.PRISM:一种具有视觉提示的可提示且强大的交互式分割模型。
Med Image Comput Comput Assist Interv. 2024 Oct;15003:389-399. doi: 10.1007/978-3-031-72384-1_37. Epub 2024 Oct 3.
3
Assessing the quality of whole slide images in cytology from nuclei features.
从细胞核特征评估细胞学中全玻片图像的质量。
J Pathol Inform. 2025 Feb 11;17:100420. doi: 10.1016/j.jpi.2025.100420. eCollection 2025 Apr.
4
Equipping computational pathology systems with artifact processing pipelines: a showcase for computation and performance trade-offs.为计算病理学系统配备伪影处理管道:计算和性能权衡的展示。
BMC Med Inform Decis Mak. 2024 Oct 7;24(1):288. doi: 10.1186/s12911-024-02676-z.
5
Toward confident prostate cancer detection using ultrasound: a multi-center study.利用超声进行自信的前列腺癌检测:一项多中心研究。
Int J Comput Assist Radiol Surg. 2024 May;19(5):841-849. doi: 10.1007/s11548-024-03119-w. Epub 2024 May 5.
6
LensePro: label noise-tolerant prototype-based network for improving cancer detection in prostate ultrasound with limited annotations.LensePro:一种基于原型的标签噪声容忍网络,用于在有限的标注下提高前列腺超声中的癌症检测。
Int J Comput Assist Radiol Surg. 2024 Jun;19(6):1121-1128. doi: 10.1007/s11548-024-03104-3. Epub 2024 Apr 10.
7
Enhancing a deep learning model for pulmonary nodule malignancy risk estimation in chest CT with uncertainty estimation.利用不确定性估计增强胸部 CT 中肺结节恶性风险估计的深度学习模型。
Eur Radiol. 2024 Oct;34(10):6639-6651. doi: 10.1007/s00330-024-10714-7. Epub 2024 Mar 27.
8
A new method of modeling the multi-stage decision-making process of CRT using machine learning with uncertainty quantification.一种使用带有不确定性量化的机器学习对心脏再同步治疗(CRT)的多阶段决策过程进行建模的新方法。
ArXiv. 2024 Apr 28:arXiv:2309.08415v4.
9
Benefits of spatial uncertainty aggregation for segmentation in digital pathology.数字病理学中分割的空间不确定性聚合的益处。
J Med Imaging (Bellingham). 2024 Jan;11(1):017501. doi: 10.1117/1.JMI.11.1.017501. Epub 2024 Jan 16.
10
Assessing informative tract segmentation and nTMS for pre-operative planning.评估信息束分割和 nTMS 用于术前规划。
J Neurosci Methods. 2023 Aug 1;396:109933. doi: 10.1016/j.jneumeth.2023.109933. Epub 2023 Jul 29.