• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

揭开变色龙的面纱:医学表格数据中分布外检测的基准。

Unmasking the chameleons: A benchmark for out-of-distribution detection in medical tabular data.

作者信息

Azizmalayeri Mohammad, Abu-Hanna Ameen, Cinà Giovanni

机构信息

Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam UMC, University of Amsterdam, the Netherlands.

Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam UMC, University of Amsterdam, the Netherlands; Institute of Logic, Language and Computation, University of Amsterdam, the Netherlands; Pacmed, Amsterdam, the Netherlands.

出版信息

Int J Med Inform. 2025 Mar;195:105762. doi: 10.1016/j.ijmedinf.2024.105762. Epub 2024 Dec 17.

DOI:10.1016/j.ijmedinf.2024.105762
PMID:39708667
Abstract

BACKGROUND

Machine Learning (ML) models often struggle to generalize effectively to data that deviates from the training distribution. This raises significant concerns about the reliability of real-world healthcare systems encountering such inputs known as out-of-distribution (OOD) data. These concerns can be addressed by real-time detection of OOD inputs. While numerous OOD detection approaches have been suggested in other fields - especially in computer vision - it remains unclear whether similar methods effectively address challenges posed by medical tabular data.

OBJECTIVE

To answer this important question, we propose an extensive reproducible benchmark to compare different OOD detection methods in medical tabular data across a comprehensive suite of tests.

METHOD

To achieve this, we leverage 4 different and large public medical datasets, including eICU and MIMIC-IV, and consider various kinds of OOD cases within these datasets. For example, we examine OODs originating from a statistically different dataset than the training set according to the membership model introduced by Debray et al. [1], as well as OODs obtained by splitting a given dataset based on a value of a distinguishing variable. To identify OOD instances, we explore a range of 10 density-based methods that learn the marginal distribution of the data, alongside 17 post-hoc detectors that are applied on top of prediction models already trained on the data. The prediction models involve three distinct architectures, namely MLP, ResNet, and Transformer.

MAIN RESULTS

In our experiments, when the membership model achieved an AUC of 0.98, which indicated a clear distinction between OOD data and the training set, we observed that the OOD detection methods had achieved AUC values exceeding 0.95 in distinguishing OOD data. In contrast, in the experiments with subtler changes in data distribution such as selecting OOD data based on ethnicity and age characteristics, many OOD detection methods performed similarly to a random classifier with AUC values close to 0.5. This may suggest a correlation between separability, as indicated by the membership model, and OOD detection performance, as indicated by the AUC of the detection model. This warrants future research.

摘要

背景

机器学习(ML)模型常常难以有效地泛化到与训练分布不同的数据上。这引发了人们对现实世界中遇到此类输入(即分布外(OOD)数据)的医疗系统可靠性的重大担忧。可以通过实时检测OOD输入来解决这些担忧。虽然在其他领域,尤其是计算机视觉领域,已经提出了许多OOD检测方法,但尚不清楚类似方法是否能有效应对医学表格数据带来的挑战。

目的

为了回答这个重要问题,我们提出了一个广泛的可重现基准,以在一系列全面的测试中比较医学表格数据中不同的OOD检测方法。

方法

为实现这一目标,我们利用4个不同的大型公共医学数据集,包括eICU和MIMIC-IV,并考虑这些数据集中的各种OOD情况。例如,根据德布雷等人[1]引入的隶属模型,我们研究源自与训练集统计上不同的数据集的OOD,以及通过基于区分变量的值分割给定数据集获得的OOD。为了识别OOD实例,我们探索了一系列10种基于密度的方法,这些方法学习数据的边际分布,以及17种事后检测器,这些检测器应用于已经在数据上训练的预测模型之上。预测模型涉及三种不同的架构,即多层感知器(MLP)、残差网络(ResNet)和变换器(Transformer)。

主要结果

在我们的实验中,当隶属模型的曲线下面积(AUC)达到0.98,这表明OOD数据与训练集之间有明显区别时,我们观察到OOD检测方法在区分OOD数据方面的AUC值超过了0.95。相比之下,在数据分布变化更细微的实验中,例如根据种族和年龄特征选择OOD数据,许多OOD检测方法的表现与随机分类器相似,AUC值接近0.5。这可能表明隶属模型所表明的可分离性与检测模型的AUC所表明的OOD检测性能之间存在相关性。这值得未来进一步研究。

相似文献

1
Unmasking the chameleons: A benchmark for out-of-distribution detection in medical tabular data.揭开变色龙的面纱:医学表格数据中分布外检测的基准。
Int J Med Inform. 2025 Mar;195:105762. doi: 10.1016/j.ijmedinf.2024.105762. Epub 2024 Dec 17.
2
Investigation of out-of-distribution detection across various models and training methodologies.跨多种模型和训练方法的分布外检测研究。
Neural Netw. 2024 Jul;175:106288. doi: 10.1016/j.neunet.2024.106288. Epub 2024 Apr 4.
3
MOOD 2020: A Public Benchmark for Out-of-Distribution Detection and Localization on Medical Images.MOOD 2020:医学图像上的分布外检测和定位的公共基准。
IEEE Trans Med Imaging. 2022 Oct;41(10):2728-2738. doi: 10.1109/TMI.2022.3170077. Epub 2022 Sep 30.
4
Diffusion models for out-of-distribution detection in digital pathology.扩散模型在数字病理学中的分布外检测。
Med Image Anal. 2024 Apr;93:103088. doi: 10.1016/j.media.2024.103088. Epub 2024 Jan 13.
5
Post-hoc out-of-distribution detection for cardiac MRI segmentation.心脏磁共振成像分割的事后分布外检测
Comput Med Imaging Graph. 2025 Jan;119:102476. doi: 10.1016/j.compmedimag.2024.102476. Epub 2024 Dec 12.
6
Out-of-distribution detection with in-distribution voting using the medical example of chest x-ray classification.使用分布内投票进行分布外检测,以胸部 X 射线分类为例。
Med Phys. 2024 Apr;51(4):2721-2732. doi: 10.1002/mp.16790. Epub 2023 Oct 13.
7
Robustness to Spurious Correlations Improves Semantic Out-of-Distribution Detection.对虚假相关性的鲁棒性可改善语义分布外检测。
Proc AAAI Conf Artif Intell. 2023 Jun 27;37(12):15305-15312. doi: 10.1609/aaai.v37i12.26785.
8
Efficient out-of-distribution detection via layer-adaptive scoring and early stopping.通过层自适应评分和早期停止实现高效的分布外检测。
Front Big Data. 2024 Nov 20;7:1444634. doi: 10.3389/fdata.2024.1444634. eCollection 2024.
9
Predictive uncertainty estimation for out-of-distribution detection in digital pathology.数字病理学中分布外检测的预测不确定性估计。
Med Image Anal. 2023 Jan;83:102655. doi: 10.1016/j.media.2022.102655. Epub 2022 Oct 17.
10
ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI.R O O D-MRI:基准测试深度学习分割模型对 MRI 中分布外和损坏数据的鲁棒性。
Neuroimage. 2023 Sep;278:120289. doi: 10.1016/j.neuroimage.2023.120289. Epub 2023 Jul 24.