• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估存在缺失数据时现有模型的预测模型性能。

Evaluation of predictive model performance of an existing model in the presence of missing data.

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Stat Med. 2021 Jul 10;40(15):3477-3498. doi: 10.1002/sim.8978. Epub 2021 Apr 11.

DOI:10.1002/sim.8978
PMID:33843085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8985431/
Abstract

In medical research, the Brier score (BS) and the area under the receiver operating characteristic (ROC) curves (AUC) are two common metrics used to evaluate prediction models of a binary outcome, such as using biomarkers to predict the risk of developing a disease in the future. The assessment of an existing prediction models using data with missing covariate values is challenging. In this article, we propose inverse probability weighted (IPW) and augmented inverse probability weighted (AIPW) estimates of AUC and BS to handle the missing data. An alternative approach uses multiple imputation (MI), which requires a model for the distribution of the missing variable. We evaluated the performance of IPW and AIPW in comparison with MI in simulation studies under missing completely at random, missing at random, and missing not at random scenarios. When there are missing observations in the data, MI and IPW can be used to obtain unbiased estimates of BS and AUC if the imputation model for the missing variable or the model for the missingness is correctly specified. MI is more efficient than IPW. Our simulation results suggest that AIPW can be more efficient than IPW, and also achieves double robustness from miss-specification of either the missingness model or the imputation model. The outcome variable should be included in the model for the missing variable under all scenarios, while it only needs to be included in missingness model if the missingness depends on the outcome. We illustrate these methods using an example from prostate cancer.

摘要

在医学研究中,Brier 评分(BS)和受试者工作特征(ROC)曲线下面积(AUC)是两种常用的评估二分类结局预测模型的指标,例如使用生物标志物来预测未来患某种疾病的风险。评估使用具有缺失协变量值的数据的现有预测模型具有挑战性。在本文中,我们提出了使用Inverse Probability Weighted (IPW) 和 Augmented Inverse Probability Weighted (AIPW) 来处理缺失数据的 AUC 和 BS 估计值。另一种方法是使用多重插补(MI),它需要缺失变量分布的模型。我们在完全随机缺失、随机缺失和非随机缺失的情况下进行了模拟研究,比较了 IPW 和 AIPW 与 MI 的性能。当数据中存在缺失观测值时,如果缺失变量的插补模型或缺失模型正确指定,则 MI 和 IPW 可用于获得 BS 和 AUC 的无偏估计值。MI 比 IPW 更有效。我们的模拟结果表明,AIPW 比 IPW 更有效,并且还可以从缺失模型或插补模型的错误指定中实现双重稳健性。在所有情况下,缺失变量的模型都应该包含结局变量,而仅当缺失依赖于结局时,缺失模型才需要包含结局变量。我们使用前列腺癌的一个例子来说明这些方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/5763ca171835/nihms-1790198-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/4b82e276571d/nihms-1790198-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/7796f24d5702/nihms-1790198-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/32ec1a8b36e7/nihms-1790198-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/635d23f33f82/nihms-1790198-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/326caa8d5eff/nihms-1790198-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/54b50ede4380/nihms-1790198-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/aa2fb4c33bc5/nihms-1790198-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/05d9d2ff1bfd/nihms-1790198-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/bef51adc9c1c/nihms-1790198-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/6b1a1d9cffd0/nihms-1790198-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/eb9762ae8573/nihms-1790198-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/69ff9324c335/nihms-1790198-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/5763ca171835/nihms-1790198-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/4b82e276571d/nihms-1790198-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/7796f24d5702/nihms-1790198-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/32ec1a8b36e7/nihms-1790198-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/635d23f33f82/nihms-1790198-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/326caa8d5eff/nihms-1790198-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/54b50ede4380/nihms-1790198-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/aa2fb4c33bc5/nihms-1790198-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/05d9d2ff1bfd/nihms-1790198-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/bef51adc9c1c/nihms-1790198-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/6b1a1d9cffd0/nihms-1790198-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/eb9762ae8573/nihms-1790198-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/69ff9324c335/nihms-1790198-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c67/8985431/5763ca171835/nihms-1790198-f0008.jpg

相似文献

1
Evaluation of predictive model performance of an existing model in the presence of missing data.评估存在缺失数据时现有模型的预测模型性能。
Stat Med. 2021 Jul 10;40(15):3477-3498. doi: 10.1002/sim.8978. Epub 2021 Apr 11.
2
On the use of multiple imputation to address data missing by design as well as unintended missing data in case-cohort studies with a binary endpoint.关于在以二分类结局为研究终点的病例-队列研究中,针对设计缺失和非故意缺失数据,采用多重填补方法进行处理。
BMC Med Res Methodol. 2023 Dec 7;23(1):287. doi: 10.1186/s12874-023-02090-5.
3
Evaluation of multiple imputation approaches for handling missing covariate information in a case-cohort study with a binary outcome.评价在二分类结局病例-对照研究中采用多种插补方法处理协变量缺失信息的效果。
BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.
4
Comparison between inverse-probability weighting and multiple imputation in Cox model with missing failure subtype.缺失失效亚组的 Cox 模型中逆概率加权与多重插补的比较
Stat Methods Med Res. 2024 Feb;33(2):344-356. doi: 10.1177/09622802231226328. Epub 2024 Jan 23.
5
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.
6
Review of inverse probability weighting for dealing with missing data.逆概率加权法处理缺失数据的综述。
Stat Methods Med Res. 2013 Jun;22(3):278-95. doi: 10.1177/0962280210395740. Epub 2011 Jan 10.
7
Augmented Inverse Probability Weighting and the Double Robustness Property.增强逆概率加权法与双重稳健性特性。
Med Decis Making. 2022 Feb;42(2):156-167. doi: 10.1177/0272989X211027181. Epub 2021 Jul 6.
8
Missing Data in Marginal Structural Models: A Plasmode Simulation Study Comparing Multiple Imputation and Inverse Probability Weighting.边缘结构模型中的缺失数据:比较多种插补和逆概率加权的 Plasmode 模拟研究。
Med Care. 2019 Mar;57(3):237-243. doi: 10.1097/MLR.0000000000001063.
9
Combining multiple imputation and inverse-probability weighting.结合多重填补法和逆概率加权法。
Biometrics. 2012 Mar;68(1):129-37. doi: 10.1111/j.1541-0420.2011.01666.x. Epub 2011 Nov 3.
10
Improving upon the efficiency of complete case analysis when covariates are MNAR.当协变量为非随机缺失时提高完全病例分析的效率。
Biostatistics. 2014 Oct;15(4):719-30. doi: 10.1093/biostatistics/kxu023. Epub 2014 Jun 6.

引用本文的文献

1
Advances in risk prediction models for cancer-related cognitive impairment.癌症相关认知障碍风险预测模型的进展。
Clin Exp Med. 2025 Mar 6;25(1):74. doi: 10.1007/s10238-025-01590-6.

本文引用的文献

1
Handling missing predictor values when validating and applying a prediction model to new patients.在验证预测模型并将其应用于新患者时处理预测变量的缺失值。
Stat Med. 2020 Nov 10;39(25):3591-3607. doi: 10.1002/sim.8682. Epub 2020 Jul 20.
2
The CAPRA Score at 10 Years: Contemporary Perspectives and Analysis of Supporting Studies.CAPRA 评分 10 年回顾:支持性研究的当代观点与分析。
Eur Urol. 2017 May;71(5):705-709. doi: 10.1016/j.eururo.2016.08.065. Epub 2016 Sep 8.
3
The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data.
使用多重填补数据的大样本对模型性能评估进行预测的估计与应用。
Biom J. 2015 Jul;57(4):614-32. doi: 10.1002/bimj.201400004. Epub 2015 Jan 29.
4
Doubly robust estimators of causal exposure effects with missing data in the outcome, exposure or a confounder.在结局、暴露或混杂因素中存在缺失数据的情况下,因果暴露效应的双重稳健估计量。
Stat Med. 2012 Dec 30;31(30):4382-400. doi: 10.1002/sim.5643. Epub 2012 Oct 22.
5
Multiple imputation using chained equations: Issues and guidance for practice.使用链式方程进行多重插补:实践中的问题和指导。
Stat Med. 2011 Feb 20;30(4):377-99. doi: 10.1002/sim.4067. Epub 2010 Nov 30.
6
Review of inverse probability weighting for dealing with missing data.逆概率加权法处理缺失数据的综述。
Stat Methods Med Res. 2013 Jun;22(3):278-95. doi: 10.1177/0962280210395740. Epub 2011 Jan 10.
7
Robust estimation of area under ROC curve using auxiliary variables in the presence of missing biomarker values.在存在生物标志物值缺失的情况下使用辅助变量对ROC曲线下面积进行稳健估计。
Biometrics. 2011 Jun;67(2):559-67. doi: 10.1111/j.1541-0420.2010.01487.x. Epub 2010 Sep 3.
8
Missing covariate data in medical research: to impute is better than to ignore.医学研究中缺失的协变量数据:填补优于忽略。
J Clin Epidemiol. 2010 Jul;63(7):721-7. doi: 10.1016/j.jclinepi.2009.12.008. Epub 2010 Mar 24.
9
Assessing the performance of prediction models: a framework for traditional and novel measures.评估预测模型的性能:传统和新型指标的框架。
Epidemiology. 2010 Jan;21(1):128-38. doi: 10.1097/EDE.0b013e3181c30fb2.
10
Dealing with missing predictor values when applying clinical prediction models.应用临床预测模型时处理预测变量缺失值的问题。
Clin Chem. 2009 May;55(5):994-1001. doi: 10.1373/clinchem.2008.115345. Epub 2009 Mar 12.