• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

协变量偏移下基于损失的模型性能度量的稳健估计

Robust Estimation of Loss-Based Measures of Model Performance under Covariate Shift.

作者信息

Morrison Samantha, Gatsonis Constantine, Dahabreh Issa J, Li Bing, Steingrimsson Jon A

机构信息

Department of Biostatistics, Brown University, Providence, United States.

CAUSALab, Harvard T.H. Chan School of Public Health, Boston, United States, Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, United States, Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, United States.

出版信息

Can J Stat. 2024 Dec;52(4). doi: 10.1002/cjs.11815. Epub 2024 Jul 12.

DOI:10.1002/cjs.11815
PMID:39678170
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11636945/
Abstract

We present methods for estimating loss-based measures of the performance of a prediction model in a target population that differs from the source population in which the model was developed, in settings where outcome and covariate data are available from the source population but only covariate data are available on a simple random sample from the target population. Prior work adjusting for differences between the two populations has used various weighting estimators with inverse odds or density ratio weights. Here, we develop more robust estimators for the target population risk (expected loss) that can be used with data-adaptive (e.g., machine learning-based) estimation of nuisance parameters. We examine the large-sample properties of the estimators and evaluate finite sample performance in simulations. Last, we apply the methods to data from lung cancer screening using nationally representative data from the National Health and Nutrition Examination Survey (NHANES) and extend our methods to account for the complex survey design of the NHANES.

摘要

我们提出了一些方法,用于在目标人群中估计预测模型性能的基于损失的度量。在这种情况下,目标人群与模型开发所在的源人群不同,且源人群可获得结局和协变量数据,而目标人群仅通过简单随机样本可获得协变量数据。先前针对这两个人群差异进行调整的工作使用了各种具有逆概率或密度比权重的加权估计器。在此,我们开发了用于目标人群风险(预期损失)的更稳健估计器,这些估计器可与数据自适应(例如基于机器学习)的干扰参数估计一起使用。我们研究了估计器的大样本性质,并在模拟中评估了有限样本性能。最后,我们将这些方法应用于来自美国国家健康与营养检查调查(NHANES)的具有全国代表性的肺癌筛查数据,并扩展我们的方法以考虑NHANES的复杂调查设计。

相似文献

1
Robust Estimation of Loss-Based Measures of Model Performance under Covariate Shift.协变量偏移下基于损失的模型性能度量的稳健估计
Can J Stat. 2024 Dec;52(4). doi: 10.1002/cjs.11815. Epub 2024 Jul 12.
2
Estimating the area under the ROC curve when transporting a prediction model to a target population.将预测模型传输到目标人群时估计 ROC 曲线下的面积。
Biometrics. 2023 Sep;79(3):2382-2393. doi: 10.1111/biom.13796. Epub 2022 Nov 25.
3
Extending prediction models for use in a new target population with failure time outcomes.将预测模型扩展到具有失效时间结局的新目标人群中使用。
Biostatistics. 2023 Jul 14;24(3):728-742. doi: 10.1093/biostatistics/kxac011.
4
Transporting a Prediction Model for Use in a New Target Population.将预测模型运用于新目标人群。
Am J Epidemiol. 2023 Feb 1;192(2):296-304. doi: 10.1093/aje/kwac128.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study.纵向治疗效果的双重稳健有效估计量:模拟中的比较性能及一个案例研究
Int J Biostat. 2019 Feb 26;15(2):/j/ijb.2019.15.issue-2/ijb-2017-0054/ijb-2017-0054.xml. doi: 10.1515/ijb-2017-0054.
7
Systematically missing data in causally interpretable meta-analysis.因果可解释的荟萃分析中系统性缺失的数据。
Biostatistics. 2024 Apr 15;25(2):289-305. doi: 10.1093/biostatistics/kxad006.
8
Collaborative double robust targeted maximum likelihood estimation.协作双稳健靶向最大似然估计
Int J Biostat. 2010 May 17;6(1):Article 17. doi: 10.2202/1557-4679.1181.
9
Semisupervised transfer learning for evaluation of model classification performance.半监督迁移学习在模型分类性能评估中的应用。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujae002.
10
Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data - A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting.二元结局和高维协变量数据的点暴露研究中的效应估计——靶向最大似然估计与治疗权重逆概率的比较
Int J Biostat. 2016 Nov 1;12(2). doi: 10.1515/ijb-2015-0034.

本文引用的文献

1
Transporting a Prediction Model for Use in a New Target Population.将预测模型运用于新目标人群。
Am J Epidemiol. 2023 Feb 1;192(2):296-304. doi: 10.1093/aje/kwac128.
2
Extending prediction models for use in a new target population with failure time outcomes.将预测模型扩展到具有失效时间结局的新目标人群中使用。
Biostatistics. 2023 Jul 14;24(3):728-742. doi: 10.1093/biostatistics/kxac011.
3
Screening for Lung Cancer: US Preventive Services Task Force Recommendation Statement.肺癌筛查:美国预防服务工作组推荐声明。
JAMA. 2021 Mar 9;325(10):962-970. doi: 10.1001/jama.2021.1117.
4
Study Designs for Extending Causal Inferences From a Randomized Trial to a Target Population.从随机试验到目标人群推广因果推论的研究设计。
Am J Epidemiol. 2021 Aug 1;190(8):1632-1642. doi: 10.1093/aje/kwaa270.
5
Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning.机器学习在因果效应估计中的应用:基于有向最小损失的估计和双重/无偏机器学习。
Biostatistics. 2020 Apr 1;21(2):353-358. doi: 10.1093/biostatistics/kxz042.
6
Big data: Some statistical issues.大数据:一些统计学问题。
Stat Probab Lett. 2018 May;136:111-115. doi: 10.1016/j.spl.2018.02.015.
7
The Highly Adaptive Lasso Estimator.高度自适应套索估计器
Proc Int Conf Data Sci Adv Anal. 2016;2016:689-696. doi: 10.1109/DSAA.2016.93. Epub 2016 Dec 26.
8
Screening for lung cancer: U.S. Preventive Services Task Force recommendation statement.肺癌筛查:美国预防服务工作组推荐声明。
Ann Intern Med. 2014 Mar 4;160(5):330-8. doi: 10.7326/M13-2771.
9
Reduced lung-cancer mortality with low-dose computed tomographic screening.低剂量计算机断层扫描筛查可降低肺癌死亡率。
N Engl J Med. 2011 Aug 4;365(5):395-409. doi: 10.1056/NEJMoa1102873. Epub 2011 Jun 29.