Suppr超能文献

用于风险预测建模的亚队列抽样设计的实证评估。

Empirical evaluation of sub-cohort sampling designs for risk prediction modeling.

作者信息

Lee Myeonggyun, Zeleniuch-Jacquotte Anne, Liu Mengling

机构信息

Department of Population Health, NYU School of Medicine, New York, NY, USA.

Department of Environmental Medicine, NYU School of Medicine, New York, NY, USA.

出版信息

J Appl Stat. 2020 Dec 21;48(8):1374-1401. doi: 10.1080/02664763.2020.1861225. eCollection 2021.

Abstract

Sub-cohort sampling designs, such as nested case-control (NCC) and case-cohort (CC) studies, have been widely used to estimate biomarker-disease associations because of their cost effectiveness. These designs have been well studied and shown to maintain relatively high efficiency compared to full-cohort designs, but their performance of building risk prediction models has been less studied. Moreover, sub-cohort sampling designs often use matching (or stratifying) to further control for confounders or to reduce measurement error. Their predictive performance depends on both the design and matching procedures. Based on a dataset from the NYU Women's Health Study (NYUWHS), we performed Monte Carlo simulations to systematically evaluate risk prediction performance under NCC, CC, and full-cohort studies. Our simulations demonstrate that sub-cohort sampling designs can have predictive accuracy (i.e. discrimination and calibration) similar to that of the full-cohort design, but could be sensitive to the matching procedure used. Our results suggest that researchers can have the option of performing NCC and CC studies with huge potential benefits in cost and resources, but need to pay particular attention to the matching procedure when developing a risk prediction model in biomarker studies.

摘要

亚队列抽样设计,如巢式病例对照(NCC)和病例队列(CC)研究,因其成本效益而被广泛用于估计生物标志物与疾病的关联。这些设计已得到充分研究,与全队列设计相比,显示出相对较高的效率,但其构建风险预测模型的性能研究较少。此外,亚队列抽样设计通常使用匹配(或分层)来进一步控制混杂因素或减少测量误差。它们的预测性能取决于设计和匹配程序。基于纽约大学女性健康研究(NYUWHS)的数据集,我们进行了蒙特卡洛模拟,以系统评估NCC、CC和全队列研究下的风险预测性能。我们的模拟表明,亚队列抽样设计可以具有与全队列设计相似的预测准确性(即区分度和校准度),但可能对所使用的匹配程序敏感。我们的结果表明,研究人员可以选择进行NCC和CC研究,在成本和资源方面有巨大潜在益处,但在生物标志物研究中开发风险预测模型时需要特别注意匹配程序。

相似文献

5

本文引用的文献

2
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验