• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于风险预测建模的亚队列抽样设计的实证评估。

Empirical evaluation of sub-cohort sampling designs for risk prediction modeling.

作者信息

Lee Myeonggyun, Zeleniuch-Jacquotte Anne, Liu Mengling

机构信息

Department of Population Health, NYU School of Medicine, New York, NY, USA.

Department of Environmental Medicine, NYU School of Medicine, New York, NY, USA.

出版信息

J Appl Stat. 2020 Dec 21;48(8):1374-1401. doi: 10.1080/02664763.2020.1861225. eCollection 2021.

DOI:10.1080/02664763.2020.1861225
PMID:35706464
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9042011/
Abstract

Sub-cohort sampling designs, such as nested case-control (NCC) and case-cohort (CC) studies, have been widely used to estimate biomarker-disease associations because of their cost effectiveness. These designs have been well studied and shown to maintain relatively high efficiency compared to full-cohort designs, but their performance of building risk prediction models has been less studied. Moreover, sub-cohort sampling designs often use matching (or stratifying) to further control for confounders or to reduce measurement error. Their predictive performance depends on both the design and matching procedures. Based on a dataset from the NYU Women's Health Study (NYUWHS), we performed Monte Carlo simulations to systematically evaluate risk prediction performance under NCC, CC, and full-cohort studies. Our simulations demonstrate that sub-cohort sampling designs can have predictive accuracy (i.e. discrimination and calibration) similar to that of the full-cohort design, but could be sensitive to the matching procedure used. Our results suggest that researchers can have the option of performing NCC and CC studies with huge potential benefits in cost and resources, but need to pay particular attention to the matching procedure when developing a risk prediction model in biomarker studies.

摘要

亚队列抽样设计,如巢式病例对照(NCC)和病例队列(CC)研究,因其成本效益而被广泛用于估计生物标志物与疾病的关联。这些设计已得到充分研究,与全队列设计相比,显示出相对较高的效率,但其构建风险预测模型的性能研究较少。此外,亚队列抽样设计通常使用匹配(或分层)来进一步控制混杂因素或减少测量误差。它们的预测性能取决于设计和匹配程序。基于纽约大学女性健康研究(NYUWHS)的数据集,我们进行了蒙特卡洛模拟,以系统评估NCC、CC和全队列研究下的风险预测性能。我们的模拟表明,亚队列抽样设计可以具有与全队列设计相似的预测准确性(即区分度和校准度),但可能对所使用的匹配程序敏感。我们的结果表明,研究人员可以选择进行NCC和CC研究,在成本和资源方面有巨大潜在益处,但在生物标志物研究中开发风险预测模型时需要特别注意匹配程序。

相似文献

1
Empirical evaluation of sub-cohort sampling designs for risk prediction modeling.用于风险预测建模的亚队列抽样设计的实证评估。
J Appl Stat. 2020 Dec 21;48(8):1374-1401. doi: 10.1080/02664763.2020.1861225. eCollection 2021.
2
Weighted metrics are required when evaluating the performance of prediction models in nested case-control studies.在嵌套病例对照研究中评估预测模型的性能时,需要使用加权指标。
BMC Med Res Methodol. 2024 May 17;24(1):115. doi: 10.1186/s12874-024-02213-6.
3
Goodness-of-fit two-phase sampling designs for time-to-event outcomes: a simulation study based on New York University Women's Health Study for breast cancer.基于纽约大学女性健康研究的乳腺癌生存时间两阶段抽样设计的拟合优度:一项模拟研究。
BMC Med Res Methodol. 2023 May 19;23(1):119. doi: 10.1186/s12874-023-01950-4.
4
Risk prediction measures for case-cohort and nested case-control designs: an application to cardiovascular disease.病例-队列和巢式病例对照设计的风险预测指标:在心血管疾病中的应用。
Am J Epidemiol. 2012 Apr 1;175(7):715-24. doi: 10.1093/aje/kwr374. Epub 2012 Mar 6.
5
Evaluating longitudinal markers under two-phase study designs.评估两阶段研究设计下的纵向标志物。
Biostatistics. 2019 Jul 1;20(3):485-498. doi: 10.1093/biostatistics/kxy013.
6
Comparison of cohort and nested case-control designs for estimating the effect of time-varying drug exposure on the risk of adverse event in the presence of ties.在存在关联的情况下,比较队列研究和巢式病例对照研究设计,以估计时变药物暴露对不良事件风险的影响。
Biom J. 2023 Aug;65(6):e2100384. doi: 10.1002/bimj.202100384. Epub 2023 Feb 27.
7
Nonparametric Maximum Likelihood Estimators of Time-Dependent Accuracy Measures for Survival Outcome Under Two-Stage Sampling Designs.两阶段抽样设计下生存结局随时间变化的准确性测量的非参数最大似然估计量
J Am Stat Assoc. 2018;113(522):882-892. doi: 10.1080/01621459.2017.1295866. Epub 2017 Feb 28.
8
Biomarker evaluation under imperfect nested case-control design.在不完全巢式病例对照设计下的生物标志物评估。
Stat Med. 2021 Aug 15;40(18):4035-4052. doi: 10.1002/sim.9012. Epub 2021 Apr 29.
9
Sampling strategies to evaluate the prognostic value of a new biomarker on a time-to-event end-point.采样策略评估时间事件终点新生物标志物的预后价值。
BMC Med Res Methodol. 2021 Apr 30;21(1):93. doi: 10.1186/s12874-021-01283-0.
10
Comparisons of risk prediction methods using nested case-control data.使用巢式病例对照数据对风险预测方法进行比较。
Stat Med. 2017 Feb 10;36(3):455-465. doi: 10.1002/sim.7143. Epub 2016 Oct 13.

引用本文的文献

1
Weighted metrics are required when evaluating the performance of prediction models in nested case-control studies.在嵌套病例对照研究中评估预测模型的性能时,需要使用加权指标。
BMC Med Res Methodol. 2024 May 17;24(1):115. doi: 10.1186/s12874-024-02213-6.
2
Goodness-of-fit two-phase sampling designs for time-to-event outcomes: a simulation study based on New York University Women's Health Study for breast cancer.基于纽约大学女性健康研究的乳腺癌生存时间两阶段抽样设计的拟合优度:一项模拟研究。
BMC Med Res Methodol. 2023 May 19;23(1):119. doi: 10.1186/s12874-023-01950-4.
3
Editorial to special issue Frontiers of Data Analysis.《数据分析前沿》特刊社论
J Appl Stat. 2021 May 21;48(8):1349-1351. doi: 10.1080/02664763.2021.1922853. eCollection 2021.

本文引用的文献

1
Breast cancer risk prediction in women aged 35-50 years: impact of including sex hormone concentrations in the Gail model.35-50 岁女性乳腺癌风险预测:加性模型中纳入性激素浓度的影响。
Breast Cancer Res. 2019 Mar 19;21(1):42. doi: 10.1186/s13058-019-1126-z.
2
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.
3
Circulating anti-Müllerian hormone and breast cancer risk: A study in ten prospective cohorts.循环抗苗勒管激素与乳腺癌风险:十项前瞻性队列研究。
Int J Cancer. 2018 Jun 1;142(11):2215-2226. doi: 10.1002/ijc.31249. Epub 2018 Feb 8.
4
Review and evaluation of performance measures for survival prediction models in external validation settings.外部验证环境下生存预测模型性能指标的回顾与评估
BMC Med Res Methodol. 2017 Apr 18;17(1):60. doi: 10.1186/s12874-017-0336-2.
5
Comparisons of risk prediction methods using nested case-control data.使用巢式病例对照数据对风险预测方法进行比较。
Stat Med. 2017 Feb 10;36(3):455-465. doi: 10.1002/sim.7143. Epub 2016 Oct 13.
6
Novel Biomarkers to Improve the Prediction of Cardiovascular Event Risk in Type 2 Diabetes Mellitus.改善2型糖尿病心血管事件风险预测的新型生物标志物
J Am Heart Assoc. 2016 May 31;5(6):e003048. doi: 10.1161/JAHA.115.003048.
7
A Prospective Evaluation of Early Detection Biomarkers for Ovarian Cancer in the European EPIC Cohort.欧洲EPIC队列中卵巢癌早期检测生物标志物的前瞻性评估。
Clin Cancer Res. 2016 Sep 15;22(18):4664-75. doi: 10.1158/1078-0432.CCR-16-0316. Epub 2016 Apr 8.
8
Serum Taurine and Stroke Risk in Women: A Prospective, Nested Case-Control Study.血清牛磺酸与女性中风风险:一项前瞻性巢式病例对照研究。
PLoS One. 2016 Feb 11;11(2):e0149348. doi: 10.1371/journal.pone.0149348. eCollection 2016.
9
A new comparison of nested case-control and case-cohort designs and methods.嵌套病例对照研究与病例队列研究设计及方法的新比较。
Eur J Epidemiol. 2015 Mar;30(3):197-207. doi: 10.1007/s10654-014-9974-4. Epub 2014 Dec 2.
10
A risk prediction model for the assessment and triage of women with hypertensive disorders of pregnancy in low-resourced settings: the miniPIERS (Pre-eclampsia Integrated Estimate of RiSk) multi-country prospective cohort study.一种用于在资源匮乏地区评估和分诊妊娠高血压疾病女性的风险预测模型:小型PIERS(先兆子痫综合风险评估)多国前瞻性队列研究。
PLoS Med. 2014 Jan;11(1):e1001589. doi: 10.1371/journal.pmed.1001589. Epub 2014 Jan 21.