基于歧视的生存时间数据多变量预后模型的样本量计算

Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data.

作者信息

Jinks Rachel C, Royston Patrick, Parmar Mahesh K B

机构信息

MRC Clinical Trials Unit at UCL, Aviation House, 125 Kingsway, London, WC2B 6NH, UK.

出版信息

BMC Med Res Methodol. 2015 Oct 12;15:82. doi: 10.1186/s12874-015-0078-y.

DOI:10.1186/s12874-015-0078-y

PMID:26459415

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4603804/

Abstract

BACKGROUND

Prognostic studies of time-to-event data, where researchers aim to develop or validate multivariable prognostic models in order to predict survival, are commonly seen in the medical literature; however, most are performed retrospectively and few consider sample size prior to analysis. Events per variable rules are sometimes cited, but these are based on bias and coverage of confidence intervals for model terms, which are not of primary interest when developing a model to predict outcome. In this paper we aim to develop sample size recommendations for multivariable models of time-to-event data, based on their prognostic ability.

METHODS

We derive formulae for determining the sample size required for multivariable prognostic models in time-to-event data, based on a measure of discrimination, D, developed by Royston and Sauerbrei. These formulae fall into two categories: either based on the significance of the value of D in a new study compared to a previous estimate, or based on the precision of the estimate of D in a new study in terms of confidence interval width. Using simulation we show that they give the desired power and type I error and are not affected by random censoring. Additionally, we conduct a literature review to collate published values of D in different disease areas.

RESULTS

We illustrate our methods using parameters from a published prognostic study in liver cancer. The resulting sample sizes can be large, and we suggest controlling study size by expressing the desired accuracy in the new study as a relative value as well as an absolute value. To improve usability we use the values of D obtained from the literature review to develop an equation to approximately convert the commonly reported Harrell's c-index to D. A flow chart is provided to aid decision making when using these methods.

CONCLUSION

We have developed a suite of sample size calculations based on the prognostic ability of a survival model, rather than the magnitude or significance of model coefficients. We have taken care to develop the practical utility of the calculations and give recommendations for their use in contemporary clinical research.

摘要

背景

在医学文献中，针对事件发生时间数据的预后研究很常见，研究人员旨在开发或验证多变量预后模型以预测生存率；然而，大多数此类研究是回顾性进行的，很少有研究在分析前考虑样本量。有时会引用每个变量的事件规则，但这些规则基于偏差和模型项置信区间的覆盖范围，而在开发预测结果的模型时，这些并非主要关注点。在本文中，我们旨在根据多变量模型对事件发生时间数据的预后能力制定样本量建议。

方法

我们基于Royston和Sauerbrei开发的一种区分度度量D，推导出用于确定事件发生时间数据多变量预后模型所需样本量的公式。这些公式分为两类：一类基于新研究中D值与先前估计值相比的显著性，另一类基于新研究中D估计值在置信区间宽度方面的精度。通过模拟我们表明，它们能提供所需的检验效能和I型错误，且不受随机删失的影响。此外，我们进行了文献综述，以整理不同疾病领域已发表的D值。

结果

我们使用来自一项已发表的肝癌预后研究的参数来说明我们的方法。得出的样本量可能会很大，我们建议通过将新研究中所需的准确度表示为相对值和绝对值来控制研究规模。为提高实用性，我们利用文献综述中获得的D值来建立一个方程，以近似地将常用的Harrell's c指数转换为D。提供了一个流程图，以帮助在使用这些方法时进行决策。

结论

我们基于生存模型的预后能力开发了一套样本量计算方法，而非基于模型系数的大小或显著性。我们已注意提高这些计算方法的实际效用，并就其在当代临床研究中的应用给出建议。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f47/4603804/5f0d32f5d0b3/12874_2015_78_Fig1_HTML.jpg

相似文献

Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data.

BMC Med Res Methodol. 2015 Oct 12;15:82. doi: 10.1186/s12874-015-0078-y.

Construction and validation of a prognostic model across several studies, with an application in superficial bladder cancer.

Stat Med. 2004 Mar 30;23(6):907-26. doi: 10.1002/sim.1691.

Decision making in surgical treatment of chronic low back pain: the performance of prognostic tests to select patients for lumbar spinal fusion.

Acta Orthop Suppl. 2013 Feb;84(349):1-35. doi: 10.3109/17453674.2012.753565.

Sample size estimation in diagnostic test studies of biomedical informatics.

J Biomed Inform. 2014 Apr;48:193-204. doi: 10.1016/j.jbi.2014.02.013. Epub 2014 Feb 26.

The effect of sample size and bias on the reliability of estimates of error: a comparative study of Dahlberg's formula.

Eur J Orthod. 2012 Apr;34(2):158-63. doi: 10.1093/ejo/cjr010. Epub 2011 Mar 29.

Blinded sample size recalculation for clinical trials with normal data and baseline adjusted analysis.

Pharm Stat. 2011 Jan-Feb;10(1):8-13. doi: 10.1002/pst.398.

Cluster randomised crossover trials with binary data and unbalanced cluster sizes: application to studies of near-universal interventions in intensive care.

Clin Trials. 2015 Feb;12(1):34-44. doi: 10.1177/1740774514559610. Epub 2014 Dec 4.

An evaluation of sample size requirements for developing risk prediction models with binary outcomes.

BMC Med Res Methodol. 2024 Jul 10;24(1):146. doi: 10.1186/s12874-024-02268-5.

Planning sample sizes when effect sizes are uncertain: The power-calibrated effect size approach.

Psychol Methods. 2016 Mar;21(1):47-60. doi: 10.1037/met0000036. Epub 2015 Dec 14.

Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome.

Stat Med. 2022 Mar 30;41(7):1280-1295. doi: 10.1002/sim.9275. Epub 2021 Dec 16.

引用本文的文献

The impact of graded nursing interventions based on quantitative risk assessment on psychological stress responses in patients undergoing resection for primary liver cancer.

BMC Nurs. 2025 Aug 13;24(1):1068. doi: 10.1186/s12912-025-03728-z.

GEMA-Na and MELD 3.0 severity scores to address sex disparities for accessing liver transplantation: a nationwide retrospective cohort study.

EClinicalMedicine. 2024 Jul 18;74:102737. doi: 10.1016/j.eclinm.2024.102737. eCollection 2024 Aug.

Evaluation of clinical prediction models (part 3): calculating the sample size required for an external validation study.

BMJ. 2024 Jan 22;384:e074821. doi: 10.1136/bmj-2023-074821.

Cohort size required for prognostic genes analysis of stage II/III esophageal squamous cell carcinoma.

Pathol Oncol Res. 2023 Feb 7;29:1610909. doi: 10.3389/pore.2023.1610909. eCollection 2023.

Nomogram to predict risk of incident chronic kidney disease in high-risk population of cardiovascular disease in China: community-based cohort study.

BMJ Open. 2021 Nov 12;11(11):e047774. doi: 10.1136/bmjopen-2020-047774.

S-GRAS score for prognostic classification of adrenocortical carcinoma: an international, multicenter ENSAT study.

Eur J Endocrinol. 2021 Nov 30;186(1):25-36. doi: 10.1530/EJE-21-0510.

External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb.

J Clin Epidemiol. 2021 Jul;135:79-89. doi: 10.1016/j.jclinepi.2021.02.011. Epub 2021 Feb 14.

Risk assessment for hospital admission in patients with COPD; a multi-centre UK prospective observational study.

PLoS One. 2020 Feb 10;15(2):e0228940. doi: 10.1371/journal.pone.0228940. eCollection 2020.

Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes.

Stat Med. 2019 Mar 30;38(7):1276-1296. doi: 10.1002/sim.7992. Epub 2018 Oct 24.

Development and validation of a new predictive model for breast cancer survival in New Zealand and comparison to the Nottingham prognostic index.

BMC Cancer. 2018 Sep 17;18(1):897. doi: 10.1186/s12885-018-4791-x.

本文引用的文献

Prognosis Research Strategy (PROGRESS) 2: prognostic factor research.

PLoS Med. 2013;10(2):e1001380. doi: 10.1371/journal.pmed.1001380. Epub 2013 Feb 5.

An evaluation of penalised survival methods for developing prognostic models with rare events.

Stat Med. 2012 May 20;31(11-12):1150-61. doi: 10.1002/sim.4371. Epub 2011 Oct 14.

A simulation study of predictive ability measures in a survival model I: explained variation measures.

Stat Med. 2012 Oct 15;31(23):2627-43. doi: 10.1002/sim.4242. Epub 2011 Apr 26.

External validity of risk models: Use of benchmark values to disentangle a case-mix effect from incorrect coefficients.

Am J Epidemiol. 2010 Oct 15;172(8):971-80. doi: 10.1093/aje/kwq223. Epub 2010 Aug 31.

Reporting methods in studies developing prognostic models in cancer: a review.

BMC Med. 2010 Mar 30;8:20. doi: 10.1186/1741-7015-8-20.

Prognosis and prognostic research: Developing a prognostic model.

BMJ. 2009 Mar 31;338:b604. doi: 10.1136/bmj.b604.

Prognostic models: a methodological framework and review of models for breast cancer.

Cancer Invest. 2009 Mar;27(3):235-43. doi: 10.1080/07357900802572110.

Prognosis and prognostic research: what, why, and how?

BMJ. 2009 Feb 23;338:b375. doi: 10.1136/bmj.b375.

Prognosis of advanced hepatocellular carcinoma: comparison of three staging systems in two French clinical trials.

Ann Oncol. 2008 Jun;19(6):1117-26. doi: 10.1093/annonc/mdn030. Epub 2008 Feb 25.

Relaxing the rule of ten events per variable in logistic and Cox regression.

Am J Epidemiol. 2007 Mar 15;165(6):710-8. doi: 10.1093/aje/kwk052. Epub 2006 Dec 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于歧视的生存时间数据多变量预后模型的样本量计算

Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献