• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床预测模型的外部验证:基于模拟的样本量计算比经验法则更可靠。

External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb.

机构信息

Centre for Prognosis Research, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom.

Centre for Prognosis Research, School of Medicine, Keele University, Keele, Staffordshire, United Kingdom.

出版信息

J Clin Epidemiol. 2021 Jul;135:79-89. doi: 10.1016/j.jclinepi.2021.02.011. Epub 2021 Feb 14.

DOI:10.1016/j.jclinepi.2021.02.011
PMID:33596458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8352630/
Abstract

INTRODUCTION

Sample size "rules-of-thumb" for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance estimates upon external validation, and propose a more tailored sample size approach.

METHODS

Simulation of logistic regression prediction models to investigate factors associated with precision of performance estimates. Then, explanation and illustration of a simulation-based approach to calculate the minimum sample size required to precisely estimate a model's calibration, discrimination and clinical utility.

RESULTS

Precision is affected by the model's linear predictor (LP) distribution, in addition to number of events and total sample size. Sample sizes of 100 (or even 200) events and non-events can give imprecise estimates, especially for calibration. The simulation-based calculation accounts for the LP distribution and (mis)calibration in the validation sample. Application identifies 2430 required participants (531 events) for external validation of a deep vein thrombosis diagnostic model.

CONCLUSION

Where researchers can anticipate the distribution of the model's LP (eg, based on development sample, or a pilot study), a simulation-based approach for calculating sample size for external validation offers more flexibility and reliability than rules-of-thumb.

摘要

简介

用于临床预测模型外部验证的样本量“经验法则”建议至少有 100 个事件和 100 个非事件。这种笼统的指导意见不够精确,也不适用于模型或验证设置。我们研究了影响外部验证时模型性能估计精度的因素,并提出了一种更具针对性的样本量方法。

方法

模拟逻辑回归预测模型,以研究与性能估计精度相关的因素。然后,解释并说明了一种基于模拟的方法,用于计算精确估计模型校准、区分度和临床实用性所需的最小样本量。

结果

精度受模型的线性预测器(LP)分布的影响,除了事件数量和总样本量。即使有 100 个(甚至 200 个)事件和非事件,估计也可能不够精确,尤其是在校准方面。基于模拟的计算考虑了验证样本中的 LP 分布和(误)校准。该应用程序确定了 2430 名(531 个事件)参与者用于深静脉血栓形成诊断模型的外部验证。

结论

如果研究人员可以预测模型 LP 的分布(例如,基于开发样本或试点研究),则基于模拟的计算外部验证样本量的方法比经验法则更灵活和可靠。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/d49345ac960f/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/d1ba7b68b41f/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/0f1ae306b59e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/c82f68024c1b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/d49345ac960f/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/d1ba7b68b41f/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/0f1ae306b59e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/c82f68024c1b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212d/8352630/d49345ac960f/gr4.jpg

相似文献

1
External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb.临床预测模型的外部验证:基于模拟的样本量计算比经验法则更可靠。
J Clin Epidemiol. 2021 Jul;135:79-89. doi: 10.1016/j.jclinepi.2021.02.011. Epub 2021 Feb 14.
2
Minimum sample size for external validation of a clinical prediction model with a binary outcome.具有二元结局的临床预测模型外部验证的最小样本量
Stat Med. 2021 Aug 30;40(19):4230-4251. doi: 10.1002/sim.9025. Epub 2021 May 24.
3
Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome.用于时间事件结局的临床预测模型外部验证的最小样本量计算。
Stat Med. 2022 Mar 30;41(7):1280-1295. doi: 10.1002/sim.9275. Epub 2021 Dec 16.
4
Minimum sample size for external validation of a clinical prediction model with a continuous outcome.用于连续结局临床预测模型外部验证的最小样本量。
Stat Med. 2021 Jan 15;40(1):133-146. doi: 10.1002/sim.8766. Epub 2020 Nov 4.
5
Expected Value of Sample Information Calculations for Risk Prediction Model Validation.风险预测模型验证中样本信息计算的期望值
Med Decis Making. 2025 Apr;45(3):232-244. doi: 10.1177/0272989X251314010. Epub 2025 Feb 18.
6
Prognostic models for newly-diagnosed chronic lymphocytic leukaemia in adults: a systematic review and meta-analysis.成人新诊断慢性淋巴细胞白血病的预后模型:一项系统评价和荟萃分析。
Cochrane Database Syst Rev. 2020 Jul 31;7(7):CD012022. doi: 10.1002/14651858.CD012022.pub2.
7
Estimation of required sample size for external validation of risk models for binary outcomes.用于二分类结局风险模型外部验证的样本量估计。
Stat Methods Med Res. 2021 Oct;30(10):2187-2206. doi: 10.1177/09622802211007522. Epub 2021 Apr 21.
8
Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes.建立多变量预测模型的最小样本量:第二部分 - 二分类和生存数据。
Stat Med. 2019 Mar 30;38(7):1276-1296. doi: 10.1002/sim.7992. Epub 2018 Oct 24.
9
The relationship between statistical power and predictor distribution in multilevel logistic regression: a simulation-based approach.多层次逻辑回归中统计功效与预测因子分布的关系:基于模拟的方法。
BMC Med Res Methodol. 2019 May 9;19(1):97. doi: 10.1186/s12874-019-0742-8.
10
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

引用本文的文献

1
TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods: a Korean translation.TRIPOD+AI声明:使用回归或机器学习方法的临床预测模型报告的更新指南:韩文翻译
Ewha Med J. 2025 Jul;48(3):e48. doi: 10.12771/emj.2025.00668. Epub 2025 Jul 31.
2
Clinical Characteristics of Gastrointestinal Malignancy and Validation of the IDIOM Model Among Patients With Iron Deficiency Anemia in Thailand.泰国缺铁性贫血患者胃肠道恶性肿瘤的临床特征及IDIOM模型的验证
JGH Open. 2025 Jul 11;9(7):e70221. doi: 10.1002/jgh3.70221. eCollection 2025 Jul.
3
Extended sample size calculations for evaluation of prediction models using a threshold for classification.

本文引用的文献

1
Calculating the sample size required for developing a clinical prediction model.计算开发临床预测模型所需的样本量。
BMJ. 2020 Mar 18;368:m441. doi: 10.1136/bmj.m441.
2
Calibration: the Achilles heel of predictive analytics.校准:预测分析的阿喀琉斯之踵。
BMC Med. 2019 Dec 16;17(1):230. doi: 10.1186/s12916-019-1466-7.
3
The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models.综合校准指数(ICI)及其相关指标,用于量化逻辑回归模型的校准。
使用分类阈值评估预测模型的扩展样本量计算
BMC Med Res Methodol. 2025 Jul 1;25(1):170. doi: 10.1186/s12874-025-02592-4.
4
Systematic scoping review of external validation studies of AI pathology models for lung cancer diagnosis.人工智能肺癌诊断病理模型外部验证研究的系统综述
NPJ Precis Oncol. 2025 Jun 7;9(1):166. doi: 10.1038/s41698-025-00940-7.
5
Development and validation of a prediction model for early recurrence in upper tract urothelial carcinoma treated with radical nephroureterectomy.根治性肾输尿管切除术治疗上尿路尿路上皮癌早期复发预测模型的开发与验证
BMC Cancer. 2025 Apr 30;25(1):808. doi: 10.1186/s12885-025-14180-2.
6
Statistical primer: sample size considerations for developing and validating clinical prediction models.统计学入门:开发和验证临床预测模型时的样本量考量
Eur J Cardiothorac Surg. 2025 May 6;67(5). doi: 10.1093/ejcts/ezaf142.
7
Improving radiomics-based models for esophagogastric variceal bleeding risk prediction in cirrhotic patients.改进基于影像组学的模型以预测肝硬化患者食管胃静脉曲张出血风险
World J Gastroenterol. 2025 Mar 21;31(11):101804. doi: 10.3748/wjg.v31.i11.101804.
8
Boosting the accuracy of existing models by updating and extending: using a multicenter COVID-19 ICU cohort as a proxy.通过更新和扩展提高现有模型的准确性:使用多中心 COVID-19 ICU 队列作为替代。
Sci Rep. 2024 Nov 1;14(1):26344. doi: 10.1038/s41598-024-70333-6.
9
Risk of repeat self-harm among individuals presenting to healthcare services: development and validation of a clinical risk assessment model (OxSET).就诊于医疗保健服务的个体再次自我伤害的风险:临床风险评估模型(OxSET)的制定与验证。
BMJ Ment Health. 2024 Oct 15;27(1):e301180. doi: 10.1136/bmjment-2024-301180.
10
Deep brain stimulation versus vagus nerve stimulation for the motor function of poststroke hemiplegia: study protocol for a multicentre randomised controlled trial.深部脑刺激与迷走神经刺激治疗脑卒中后偏瘫的运动功能:一项多中心随机对照试验的研究方案。
BMJ Open. 2024 Oct 8;14(10):e086098. doi: 10.1136/bmjopen-2024-086098.
Stat Med. 2019 Sep 20;38(21):4051-4065. doi: 10.1002/sim.8281. Epub 2019 Jul 3.
4
Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes.建立多变量预测模型的最小样本量:第二部分 - 二分类和生存数据。
Stat Med. 2019 Mar 30;38(7):1276-1296. doi: 10.1002/sim.7992. Epub 2018 Oct 24.
5
Minimum sample size for developing a multivariable prediction model: Part I - Continuous outcomes.建立多变量预测模型的最小样本量:第一部分-连续结局。
Stat Med. 2019 Mar 30;38(7):1262-1275. doi: 10.1002/sim.7993. Epub 2018 Oct 22.
6
A framework for meta-analysis of prediction model studies with binary and time-to-event outcomes.用于二元和事件发生时间结局的预测模型研究的Meta分析框架。
Stat Methods Med Res. 2019 Sep;28(9):2768-2786. doi: 10.1177/0962280218785504. Epub 2018 Jul 23.
7
Sample size for binary logistic prediction models: Beyond events per variable criteria.二项逻辑预测模型的样本量:超越变量标准的事件数。
Stat Methods Med Res. 2019 Aug;28(8):2455-2474. doi: 10.1177/0962280218784726. Epub 2018 Jul 3.
8
Comparing distributions of polygenic risk scores of type 2 diabetes and coronary heart disease within different populations.比较不同人群中2型糖尿病和冠心病多基因风险评分的分布情况。
PLoS One. 2017 Jul 5;12(7):e0179238. doi: 10.1371/journal.pone.0179238. eCollection 2017.
9
Sample size calculation to externally validate scoring systems based on logistic regression models.基于逻辑回归模型进行外部验证评分系统的样本量计算。
PLoS One. 2017 May 1;12(5):e0176726. doi: 10.1371/journal.pone.0176726. eCollection 2017.
10
Prediction of complications in early-onset pre-eclampsia (PREP): development and external multinational validation of prognostic models.早发型子痫前期(PREP)并发症的预测:预后模型的开发与外部多中心验证
BMC Med. 2017 Mar 30;15(1):68. doi: 10.1186/s12916-017-0827-3.