• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测性逻辑回归模型的外部验证研究需要大量有效的样本量。

Substantial effective sample sizes were required for external validation studies of predictive logistic regression models.

作者信息

Vergouwe Yvonne, Steyerberg Ewout W, Eijkemans Marinus J C, Habbema J Dik F

机构信息

Department of Public Health, Erasmus MC, P.O. Box 1738, 3000 DR Rotterdam, The Netherlands.

出版信息

J Clin Epidemiol. 2005 May;58(5):475-83. doi: 10.1016/j.jclinepi.2004.06.017.

DOI:10.1016/j.jclinepi.2004.06.017
PMID:15845334
Abstract

BACKGROUND AND OBJECTIVES

The performance of a prediction model is usually worse in external validation data compared to the development data. We aimed to determine at which effective sample sizes (i.e., number of events) relevant differences in model performance can be detected with adequate power.

METHODS

We used a logistic regression model to predict the probability that residual masses of patients treated for metastatic testicular cancer contained only benign tissue. We performed standard power calculations and Monte Carlo simulations to estimate the numbers of events that are required to detect several types of model invalidity with 80% power at the 5% significance level.

RESULTS

A validation sample with 111 events was required to detect that a model predicted too high probabilities, when predictions were on average 1.5 times too high on the odds scale. A decrease in discriminative ability of the model, indicated by a decrease in the c-statistic from 0.83 to 0.73, required 81 to 106 events, depending on the specific scenario.

CONCLUSION

We suggest a minimum of 100 events and 100 nonevents for external validation samples. Specific hypotheses may, however, require substantially higher effective sample sizes to obtain adequate power.

摘要

背景与目的

与开发数据相比,预测模型在外部验证数据中的表现通常更差。我们旨在确定在何种有效样本量(即事件数)下,能够以足够的检验效能检测到模型性能的相关差异。

方法

我们使用逻辑回归模型来预测接受转移性睾丸癌治疗的患者残留肿块仅包含良性组织的概率。我们进行了标准的效能计算和蒙特卡洛模拟,以估计在5%显著性水平下,以80%的检验效能检测几种类型的模型无效性所需的事件数。

结果

当预测在优势比尺度上平均高1.5倍时,需要一个包含111个事件的验证样本才能检测到模型预测的概率过高。根据具体情况,模型判别能力的下降(由c统计量从0.83降至0.73表示)需要81至106个事件。

结论

我们建议外部验证样本的事件数最少为100个,非事件数最少为100个。然而,特定的假设可能需要实质上更高的有效样本量才能获得足够的检验效能。

相似文献

1
Substantial effective sample sizes were required for external validation studies of predictive logistic regression models.预测性逻辑回归模型的外部验证研究需要大量有效的样本量。
J Clin Epidemiol. 2005 May;58(5):475-83. doi: 10.1016/j.jclinepi.2004.06.017.
2
Validation of a prediction model and its predictors for the histology of residual masses in nonseminomatous testicular cancer.非精原细胞瘤性睾丸癌残留肿块组织学预测模型及其预测因子的验证
J Urol. 2001 Jan;165(1):84-8; discussion 88. doi: 10.1097/00005392-200101000-00021.
3
Polytomous logistic regression analysis could be applied more often in diagnostic research.多分类逻辑回归分析在诊断研究中可以更频繁地应用。
J Clin Epidemiol. 2008 Feb;61(2):125-34. doi: 10.1016/j.jclinepi.2007.03.002. Epub 2007 Jun 29.
4
External validation of prognostic models for critically ill patients required substantial sample sizes.危重症患者预后模型的外部验证需要大量样本量。
J Clin Epidemiol. 2007 May;60(5):491-501. doi: 10.1016/j.jclinepi.2006.08.011. Epub 2007 Feb 5.
5
Validation and updating of predictive logistic regression models: a study on sample size and shrinkage.预测性逻辑回归模型的验证与更新:样本量与收缩的研究
Stat Med. 2004 Aug 30;23(16):2567-86. doi: 10.1002/sim.1844.
6
A simple approach to power and sample size calculations in logistic regression and Cox regression models.逻辑回归和Cox回归模型中功效及样本量计算的一种简单方法。
Stat Med. 2004 Jun 15;23(11):1781-92. doi: 10.1002/sim.1753.
7
A comparison of regression trees, logistic regression, generalized additive models, and multivariate adaptive regression splines for predicting AMI mortality.用于预测急性心肌梗死死亡率的回归树、逻辑回归、广义相加模型和多元自适应回归样条的比较。
Stat Med. 2007 Jul 10;26(15):2937-57. doi: 10.1002/sim.2770.
8
A simulation study of sample size for multilevel logistic regression models.多水平逻辑回归模型样本量的模拟研究
BMC Med Res Methodol. 2007 Jul 16;7:34. doi: 10.1186/1471-2288-7-34.
9
Prognosis following severe head injury: Development and validation of a model for prediction of death, disability, and functional recovery.重度颅脑损伤后的预后:死亡、残疾和功能恢复预测模型的开发与验证
J Trauma. 2006 Dec;61(6):1484-91. doi: 10.1097/01.ta.0000195981.63776.ba.
10
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.直接关联研究与间接关联研究相对效力的详细分析及其解读的意义。
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.

引用本文的文献

1
Advancing Prognostics in Oncology: Developing a Machine Learning Model for Predicting 2-Year and 5-Year Survival Rates in Patients with Undifferentiated Pleomorphic Sarcoma.肿瘤学中的预后进展:开发用于预测未分化多形性肉瘤患者2年和5年生存率的机器学习模型
Ann Surg Oncol. 2025 Sep 8. doi: 10.1245/s10434-025-18249-x.
2
Development and validation of a dynamic prediction model for single-dose methotrexate treatment success in tubal ectopic pregnancy: a multicentre cohort study in Chinese hospitals.单剂量甲氨蝶呤治疗输卵管异位妊娠成功的动态预测模型的开发与验证:中国医院的多中心队列研究
BMJ Open. 2025 Sep 1;15(9):e092110. doi: 10.1136/bmjopen-2024-092110.
3
Predicting pain reduction following laparoscopic surgery for endometriosis: a retrospective cohort study using UK national and research databases.
预测子宫内膜异位症腹腔镜手术后的疼痛减轻情况:一项使用英国国家和研究数据库的回顾性队列研究。
BMJ Open. 2025 Aug 27;15(8):e099374. doi: 10.1136/bmjopen-2025-099374.
4
Development and Validation of a Nomogram-Based Risk Prediction Model for Diabetic Retinopathy in Elderly Adults with Type 2 Diabetes Mellitus.2型糖尿病老年患者糖尿病视网膜病变基于列线图的风险预测模型的开发与验证
Diabetes Metab Syndr Obes. 2025 Jul 25;18:2509-2523. doi: 10.2147/DMSO.S530424. eCollection 2025.
5
Development and validation of a novel clinical risk score to predict hypoxaemia in children with pneumonia using the WHO PREPARE dataset.利用世界卫生组织“准备就绪”数据集开发并验证一种预测肺炎患儿低氧血症的新型临床风险评分。
BMJ Glob Health. 2025 Jul 7;10(7):e017256. doi: 10.1136/bmjgh-2024-017256.
6
Development and validation of a nomogram for predicting the risk of cognitive impairment among chronic obstructive pulmonary diseases.慢性阻塞性肺疾病认知障碍风险预测列线图的开发与验证
Ann Med. 2025 Dec;57(1):2528448. doi: 10.1080/07853890.2025.2528448. Epub 2025 Jul 5.
7
Prediction models for intraventricular hemorrhage in very preterm infants: a systematic review.极早产儿脑室内出血的预测模型:一项系统综述
Front Pediatr. 2025 Jun 4;13:1605145. doi: 10.3389/fped.2025.1605145. eCollection 2025.
8
powerROC: An Interactive Web Tool for Sample Size Calculation in Assessing Models' Discriminative Abilities.powerROC:用于评估模型鉴别能力时样本量计算的交互式网络工具。
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:196-204. eCollection 2025.
9
Magnetic resonance enterography to predict subsequent disabling Crohn's disease in newly diagnosed patients (METRIC-EF)-multivariable prediction model, multicentre diagnostic inception cohort.磁共振小肠造影预测新诊断患者中后续致残性克罗恩病(METRIC-EF)-多变量预测模型,多中心诊断性起始队列研究
Eur Radiol. 2025 May 14. doi: 10.1007/s00330-025-11636-8.
10
Statistical primer: sample size considerations for developing and validating clinical prediction models.统计学入门:开发和验证临床预测模型时的样本量考量
Eur J Cardiothorac Surg. 2025 May 6;67(5). doi: 10.1093/ejcts/ezaf142.