• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

回归收缩方法在临床预测模型中并不能保证性能得到改善:模拟研究。

Regression shrinkage methods for clinical prediction models do not guarantee improved performance: Simulation study.

机构信息

Department of Development and Regeneration, KU Leuven, Leuven, Belgium.

Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands.

出版信息

Stat Methods Med Res. 2020 Nov;29(11):3166-3178. doi: 10.1177/0962280220921415. Epub 2020 May 13.

DOI:10.1177/0962280220921415
PMID:32401702
Abstract

When developing risk prediction models on datasets with limited sample size, shrinkage methods are recommended. Earlier studies showed that shrinkage results in better predictive performance on average. This simulation study aimed to investigate the variability of regression shrinkage on predictive performance for a binary outcome. We compared standard maximum likelihood with the following shrinkage methods: uniform shrinkage (likelihood-based and bootstrap-based), penalized maximum likelihood (ridge) methods, LASSO logistic regression, adaptive LASSO, and Firth's correction. In the simulation study, we varied the number of predictors and their strength, the correlation between predictors, the event rate of the outcome, and the events per variable. In terms of results, we focused on the calibration slope. The slope indicates whether risk predictions are too extreme (slope < 1) or not extreme enough (slope > 1). The results can be summarized into three main findings. First, shrinkage improved calibration slopes on average. Second, the between-sample variability of calibration slopes was often increased relative to maximum likelihood. In contrast to other shrinkage approaches, Firth's correction had a small shrinkage effect but showed low variability. Third, the correlation between the estimated shrinkage and the optimal shrinkage to remove overfitting was typically negative, with Firth's correction as the exception. We conclude that, despite improved performance on average, shrinkage often worked poorly in individual datasets, in particular when it was most needed. The results imply that shrinkage methods do not solve problems associated with small sample size or low number of events per variable.

摘要

当在样本量有限的数据集上开发风险预测模型时,建议使用收缩方法。早期的研究表明,收缩平均会提高预测性能。本模拟研究旨在研究二元结果的预测性能上回归收缩的可变性。我们将标准最大似然与以下收缩方法进行了比较:均匀收缩(基于似然和基于引导的)、惩罚最大似然(岭)方法、LASSO 逻辑回归、自适应 LASSO 和 Firth 校正。在模拟研究中,我们改变了预测因子的数量及其强度、预测因子之间的相关性、结果的事件发生率和变量的事件数。在结果方面,我们主要关注校准斜率。斜率表示风险预测是否过于极端(斜率 < 1)或不够极端(斜率 > 1)。结果可以总结为三个主要发现。首先,收缩平均提高了校准斜率。其次,与最大似然相比,校准斜率的样本间可变性通常增加。与其他收缩方法不同,Firth 校正的收缩效果较小,但变异性较低。第三,估计的收缩与去除过拟合的最佳收缩之间的相关性通常为负,Firth 校正则是例外。我们得出结论,尽管平均性能有所提高,但收缩方法在单个数据集上的表现通常不佳,尤其是在最需要时。结果表明,收缩方法并不能解决样本量小或每个变量的事件数少的问题。

相似文献

1
Regression shrinkage methods for clinical prediction models do not guarantee improved performance: Simulation study.回归收缩方法在临床预测模型中并不能保证性能得到改善:模拟研究。
Stat Methods Med Res. 2020 Nov;29(11):3166-3178. doi: 10.1177/0962280220921415. Epub 2020 May 13.
2
To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets.调参还是不调参,小数据集或稀疏数据集的岭 logistic 回归案例研究。
BMC Med Res Methodol. 2021 Sep 30;21(1):199. doi: 10.1186/s12874-021-01374-y.
3
Firth's logistic regression with rare events: accurate effect estimates and predictions?针对罕见事件的费思逻辑回归:准确的效应估计与预测?
Stat Med. 2017 Jun 30;36(14):2302-2317. doi: 10.1002/sim.7273. Epub 2017 Mar 12.
4
Penalized Regression Methods With Modified Cross-Validation and Bootstrap Tuning Produce Better Prediction Models.惩罚回归方法通过修正的交叉验证和引导调整可以产生更好的预测模型。
Biom J. 2024 Jul;66(5):e202300245. doi: 10.1002/bimj.202300245.
5
Sample size considerations and predictive performance of multinomial logistic prediction models.多分类逻辑回归预测模型的样本量考虑因素和预测性能。
Stat Med. 2019 Apr 30;38(9):1601-1619. doi: 10.1002/sim.8063. Epub 2019 Jan 6.
6
Re-evaluation of the comparative effectiveness of bootstrap-based optimism correction methods in the development of multivariable clinical prediction models.基于 Bootstrap 的校正方法在多变量临床预测模型构建中的校正效能再评价。
BMC Med Res Methodol. 2021 Jan 7;21(1):9. doi: 10.1186/s12874-020-01201-w.
7
Developing clinical prediction models when adhering to minimum sample size recommendations: The importance of quantifying bootstrap variability in tuning parameters and predictive performance.在遵守最小样本量建议的情况下开发临床预测模型:在调整参数和预测性能时量化引导变异性的重要性。
Stat Methods Med Res. 2021 Dec;30(12):2545-2561. doi: 10.1177/09622802211046388. Epub 2021 Oct 8.
8
Review and evaluation of penalised regression methods for risk prediction in low-dimensional data with few events.低事件数低维数据中风险预测的惩罚回归方法综述与评估
Stat Med. 2016 Mar 30;35(7):1159-77. doi: 10.1002/sim.6782. Epub 2015 Oct 29.
9
On estimation for accelerated failure time models with small or rare event survival data.小样本或稀有事件生存数据的加速失效时间模型估计。
BMC Med Res Methodol. 2022 Jun 11;22(1):169. doi: 10.1186/s12874-022-01638-1.
10
Reparametrized Firth's Logistic Regressions for Dose-Finding Study With the Biased-Coin Design.重参数化 Firth 的逻辑回归在偏币设计剂量发现研究中的应用。
Pharm Stat. 2024 Nov-Dec;23(6):1117-1127. doi: 10.1002/pst.2423. Epub 2024 Jul 16.

引用本文的文献

1
Adaption of the Memorial Sloan Kettering Cancer Center Nomograms for the Prediction of Prostate Cancer-specific Death in Sweden: A Population-based Cohort Study.纪念斯隆凯特琳癌症中心列线图在瑞典预测前列腺癌特异性死亡中的应用:一项基于人群的队列研究
Eur Urol Open Sci. 2025 Jul 14;78:41-50. doi: 10.1016/j.euros.2025.06.003. eCollection 2025 Aug.
2
Early detection of ICU-acquired infections using high-frequency electronic health record data.利用高频电子健康记录数据早期检测重症监护病房获得性感染
BMC Med Inform Decis Mak. 2025 Jul 21;25(1):273. doi: 10.1186/s12911-025-03031-6.
3
A decomposition of Fisher's information to inform sample size for developing or updating fair and precise clinical prediction models for individual risk-part 1: binary outcomes.
分解费舍尔信息以确定样本量,用于开发或更新针对个体风险的公平且精确的临床预测模型——第1部分:二元结局
Diagn Progn Res. 2025 Jul 8;9(1):14. doi: 10.1186/s41512-025-00193-9.
4
Alternatives to default shrinkage methods can improve prediction accuracy, calibration, and coverage: A methods comparison study.默认收缩方法的替代方法可提高预测准确性、校准和覆盖率:一项方法比较研究。
Stat Methods Med Res. 2025 Jul;34(7):1342-1355. doi: 10.1177/09622802251338440. Epub 2025 May 29.
5
Statistical primer: sample size considerations for developing and validating clinical prediction models.统计学入门:开发和验证临床预测模型时的样本量考量
Eur J Cardiothorac Surg. 2025 May 6;67(5). doi: 10.1093/ejcts/ezaf142.
6
When the whole is greater than the sum of its parts: why machine learning and conventional statistics are complementary for predicting future health outcomes.当整体大于部分之和:为何机器学习与传统统计学在预测未来健康结果方面相辅相成。
Clin Kidney J. 2025 Feb 20;18(4):sfaf059. doi: 10.1093/ckj/sfaf059. eCollection 2025 Apr.
7
James-Stein Estimator Improves Accuracy and Sample Efficiency in Human Kinematic and Metabolic Data.詹姆斯 - 斯坦估计器提高了人体运动学和代谢数据的准确性及样本效率。
Ann Biomed Eng. 2025 Apr 16. doi: 10.1007/s10439-025-03718-x.
8
Development and external validation of prediction risk scores (STRISK and NOFA) to predict immediate surgical need in adhesive small bowel obstruction: an observational prospective multicentre study.预测粘连性小肠梗阻即刻手术需求的预测风险评分(STRISK和NOFA)的开发与外部验证:一项观察性前瞻性多中心研究
Br J Surg. 2025 Mar 4;112(3). doi: 10.1093/bjs/znaf025.
9
Overlooked and underpowered: a meta-research addressing sample size in radiomics prediction models for binary outcomes.被忽视且样本量不足:一项针对二元结局的放射组学预测模型样本量的元研究。
Eur Radiol. 2025 Mar;35(3):1146-1156. doi: 10.1007/s00330-024-11331-0. Epub 2025 Jan 9.
10
James-Stein estimator improves accuracy and sample efficiency in human kinematic and metabolic data.詹姆斯-斯坦估计器提高了人类运动学和代谢数据的准确性和样本效率。
bioRxiv. 2024 Oct 17:2024.10.07.616339. doi: 10.1101/2024.10.07.616339.