• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生存数据预测的超级学习器。

Super Learner for Survival Data Prediction.

作者信息

Golmakani Marzieh K, Polley Eric C

机构信息

Pfizer Inc., San Diego, CA, USA.

Health Science Research, Mayo Clinic Minnesota, Rochester, Minnesota, USA.

出版信息

Int J Biostat. 2020 Feb 22. doi: 10.1515/ijb-2019-0065.

DOI:10.1515/ijb-2019-0065
PMID:32097120
Abstract

Survival analysis is a widely used method to establish a connection between a time to event outcome and a set of potential covariates. Accurately predicting the time of an event of interest is of primary importance in survival analysis. Many different algorithms have been proposed for survival prediction. However, for a given prediction problem it is rarely, if ever, possible to know in advance which algorithm will perform the best. In this paper we propose two algorithms for constructing super learners in survival data prediction where the individual algorithms are based on proportional hazards. A super learner is a flexible approach to statistical learning that finds the best weighted ensemble of the individual algorithms. Finding the optimal combination of the individual algorithms through minimizing cross-validated risk controls for over-fitting of the final ensemble learner. Candidate algorithms may range from a basic Cox model to tree-based machine learning algorithms, assuming all candidate algorithms are based on the proportional hazards framework. The ensemble weights are estimated by minimizing the cross-validated negative log partial likelihood. We compare the performance of the proposed super learners with existing models through extensive simulation studies. In all simulation scenarios, the proposed super learners are either the best fit or near the best fit. The performances of the newly proposed algorithms are also demonstrated with clinical data examples.

摘要

生存分析是一种广泛使用的方法,用于建立事件发生时间结果与一组潜在协变量之间的联系。在生存分析中,准确预测感兴趣事件的发生时间至关重要。已经提出了许多不同的算法用于生存预测。然而,对于给定的预测问题,很少有可能(如果有的话)提前知道哪种算法将表现最佳。在本文中,我们提出了两种在生存数据预测中构建超级学习器的算法,其中各个算法基于比例风险模型。超级学习器是一种灵活的统计学习方法,它能找到各个算法的最佳加权组合。通过最小化交叉验证风险来控制最终集成学习器的过拟合,从而找到各个算法的最优组合。候选算法的范围可以从基本的Cox模型到基于树的机器学习算法,假设所有候选算法都基于比例风险框架。通过最小化交叉验证的负对数偏似然来估计集成权重。我们通过广泛的模拟研究比较了所提出的超级学习器与现有模型的性能。在所有模拟场景中,所提出的超级学习器要么是最佳拟合,要么接近最佳拟合。新提出算法的性能也通过临床数据示例得到了证明。

相似文献

1
Super Learner for Survival Data Prediction.用于生存数据预测的超级学习器。
Int J Biostat. 2020 Feb 22. doi: 10.1515/ijb-2019-0065.
2
Super learner.超级学习者。
Stat Appl Genet Mol Biol. 2007;6:Article25. doi: 10.2202/1544-6115.1309. Epub 2007 Sep 16.
3
A hybrid super ensemble learning model for the early-stage prediction of diabetes risk.一种用于糖尿病风险早期预测的混合超级集成学习模型。
Med Biol Eng Comput. 2023 Mar;61(3):785-797. doi: 10.1007/s11517-022-02749-z. Epub 2023 Jan 5.
4
Stacked generalization: an introduction to super learning.堆叠泛化:超级学习导论。
Eur J Epidemiol. 2018 May;33(5):459-464. doi: 10.1007/s10654-018-0390-z. Epub 2018 Apr 10.
5
Practical considerations for specifying a super learner.指定超级学习者的实用考虑因素。
Int J Epidemiol. 2023 Aug 2;52(4):1276-1285. doi: 10.1093/ije/dyad023.
6
Can Hyperparameter Tuning Improve the Performance of a Super Learner?: A Case Study.超参数调优能否提高超级学习者的性能?:一项案例研究。
Epidemiology. 2019 Jul;30(4):521-531. doi: 10.1097/EDE.0000000000001027.
7
Super learning: an application to the prediction of HIV-1 drug resistance.超级学习:在预测HIV-1耐药性方面的应用。
Stat Appl Genet Mol Biol. 2007;6:Article7. doi: 10.2202/1544-6115.1240. Epub 2007 Feb 23.
8
Mortality risk score prediction in an elderly population using machine learning.基于机器学习的老年人群死亡率风险评分预测。
Am J Epidemiol. 2013 Mar 1;177(5):443-52. doi: 10.1093/aje/kws241. Epub 2013 Jan 29.
9
Propensity score prediction for electronic healthcare databases using Super Learner and High-dimensional Propensity Score Methods.使用超级学习器和高维倾向评分方法对电子医疗数据库进行倾向评分预测。
J Appl Stat. 2019;46(12):2216-2236. doi: 10.1080/02664763.2019.1582614. Epub 2019 Feb 22.
10
The Balance Super Learner: A robust adaptation of the Super Learner to improve estimation of the average treatment effect in the treated based on propensity score matching.平衡超级学习者:超级学习者的稳健自适应方法,可提高基于倾向评分匹配的处理组平均处理效应估计的稳健性。
Stat Methods Med Res. 2018 Aug;27(8):2504-2518. doi: 10.1177/0962280216682055. Epub 2016 Dec 15.

引用本文的文献

1
A novel non-negative Bayesian stacking modeling method for Cancer survival prediction using high-dimensional omics data.一种使用高维组学数据进行癌症生存预测的新型非负贝叶斯堆叠建模方法。
BMC Med Res Methodol. 2024 May 3;24(1):105. doi: 10.1186/s12874-024-02232-3.
2
Optimizing cardiovascular disease mortality prediction: a super learner approach in the tehran lipid and glucose study.优化心血管疾病死亡率预测:特兰脂质和血糖研究中的超级学习者方法。
BMC Med Inform Decis Mak. 2024 Apr 16;24(1):97. doi: 10.1186/s12911-024-02489-0.
3
Accelerating Elastic Property Prediction in Fe-C Alloys through Coupling of Molecular Dynamics and Machine Learning.
通过分子动力学与机器学习相结合加速铁碳合金弹性性能预测
Materials (Basel). 2024 Jan 26;17(3):601. doi: 10.3390/ma17030601.
4
Using the Super Learner algorithm to predict risk of major adverse cardiovascular events after percutaneous coronary intervention in patients with myocardial infarction.应用超级学习者算法预测心肌梗死患者经皮冠状动脉介入治疗后发生主要不良心血管事件的风险。
BMC Med Res Methodol. 2024 Mar 8;24(1):59. doi: 10.1186/s12874-024-02179-5.
5
Determinants of COVID-19 Outcome as Predictors of Delayed Healthcare Services among Adults ≥50 Years during the Pandemic: 2006-2020 Health and Retirement Study.COVID-19 结局的决定因素预测了疫情期间 50 岁及以上成年人延迟医疗服务的情况:2006-2020 年健康与退休研究。
Int J Environ Res Public Health. 2022 Sep 23;19(19):12059. doi: 10.3390/ijerph191912059.
6
Individual dynamic prediction of clinical endpoint from large dimensional longitudinal biomarker history: a landmark approach.从大型纵向生物标志物历史中对临床终点进行个体动态预测:一种里程碑方法。
BMC Med Res Methodol. 2022 Jul 11;22(1):188. doi: 10.1186/s12874-022-01660-3.
7
Uncertainty in lung cancer stage for survival estimation via set-valued classification.基于集值分类的肺癌分期对生存估计的不确定性。
Stat Med. 2022 Aug 30;41(19):3772-3788. doi: 10.1002/sim.9448. Epub 2022 Jun 8.
8
A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data.一项范围性方法学综述:比较统计方法和机器学习方法对事件发生时间数据进行风险预测的模拟研究。
Diagn Progn Res. 2022 Jun 2;6(1):10. doi: 10.1186/s41512-022-00124-y.
9
Predictors of Covid-19 level of concern among older adults from the health and retirement study.预测老年人对新冠病毒担忧程度的因素:来自健康与退休研究。
Sci Rep. 2022 Mar 15;12(1):4396. doi: 10.1038/s41598-022-08332-8.
10
The promise of automated machine learning for the genetic analysis of complex traits.自动化机器学习在复杂性状的遗传分析中的应用前景。
Hum Genet. 2022 Sep;141(9):1529-1544. doi: 10.1007/s00439-021-02393-x. Epub 2021 Oct 28.