• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用预测误差曲线评估随机森林用于生存分析

Evaluating Random Forests for Survival Analysis using Prediction Error Curves.

作者信息

Mogensen Ulla B, Ishwaran Hemant, Gerds Thomas A

机构信息

Department of Biostatistics, University of Copenhagen, Denmark.

Department of Epidemiology and Public Health, University of Miami, USA.

出版信息

J Stat Softw. 2012 Sep;50(11):1-23. doi: 10.18637/jss.v050.i11.

DOI:10.18637/jss.v050.i11
PMID:25317082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4194196/
Abstract

Prediction error curves are increasingly used to assess and compare predictions in survival analysis. This article surveys the R package which provides a set of functions for efficient computation of prediction error curves. The software implements inverse probability of censoring weights to deal with right censored data and several variants of cross-validation to deal with the apparent error problem. In principle, all kinds of prediction models can be assessed, and the package readily supports most traditional regression modeling strategies, like Cox regression or additive hazard regression, as well as state of the art machine learning methods such as random forests, a nonparametric method which provides promising alternatives to traditional strategies in low and high-dimensional settings. We show how the functionality of can be extended to yet unsupported prediction models. As an example, we implement support for random forest prediction models based on the R-packages and . Using data of the Copenhagen Stroke Study we use to compare random forests to a Cox regression model derived from stepwise variable selection. Reproducible results on the user level are given for publicly available data from the German breast cancer study group.

摘要

预测误差曲线在生存分析中越来越多地用于评估和比较预测结果。本文介绍了一个R包,它提供了一组用于高效计算预测误差曲线的函数。该软件实现了用于处理右删失数据的删失权重逆概率以及用于处理明显误差问题的几种交叉验证变体。原则上,可以评估各种预测模型,并且该包很容易支持大多数传统回归建模策略,如Cox回归或加法风险回归,以及诸如随机森林等先进的机器学习方法,随机森林是一种非参数方法,在低维和高维设置中为传统策略提供了有前景的替代方案。我们展示了如何将该包的功能扩展到尚未得到支持的预测模型。例如,我们基于R包和实现了对随机森林预测模型的支持。使用哥本哈根中风研究的数据,我们使用该包将随机森林与通过逐步变量选择得出的Cox回归模型进行比较。针对德国乳腺癌研究组的公开可用数据,在用户层面给出了可重现的结果。

相似文献

1
Evaluating Random Forests for Survival Analysis using Prediction Error Curves.使用预测误差曲线评估随机森林用于生存分析
J Stat Softw. 2012 Sep;50(11):1-23. doi: 10.18637/jss.v050.i11.
2
Survival prediction models: an introduction to discrete-time modeling.生存预测模型:离散时间建模简介。
BMC Med Res Methodol. 2022 Jul 26;22(1):207. doi: 10.1186/s12874-022-01679-6.
3
A Comparison of Random Forest Variable Selection Methods for Classification Prediction Modeling.用于分类预测建模的随机森林变量选择方法比较
Expert Syst Appl. 2019 Nov 15;134:93-101. doi: 10.1016/j.eswa.2019.05.028. Epub 2019 May 23.
4
Random forest methodology for model-based recursive partitioning: the mobForest package for R.基于模型的递归分割的随机森林方法:R 中的 mobForest 包。
BMC Bioinformatics. 2013 Apr 11;14:125. doi: 10.1186/1471-2105-14-125.
5
Survival analysis in breast cancer: evaluating ensemble learning techniques for prediction.乳腺癌生存分析:评估用于预测的集成学习技术
PeerJ Comput Sci. 2024 Jul 10;10:e2147. doi: 10.7717/peerj-cs.2147. eCollection 2024.
6
Block Forests: random forests for blocks of clinical and omics covariate data.块森林:用于临床和组学协变量数据块的随机森林。
BMC Bioinformatics. 2019 Jun 27;20(1):358. doi: 10.1186/s12859-019-2942-y.
7
Personalized Risk Prediction in Clinical Oncology Research: Applications and Practical Issues Using Survival Trees and Random Forests.临床肿瘤学研究中的个性化风险预测:使用生存树和随机森林的应用及实际问题
J Biopharm Stat. 2018;28(2):333-349. doi: 10.1080/10543406.2017.1377730. Epub 2017 Oct 19.
8
A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究:变量选择和预测性能。
BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.
9
Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data.利用牛奶近红外光谱数据评估机器学习方法和变量选择方法在荷斯坦奶牛中预测难以测量性状的性能。
J Dairy Sci. 2021 Jul;104(7):8107-8121. doi: 10.3168/jds.2020-19861. Epub 2021 Apr 15.
10
OBLIQUE RANDOM SURVIVAL FORESTS.倾斜随机生存森林
Ann Appl Stat. 2019 Sep;13(3):1847-1883. doi: 10.1214/19-aoas1261. Epub 2019 Oct 17.

引用本文的文献

1
Disulfidptosis-associated gene signature predicts prognosis and radioresistance in NSCLC.二硫化物诱导细胞程序性坏死相关基因特征预测非小细胞肺癌的预后和放射抗性。
Transl Oncol. 2025 Aug 20;61:102496. doi: 10.1016/j.tranon.2025.102496.
2
Microbiome-based prediction of allogeneic hematopoietic stem cell transplantation outcome.基于微生物组对异基因造血干细胞移植结果的预测
Genome Med. 2025 Jul 17;17(1):80. doi: 10.1186/s13073-025-01507-8.
3
Effects of mental fatigue on biomechanical characteristics and risk associated with non-contact anterior cruciate ligament injuries during landing.

本文引用的文献

1
Estimating a time-dependent concordance index for survival prediction models with covariate dependent censoring.估计具有协变量相关删失的生存预测模型的时依一致性指数。
Stat Med. 2013 Jun 15;32(13):2173-84. doi: 10.1002/sim.5681. Epub 2012 Nov 22.
2
Confidence scores for prediction models.预测模型的置信度分数。
Biom J. 2011 Mar;53(2):259-74. doi: 10.1002/bimj.201000157. Epub 2011 Feb 17.
3
Testing the prediction error difference between 2 predictors.测试两个预测指标之间的预测误差差异。
精神疲劳对着陆过程中生物力学特征及非接触性前交叉韧带损伤相关风险的影响。
Front Bioeng Biotechnol. 2025 May 27;13:1582873. doi: 10.3389/fbioe.2025.1582873. eCollection 2025.
4
Competing risk nomogram for predicting cancer-specific survival in patients with primary bone diffuse large B-cell lymphoma: a SEER-based retrospective study.预测原发性骨弥漫性大B细胞淋巴瘤患者癌症特异性生存的竞争风险列线图:一项基于监测、流行病学和最终结果(SEER)数据库的回顾性研究
Front Med (Lausanne). 2025 May 12;12:1572919. doi: 10.3389/fmed.2025.1572919. eCollection 2025.
5
Lifetime analysis with monotonic degradation: a boosted first hitting time model based on a homogeneous gamma process.具有单调退化的寿命分析:基于齐次伽马过程的增强首次击中时间模型。
Lifetime Data Anal. 2025 Apr;31(2):300-339. doi: 10.1007/s10985-025-09648-z. Epub 2025 Apr 5.
6
A review of survival stacking: a method to cast survival regression analysis as a classification problem.生存堆叠综述:一种将生存回归分析转化为分类问题的方法。
Int J Biostat. 2025 Mar 28;21(1):37-51. doi: 10.1515/ijb-2022-0055. eCollection 2025 May 1.
7
Using prognostic signatures and machine learning to identify core features associated with response to CDK4/6 inhibitor-based therapy in metastatic breast cancer.利用预后特征和机器学习来识别与转移性乳腺癌中基于CDK4/6抑制剂治疗反应相关的核心特征。
Oncogene. 2025 May;44(19):1387-1399. doi: 10.1038/s41388-025-03308-0. Epub 2025 Feb 26.
8
A prognostic model for lung adenocarcinoma based on cuproptosis and disulfidptosis related genes revealing the key prognostic role of FURIN.一种基于铜死亡和二硫键化死亡相关基因的肺腺癌预后模型,揭示了弗林蛋白酶(FURIN)的关键预后作用。
Sci Rep. 2025 Feb 19;15(1):6057. doi: 10.1038/s41598-025-90653-5.
9
On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.关于用于评估生存预测误差的删失权重的逆概率估计
PLoS One. 2025 Jan 31;20(1):e0318349. doi: 10.1371/journal.pone.0318349. eCollection 2025.
10
Challenges in multinational rare disease clinical studies during COVID-19: regulatory assessment of cipaglucosidase alfa plus miglustat in adults with late-onset Pompe disease.2019冠状病毒病期间跨国罕见病临床研究面临的挑战:阿加糖苷酶α联合米格列醇治疗晚发型庞贝病成人患者的监管评估
J Neurol. 2025 Jan 7;272(1):103. doi: 10.1007/s00415-024-12843-x.
Biostatistics. 2009 Jul;10(3):550-60. doi: 10.1093/biostatistics/kxp011. Epub 2009 Apr 20.
4
Adapting prediction error estimates for biased complexity selection in high-dimensional bootstrap samples.在高维自助抽样样本中针对有偏复杂度选择调整预测误差估计值。
Stat Appl Genet Mol Biol. 2008;7(1):Article12. doi: 10.2202/1544-6115.1346. Epub 2008 Mar 14.
5
Sex differences in stroke survival: 10-year follow-up of the Copenhagen stroke study cohort.中风存活的性别差异:哥本哈根中风研究队列的10年随访
J Stroke Cerebrovasc Dis. 2005 Sep-Oct;14(5):215-20. doi: 10.1016/j.jstrokecerebrovasdis.2005.06.002.
6
Efron-type measures of prediction error for survival analysis.用于生存分析的预测误差的埃弗龙型度量。
Biometrics. 2007 Dec;63(4):1283-7. doi: 10.1111/j.1541-0420.2007.00832.x. Epub 2007 Jul 25.
7
A comparison of bootstrap methods and an adjusted bootstrap approach for estimating the prediction error in microarray classification.用于估计微阵列分类中预测误差的自助法与调整后的自助法的比较。
Stat Med. 2007 Dec 20;26(29):5320-34. doi: 10.1002/sim.2968.
8
Consistent estimation of the expected Brier score in general survival models with right-censored event times.在具有右删失事件时间的一般生存模型中对预期Brier评分进行一致估计。
Biom J. 2006 Dec;48(6):1029-40. doi: 10.1002/bimj.200610301.
9
Survival ensembles.生存集成法。
Biostatistics. 2006 Jul;7(3):355-73. doi: 10.1093/biostatistics/kxj011. Epub 2005 Dec 12.
10
Prediction error estimation: a comparison of resampling methods.预测误差估计:重采样方法的比较
Bioinformatics. 2005 Aug 1;21(15):3301-7. doi: 10.1093/bioinformatics/bti499. Epub 2005 May 19.