加速失效时间模型的灵活增强

Flexible boosting of accelerated failure time models.

作者信息

Schmid Matthias, Hothorn Torsten

机构信息

1Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg, Waldstrasse 6, D-91054 Erlangen, Germany.

出版信息

BMC Bioinformatics. 2008 Jun 6;9:269. doi: 10.1186/1471-2105-9-269.

DOI:10.1186/1471-2105-9-269

PMID:18538026

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2453145/

Abstract

BACKGROUND

When boosting algorithms are used for building survival models from high-dimensional data, it is common to fit a Cox proportional hazards model or to use least squares techniques for fitting semiparametric accelerated failure time models. There are cases, however, where fitting a fully parametric accelerated failure time model is a good alternative to these methods, especially when the proportional hazards assumption is not justified. Boosting algorithms for the estimation of parametric accelerated failure time models have not been developed so far, since these models require the estimation of a model-specific scale parameter which traditional boosting algorithms are not able to deal with.

RESULTS

We introduce a new boosting algorithm for censored time-to-event data which is suitable for fitting parametric accelerated failure time models. Estimation of the predictor function is carried out simultaneously with the estimation of the scale parameter, so that the negative log likelihood of the survival distribution can be used as a loss function for the boosting algorithm. The estimation of the scale parameter does not affect the favorable properties of boosting with respect to variable selection.

CONCLUSION

The analysis of a high-dimensional set of microarray data demonstrates that the new algorithm is able to outperform boosting with the Cox partial likelihood when the proportional hazards assumption is questionable. In low-dimensional settings, i.e., when classical likelihood estimation of a parametric accelerated failure time model is possible, simulations show that the new boosting algorithm closely approximates the estimates obtained from the maximum likelihood method.

摘要

背景

当使用提升算法从高维数据构建生存模型时，通常会拟合Cox比例风险模型或使用最小二乘法技术来拟合半参数加速失效时间模型。然而，在某些情况下，拟合完全参数化的加速失效时间模型是这些方法的一个很好的替代方案，特别是当比例风险假设不合理时。到目前为止，尚未开发用于估计参数加速失效时间模型的提升算法，因为这些模型需要估计特定于模型的尺度参数，而传统的提升算法无法处理该参数。

结果

我们引入了一种新的用于删失事件发生时间数据的提升算法，该算法适用于拟合参数加速失效时间模型。预测函数的估计与尺度参数的估计同时进行，因此生存分布的负对数似然可以用作提升算法的损失函数。尺度参数的估计不会影响提升在变量选择方面的良好性质。

结论

对一组高维微阵列数据的分析表明，当比例风险假设存在疑问时，新算法能够优于使用Cox偏似然的提升算法。在低维情况下，即当可以对参数加速失效时间模型进行经典似然估计时，模拟表明新的提升算法与从最大似然法获得的估计值非常接近。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1bd/2453145/304dd4c85654/1471-2105-9-269-1.jpg

相似文献

Flexible boosting of accelerated failure time models.

BMC Bioinformatics. 2008 Jun 6;9:269. doi: 10.1186/1471-2105-9-269.

Scalable algorithms for semiparametric accelerated failure time models in high dimensions.

Stat Med. 2022 Mar 15;41(6):933-949. doi: 10.1002/sim.9264. Epub 2022 Jan 11.

On hazard-based penalized likelihood estimation of accelerated failure time model with partly interval censoring.

Stat Methods Med Res. 2020 Dec;29(12):3804-3817. doi: 10.1177/0962280220942555. Epub 2020 Jul 20.

On semiparametric accelerated failure time models with time-varying covariates: A maximum penalised likelihood estimation.

Stat Med. 2023 Dec 30;42(30):5577-5595. doi: 10.1002/sim.9926. Epub 2023 Oct 16.

Flexible parametric accelerated failure time model.

J Biopharm Stat. 2021 Sep 3;31(5):650-667. doi: 10.1080/10543406.2021.1934854. Epub 2021 Sep 22.

Accelerated failure time modeling via nonparametric mixtures.

Biometrics. 2023 Mar;79(1):165-177. doi: 10.1111/biom.13556. Epub 2021 Sep 20.

Buckley-James boosting model based on extreme learning machine and random survival forests.

Biom J. 2023 Jun;65(5):e2200153. doi: 10.1002/bimj.202200153. Epub 2023 Apr 17.

Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection.

BMC Bioinformatics. 2016 Jul 22;17:288. doi: 10.1186/s12859-016-1149-8.

Flexible multistate models for interval-censored data: Specification, estimation, and an application to ageing research.

Stat Med. 2018 May 10;37(10):1636-1649. doi: 10.1002/sim.7604. Epub 2018 Jan 31.

Buckley-James boosting for survival analysis with high-dimensional biomarker data.

Stat Appl Genet Mol Biol. 2010;9(1):Article24. doi: 10.2202/1544-6115.1550. Epub 2010 Jun 8.

引用本文的文献

BoXHED2.0: Scalable Boosting of Dynamic Survival Analysis.

J Stat Softw. 2025;113(3). doi: 10.18637/jss.v113.i03. Epub 2025 Jul 28.

Tutorial on survival modeling with applications to omics data.

Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae132.

A boosting first-hitting-time model for survival analysis in high-dimensional settings.

Lifetime Data Anal. 2023 Apr;29(2):420-440. doi: 10.1007/s10985-022-09553-9. Epub 2022 Apr 27.

BOOSTED NONPARAMETRIC HAZARDS WITH TIME-DEPENDENT COVARIATES.

Ann Stat. 2021 Aug;49(4):2101-2128. doi: 10.1214/20-aos2028. Epub 2021 Sep 29.

Review of statistical methods for survival analysis using genomic data.

Genomics Inform. 2019 Dec;17(4):e41. doi: 10.5808/GI.2019.17.4.e41. Epub 2019 Dec 20.

Predicting survival times for neuroblastoma patients using RNA-seq expression profiles.

Biol Direct. 2018 May 30;13(1):11. doi: 10.1186/s13062-018-0213-x.

Sparse boosting for high-dimensional survival data with varying coefficients.

Stat Med. 2018 Feb 28;37(5):789-800. doi: 10.1002/sim.7544. Epub 2017 Nov 19.

An Update on Statistical Boosting in Biomedicine.

Comput Math Methods Med. 2017;2017:6083072. doi: 10.1155/2017/6083072. Epub 2017 Aug 2.

Gradient boosting machines, a tutorial.

Front Neurorobot. 2013 Dec 4;7:21. doi: 10.3389/fnbot.2013.00021. eCollection 2013.

Boosting the concordance index for survival data--a unified framework to derive and evaluate biomarker combinations.

PLoS One. 2014 Jan 6;9(1):e84483. doi: 10.1371/journal.pone.0084483. eCollection 2014.

本文引用的文献

Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models.

BMC Bioinformatics. 2008 Jan 10;9:14. doi: 10.1186/1471-2105-9-14.

Doubly penalized buckley-james method for survival data with high-dimensional covariates.

Biometrics. 2008 Mar;64(1):132-40. doi: 10.1111/j.1541-0420.2007.00877.x. Epub 2007 Aug 3.

Efron-type measures of prediction error for survival analysis.

Biometrics. 2007 Dec;63(4):1283-7. doi: 10.1111/j.1541-0420.2007.00832.x. Epub 2007 Jul 25.

Predicting survival from microarray data--a comparative study.

Bioinformatics. 2007 Aug 15;23(16):2080-7. doi: 10.1093/bioinformatics/btm305. Epub 2007 Jun 6.

Assessment of survival prediction models based on microarray data.

Bioinformatics. 2007 Jul 15;23(14):1768-74. doi: 10.1093/bioinformatics/btm232. Epub 2007 May 7.

Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO.

Biometrics. 2007 Mar;63(1):259-71. doi: 10.1111/j.1541-0420.2006.00660.x.

Consistent estimation of the expected Brier score in general survival models with right-censored event times.

Biom J. 2006 Dec;48(6):1029-40. doi: 10.1002/bimj.200610301.

Regularized estimation in the accelerated failure time model with high-dimensional covariates.

Biometrics. 2006 Sep;62(3):813-20. doi: 10.1111/j.1541-0420.2006.00562.x.

Stage II colon cancer prognosis prediction by tumor gene expression profiling.

J Clin Oncol. 2006 Oct 10;24(29):4685-91. doi: 10.1200/JCO.2005.05.0229. Epub 2006 Sep 11.

Survival ensembles.

Biostatistics. 2006 Jul;7(3):355-73. doi: 10.1093/biostatistics/kxj011. Epub 2005 Dec 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

加速失效时间模型的灵活增强

Flexible boosting of accelerated failure time models.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献