高维数据随机生存森林的选择性综述

A Selective Review on Random Survival Forests for High Dimensional Data.

作者信息

Wang Hong, Li Gang

机构信息

School of Mathematics and Statistics, Central South University, Hunan 410083, China.

Department of Biostatistics and Biomathematics, School of Public Health, University of California at Los Angeles, CA 90095, USA.

出版信息

Quant Biosci. 2017;36(2):85-96. doi: 10.22283/qbs.2017.36.2.85.

DOI:10.22283/qbs.2017.36.2.85

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6364686/

Abstract

Over the past decades, there has been considerable interest in applying statistical machine learning methods in survival analysis. Ensemble based approaches, especially random survival forests, have been developed in a variety of contexts due to their high precision and non-parametric nature. This article aims to provide a timely review on recent developments and applications of random survival forests for time-to-event data with high dimensional covariates. This selective review begins with an introduction to the random survival forest framework, followed by a survey of recent developments on splitting criteria, variable selection, and other advanced topics of random survival forests for time-to-event data in high dimensional settings. We also discuss potential research directions for future research.

摘要

在过去几十年中，人们对将统计机器学习方法应用于生存分析产生了浓厚兴趣。基于集成的方法，特别是随机生存森林，由于其高精度和非参数性质，已在各种背景下得到发展。本文旨在及时综述随机生存森林在具有高维协变量的事件发生时间数据方面的最新进展和应用。这篇选择性综述首先介绍随机生存森林框架，随后概述在高维环境下针对事件发生时间数据的随机生存森林在分裂标准、变量选择及其他高级主题方面的最新进展。我们还讨论了未来研究的潜在方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd34/6364686/b085860fc246/nihms-986727-f0001.jpg

相似文献

1

A Selective Review on Random Survival Forests for High Dimensional Data.高维数据随机生存森林的选择性综述

Quant Biosci. 2017;36(2):85-96. doi: 10.22283/qbs.2017.36.2.85.

2

A comparison of the conditional inference survival forest model to random survival forests based on a simulation study as well as on two applications with time-to-event data.基于模拟研究以及对两个事件发生时间数据应用的情况，对条件推断生存森林模型与随机生存森林进行比较。

BMC Med Res Methodol. 2017 Jul 28;17(1):115. doi: 10.1186/s12874-017-0383-8.

3

Survival forests for data with dependent censoring.带有相依删失数据的生存森林。

Stat Methods Med Res. 2019 Feb;28(2):445-461. doi: 10.1177/0962280217727314. Epub 2017 Aug 24.

4

A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究：变量选择和预测性能。

BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.

5

Application of random survival forests in understanding the determinants of under-five child mortality in Uganda in the presence of covariates that satisfy the proportional and non-proportional hazards assumption.在存在满足比例和非比例风险假设的协变量的情况下，随机生存森林在理解乌干达五岁以下儿童死亡率的决定因素中的应用。

BMC Res Notes. 2017 Sep 7;10(1):459. doi: 10.1186/s13104-017-2775-6.

6

A Modified Random Survival Forests Algorithm for High Dimensional Predictors and Self-Reported Outcomes.一种用于高维预测变量和自我报告结果的改进随机生存森林算法。

J Comput Graph Stat. 2018;27(4):763-772. doi: 10.1080/10618600.2018.1474115. Epub 2018 Aug 20.

7

Survival prediction models: an introduction to discrete-time modeling.生存预测模型：离散时间建模简介。

BMC Med Res Methodol. 2022 Jul 26;22(1):207. doi: 10.1186/s12874-022-01679-6.

8

Personalized Risk Prediction in Clinical Oncology Research: Applications and Practical Issues Using Survival Trees and Random Forests.临床肿瘤学研究中的个性化风险预测：使用生存树和随机森林的应用及实际问题

J Biopharm Stat. 2018;28(2):333-349. doi: 10.1080/10543406.2017.1377730. Epub 2017 Oct 19.

9

Block Forests: random forests for blocks of clinical and omics covariate data.块森林：用于临床和组学协变量数据块的随机森林。

BMC Bioinformatics. 2019 Jun 27;20(1):358. doi: 10.1186/s12859-019-2942-y.

10

Evaluating Random Forests for Survival Analysis using Prediction Error Curves.使用预测误差曲线评估随机森林用于生存分析

J Stat Softw. 2012 Sep;50(11):1-23. doi: 10.18637/jss.v050.i11.

引用本文的文献

1

Microbiome-based prediction of allogeneic hematopoietic stem cell transplantation outcome.基于微生物组对异基因造血干细胞移植结果的预测

Genome Med. 2025 Jul 17;17(1):80. doi: 10.1186/s13073-025-01507-8.

2

Machine learning survival models for Non-alcoholic fatty liver disease based on a health checkup cohort.基于健康体检队列的非酒精性脂肪性肝病机器学习生存模型

BMC Gastroenterol. 2025 Jul 15;25(1):518. doi: 10.1186/s12876-025-04120-6.

3

Machine Learning Model Predicts Abnormal Lymphocytosis Associated With Chronic Lymphocytic Leukemia.机器学习模型可预测与慢性淋巴细胞白血病相关的异常淋巴细胞增多。

JCO Clin Cancer Inform. 2025 Jun;9:e2400197. doi: 10.1200/CCI-24-00197. Epub 2025 Jun 24.

4

A machine learning-derived angiogenesis signature for clinical prognosis and immunotherapy guidance in colon adenocarcinoma.一种用于预测结肠腺癌临床预后和指导免疫治疗的机器学习衍生血管生成特征

Sci Rep. 2025 May 31;15(1):19126. doi: 10.1038/s41598-025-03920-w.

5

Explainable transformer-based deep survival analysis in childhood acute lymphoblastic leukemia.基于可解释性变压器的儿童急性淋巴细胞白血病深度生存分析

Comput Biol Med. 2025 Jun;191:110118. doi: 10.1016/j.compbiomed.2025.110118. Epub 2025 Apr 7.

6

Dynamic survival prediction of end-stage kidney disease using random survival forests for competing risk analysis.使用随机生存森林进行竞争风险分析的终末期肾病动态生存预测

Front Med (Lausanne). 2024 Dec 11;11:1428073. doi: 10.3389/fmed.2024.1428073. eCollection 2024.

7

Prognostic prediction for inflammatory breast cancer patients using random survival forest modeling.使用随机生存森林模型对炎性乳腺癌患者进行预后预测。

Transl Oncol. 2025 Feb;52:102246. doi: 10.1016/j.tranon.2024.102246. Epub 2024 Dec 15.

8

Cytokine profiles as predictors of HIV incidence using machine learning survival models and statistical interpretable techniques.使用机器学习生存模型和统计可解释技术，将细胞因子谱作为HIV发病率的预测指标

Sci Rep. 2024 Dec 2;14(1):29895. doi: 10.1038/s41598-024-81510-y.

9

Construction of Prognostic Prediction Models for Colorectal Cancer Based on Ferroptosis-Related Genes: A Multi-Dataset and Multi-Model Analysis.基于铁死亡相关基因构建结直肠癌预后预测模型：多数据集与多模型分析

Biomed Eng Comput Biol. 2024 Nov 2;15:11795972241293516. doi: 10.1177/11795972241293516. eCollection 2024.

10

Individualized decision making in on-scene resuscitation time for out-of-hospital cardiac arrest using reinforcement learning.使用强化学习进行院外心脏骤停现场复苏时间的个体化决策

NPJ Digit Med. 2024 Oct 9;7(1):276. doi: 10.1038/s41746-024-01278-3.

本文引用的文献

1

Censoring Unbiased Regression Trees and Ensembles.审查无偏回归树与集成方法

J Am Stat Assoc. 2019;114(525):370-383. doi: 10.1080/01621459.2017.1407775. Epub 2018 Jul 9.

2

The Effect of Splitting on Random Forests.分裂对随机森林的影响。

Mach Learn. 2015 Apr;99(1):75-118. doi: 10.1007/s10994-014-5451-2. Epub 2014 Jul 2.

3

Survival forests for data with dependent censoring.带有相依删失数据的生存森林。

Stat Methods Med Res. 2019 Feb;28(2):445-461. doi: 10.1177/0962280217727314. Epub 2017 Aug 24.

4

Random survival forest with space extensions for censored data.用于删失数据的具有空间扩展的随机生存森林

Artif Intell Med. 2017 Jun;79:52-61. doi: 10.1016/j.artmed.2017.06.005. Epub 2017 Jun 20.

5

Unbiased split variable selection for random survival forests using maximally selected rank statistics.使用最大选择秩统计量对随机生存森林进行无偏分裂变量选择。

Stat Med. 2017 Apr 15;36(8):1272-1284. doi: 10.1002/sim.7212. Epub 2017 Jan 15.

6

Survival analysis for high-dimensional, heterogeneous medical data: Exploring feature extraction as an alternative to feature selection.高维异质医学数据的生存分析：探索特征提取作为特征选择的替代方法。

Artif Intell Med. 2016 Sep;72:1-11. doi: 10.1016/j.artmed.2016.07.004. Epub 2016 Jul 29.

7

Random rotation survival forest for high dimensional censored data.用于高维删失数据的随机旋转生存森林

Springerplus. 2016 Aug 26;5(1):1425. doi: 10.1186/s40064-016-3113-5. eCollection 2016.

8

Random Survival Forest in practice: a method for modelling complex metabolomics data in time to event analysis.实践中的随机生存森林：一种在时间-事件分析中对复杂代谢组学数据进行建模的方法。

Int J Epidemiol. 2016 Oct;45(5):1406-1420. doi: 10.1093/ije/dyw145. Epub 2016 Sep 1.

9

Recursive Partitioning Method on Competing Risk Outcomes.竞争风险结局的递归划分方法

Cancer Inform. 2016 Jul 26;15(Suppl 2):9-16. doi: 10.4137/CIN.S39364. eCollection 2016.

10

L₁ splitting rules in survival forests.生存森林中的L₁分裂规则。

Lifetime Data Anal. 2017 Oct;23(4):671-691. doi: 10.1007/s10985-016-9372-1. Epub 2016 Jul 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验