Wang Hong, Li Gang
School of Mathematics and Statistics, Central South University, Hunan 410083, China.
Department of Biostatistics and Biomathematics, School of Public Health, University of California at Los Angeles, CA 90095, USA.
Quant Biosci. 2017;36(2):85-96. doi: 10.22283/qbs.2017.36.2.85.
Over the past decades, there has been considerable interest in applying statistical machine learning methods in survival analysis. Ensemble based approaches, especially random survival forests, have been developed in a variety of contexts due to their high precision and non-parametric nature. This article aims to provide a timely review on recent developments and applications of random survival forests for time-to-event data with high dimensional covariates. This selective review begins with an introduction to the random survival forest framework, followed by a survey of recent developments on splitting criteria, variable selection, and other advanced topics of random survival forests for time-to-event data in high dimensional settings. We also discuss potential research directions for future research.
在过去几十年中,人们对将统计机器学习方法应用于生存分析产生了浓厚兴趣。基于集成的方法,特别是随机生存森林,由于其高精度和非参数性质,已在各种背景下得到发展。本文旨在及时综述随机生存森林在具有高维协变量的事件发生时间数据方面的最新进展和应用。这篇选择性综述首先介绍随机生存森林框架,随后概述在高维环境下针对事件发生时间数据的随机生存森林在分裂标准、变量选择及其他高级主题方面的最新进展。我们还讨论了未来研究的潜在方向。