Groll Andreas, Hastie Trevor, Tutz Gerhard
Department of Statistics, Ludwig-Maximilians-University Munich, Akademiestraße 1, 80799 Munich, Germany.
Department of Statistics, University of Stanford, 390 Serra Mall, Sequoia Hall, California 94305, U.S.A.
Biometrics. 2017 Sep;73(3):846-856. doi: 10.1111/biom.12637. Epub 2017 Jan 13.
In all sorts of regression problems, it has become more and more important to deal with high-dimensional data with lots of potentially influential covariates. A possible solution is to apply estimation methods that aim at the detection of the relevant effect structure by using penalization methods. In this article, the effect structure in the Cox frailty model, which is the most widely used model that accounts for heterogeneity in survival data, is investigated. Since in survival models one has to account for possible variation of the effect strength over time the selection of the relevant features has to distinguish between several cases, covariates can have time-varying effects, time-constant effects, or be irrelevant. A penalization approach is proposed that is able to distinguish between these types of effects to obtain a sparse representation that includes the relevant effects in a proper form. It is shown in simulations that the method works well. The method is applied to model the time until pregnancy, illustrating that the complexity of the influence structure can be strongly reduced by using the proposed penalty approach.
在各类回归问题中,处理具有大量潜在影响协变量的高维数据变得越来越重要。一种可能的解决方案是应用旨在通过使用惩罚方法来检测相关效应结构的估计方法。在本文中,研究了Cox脆弱模型中的效应结构,该模型是用于解释生存数据异质性的最广泛使用的模型。由于在生存模型中必须考虑效应强度随时间的可能变化,相关特征的选择必须区分几种情况,协变量可以具有随时间变化的效应、时间恒定的效应或不相关。提出了一种惩罚方法,该方法能够区分这些类型的效应,以获得以适当形式包含相关效应的稀疏表示。模拟结果表明该方法效果良好。该方法被应用于对怀孕前的时间进行建模,说明使用所提出的惩罚方法可以大大降低影响结构的复杂性。