Department of Statistics, University of Florida, Gainesville, Florida, USA.
Genetics Institute, University of Florida, Gainesville, Florida, USA.
Stat Med. 2022 Mar 15;41(6):933-949. doi: 10.1002/sim.9264. Epub 2022 Jan 11.
Semiparametric accelerated failure time (AFT) models are a useful alternative to Cox proportional hazards models, especially when the assumption of constant hazard ratios is untenable. However, rank-based criteria for fitting AFT models are often nondifferentiable, which poses a computational challenge in high-dimensional settings. In this article, we propose a new alternating direction method of multipliers algorithm for fitting semiparametric AFT models by minimizing a penalized rank-based loss function. Our algorithm scales well in both the number of subjects and number of predictors, and can easily accommodate a wide range of popular penalties. To improve the selection of tuning parameters, we propose a new criterion which avoids some common problems in cross-validation with censored responses. Through extensive simulation studies, we show that our algorithm and software is much faster than existing methods (which can only be applied to special cases), and we show that estimators which minimize a penalized rank-based criterion often outperform alternative estimators which minimize penalized weighted least squares criteria. Application to nine cancer datasets further demonstrates that rank-based estimators of semiparametric AFT models are competitive with estimators assuming proportional hazards in high-dimensional settings, whereas weighted least squares estimators are often not. A software package implementing the algorithm, along with a set of auxiliary functions, is available for download at github.com/ajmolstad/penAFT.
半参数加速失效时间(AFT)模型是 Cox 比例风险模型的一种有用替代方法,特别是在假设危险比不变时不可行的情况下。然而,用于拟合 AFT 模型的基于秩的准则通常不可微,这在高维环境中带来了计算挑战。在本文中,我们提出了一种新的交替方向乘子算法,通过最小化惩罚基于秩的损失函数来拟合半参数 AFT 模型。我们的算法在受试者数量和预测变量数量上都具有良好的扩展性,并且可以轻松适应广泛的流行惩罚。为了改善调整参数的选择,我们提出了一种新的准则,避免了带有删失响应的交叉验证中的一些常见问题。通过广泛的模拟研究,我们表明我们的算法和软件比现有方法(只能应用于特殊情况)快得多,并且表明最小化惩罚基于秩准则的估计量通常优于最小化惩罚加权最小二乘准则的替代估计量。对九个癌症数据集的应用进一步表明,在高维环境中,半参数 AFT 模型的基于秩的估计量与假设比例风险的估计量具有竞争力,而加权最小二乘估计量则往往不具有竞争力。实现该算法的软件包以及一组辅助函数可在 github.com/ajmolstad/penAFT 上下载。