Cui Erjia, Crainiceanu Ciprian M, Leroux Andrew
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, USA.
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, USA; Department of Biostatistics and Bioinformatics, University of Colorado, Anschutz Medical Campus, USA.
J Comput Graph Stat. 2021;30(3):780-793. doi: 10.1080/10618600.2020.1853550. Epub 2021 Jan 1.
We propose the Additive Functional Cox Model to flexibly quantify the association between functional covariates and time to event data. The model extends the linear functional proportional hazards model by allowing the association between the functional covariate and log hazard to vary non-linearly in both the functional domain and the value of the functional covariate. Additionally, we introduce critical transformations of the functional covariate which address the weak model identifiability in areas of information sparsity and discuss their impact on interpretation and inference. We also introduce a novel estimation procedure that accounts for identifiability constraints directly during model fitting. Methods are applied to the National Health and Nutrition Examination Survey (NHANES) 2003-2006 accelerometry data and quantify new and interpretable circadian patterns of physical activity that are associated with all-cause mortality. We also introduce a simple and novel simulation framework for generating survival data with functional predictors which resemble the observed data. The accompanying inferential R software is fast, open source and publicly available. Our data application and simulations are fully reproducible through the accompanying vignette.
我们提出了加性函数Cox模型,以灵活地量化函数协变量与事件发生时间数据之间的关联。该模型扩展了线性函数比例风险模型,允许函数协变量与对数风险之间的关联在函数域和函数协变量的值中呈非线性变化。此外,我们引入了函数协变量的关键变换,解决了信息稀疏区域中模型可识别性较弱的问题,并讨论了它们对解释和推断的影响。我们还引入了一种新颖的估计程序,在模型拟合过程中直接考虑可识别性约束。方法应用于2003 - 2006年美国国家健康与营养检查调查(NHANES)的加速度计数据,并量化了与全因死亡率相关的新的且可解释的身体活动昼夜模式。我们还引入了一个简单新颖的模拟框架,用于生成具有类似于观测数据的函数预测变量的生存数据。随附的用于推断的R软件快速、开源且可公开获取。通过随附的 vignette,我们的数据应用和模拟是完全可重现的。