Zheng Yu, Cai Tianxi
Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts 02115, U.S.A.
Biometrics. 2017 Dec;73(4):1169-1178. doi: 10.1111/biom.12683. Epub 2017 Mar 10.
Reliable and accurate risk prediction is fundamental for successful management of clinical conditions. Estimating comprehensive risk prediction models precisely, however, is a difficult task, especially when the outcome of interest is time to a rare event and the number of candidate predictors, p, is not very small. Another challenge in developing accurate risk models arises from potential model misspecification. Time-specific generalized linear models estimated with inverse censoring probability weighting are robust to model misspecification, but may be inefficient in the rare event setting. To improve the efficiency of such robust estimation procedures, various augmentation methods have been proposed in the literature. These procedures can also leverage auxiliary variables such as intermediate outcomes that are predictive of event risk. However, most existing methods do not perform well in the rare event setting, especially when p is not small. In this article, we propose a two-step, imputation-based augmentation procedure that can improve estimation efficiency and that is robust to model misspecification. We also develop regularized augmentation procedures for settings where p is not small, along with procedures to improve the estimation of individualized treatment effect in risk reduction. Numerical studies suggest that our proposed methods substantially outperform existing methods in efficiency gains. The proposed methods are applied to an AIDS clinical trial for treating HIV-infected patients.
可靠且准确的风险预测是成功管理临床病症的基础。然而,精确估计综合风险预测模型是一项艰巨的任务,尤其是当关注的结果是罕见事件发生的时间且候选预测变量的数量(p)不是很小时。开发准确风险模型的另一个挑战源于潜在的模型误设。使用逆删失概率加权估计的特定时间广义线性模型对模型误设具有鲁棒性,但在罕见事件情况下可能效率不高。为了提高此类鲁棒估计程序的效率,文献中提出了各种增强方法。这些程序还可以利用辅助变量,如预测事件风险的中间结果。然而,大多数现有方法在罕见事件情况下表现不佳,尤其是当(p)不小的时候。在本文中,我们提出了一种基于插补的两步增强程序,它可以提高估计效率并且对模型误设具有鲁棒性。我们还针对(p)不小的情况开发了正则化增强程序,以及改进风险降低中个体化治疗效果估计的程序。数值研究表明,我们提出的方法在效率提升方面明显优于现有方法。所提出的方法应用于一项治疗艾滋病毒感染患者的艾滋病临床试验。