Wolock Charles J, Gilbert Peter B, Simon Noah, Carone Marco
Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania.
Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center.
J Comput Graph Stat. 2024;33(3):1098-1108. doi: 10.1080/10618600.2024.2304070. Epub 2024 Feb 13.
The conditional survival function of a time-to-event outcome subject to censoring and truncation is a common target of estimation in survival analysis. This parameter may be of scientific interest and also often appears as a nuisance in nonparametric and semiparametric problems. In addition to classical parametric and semiparametric methods (e.g., based on the Cox proportional hazards model), flexible machine learning approaches have been developed to estimate the conditional survival function. However, many of these methods are either implicitly or explicitly targeted toward risk stratification rather than overall survival function estimation. Others apply only to discrete-time settings or require inverse probability of censoring weights, which can be as difficult to estimate as the outcome survival function itself. Here, we employ a decomposition of the conditional survival function in terms of observable regression models in which censoring and truncation play no role. This allows application of an array of flexible regression and classification methods rather than only approaches that explicitly handle the complexities inherent to survival data. We outline estimation procedures based on this decomposition, empirically assess their performance, and demonstrate their use on data from an HIV vaccine trial. Supplementary materials for this article are available online.
对于受删失和截断影响的事件发生时间结局,其条件生存函数是生存分析中常见的估计目标。该参数可能具有科学研究价值,并且在非参数和半参数问题中也常常作为一个干扰因素出现。除了经典的参数和半参数方法(例如基于Cox比例风险模型)之外,还开发了灵活的机器学习方法来估计条件生存函数。然而,这些方法中的许多要么隐式地要么显式地针对风险分层,而不是总体生存函数估计。其他方法仅适用于离散时间设置,或者需要删失权重的逆概率,而这可能与结局生存函数本身一样难以估计。在此,我们根据可观测回归模型对条件生存函数进行分解,其中删失和截断不起作用。这允许应用一系列灵活的回归和分类方法,而不仅仅是那些明确处理生存数据固有复杂性的方法。我们概述了基于这种分解的估计程序,通过实证评估它们的性能,并展示它们在一项HIV疫苗试验数据中的应用。本文的补充材料可在线获取。