Yi Yanyao, Ye Ting, Yu Menggang, Shao Jun
KLATASDS-MOE, School of Statistics, East China Normal University, Shanghai, China.
Department of Statistics, University of Wisconsin, Madison, Wisconsin.
Biometrics. 2020 Jun;76(2):460-471. doi: 10.1111/biom.13155. Epub 2019 Nov 18.
Analysis with time-to-event data in clinical and epidemiological studies often encounters missing covariate values, and the missing at random assumption is commonly adopted, which assumes that missingness depends on the observed data, including the observed outcome which is the minimum of survival and censoring time. However, it is conceivable that in certain settings, missingness of covariate values is related to the survival time but not to the censoring time. This is especially so when covariate missingness is related to an unmeasured variable affected by the patient's illness and prognosis factors at baseline. If this is the case, then the covariate missingness is not at random as the survival time is censored, and it creates a challenge in data analysis. In this article, we propose an approach to deal with such survival-time-dependent covariate missingness based on the well known Cox proportional hazard model. Our method is based on inverse propensity weighting with the propensity estimated by nonparametric kernel regression. Our estimators are consistent and asymptotically normal, and their finite-sample performance is examined through simulation. An application to a real-data example is included for illustration.
在临床和流行病学研究中,对事件发生时间数据进行分析时常常会遇到协变量值缺失的情况,通常采用随机缺失假设,该假设认为缺失情况取决于观测数据,包括作为生存时间和删失时间最小值的观测结局。然而,可以想象在某些情况下,协变量值的缺失与生存时间相关,但与删失时间无关。当协变量缺失与一个在基线时受患者疾病和预后因素影响的未测量变量相关时,尤其如此。如果是这种情况,那么随着生存时间被删失,协变量缺失并非随机,这给数据分析带来了挑战。在本文中,我们基于著名的Cox比例风险模型提出一种方法来处理这种与生存时间相关的协变量缺失。我们的方法基于逆概率加权,其中倾向得分由非参数核回归估计。我们的估计量是一致的且渐近正态,并且通过模拟检验了它们的有限样本性能。还给出了一个实际数据例子的应用以作说明。