Division of Epidemiology and Biostatistics, College of Public Health, University of Arizona, Tucson, AZ 85724, USA.
BMC Med Res Methodol. 2009 Sep 29;9:66. doi: 10.1186/1471-2288-9-66.
Due to early colonoscopy for some participants, interval-censored observations can be introduced into the data of a colorectal polyp prevention trial. The censoring could be dependent of risk of recurrence if the reasons of having early colonoscopy are associated with recurrence. This can complicate estimation of the recurrence rate.
We propose to use midpoint imputation to convert interval-censored data problems to right censored data problems. To adjust for potential dependent censoring, we use information from auxiliary variables to define risk groups to perform the weighted Kaplan-Meier estimation to the midpoint imputed data. The risk groups are defined using two risk scores derived from two working proportional hazards models with the auxiliary variables as the covariates. One is for the recurrence time and the other is for the censoring time. The method described here is explored by simulation and illustrated with an example from a colorectal polyp prevention trial.
We first show that midpoint imputation under an assumption of independent censoring will produce an unbiased estimate of recurrence rate at the end of the trial, which is often the main interest of a colorectal polyp prevention trial, and then show in simulations that the weighted Kaplan-Meier method using the information from auxiliary variables based on the midpoint imputed data can improve efficiency in a situation with independent censoring and reduce bias in a situation with dependent censoring compared to the conventional methods, while estimating the recurrence rate at the end of the trial.
The research in this paper uses midpoint imputation to handle interval-censored observations and then uses the information from auxiliary variables to adjust for dependent censoring by incorporating them into the weighted Kaplan-Meier estimation. This approach can handle a situation with multiple auxiliary variables by deriving two risk scores from two working PH models. Although the idea of this approach might appear simple, the results do show that the weighted Kaplan-Meier approach can gain efficiency and reduce bias due to dependent censoring.
由于部分参与者进行了早期结肠镜检查,结直肠息肉预防试验的数据中可能会出现区间删失观测。如果早期结肠镜检查的原因与复发相关,那么删失可能是依赖于复发风险的。这会使复发率的估计变得复杂。
我们建议使用中点插补将区间删失数据问题转换为右删失数据问题。为了调整潜在的依赖删失,我们使用辅助变量的信息来定义风险组,对中点插补数据进行加权 Kaplan-Meier 估计。风险组是使用两个源自两个带有辅助变量作为协变量的工作比例风险模型的风险评分来定义的。一个用于复发时间,另一个用于删失时间。这里描述的方法通过模拟进行了探索,并通过结直肠息肉预防试验的一个实例进行了说明。
我们首先证明了在独立删失假设下,中点插补将产生试验结束时的复发率的无偏估计,这通常是结直肠息肉预防试验的主要关注点,然后在模拟中表明,使用基于中点插补数据的辅助变量信息的加权 Kaplan-Meier 方法可以在独立删失情况下提高效率,在依赖删失情况下减少偏倚,同时估计试验结束时的复发率。
本文的研究使用中点插补处理区间删失观测,然后使用辅助变量的信息通过将其纳入加权 Kaplan-Meier 估计来调整依赖删失。通过从两个工作 PH 模型中得出两个风险评分,该方法可以处理具有多个辅助变量的情况。虽然这种方法的想法似乎很简单,但结果确实表明,加权 Kaplan-Meier 方法可以由于依赖删失而获得效率和减少偏差。