Sun Yifei, Wang Mei-Cheng
Department of Biostatistics, School of Public Health, Johns Hopkins University, Baltimore, MD 21205 (
J Am Stat Assoc. 2017;112(518):745-756. doi: 10.1080/01621459.2016.1166113. Epub 2017 Apr 12.
In follow-up studies, utility marker measurements are usually collected upon the occurrence of recurrent events until a terminal event such as death takes place. In this article, we define the recurrent marker process to characterize utility accumulation over time. For example, with medical cost and repeated hospitalizations being treated as marker and recurrent events respectively, the recurrent marker process is the trajectory of cumulative cost, which stops to increase after death. In many applications, competing risks arise as subjects are at risk of more than one mutually exclusive terminal event, such as death from different causes, and modeling the recurrent marker process for each failure type is often of interest. However, censoring creates challenges in the methodological development, because for censored subjects, both failure type and recurrent marker process after censoring are unobserved. To circumvent this problem, we propose a nonparametric framework for recurrent marker process with competing terminal events. In the presence of competing risks, we start with an estimator by using marker information from uncensored subjects. As a result, the estimator can be inefficient under heavy censoring. To improve efficiency, we propose a second estimator by combining the first estimator with auxiliary information from the estimate under non-competing risks model. The large sample properties and optimality of the second estimator is established. Simulation studies and an application to the SEER-Medicare linked data are presented to illustrate the proposed methods. Supplemental materials are available online.
在随访研究中,效用指标测量通常在复发事件发生时收集,直至发生死亡等终末事件。在本文中,我们定义了复发指标过程来描述效用随时间的累积情况。例如,将医疗费用和反复住院分别视为指标和复发事件,复发指标过程就是累积费用的轨迹,在死亡后停止增加。在许多应用中,由于受试者面临不止一种相互排斥的终末事件的风险,如死于不同原因,对每种失败类型的复发指标过程进行建模通常很有意义。然而,删失在方法学发展中带来了挑战,因为对于删失的受试者,删失后的失败类型和复发指标过程都是未观察到的。为了规避这个问题,我们提出了一个用于具有竞争终末事件的复发指标过程的非参数框架。在存在竞争风险的情况下,我们首先使用未删失受试者的指标信息构建一个估计量。结果,在严重删失的情况下,该估计量可能效率不高。为了提高效率,我们通过将第一个估计量与来自非竞争风险模型下估计的辅助信息相结合,提出了第二个估计量。建立了第二个估计量的大样本性质和最优性。给出了模拟研究以及对SEER - Medicare链接数据的应用,以说明所提出的方法。补充材料可在线获取。