Binder Nadine, Herrnböck Anne-Sophie, Schumacher Martin
Freiburg Center for Data Analysis and Modeling, University of Freiburg, Eckerstr. 1, 79104, Freiburg, Germany.
Institute for Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Stefan-Meier-Str. 26, 79104, Freiburg, Germany.
Biom J. 2017 Mar;59(2):251-269. doi: 10.1002/bimj.201500167. Epub 2016 Nov 21.
In clinical and epidemiological studies information on the primary outcome of interest, that is, the disease status, is usually collected at a limited number of follow-up visits. The disease status can often only be retrieved retrospectively in individuals who are alive at follow-up, but will be missing for those who died before. Right-censoring the death cases at the last visit (ad-hoc analysis) yields biased hazard ratio estimates of a potential risk factor, and the bias can be substantial and occur in either direction. In this work, we investigate three different approaches that use the same likelihood contributions derived from an illness-death multistate model in order to more adequately estimate the hazard ratio by including the death cases into the analysis: a parametric approach, a penalized likelihood approach, and an imputation-based approach. We investigate to which extent these approaches allow for an unbiased regression analysis by evaluating their performance in simulation studies and on a real data example. In doing so, we use the full cohort with complete illness-death data as reference and artificially induce missing information due to death by setting discrete follow-up visits. Compared to an ad-hoc analysis, all considered approaches provide less biased or even unbiased results, depending on the situation studied. In the real data example, the parametric approach is seen to be too restrictive, whereas the imputation-based approach could almost reconstruct the original event history information.
在临床和流行病学研究中,关于主要关注结局(即疾病状态)的信息通常是在有限次数的随访中收集的。疾病状态往往只能在随访时仍存活的个体中进行回顾性获取,而对于那些在随访前死亡的个体则会缺失该信息。在最后一次访视时对死亡病例进行右删失(即临时分析)会导致对潜在风险因素的风险比估计产生偏差,而且偏差可能很大,并且偏差方向不定。在这项研究中,我们探究了三种不同的方法,这些方法使用源自疾病 - 死亡多状态模型的相同似然贡献,以便通过将死亡病例纳入分析来更充分地估计风险比:一种参数方法、一种惩罚似然方法和一种基于插补的方法。我们通过在模拟研究和一个实际数据示例中评估它们的性能,来探究这些方法在多大程度上能够实现无偏回归分析。在此过程中,我们将具有完整疾病 - 死亡数据的全队列作为参考,并通过设置离散的随访访视人为地引入因死亡导致的缺失信息。与临时分析相比,根据所研究的情况,所有考虑的方法都能提供偏差较小甚至无偏的结果。在实际数据示例中,参数方法显得过于受限,而基于插补的方法几乎可以重建原始事件历史信息。