Štupnik Tomaž, Pohar Perme Maja
Department of Thoracic Surgery, Univerzitetni Klinični Center Ljubljana, Zaloška 7, SI-1000, Ljubljana, Slovenia.
Institute for Biostatistics and Medical Informatics, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1104, Ljubljana, Slovenia.
Stat Med. 2016 Mar 30;35(7):1130-43. doi: 10.1002/sim.6766. Epub 2015 Oct 13.
When analyzing time to disease recurrence, we sometimes need to work with data where all the recurrences are recorded, but no information is available on the possible deaths. This may occur when studying diseases of benign nature where patients are only seen at disease recurrences or in poorly-designed registries of benign diseases or medical device implantations without sufficient patient identifiers to obtain their dead/alive status at a later date. When the average time to disease recurrence is long enough in comparison with the expected survival of the patients, statistical analysis of such data can be significantly biased. Under the assumption that the expected survival of an individual is not influenced by the disease itself, general population mortality tables may be used to remove this bias. We show why the intuitive solution of simply imputing the patient's expected survival time does not give unbiased estimates of the usual quantities of interest in survival analysis and further explain that cumulative incidence function analysis does not require additional assumptions on general population mortality. We provide an alternative framework that allows unbiased estimation and introduce two new approaches: an iterative imputation method and a mortality adjusted at risk function. Their properties are carefully studied, with the results supported by simulations and illustrated on a real-world example.
在分析疾病复发时间时,我们有时需要处理这样的数据:所有复发情况都有记录,但关于可能的死亡情况却没有相关信息。这种情况可能出现在研究良性疾病时,患者仅在疾病复发时接受检查;或者出现在设计不佳的良性疾病或医疗设备植入登记系统中,这些系统没有足够的患者标识符来获取患者后续的生死状态。当疾病复发的平均时间与患者的预期生存期相比足够长时,对此类数据的统计分析可能会产生显著偏差。在个体的预期生存期不受疾病本身影响这一假设下,可以使用一般人群死亡率表来消除这种偏差。我们展示了为何简单地估算患者预期生存时间这种直观的解决方法无法给出生存分析中常用感兴趣量的无偏估计,并进一步解释累积发病率函数分析不需要对一般人群死亡率做额外假设。我们提供了一个能进行无偏估计的替代框架,并引入了两种新方法:迭代插补法和风险调整死亡率函数。我们仔细研究了它们的性质,模拟结果支持了这些结果,并通过一个实际例子进行了说明。