Department of Radiological Science, Oncology and Human Pathology, "Sapienza" University of Rome, Policlinico Umberto I, 00161 Rome, Italy.
Alef-Advanced Laboratory Economics and Finance, 00198 Rome, Italy.
Curr Oncol. 2023 Feb 8;30(2):2105-2126. doi: 10.3390/curroncol30020163.
We address the problem of how COVID-19 deaths observed in an oncology clinical trial can be consistently taken into account in typical survival estimates. We refer to oncological patients since there is empirical evidence of strong correlation between COVID-19 and cancer deaths, which implies that COVID-19 deaths cannot be treated simply as non-informative censoring, a property usually required by the classical survival estimators. We consider the problem in the framework of the widely used Kaplan-Meier (KM) estimator. Through a counterfactual approach, an algorithmic method is developed allowing to include COVID-19 deaths in the observed data by mean-imputation. The procedure can be seen in the class of the (EM) algorithms and will be referred to as (CoDMI) algorithm. We discuss the CoDMI underlying assumptions and the convergence issue. The algorithm provides a completed lifetime data set, where each Covid-death time is replaced by a point estimate of the corresponding virtual lifetime. This complete data set is naturally equipped with the corresponding KM survival function estimate and all available statistical tools can be applied to these data. However, mean-imputation requires an increased variance of the estimates. We then propose a natural extension of the classical Greenwood's formula, thus obtaining expanded confidence intervals for the survival function estimate. To illustrate how the algorithm works, CoDMI is applied to real medical data extended by the addition of artificial Covid-death observations. The results are compared with the estimates provided by the two naïve approaches which count COVID-19 deaths as censoring or as deaths by the disease under study. In order to evaluate the predictive performances of CoDMI an extensive simulation study is carried out. The results indicate that in the simulated scenarios CoDMI is roughly unbiased and outperforms the estimates obtained by the naïve approaches. A user-friendly version of CoDMI programmed in R is freely available.
我们解决了如何在典型的生存估计中一致考虑肿瘤学临床试验中观察到的 COVID-19 死亡的问题。我们提到肿瘤患者,因为有经验证据表明 COVID-19 与癌症死亡之间存在很强的相关性,这意味着 COVID-19 死亡不能简单地视为非信息性删失,这是经典生存估计器通常需要的属性。我们在广泛使用的 Kaplan-Meier(KM)估计器的框架内考虑这个问题。通过反事实方法,开发了一种算法方法,允许通过均值插补将 COVID-19 死亡纳入观察数据。该过程可以看作是(EM)算法类中的一种,并将其称为(CoDMI)算法。我们讨论了 CoDMI 的基本假设和收敛问题。该算法提供了一个完整的生存数据集,其中每个 Covid 死亡时间都由相应虚拟寿命的点估计值替换。这个完整的数据集自然配备了相应的 KM 生存函数估计值,并且可以将所有可用的统计工具应用于这些数据。然而,均值插补需要增加估计值的方差。然后,我们提出了经典 Greenwood 公式的自然扩展,从而获得了生存函数估计值的扩展置信区间。为了说明算法的工作原理,将 CoDMI 应用于通过添加人工 Covid 死亡观察值扩展的真实医学数据。将结果与将 COVID-19 死亡视为删失或研究疾病死亡的两种简单方法提供的估计值进行比较。为了评估 CoDMI 的预测性能,进行了广泛的模拟研究。结果表明,在模拟场景中,CoDMI 大致无偏并且优于简单方法提供的估计值。CoDMI 的一个用户友好版本以 R 语言编写,可免费使用。