Chen Shuai, Zhao Hongwei
Department of Statistics, Texas A&M University, College Station, Texas, USA.
Department of Epidemiology and Biostatistics, Texas A&M Health Science Center, College Station, Texas, USA.
J Stat Theory Pract. 2013 Apr 1;7(2). doi: 10.1080/15598608.2013.771556.
Medical cost estimation is a challenging task when censoring of data is present. Although researchers have proposed methods for estimating mean costs, these are often derived from theory and are not always easy to understand. We provide an alternative method, based on a replace-from-the-right algorithm, for estimating mean costs more efficiently. We show that our estimator is equivalent to an existing one that is based on the inverse probability weighting principle and semiparametric efficiency theory. We also propose an alternative method for estimating the survival function of costs, based on the redistribute-to-the-right algorithm, that was originally used for explaining the Kaplan-Meier estimator. We show that this second proposed estimator is equivalent to a simple weighted survival estimator of costs. Finally, we develop a more efficient survival estimator of costs, using the same redistribute-to-the-right principle. This estimator is naturally monotone, more efficient than some existing survival estimators, and has a quite small bias in many realistic settings. We conduct numerical studies to examine the finite sample property of the survival estimators for costs, and show that our new estimator has small mean squared errors when the sample size is not too large. We apply both existing and new estimators to a data example from a randomized cardiovascular clinical trial.
当存在数据删失时,医疗成本估计是一项具有挑战性的任务。尽管研究人员已经提出了估计平均成本的方法,但这些方法往往源于理论,并不总是易于理解。我们提供了一种基于从右替换算法的替代方法,以更有效地估计平均成本。我们表明,我们的估计器等同于基于逆概率加权原理和半参数效率理论的现有估计器。我们还提出了一种基于向右重新分配算法的估计成本生存函数的替代方法,该算法最初用于解释Kaplan-Meier估计器。我们表明,这第二个提出的估计器等同于成本的简单加权生存估计器。最后,我们使用相同的向右重新分配原理开发了一种更有效的成本生存估计器。该估计器自然是单调的,比一些现有的生存估计器更有效,并且在许多实际情况下偏差相当小。我们进行了数值研究,以检验成本生存估计器的有限样本性质,并表明当样本量不是太大时,我们的新估计器具有较小的均方误差。我们将现有估计器和新估计器都应用于一项随机心血管临床试验的数据示例。