van der Laan Mark, Gruber Susan
Int J Biostat. 2016 May 1;12(1):351-78. doi: 10.1515/ijb-2015-0054.
Consider a study in which one observes n independent and identically distributed random variables whose probability distribution is known to be an element of a particular statistical model, and one is concerned with estimation of a particular real valued pathwise differentiable target parameter of this data probability distribution. The targeted maximum likelihood estimator (TMLE) is an asymptotically efficient substitution estimator obtained by constructing a so called least favorable parametric submodel through an initial estimator with score, at zero fluctuation of the initial estimator, that spans the efficient influence curve, and iteratively maximizing the corresponding parametric likelihood till no more updates occur, at which point the updated initial estimator solves the so called efficient influence curve equation. In this article we construct a one-dimensional universal least favorable submodel for which the TMLE only takes one step, and thereby requires minimal extra data fitting to achieve its goal of solving the efficient influence curve equation. We generalize these to universal least favorable submodels through the relevant part of the data distribution as required for targeted minimum loss-based estimation. Finally, remarkably, given a multidimensional target parameter, we develop a universal canonical one-dimensional submodel such that the one-step TMLE, only maximizing the log-likelihood over a univariate parameter, solves the multivariate efficient influence curve equation. This allows us to construct a one-step TMLE based on a one-dimensional parametric submodel through the initial estimator, that solves any multivariate desired set of estimating equations.
观察到(n)个独立同分布的随机变量,其概率分布已知是特定统计模型中的一个元素,并且关注该数据概率分布的特定实值逐点可微目标参数的估计。目标最大似然估计器(TMLE)是一种渐近有效替代估计器,它通过一个初始估计器构建一个所谓的最不利参数子模型来获得,该初始估计器在其零波动时具有得分,该得分跨越有效影响曲线,并迭代地最大化相应的参数似然,直到不再有更新发生,此时更新后的初始估计器求解所谓的有效影响曲线方程。在本文中,我们构建了一个一维通用最不利子模型,对于该子模型,TMLE只需要一步,从而只需最少的额外数据拟合就能实现其求解有效影响曲线方程的目标。我们根据基于目标最小损失估计所需的数据分布相关部分,将这些推广到通用最不利子模型。最后,值得注意的是,对于多维目标参数,我们开发了一个通用规范一维子模型,使得一步TMLE(仅在单变量参数上最大化对数似然)能够求解多元有效影响曲线方程。这使我们能够通过初始估计器基于一维参数子模型构建一步TMLE,该TMLE能求解任何多元期望的估计方程组。