Mao Lu
Department of Biostatistics and Medical Informatics, 207A WARF Office Building, 610 Walnut St., University of Wisconsin-Madison.
Electron J Stat. 2024;18(2):4640-4666. doi: 10.1214/24-ejs2311. Epub 2024 Nov 22.
The marginal inference of an outcome variable can be improved by closely related covariates with a structured distribution. This differs from standard covariate adjustment in randomized trials, which exploits covariate-treatment independence rather than knowledge on the covariate distribution. Yet it can also be done robustly against misspecification of the outcome-covariate relationship. Starting with a standard estimating function involving only the outcome, we first use a working regression model to compute its conditional expectation given the covariates, and then remove the uninformative part under the covariate distribution model. This effectively projects the initial function onto the joint tangent space of the full data, thereby achieving local efficiency when the regression model is correct. Importantly, even with a faulty working model, the estimator remains unbiased as the subtracted term is always asymptotically centered. Further improvement is possible if the outcome distribution also has its own structure. To demonstrate the process, we consider three examples: one with fully parametric covariates, one with a covariate following a partial parametric model against others, and another with mutually independent covariates.
通过具有结构化分布的密切相关协变量,可以改善结果变量的边际推断。这与随机试验中的标准协变量调整不同,后者利用协变量与治疗的独立性,而非协变量分布的知识。然而,它也可以针对结果 - 协变量关系的错误设定进行稳健处理。从仅涉及结果的标准估计函数开始,我们首先使用工作回归模型来计算给定协变量时它的条件期望,然后在协变量分布模型下去除无信息部分。这有效地将初始函数投影到完整数据的联合切空间上,从而在回归模型正确时实现局部效率。重要的是,即使工作模型有缺陷,估计量仍然是无偏的,因为减去的项总是渐近中心化的。如果结果分布也有其自身结构,则可能会有进一步的改进。为了演示这个过程,我们考虑三个例子:一个具有完全参数化协变量,一个具有相对于其他变量遵循部分参数模型的协变量,另一个具有相互独立的协变量。