Chen Jacob M, Malinsky Daniel, Bhattacharya Rohit
Department of Computer Science, Williams College.
Department of Biostatistics, Columbia University.
Proc Mach Learn Res. 2023 Aug;216:358-368.
We consider missingness in the context of causal inference when the outcome of interest may be missing. If the outcome directly affects its own missingness status, i.e., it is "self-censoring", this may lead to severely biased causal effect estimates. Miao et al. [2015] proposed the shadow variable method to correct for bias due to self-censoring; however, verifying the required model assumptions can be difficult. Here, we propose a test based on a randomized incentive variable offered to encourage reporting of the outcome that can be used to verify identification assumptions that are sufficient to correct for both self-censoring and confounding bias. Concretely, the test confirms whether a given set of pre-treatment covariates is sufficient to block all backdoor paths between the treatment and outcome as well as all paths between the treatment and missingness indicator after conditioning on the outcome. We show that under these conditions, the causal effect is identified by using the treatment as a shadow variable, and it leads to an intuitive inverse probability weighting estimator that uses a product of the treatment and response weights. We evaluate the efficacy of our test and downstream estimator via simulations.
当感兴趣的结果可能缺失时,我们在因果推断的背景下考虑缺失问题。如果结果直接影响其自身的缺失状态,即它是“自我删失”的,这可能会导致因果效应估计出现严重偏差。Miao等人[2015年]提出了影子变量方法来校正由于自我删失导致的偏差;然而,验证所需的模型假设可能很困难。在此,我们提出一种基于提供随机激励变量的检验方法,以鼓励报告结果,该检验可用于验证足以校正自我删失和混杂偏差的识别假设。具体而言,该检验确认给定的一组预处理协变量是否足以阻断处理与结果之间的所有后门路径以及在以结果为条件后处理与缺失指示符之间的所有路径。我们表明,在这些条件下,通过将处理用作影子变量来识别因果效应,并且它会导致一个直观的逆概率加权估计量,该估计量使用处理权重和响应权重的乘积。我们通过模拟评估我们的检验和下游估计量的功效。