Li Qian, Wang Zhichao, Liu Shaowu, Li Gang, Xu Guandong
IEEE Trans Neural Netw Learn Syst. 2023 Aug;34(8):4083-4095. doi: 10.1109/TNNLS.2021.3118542. Epub 2023 Aug 4.
Treatment effect estimation helps answer questions, such as whether a specific treatment affects the outcome of interest. One fundamental issue in this research is to alleviate the treatment assignment bias among those treated units and controlled units. Classical causal inference methods resort to the propensity score estimation, which unfortunately tends to be misspecified when only limited overlapping exists between the treated and the controlled units. Moreover, existing supervised methods mainly consider the treatment assignment information underlying the factual space, and thus, their performance of counterfactual inference may be degraded due to overfitting of the factual results. To alleviate those issues, we build on the optimal transport theory and propose a novel causal optimal transport (CausalOT) model to estimate an individual treatment effect (ITE). With the proposed propensity measure, CausalOT can infer the counterfactual outcome by solving a novel regularized optimal transport problem, which allows the utilization of global information on observational covariates to alleviate the issue of limited overlapping. In addition, a novel counterfactual loss is designed for CausalOT to align the factual outcome distribution with the counterfactual outcome distribution. Most importantly, we prove the theoretical generalization bound for the counterfactual error of CausalOT. Empirical studies on benchmark datasets confirm that the proposed CausalOT outperforms state-of-the-art causal inference methods.
治疗效果估计有助于回答一些问题,比如特定治疗是否会影响感兴趣的结果。该研究中的一个基本问题是减轻治疗组和对照组中治疗分配的偏差。经典的因果推断方法依赖于倾向得分估计,然而,当治疗组和对照组之间仅存在有限重叠时,这种方法往往会出现错误设定。此外,现有的监督方法主要考虑事实空间中的治疗分配信息,因此,由于对事实结果的过拟合,它们的反事实推断性能可能会下降。为了缓解这些问题,我们基于最优传输理论,提出了一种新颖的因果最优传输(CausalOT)模型来估计个体治疗效果(ITE)。通过所提出的倾向度量,CausalOT可以通过解决一个新颖的正则化最优传输问题来推断反事实结果,这允许利用观测协变量的全局信息来缓解有限重叠的问题。此外,为CausalOT设计了一种新颖的反事实损失,以使事实结果分布与反事实结果分布对齐。最重要的是,我们证明了CausalOT反事实误差的理论泛化界。在基准数据集上的实证研究证实,所提出的CausalOT优于现有的因果推断方法。