Cattaneo Matias D, Feng Yingjie, Titiunik Rocio
Department of Operations Research and Financial Engineering, Princeton University.
School of Economics and Management, Tsinghua University.
J Am Stat Assoc. 2021;116(536):1865-1880. doi: 10.1080/01621459.2021.1979561. Epub 2021 Dec 2.
Uncertainty quantification is a fundamental problem in the analysis and interpretation of synthetic control (SC) methods. We develop conditional prediction intervals in the SC framework, and provide conditions under which these intervals offer finite-sample probability guarantees. Our method allows for covariate adjustment and non-stationary data. The construction begins by noting that the statistical uncertainty of the SC prediction is governed by two distinct sources of randomness: one coming from the construction of the (likely misspecified) SC weights in the pre-treatment period, and the other coming from the unobservable stochastic error in the post-treatment period when the treatment effect is analyzed. Accordingly, our proposed prediction intervals are constructed taking into account both sources of randomness. For implementation, we propose a simulation-based approach along with finite-sample-based probability bound arguments, naturally leading to principled sensitivity analysis methods. We illustrate the numerical performance of our methods using empirical applications and a small simulation study. Python, R and Stata software packages implementing our methodology are available.
不确定性量化是合成控制(SC)方法分析与解释中的一个基本问题。我们在SC框架中开发了条件预测区间,并给出了这些区间提供有限样本概率保证的条件。我们的方法允许进行协变量调整和处理非平稳数据。构建过程始于注意到SC预测的统计不确定性由两个不同的随机源决定:一个来自预处理期(可能设定错误的)SC权重的构建,另一个来自分析处理效应时处理期不可观测的随机误差。因此,我们提出的预测区间在构建时考虑了这两个随机源。为了实现,我们提出了一种基于模拟的方法以及基于有限样本的概率界论证,自然地引出了有原则的敏感性分析方法。我们通过实证应用和一个小型模拟研究来说明我们方法的数值性能。实现我们方法的Python、R和Stata软件包均可获取。