DeMaris Alfred
Bowling Green State University.
Psychol Methods. 2014 Sep;19(3):380-97. doi: 10.1037/a0037416. Epub 2014 Aug 11.
Unmeasured confounding is the principal threat to unbiased estimation of treatment "effects" (i.e., regression parameters for binary regressors) in nonexperimental research. It refers to unmeasured characteristics of individuals that lead them both to be in a particular "treatment" category and to register higher or lower values than others on a response variable. In this article, I introduce readers to 2 econometric techniques designed to control the problem, with a particular emphasis on the Heckman selection model (HSM). Both techniques can be used with only cross-sectional data. Using a Monte Carlo experiment, I compare the performance of instrumental-variable regression (IVR) and HSM to that of ordinary least squares (OLS) under conditions with treatment and unmeasured confounding both present and absent. I find HSM generally to outperform IVR with respect to mean-square-error of treatment estimates, as well as power for detecting either a treatment effect or unobserved confounding. However, both HSM and IVR require a large sample to be fully effective. The use of HSM and IVR in tandem with OLS to untangle unobserved confounding bias in cross-sectional data is further demonstrated with an empirical application. Using data from the 2006-2010 General Social Survey (National Opinion Research Center, 2014), I examine the association between being married and subjective well-being.
在非实验性研究中,未测量的混杂因素是对治疗“效果”(即二元回归变量的回归参数)进行无偏估计的主要威胁。它指的是个体的未测量特征,这些特征使他们既处于特定的“治疗”类别中,又在响应变量上比其他人记录更高或更低的值。在本文中,我向读者介绍两种旨在控制该问题的计量经济学技术,特别强调赫克曼选择模型(HSM)。这两种技术都只能用于横截面数据。通过蒙特卡罗实验,我比较了工具变量回归(IVR)和HSM与普通最小二乘法(OLS)在存在和不存在治疗及未测量混杂因素的条件下的性能。我发现,就治疗估计的均方误差以及检测治疗效果或未观察到的混杂因素的功效而言,HSM通常优于IVR。然而,HSM和IVR都需要大样本才能充分发挥作用。通过一个实证应用,进一步展示了将HSM和IVR与OLS结合使用以消除横截面数据中未观察到的混杂偏差的情况。利用2006 - 2010年综合社会调查(国家民意研究中心,2014年)的数据,我研究了婚姻状况与主观幸福感之间的关联。