Li Yun, Lee Yoonseok, Wolfe Robert A, Morgenstern Hal, Zhang Jinyao, Port Friedrich K, Robinson Bruce M
Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, U.S.A.; Arbor Research Collaborative for Health, Ann Arbor, MI, 48104, U.S.A.
Stat Med. 2015 Mar 30;34(7):1150-68. doi: 10.1002/sim.6404. Epub 2014 Dec 29.
Treatment preferences of groups (e.g., clinical centers) have often been proposed as instruments to control for unmeasured confounding-by-indication in instrumental variable (IV) analyses. However, formal evaluations of these group-preference-based instruments are lacking. Unique challenges include the following: (i) correlations between outcomes within groups; (ii) the multi-value nature of the instruments; (iii) unmeasured confounding occurring between and within groups. We introduce the framework of between-group and within-group confounding to assess assumptions required for the group-preference-based IV analyses. Our work illustrates that, when unmeasured confounding effects exist only within groups but not between groups, preference-based IVs can satisfy assumptions required for valid instruments. We then derive a closed-form expression of asymptotic bias of the two-stage generalized ordinary least squares estimator when the IVs are valid. Simulations demonstrate that the asymptotic bias formula approximates bias in finite samples quite well, particularly when the number of groups is moderate to large. The bias formula shows that when the cluster size is finite, the IV estimator is asymptotically biased; only when both the number of groups and cluster size go to infinity, the bias disappears. However, the IV estimator remains advantageous in reducing bias from confounding-by-indication. The bias assessment provides practical guidance for preference-based IV analyses. To increase their performance, one should adjust for as many measured confounders as possible, consider groups that have the most random variation in treatment assignment and increase cluster size. To minimize the likelihood for these IVs to be invalid, one should minimize unmeasured between-group confounding.
在工具变量(IV)分析中,常常有人提出将群体(如临床中心)的治疗偏好作为控制未测量的指示性混杂因素的工具。然而,目前缺乏对这些基于群体偏好的工具进行正式评估。独特的挑战包括:(i)组内结果之间的相关性;(ii)工具的多值性质;(iii)组间和组内存在未测量的混杂因素。我们引入组间和组内混杂的框架来评估基于群体偏好的IV分析所需的假设。我们的研究表明,当未测量的混杂效应仅存在于组内而非组间时,基于偏好的IV可以满足有效工具所需的假设。然后,我们推导了IV有效时两阶段广义普通最小二乘估计量的渐近偏差的闭式表达式。模拟结果表明,渐近偏差公式在有限样本中能很好地近似偏差,特别是当组数为中等至较大时。偏差公式表明,当聚类大小有限时,IV估计量存在渐近偏差;只有当组数和聚类大小都趋于无穷大时,偏差才会消失。然而,IV估计量在减少指示性混杂偏差方面仍然具有优势。偏差评估为基于偏好的IV分析提供了实际指导。为了提高其性能,应尽可能多地调整已测量的混杂因素,考虑治疗分配中随机变化最大的组,并增加聚类大小。为了最小化这些IV无效的可能性,应尽量减少未测量的组间混杂。