Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, USA.
Department of Biomedical Data Science, Dartmouth College, Hanover, New Hampshire, USA.
Genet Epidemiol. 2022 Feb;46(1):17-31. doi: 10.1002/gepi.22435. Epub 2021 Oct 21.
Mendelian randomization (MR) is an application of instrumental variable (IV) methods to observational data in which the IV is a genetic variant. MR methods applicable to the general exponential family of distributions are currently not well characterized. We adapt a general linear model framework to the IV setting and propose a general MR method applicable to any full-rank distribution from the exponential family. Empirical bias and coverage are estimated via simulations. The proposed method is compared to several existing MR methods. Real data analyses are performed using data from the REGARDS study to estimate the potential causal effect of smoking frequency on stroke risk in African Americans. In simulations with binary variates and very weak instruments the proposed method had the lowest median [Q , Q ] bias (0.10 [-3.68 to 3.62]); compared with 2SPS (0.27 [-3.74 to 4.26]) and the Wald method (-0.69 [-1.72 to 0.35]). Low bias was observed throughout other simulation scenarios; as well as more than 90% coverage for the proposed method. In simulations with count variates, the proposed method performed comparably to 2SPS; the Wald method maintained the most consistent low bias; and 2SRI was biased towards the null. Real data analyses find no evidence for a causal effect of smoking frequency on stroke risk. The proposed MR method has low bias and acceptable coverage across a wide range of distributional scenarios and instrument strengths; and provides a more parsimonious framework for asymptotic hypothesis testing compared to existing two-stage procedures.
孟德尔随机化(MR)是一种将工具变量(IV)方法应用于观察性数据的方法,其中 IV 是一种遗传变异。目前,适用于一般指数分布族的 MR 方法尚未得到很好的描述。我们将一般线性模型框架应用于 IV 设置,并提出了一种适用于指数家族中任何满秩分布的通用 MR 方法。通过模拟估计经验偏差和覆盖范围。将提出的方法与几种现有的 MR 方法进行了比较。使用 REGARDS 研究的数据进行真实数据分析,以估计吸烟频率对非裔美国人中风风险的潜在因果影响。在具有二进制变量和非常弱的工具的模拟中,提出的方法具有最低的中位数[Q, Q ]偏差(0.10 [-3.68 至 3.62]);与 2SPS(0.27 [-3.74 至 4.26])和 Wald 方法(-0.69 [-1.72 至 0.35])相比。在其他模拟场景中观察到低偏差;以及超过 90%的拟议方法的覆盖率。在计数变量的模拟中,提出的方法与 2SPS 表现相当;Wald 方法保持了最一致的低偏差;而 2SRI 则偏向于零。真实数据分析没有发现吸烟频率对中风风险有因果影响的证据。提出的 MR 方法在广泛的分布情况和工具强度范围内具有低偏差和可接受的覆盖率;与现有的两阶段程序相比,为渐近假设检验提供了更简洁的框架。