Wang Peiyao, Lin Zhaotong, Pan Wei
Division of Biostatistics and Health Data Science, University of Minnesota, Minneapolis, MN, USA.
Department of Statistics, Florida State University, Tallahassee, FL, USA.
HGG Adv. 2025 Apr 10;6(2):100412. doi: 10.1016/j.xhgg.2025.100412. Epub 2025 Jan 30.
Mendelian randomization (MR) facilitates causal inference with observational data using publicly available genome-wide association study (GWAS) results. In a GWAS, one or more heritable covariates may be adjusted for to estimate the direct effects of SNPs on a focal trait or to improve statistical power, which may introduce collider bias in SNP-trait association estimates, thus affecting downstream MR analyses. Numerical studies suggested that using covariate-adjusted GWAS summary data might introduce bias in univariable Mendelian randomization (UVMR), which can be mitigated by multivariable Mendelian randomization (MVMR). However, it remains unclear and even mysterious why/how MVMR works; a rigorous theory is needed to explain and substantiate the above empirical observation. In this paper, we derive some analytical results when multiple covariates are adjusted for in the GWAS of exposure and/or the GWAS of outcome, thus supporting and explaining the empirical results. Our analytical results offer insights to how bias arises in UVMR and how it is avoided in MVMR, regardless of whether collider bias is present. We also consider applying UVMR or MVMR methods after collider-bias correction. We conducted extensive simulations to demonstrate that with covariate-adjusted GWAS summary data, MVMR had an advantage over UVMR by producing nearly unbiased causal estimates; however, in some situations it is advantageous to apply UVMR after bias correction. In real data analyses of the GWAS data with body mass index (BMI) being adjusted for metabolomic principal components, we examined the causal effect of BMI on blood pressure, confirming the above points.
孟德尔随机化(MR)利用公开可用的全基因组关联研究(GWAS)结果,促进对观测数据的因果推断。在GWAS中,可能会对一个或多个可遗传协变量进行调整,以估计单核苷酸多态性(SNP)对目标性状的直接影响或提高统计功效,这可能会在SNP-性状关联估计中引入对撞机偏差,从而影响下游的MR分析。数值研究表明,使用经协变量调整的GWAS汇总数据可能会在单变量孟德尔随机化(UVMR)中引入偏差,而多变量孟德尔随机化(MVMR)可以减轻这种偏差。然而,MVMR为何/如何起作用仍不明确甚至很神秘;需要一个严谨的理论来解释和证实上述实证观察结果。在本文中,当在暴露的GWAS和/或结局的GWAS中对多个协变量进行调整时,我们得出了一些分析结果,从而支持并解释了实证结果。我们的分析结果深入探讨了UVMR中偏差是如何产生的,以及在MVMR中如何避免偏差,无论是否存在对撞机偏差。我们还考虑在对撞机偏差校正后应用UVMR或MVMR方法。我们进行了广泛的模拟,以证明对于经协变量调整的GWAS汇总数据,MVMR通过产生几乎无偏差的因果估计,比UVMR具有优势;然而,在某些情况下,在偏差校正后应用UVMR是有利的。在对代谢组学主成分进行调整后的BMI的GWAS数据的实际数据分析中,我们研究了BMI对血压的因果效应,证实了上述观点。