Mandl Maximilian M, Boulesteix Anne-Laure, Burgess Stephen, Zuber Verena
Institute for Medical Information Processing, Biometry, and Epidemiology, Faculty of Medicine, Ludwig-Maximilians-Universität, München, Germany.
Munich Center for Machine Learning, Munich, Germany.
Stat Med. 2025 Jul;44(15-17):e70143. doi: 10.1002/sim.70143.
Mendelian randomization (MR) uses genetic variants as instrumental variables to infer causal effects of exposures on an outcome. One key assumption of MR is that the genetic variants used as instrumental variables are independent of the outcome conditional on the risk factor and unobserved confounders. Violations of this assumption, that is, the effect of the instrumental variables on the outcome through a path other than the risk factor included in the model (which can be caused by pleiotropy), are common phenomena in human genetics. Genetic variants, which deviate from this assumption, appear as outliers to the MR model fit and can be detected by the general heterogeneity statistics proposed in the literature, which are known to suffer from overdispersion, that is, too many genetic variants are declared as false outliers. We propose a method that corrects for overdispersion of the heterogeneity statistics in uni- and multivariable MR analysis by making use of the estimated inflation factor to correctly remove outlying instruments and therefore account for pleiotropic effects. Our method is applicable to summary-level data.
孟德尔随机化(MR)使用基因变异作为工具变量来推断暴露因素对结局的因果效应。MR的一个关键假设是,用作工具变量的基因变异在风险因素和未观察到的混杂因素的条件下与结局无关。违反这一假设,即工具变量通过模型中包含的风险因素以外的路径对结局产生影响(这可能由基因多效性引起),是人类遗传学中的常见现象。偏离这一假设的基因变异在MR模型拟合中表现为异常值,可以通过文献中提出的一般异质性统计量检测到,而这些统计量已知存在过度离散问题,即有太多基因变异被判定为假异常值。我们提出了一种方法,通过利用估计的膨胀因子来正确去除异常工具变量,从而校正单变量和多变量MR分析中异质性统计量的过度离散,进而考虑基因多效性效应。我们的方法适用于汇总水平的数据。