Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA.
Center for Perinatal, Pediatric and Environmental Epidemiology, Yale School of Public Health, New Haven, CT, USA.
J Hum Genet. 2024 Jul;69(7):301-309. doi: 10.1038/s10038-024-01232-x. Epub 2024 Mar 25.
Identification of pleiotropy at the single nucleotide polymorphism (SNP) level provides valuable insights into shared genetic signals among phenotypes. One approach to study these signals is through mediation analysis, which dissects the total effect of a SNP on the outcome into a direct effect and an indirect effect through a mediator. However, estimated effects from mediation analysis can be confounded by the genetic correlation between phenotypes, leading to inaccurate results. To address this confounding effect in the context of genetic mediation analysis, we propose a restricted-maximum-likelihood (REML)-based mediation analysis framework called REML-mediation, which can be applied to either individual-level or summary statistics data. Simulations demonstrated that REML-mediation provides unbiased estimates of the true cross-trait causal effect, assuming certain assumptions, albeit with a slightly inflated standard error compared to traditional linear regression. To validate the effectiveness of REML-mediation, we applied it to UK Biobank data and analyzed several mediator-outcome trait pairs along with their corresponding sets of pleiotropic SNPs. REML-mediation successfully identified and corrected for genetic confounding effects in these trait pairs, with correction magnitudes ranging from 7% to 39%. These findings highlight the presence of genetic confounding effects in cross-trait epidemiological studies and underscore the importance of accounting for them in data analysis.
在单核苷酸多态性 (SNP) 水平上识别多效性为表型之间共享遗传信号提供了有价值的见解。研究这些信号的一种方法是通过中介分析,该分析将 SNP 对结果的总效应分解为通过中介的直接效应和间接效应。然而,中介分析中估计的效应可能会受到表型之间遗传相关性的混淆,导致结果不准确。为了解决遗传中介分析中这种混淆的影响,我们提出了一种基于限制极大似然 (REML) 的中介分析框架,称为 REML-中介分析,它可以应用于个体水平或汇总统计数据。模拟表明,REML-中介分析在某些假设下,提供了真实跨性状因果效应的无偏估计,尽管与传统线性回归相比,标准误差略有膨胀。为了验证 REML-中介分析的有效性,我们将其应用于英国生物库数据,并分析了几个中介-结局性状对以及它们相应的多效性 SNP 集。REML-中介分析成功地识别并纠正了这些性状对中的遗传混杂效应,校正幅度从 7%到 39%不等。这些发现强调了跨性状流行病学研究中存在遗传混杂效应,并强调了在数据分析中考虑这些效应的重要性。