Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK; MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK; UK Dementia Research Institute at Imperial College, Imperial College London, London, UK.
National Institute of Agricultural Botany, Cambridge, UK.
Am J Hum Genet. 2022 May 5;109(5):767-782. doi: 10.1016/j.ajhg.2022.04.001. Epub 2022 Apr 21.
Mendelian randomization and colocalization are two statistical approaches that can be applied to summarized data from genome-wide association studies (GWASs) to understand relationships between traits and diseases. However, despite similarities in scope, they are different in their objectives, implementation, and interpretation, in part because they were developed to serve different scientific communities. Mendelian randomization assesses whether genetic predictors of an exposure are associated with the outcome and interprets an association as evidence that the exposure has a causal effect on the outcome, whereas colocalization assesses whether two traits are affected by the same or distinct causal variants. When considering genetic variants in a single genetic region, both approaches can be performed. While a positive colocalization finding typically implies a non-zero Mendelian randomization estimate, the reverse is not generally true: there are several scenarios which would lead to a non-zero Mendelian randomization estimate but lack evidence for colocalization. These include the existence of distinct but correlated causal variants for the exposure and outcome, which would violate the Mendelian randomization assumptions, and a lack of strong associations with the outcome. As colocalization was developed in the GWAS tradition, typically evidence for colocalization is concluded only when there is strong evidence for associations with both traits. In contrast, a non-zero estimate from Mendelian randomization can be obtained despite only nominally significant genetic associations with the outcome at the locus. In this review, we discuss how the two approaches can provide complementary information on potential therapeutic targets.
孟德尔随机化和共定位是两种可应用于全基因组关联研究(GWAS)汇总数据的统计方法,用于理解特征与疾病之间的关系。然而,尽管它们在范围上有相似之处,但在目标、实施和解释方面存在差异,部分原因是它们是为不同的科学社区开发的。孟德尔随机化评估暴露的遗传预测因子是否与结果相关,并将相关性解释为暴露对结果有因果影响的证据,而共定位评估两个特征是否受相同或不同的因果变异影响。当考虑单个遗传区域中的遗传变异时,可以同时进行这两种方法。虽然阳性共定位发现通常意味着孟德尔随机化估计值非零,但反之则不一定正确:存在几种情况会导致孟德尔随机化估计值非零但缺乏共定位证据。这些情况包括暴露和结果存在不同但相关的因果变异,这将违反孟德尔随机化假设,以及与结果缺乏强关联。由于共定位是在 GWAS 传统中开发的,通常只有当两个特征都有强烈的关联证据时,才会得出共定位的证据。相比之下,即使在该基因座与结果仅有名义上显著的遗传关联的情况下,也可以从孟德尔随机化中获得非零估计值。在这篇综述中,我们讨论了这两种方法如何为潜在的治疗靶点提供互补信息。