Groningen Bioinformatics Centre, University of Groningen, The Netherlands.
Trends Genet. 2010 Dec;26(12):493-8. doi: 10.1016/j.tig.2010.09.002. Epub 2010 Oct 15.
Genome-wide linkage and association studies of tens of thousands of clinical and molecular traits are currently underway, offering rich data for inferring causality between traits and genetic variation. However, the inference process is based on discovering subtle patterns in the correlation between traits and is therefore challenging and could create a flood of untrustworthy causal inferences. Here we introduce the concerns and show that they are already valid in simple scenarios of two traits linked to or associated with the same genomic region. We argue that more comprehensive analysis and Bayesian reasoning are needed and that these can overcome some of the pitfalls, although not in every conceivable case. We conclude that causal inference methods can still be of use in the iterative process of mathematical modeling and biological validation.
目前,正在进行数以万计的临床和分子特征的全基因组连锁和关联研究,为推断特征与遗传变异之间的因果关系提供了丰富的数据。然而,这种推断过程是基于发现特征之间相关性的细微模式,因此具有挑战性,并且可能会产生大量不可信的因果推断。在这里,我们介绍了这些关注点,并表明它们在两个特征与同一基因组区域相关或关联的简单情况下已经成立。我们认为需要更全面的分析和贝叶斯推理,并且这些方法可以克服一些陷阱,尽管不是在每种情况下都可以。我们的结论是,因果推断方法仍然可以在数学建模和生物验证的迭代过程中发挥作用。