Department of Animal Sciences, Federal University of Minas Gerais, Belo Horizonte, MG 30123-970, Brazil.
Genetics. 2010 Jun;185(2):633-44. doi: 10.1534/genetics.109.112979. Epub 2010 Mar 29.
Biology is characterized by complex interactions between phenotypes, such as recursive and simultaneous relationships between substrates and enzymes in biochemical systems. Structural equation models (SEMs) can be used to study such relationships in multivariate analyses, e.g., with multiple traits in a quantitative genetics context. Nonetheless, the number of different recursive causal structures that can be used for fitting a SEM to multivariate data can be huge, even when only a few traits are considered. In recent applications of SEMs in mixed-model quantitative genetics settings, causal structures were preselected on the basis of prior biological knowledge alone. Therefore, the wide range of possible causal structures has not been properly explored. Alternatively, causal structure spaces can be explored using algorithms that, using data-driven evidence, can search for structures that are compatible with the joint distribution of the variables under study. However, the search cannot be performed directly on the joint distribution of the phenotypes as it is possibly confounded by genetic covariance among traits. In this article we propose to search for recursive causal structures among phenotypes using the inductive causation (IC) algorithm after adjusting the data for genetic effects. A standard multiple-trait model is fitted using Bayesian methods to obtain a posterior covariance matrix of phenotypes conditional to unobservable additive genetic effects, which is then used as input for the IC algorithm. As an illustrative example, the proposed methodology was applied to simulated data related to multiple traits measured on a set of inbred lines.
生物学的特点是表型之间存在复杂的相互作用,例如生化系统中底物和酶之间的递归和同时关系。结构方程模型(SEM)可用于在多元分析中研究此类关系,例如在定量遗传学背景下具有多个性状。尽管如此,即使只考虑少数几个性状,用于将 SEM 拟合到多元数据的递归因果结构的数量也可能非常庞大。在 SEM 在混合模型定量遗传学设置中的最近应用中,因果结构仅基于先前的生物学知识进行了预选。因此,尚未适当探索广泛的可能因果结构。或者,可以使用算法来探索因果结构空间,该算法使用数据驱动的证据,可以搜索与所研究变量的联合分布兼容的结构。但是,由于性状之间的遗传协方差,不能直接在表型的联合分布上进行搜索。在本文中,我们建议在调整数据以适应遗传效应后,使用归纳因果(IC)算法在表型之间搜索递归因果结构。使用贝叶斯方法拟合标准的多性状模型,以获得条件于不可观测的加性遗传效应的表型后验协方差矩阵,然后将其用作 IC 算法的输入。作为说明性示例,将所提出的方法应用于与一组近交系上测量的多个性状相关的模拟数据。