Monneret Gilles, Jaffrézic Florence, Rau Andrea, Zerjal Tatiana, Nuel Grégory
UMR GABI, AgroParisTech, INRA, Université Paris-Saclay, 78350 Jouy-en-Josas, France.
LPMA, UMR CNRS 7599, UPMC, Sorbonne Universités, 4 place Jussieu, 75005 Paris, France.
PLoS One. 2017 Mar 16;12(3):e0171142. doi: 10.1371/journal.pone.0171142. eCollection 2017.
Causal network inference is an important methodological challenge in biology as well as other areas of application. Although several causal network inference methods have been proposed in recent years, they are typically applicable for only a small number of genes, due to the large number of parameters to be estimated and the limited number of biological replicates available. In this work, we consider the specific case of transcriptomic studies made up of both observational and interventional data in which a single gene of biological interest is knocked out. We focus on a marginal causal estimation approach, based on the framework of Gaussian directed acyclic graphs, to infer causal relationships between the knocked-out gene and a large set of other genes. In a simulation study, we found that our proposed method accurately differentiates between downstream causal relationships and those that are upstream or simply associative. It also enables an estimation of the total causal effects between the gene of interest and the remaining genes. Our method performed very similarly to a classical differential analysis for experiments with a relatively large number of biological replicates, but has the advantage of providing a formal causal interpretation. Our proposed marginal causal approach is computationally efficient and may be applied to several thousands of genes simultaneously. In addition, it may help highlight subsets of genes of interest for a more thorough subsequent causal network inference. The method is implemented in an R package called MarginalCausality (available on GitHub).
因果网络推断在生物学以及其他应用领域都是一项重要的方法学挑战。尽管近年来已经提出了几种因果网络推断方法,但由于需要估计的参数数量众多以及可用的生物学重复样本数量有限,它们通常仅适用于少数基因。在这项工作中,我们考虑由观测数据和干预数据组成的转录组学研究的特定情况,其中一个感兴趣的单基因被敲除。我们专注于一种基于高斯有向无环图框架的边际因果估计方法,以推断被敲除基因与大量其他基因之间的因果关系。在一项模拟研究中,我们发现我们提出的方法能够准确区分下游因果关系与上游或仅仅是关联的关系。它还能够估计感兴趣基因与其余基因之间的总因果效应。对于具有相对大量生物学重复样本的实验,我们的方法与经典的差异分析表现非常相似,但具有提供正式因果解释的优势。我们提出的边际因果方法计算效率高,可同时应用于数千个基因。此外,它可能有助于突出感兴趣的基因子集,以便进行更深入的后续因果网络推断。该方法在一个名为MarginalCausality的R包中实现(可在GitHub上获取)。