IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):56-70. doi: 10.1109/TCBB.2018.2853728. Epub 2018 Jul 6.
Causality inference is the use of computational techniques to predict possible causal relationships for a set of variables, thereby forming a directed network. Causality inference in Gene Regulatory Networks (GRNs) is an important, yet challenging task due to the limits of available data and lack of efficiency in existing causality inference techniques. A number of techniques have been proposed and applied to infer causal relationships in various domains, although they are not specific to regulatory network inference. In this paper, we assess the effectiveness of methods for inferring causal GRNs. We introduce seven different inference methods and apply them to infer directed edges in GRNs. We use time-series expression data from the DREAM challenges to assess the methods in terms of quality of inference and rank them based on performance. The best method is applied to Breast Cancer data to infer a causal network. Experimental results show that Causation Entropy is best, however, highly time-consuming and not feasible to use in a relatively large network. We infer Breast Cancer GRN with the second-best method, Transfer Entropy. The topological analysis of the network reveals that top out-degree genes such as SLC39A5 which are considered central genes, play important role in cancer progression.
因果推断是使用计算技术来预测一组变量的可能因果关系,从而形成有向网络。由于可用数据的限制和现有因果推断技术的效率低下,基因调控网络 (GRN) 中的因果推断是一项重要但具有挑战性的任务。已经提出并应用了许多技术来推断各种领域的因果关系,但它们不是专门针对调节网络推断的。在本文中,我们评估了推断因果 GRN 的方法的有效性。我们介绍了七种不同的推断方法,并将它们应用于推断 GRN 中的有向边。我们使用来自 DREAM 挑战的时间序列表达数据,根据推断质量评估方法,并根据性能对其进行排名。最好的方法应用于乳腺癌数据来推断因果网络。实验结果表明,因果熵是最好的,但非常耗时,在相对较大的网络中不可行。我们使用第二种最佳方法,传递熵,推断乳腺癌 GRN。网络的拓扑分析表明,被认为是中心基因的顶级出度基因(如 SLC39A5)在癌症进展中起着重要作用。