Joshi Anagha, Beck Yvonne, Michoel Tom
1 Division of Developmental Biology, The Roslin Institute, The University of Edinburgh , Midlothian, Scotland, United Kingdom .
J Comput Biol. 2015 Apr;22(4):253-65. doi: 10.1089/cmb.2014.0290.
Gene regulatory network inference uses genome-wide transcriptome measurements in response to genetic, environmental, or dynamic perturbations to predict causal regulatory influences between genes. We hypothesized that evolution also acts as a suitable network perturbation and that integration of data from multiple closely related species can lead to improved reconstruction of gene regulatory networks. To test this hypothesis, we predicted networks from temporal gene expression data for 3,610 genes measured during early embryonic development in six Drosophila species and compared predicted networks to gold standard networks of ChIP-chip and ChIP-seq interactions for developmental transcription factors in five species. We found that (i) the performance of single-species networks was independent of the species where the gold standard was measured; (ii) differences between predicted networks reflected the known phylogeny and differences in biology between the species; (iii) an integrative consensus network that minimized the total number of edge gains and losses with respect to all single-species networks performed better than any individual network. Our results show that in an evolutionarily conserved system, integration of data from comparable experiments in multiple species improves the inference of gene regulatory networks. They provide a basis for future studies on the numerous multispecies gene expression datasets for other biological processes available in the literature.
基因调控网络推断利用全基因组转录组测量结果(针对遗传、环境或动态扰动)来预测基因之间的因果调控影响。我们推测,进化也可作为一种合适的网络扰动,并且整合来自多个密切相关物种的数据能够改进基因调控网络的重建。为了验证这一假设,我们根据六个果蝇物种早期胚胎发育过程中测量的3610个基因的时间基因表达数据预测网络,并将预测的网络与五个物种中发育转录因子的ChIP-chip和ChIP-seq相互作用的金标准网络进行比较。我们发现:(i)单物种网络的性能与测量金标准的物种无关;(ii)预测网络之间的差异反映了已知的系统发育以及物种之间生物学上的差异;(iii)相对于所有单物种网络,使边的增加和减少总数最小化的整合共识网络比任何单个网络表现更好。我们的结果表明,在一个进化上保守的系统中,整合来自多个物种可比实验的数据能够改进基因调控网络的推断。它们为未来研究文献中可得的其他生物过程的众多多物种基因表达数据集提供了基础。