Omranian Nooshin, Eloundou-Mbebi Jeanne M O, Mueller-Roeber Bernd, Nikoloski Zoran
Systems Biology and Mathematical Modelling Group, Max Planck Institute for Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam, Germany.
Department of Molecular Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, Haus 20, 14476 Potsdam, Germany.
Sci Rep. 2016 Feb 11;6:20533. doi: 10.1038/srep20533.
Devising computational methods to accurately reconstruct gene regulatory networks given gene expression data is key to systems biology applications. Here we propose a method for reconstructing gene regulatory networks by simultaneous consideration of data sets from different perturbation experiments and corresponding controls. The method imposes three biologically meaningful constraints: (1) expression levels of each gene should be explained by the expression levels of a small number of transcription factor coding genes, (2) networks inferred from different data sets should be similar with respect to the type and number of regulatory interactions, and (3) relationships between genes which exhibit similar differential behavior over the considered perturbations should be favored. We demonstrate that these constraints can be transformed in a fused LASSO formulation for the proposed method. The comparative analysis on transcriptomics time-series data from prokaryotic species, Escherichia coli and Mycobacterium tuberculosis, as well as a eukaryotic species, mouse, demonstrated that the proposed method has the advantages of the most recent approaches for regulatory network inference, while obtaining better performance and assigning higher scores to the true regulatory links. The study indicates that the combination of sparse regression techniques with other biologically meaningful constraints is a promising framework for gene regulatory network reconstructions.
设计能够根据基因表达数据准确重建基因调控网络的计算方法是系统生物学应用的关键。在此,我们提出一种通过同时考虑来自不同扰动实验及其相应对照的数据集来重建基因调控网络的方法。该方法施加了三个具有生物学意义的约束条件:(1)每个基因的表达水平应由少数转录因子编码基因的表达水平来解释;(2)从不同数据集推断出的网络在调控相互作用的类型和数量方面应相似;(3)在所考虑的扰动下表现出相似差异行为的基因之间的关系应更受青睐。我们证明这些约束条件可以转化为所提方法的融合套索(fused LASSO)公式。对来自原核生物大肠杆菌和结核分枝杆菌以及真核生物小鼠的转录组学时间序列数据的比较分析表明,所提方法具有最新调控网络推断方法的优点,同时能获得更好的性能,并为真实的调控链接赋予更高的分数。该研究表明,将稀疏回归技术与其他具有生物学意义的约束条件相结合是基因调控网络重建的一个有前景的框架。