Poirel Christopher L, Rodrigues Richard R, Chen Katherine C, Tyson John J, Murali T M
Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA.
J Comput Biol. 2013 May;20(5):409-18. doi: 10.1089/cmb.2012.0274.
Top-down analyses in systems biology can automatically find correlations among genes and proteins in large-scale datasets. However, it is often difficult to design experiments from these results. In contrast, bottom-up approaches painstakingly craft detailed models that can be simulated computationally to suggest wet lab experiments. However, developing the models is a manual process that can take many years. These approaches have largely been developed independently. We present LINKER, an efficient and automated data-driven method that can analyze molecular interactomes to propose extensions to models that can be simulated. LINKER combines teleporting random walks and k-shortest path computations to discover connections from a source protein to a set of proteins collectively involved in a particular cellular process. We evaluate the efficacy of LINKER by applying it to a well-known dynamic model of the cell division cycle in Saccharomyces cerevisiae. Compared to other state-of-the-art methods, subnetworks computed by LINKER are heavily enriched in Gene Ontology (GO) terms relevant to the cell cycle. Finally, we highlight how networks computed by LINKER elucidate the role of a protein kinase (Cdc5) in the mitotic exit network of a dynamic model of the cell cycle.
系统生物学中的自上而下分析能够在大规模数据集中自动找出基因与蛋白质之间的相关性。然而,依据这些结果设计实验往往很困难。相比之下,自下而上的方法精心构建详细模型,这些模型能够通过计算模拟来为湿实验室实验提供建议。然而,开发这些模型是一个可能需要多年时间的手动过程。这些方法在很大程度上是独立发展起来的。我们提出了LINKER,这是一种高效且自动化的数据驱动方法,它能够分析分子相互作用组,为可模拟的模型提出扩展建议。LINKER结合了瞬移随机游走和k最短路径计算,以发现从源蛋白质到共同参与特定细胞过程的一组蛋白质之间的联系。我们通过将LINKER应用于酿酒酵母细胞分裂周期的一个著名动态模型来评估其功效。与其他最先进的方法相比,LINKER计算出的子网在与细胞周期相关的基因本体(GO)术语中高度富集。最后,我们强调了LINKER计算出的网络如何阐明一种蛋白激酶(Cdc5)在细胞周期动态模型的有丝分裂退出网络中的作用。