Rahman Ahsanur, Poirel Christopher L, Badger David J, Estep Craig, Murali T M
Virginia Tech, Blacksburg.
IEEE/ACM Trans Comput Biol Bioinform. 2013 Sep-Oct;10(5):1113-24. doi: 10.1109/TCBB.2013.71.
Analysis of molecular interaction networks is pervasive in systems biology. This research relies almost entirely on graphs for modeling interactions. However, edges in graphs cannot represent multiway interactions among molecules, which occur very often within cells. Hypergraphs may be better representations for networks having such interactions, since hyperedges can naturally represent relationships among multiple molecules. Here, we propose using hypergraphs to capture the uncertainty inherent in reverse engineering gene-gene networks. Some subsets of nodes may induce highly varying subgraphs across an ensemble of networks inferred by a reverse engineering algorithm. We provide a novel formulation of hyperedges to capture this uncertainty in network topology. We propose a clustering-based approach to discover hyperedges. We show that our approach can recover hyperedges planted in synthetic data sets with high precision and recall, even for moderate amount of noise. We apply our techniques to a data set of pathways inferred from genetic interaction data in S. cerevisiae related to the unfolded protein response. Our approach discovers several hyperedges that capture the uncertain connectivity of genes in relevant protein complexes, suggesting that further experiments may be required to precisely discern their interaction patterns. We also show that these complexes are not discovered by an algorithm that computes frequent and dense subgraphs.
分子相互作用网络分析在系统生物学中普遍存在。这项研究几乎完全依赖于图来对相互作用进行建模。然而,图中的边无法表示分子间的多路相互作用,而这种相互作用在细胞内经常发生。对于具有此类相互作用的网络,超图可能是更好的表示方式,因为超边可以自然地表示多个分子之间的关系。在此,我们提议使用超图来捕捉逆向工程基因 - 基因网络中固有的不确定性。在通过逆向工程算法推断出的一组网络中,某些节点子集可能会诱导出高度变化的子图。我们提供了一种新颖的超边公式来捕捉网络拓扑中的这种不确定性。我们提出一种基于聚类的方法来发现超边。我们表明,即使对于适度的噪声量,我们的方法也能高精度且高召回率地恢复植入合成数据集中的超边。我们将我们的技术应用于从酿酒酵母中与未折叠蛋白反应相关的遗传相互作用数据推断出的通路数据集。我们的方法发现了几个超边,这些超边捕捉了相关蛋白质复合物中基因的不确定连接性,这表明可能需要进一步的实验来精确辨别它们的相互作用模式。我们还表明,一个计算频繁且密集子图的算法无法发现这些复合物。