Ma Wenji, McAnulla Craig, Wang Lusheng
Department of Computer Science, City University of Hong Kong, Hong Kong.
Biochim Biophys Acta. 2012 Dec;1824(12):1418-24. doi: 10.1016/j.bbapap.2012.06.009. Epub 2012 Jul 3.
With the development of high-throughput methods for identifying protein-protein interactions, large scale interaction networks are available. Computational methods to analyze the networks to detect functional modules as protein complexes are becoming more important. However, most of the existing methods only make use of the protein-protein interaction networks without considering the structural limitations of proteins to bind together. In this paper, we design a new protein complex prediction method by extending the idea of using domain-domain interaction information. Here we formulate the problem into a maximum matching problem (which can be solved in polynomial time) instead of the binary integer linear programming approach (which can be NP-hard in the worst case). We also add a step to predict domain-domain interactions which first searches the database Pfam using the hidden Markov model and then predicts the domain-domain interactions based on the database DOMINE and InterDom which contain confirmed DDIs. By adding the domain-domain interaction prediction step, we have more edges in the DDI graph and the recall value is increased significantly (at least doubled) comparing with the method of Ozawa et al. (2010) [1] while the average precision value is slightly better. We also combine our method with three other existing methods, such as COACH, MCL and MCODE. Experiments show that the precision of the combined method is improved. This article is part of a Special Issue entitled: Computational Methods for Protein Interaction and Structural Prediction.
随着用于识别蛋白质 - 蛋白质相互作用的高通量方法的发展,大规模相互作用网络已可获得。通过计算方法分析这些网络以检测作为蛋白质复合物的功能模块正变得越来越重要。然而,现有的大多数方法仅利用蛋白质 - 蛋白质相互作用网络,而没有考虑蛋白质结合在一起的结构限制。在本文中,我们通过扩展使用结构域 - 结构域相互作用信息的思路,设计了一种新的蛋白质复合物预测方法。在这里,我们将该问题表述为一个最大匹配问题(可在多项式时间内求解),而不是二元整数线性规划方法(在最坏情况下可能是NP难的)。我们还添加了一个预测结构域 - 结构域相互作用的步骤,该步骤首先使用隐马尔可夫模型搜索Pfam数据库,然后基于包含已确认DDI的DOMINE和InterDom数据库预测结构域 - 结构域相互作用。通过添加结构域 - 结构域相互作用预测步骤,我们在DDI图中有了更多边,与Ozawa等人(2010年)[1]的方法相比,召回值显著提高(至少翻倍),而平均精度值略好。我们还将我们的方法与其他三种现有方法,如COACH、MCL和MCODE相结合。实验表明,组合方法的精度得到了提高。本文是名为:蛋白质相互作用和结构预测的计算方法的特刊的一部分。