Brief Bioinform. 2014 Sep;15(5):823-38. doi: 10.1093/bib/bbt014. Epub 2013 Mar 19.
With the increase of available protein-protein interaction (PPI) data, more and more efforts have been put to PPI network modeling, and a number of models of PPI networks have been proposed. Roughly speaking, good models of PPI networks should be able to accurately describe PPI mechanisms, and thus reproduce the structures of PPI networks. With such models, theoretical and/or computational biologists can efficiently explore the evolution and dynamics of PPI networks. However, a theoretical and/or computational biologist may feel confused when she/he has to choose a proper PPI model for her/his research work from a dozen of candidate models, while there is no guideline available to help her/him. To tackle this problem, in this article, we carry out a comprehensive performance comparison study on 12 existing models over PPI datasets of four species (yeast, mouse, fruit fly and nematode), by comparing the global and local statistical properties of the original PPI networks and the model-reproduced ones. To draw more convincing conclusions, we use the mean reciprocal rank to combine the ranks of a certain model on all statistical properties. Our experimental results indicate that the PS_Seed model [Solé and Pastor-Satorras (PS) model with seed] the STICKY model and the DD_Seed model (Duplication-Divergence model with seed) fit best with the test PPI datasets. By analyzing the underlying mechanisms of the models with better fitting ability, our analysis shows that the evolutionary mechanism of node duplication and link dynamics and the mechanisms with 'degree-weighted' behaviors seem to be able to describe the PPI networks better.
随着蛋白质-蛋白质相互作用 (PPI) 数据的增加,越来越多的人致力于 PPI 网络建模,并且已经提出了许多 PPI 网络模型。大致来说,良好的 PPI 网络模型应该能够准确地描述 PPI 机制,从而再现 PPI 网络的结构。有了这样的模型,理论和/或计算生物学家可以有效地探索 PPI 网络的进化和动态。然而,当理论和/或计算生物学家不得不从十几个候选模型中为她/他的研究工作选择一个合适的 PPI 模型时,她/他可能会感到困惑,而没有可用的指导方针来帮助她/他。为了解决这个问题,在本文中,我们对 12 种现有的模型在四个物种(酵母、老鼠、果蝇和线虫)的 PPI 数据集上进行了全面的性能比较研究,通过比较原始 PPI 网络和模型再现网络的全局和局部统计特性。为了得出更有说服力的结论,我们使用平均倒数排名来组合某个模型在所有统计特性上的排名。我们的实验结果表明,PS_Seed 模型 [带有种子的 Solé 和 Pastor-Satorras (PS) 模型]、STICKY 模型和 DD_Seed 模型(带有种子的复制-发散模型)与测试 PPI 数据集拟合得最好。通过分析具有更好拟合能力的模型的潜在机制,我们的分析表明,节点复制和链接动态的进化机制以及具有“度加权”行为的机制似乎能够更好地描述 PPI 网络。