Shen Xianjun, Yi Li, Yi Yang, Yang Jincai, He Tingting, Hu Xiaohua
BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S5. doi: 10.1186/1471-2105-16-S12-S5. Epub 2015 Aug 25.
The identification of protein functional modules would be a great aid in furthering our knowledge of the principles of cellular organization. Most existing algorithms for identifying protein functional modules have a common defect -- once a protein node is assigned to a functional module, there is no chance to move the protein to the other functional modules during the follow-up processes, which lead the erroneous partitioning occurred at previous step to accumulate till to the end.
In this paper, we design a new algorithm ADM (Adaptive Density Modularity) to detect protein functional modules based on adaptive density modularity. In ADM algorithm, according to the comparison between external closely associated degree and internal closely associated degree, the partitioning of a protein-protein interaction network into functional modules always evolves quickly to increase the density modularity of the network. The integration of density modularity into the new algorithm not only overcomes the drawback mentioned above, but also contributes to identifying protein functional modules more effectively.
The experimental result reveals that the performance of ADM algorithm is superior to many state-of-the-art protein functional modules detection techniques in aspect of the accuracy of prediction. Moreover, the identified protein functional modules are statistically significant in terms of "Biological Process" annotated in Gene Ontology, which provides substantial support for revealing the principles of cellular organization.
蛋白质功能模块的识别将极大地有助于我们进一步了解细胞组织原理。大多数现有的识别蛋白质功能模块的算法都有一个共同缺陷——一旦一个蛋白质节点被分配到一个功能模块,在后续过程中就没有机会将该蛋白质移动到其他功能模块,这导致在先前步骤中出现的错误划分一直累积到最后。
在本文中,我们设计了一种新算法ADM(自适应密度模块化),基于自适应密度模块化来检测蛋白质功能模块。在ADM算法中,根据外部紧密关联度和内部紧密关联度的比较,将蛋白质-蛋白质相互作用网络划分为功能模块的过程总是快速演进,以提高网络的密度模块化。将密度模块化集成到新算法中,不仅克服了上述缺点,还有助于更有效地识别蛋白质功能模块。
实验结果表明,ADM算法在预测准确性方面优于许多先进的蛋白质功能模块检测技术。此外,所识别的蛋白质功能模块在基因本体论注释的“生物过程”方面具有统计学意义,这为揭示细胞组织原理提供了有力支持。