Hashemifar Somaye, Xu Jinbo
Toyota Technological Institute at Chicago, Chicago, IL 60637, USA.
Bioinformatics. 2014 Sep 1;30(17):i438-44. doi: 10.1093/bioinformatics/btu450.
High-throughput experimental techniques have produced a large amount of protein-protein interaction (PPI) data. The study of PPI networks, such as comparative analysis, shall benefit the understanding of life process and diseases at the molecular level. One way of comparative analysis is to align PPI networks to identify conserved or species-specific subnetwork motifs. A few methods have been developed for global PPI network alignment, but it still remains challenging in terms of both accuracy and efficiency.
This paper presents a novel global network alignment algorithm, denoted as HubAlign, that makes use of both network topology and sequence homology information, based upon the observation that topologically important proteins in a PPI network usually are much more conserved and thus, more likely to be aligned. HubAlign uses a minimum-degree heuristic algorithm to estimate the topological and functional importance of a protein from the global network topology information. Then HubAlign aligns topologically important proteins first and gradually extends the alignment to the whole network. Extensive tests indicate that HubAlign greatly outperforms several popular methods in terms of both accuracy and efficiency, especially in detecting functionally similar proteins.
HubAlign is available freely for non-commercial purposes at http://ttic.uchicago.edu/∼hashemifar/software/HubAlign.zip.
Supplementary data are available at Bioinformatics online.
高通量实验技术产生了大量蛋白质-蛋白质相互作用(PPI)数据。对PPI网络的研究,如比较分析,将有助于在分子水平上理解生命过程和疾病。比较分析的一种方法是比对PPI网络以识别保守的或物种特异性的子网基序。已经开发了一些用于全局PPI网络比对的方法,但在准确性和效率方面仍然具有挑战性。
本文提出了一种新颖的全局网络比对算法,称为HubAlign,它利用网络拓扑和序列同源性信息,基于这样的观察:PPI网络中拓扑重要的蛋白质通常更保守,因此更有可能被比对。HubAlign使用最小度启发式算法从全局网络拓扑信息估计蛋白质的拓扑和功能重要性。然后HubAlign首先比对拓扑重要的蛋白质,并逐渐将比对扩展到整个网络。广泛的测试表明,HubAlign在准确性和效率方面都大大优于几种流行的方法,特别是在检测功能相似的蛋白质方面。
HubAlign可在http://ttic.uchicago.edu/∼hashemifar/software/HubAlign.zip上免费用于非商业目的。
补充数据可在《生物信息学》在线获取。