ClusterM：一种用于跨多个蛋白质相互作用网络对保守蛋白质复合物进行计算预测的可扩展算法。

ClusterM: a scalable algorithm for computational prediction of conserved protein complexes across multiple protein interaction networks.

作者信息

Wang Yijie, Jeong Hyundoo, Yoon Byung-Jun, Qian Xiaoning

机构信息

School of Informatics, Computing and Engineering, Indiana University, Bloomington, 47405, IN, USA.

Department of Mechatronics Engineering, Incheon National University, Incheon, 22012, South Korea.

出版信息

BMC Genomics. 2020 Nov 18;21(Suppl 10):615. doi: 10.1186/s12864-020-07010-1.

DOI:10.1186/s12864-020-07010-1

PMID:33208103

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7677834/

Abstract

BACKGROUND

The current computational methods on identifying conserved protein complexes across multiple Protein-Protein Interaction (PPI) networks suffer from the lack of explicit modeling of the desired topological properties within conserved protein complexes as well as their scalability.

RESULTS

To overcome those issues, we propose a scalable algorithm-ClusterM-for identifying conserved protein complexes across multiple PPI networks through the integration of network topology and protein sequence similarity information. ClusterM overcomes the computational barrier that existed in previous methods, where the complexity escalates exponentially when handling an increasing number of PPI networks; and it is able to detect conserved protein complexes with both topological separability and cohesive protein sequence conservation. On two independent compendiums of PPI networks from Saccharomyces cerevisiae (Sce, yeast), Drosophila melanogaster (Dme, fruit fly), Caenorhabditis elegans (Cel, worm), and Homo sapiens (Hsa, human), we demonstrate that ClusterM outperforms other state-of-the-art algorithms by a significant margin and is able to identify de novo conserved protein complexes across four species that are missed by existing algorithms.

CONCLUSIONS

ClusterM can better capture the desired topological property of a typical conserved protein complex, which is densely connected within the complex while being well-separated from the rest of the networks. Furthermore, our experiments have shown that ClusterM is highly scalable and efficient when analyzing multiple PPI networks.

摘要

背景

当前用于识别多个蛋白质-蛋白质相互作用（PPI）网络中保守蛋白质复合物的计算方法，存在缺乏对保守蛋白质复合物中所需拓扑特性进行显式建模以及可扩展性不足的问题。

结果

为克服这些问题，我们提出了一种可扩展算法ClusterM，用于通过整合网络拓扑和蛋白质序列相似性信息来识别多个PPI网络中的保守蛋白质复合物。ClusterM克服了先前方法中存在的计算障碍，即在处理越来越多的PPI网络时复杂度呈指数级增长；并且它能够检测具有拓扑可分离性和凝聚性蛋白质序列保守性的保守蛋白质复合物。在来自酿酒酵母（Sce，酵母）、黑腹果蝇（Dme，果蝇）、秀丽隐杆线虫（Cel，线虫）和智人（Hsa，人类）的两个独立的PPI网络汇编数据集上，我们证明ClusterM显著优于其他现有算法，并且能够识别现有算法遗漏的跨四个物种的全新保守蛋白质复合物。

结论

ClusterM能够更好地捕捉典型保守蛋白质复合物所需的拓扑特性，即复合物内部紧密连接，同时与网络的其余部分良好分离。此外，我们的实验表明，ClusterM在分析多个PPI网络时具有高度的可扩展性和效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e380/7677834/5acf6a3264b2/12864_2020_7010_Fig1_HTML.jpg

相似文献

ClusterM: a scalable algorithm for computational prediction of conserved protein complexes across multiple protein interaction networks.ClusterM：一种用于跨多个蛋白质相互作用网络对保守蛋白质复合物进行计算预测的可扩展算法。

BMC Genomics. 2020 Nov 18;21(Suppl 10):615. doi: 10.1186/s12864-020-07010-1.

ConnectedAlign: a PPI network alignment method for identifying conserved protein complexes across multiple species.ConnectedAlign：一种用于识别多个物种中保守蛋白质复合物的 PPI 网络对齐方法。

BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):286. doi: 10.1186/s12859-018-2271-6.

Global alignment of multiple protein interaction networks with application to functional orthology detection.多个蛋白质相互作用网络的全局比对及其在功能直系同源检测中的应用。

Proc Natl Acad Sci U S A. 2008 Sep 2;105(35):12763-8. doi: 10.1073/pnas.0806627105. Epub 2008 Aug 25.

Identifying conserved protein complexes between species by constructing interolog networks.通过构建种间同源蛋白互作网络来鉴定物种间保守的蛋白质复合物。

BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S8. doi: 10.1186/1471-2105-14-S16-S8. Epub 2013 Oct 22.

LePrimAlign: local entropy-based alignment of PPI networks to predict conserved modules.LePrimAlign：基于局部信息熵的蛋白质相互作用网络比对方法，用于预测保守模块。

BMC Genomics. 2019 Dec 24;20(Suppl 9):964. doi: 10.1186/s12864-019-6271-3.

A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。

BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.

Global Biological Network Alignment by Using Efficient Memetic Algorithm.利用高效的Memetic 算法进行全球生物网络比对。

IEEE/ACM Trans Comput Biol Bioinform. 2016 Nov;13(6):1117-1129. doi: 10.1109/TCBB.2015.2511741. Epub 2015 Dec 23.

Joint clustering of protein interaction networks through Markov random walk.通过马尔可夫随机游走对蛋白质相互作用网络进行联合聚类。

BMC Syst Biol. 2014;8 Suppl 1(Suppl 1):S9. doi: 10.1186/1752-0509-8-S1-S9. Epub 2014 Jan 24.

Global alignment of protein-protein interaction networks.蛋白质-蛋白质相互作用网络的全局比对

Methods Mol Biol. 2013;939:21-34. doi: 10.1007/978-1-62703-107-3_3.

A methodology for detecting the orthology signal in a PPI network at a functional complex level.一种在功能复合体水平上检测蛋白质-蛋白质相互作用网络中的直系同源信号的方法。

BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S18. doi: 10.1186/1471-2105-13-S10-S18.

引用本文的文献

SAMNA: accurate alignment of multiple biological networks based on simulated annealing.SAMNA：基于模拟退火的多个生物网络的精确对齐。

J Integr Bioinform. 2023 Dec 14;20(4). doi: 10.1515/jib-2023-0006. eCollection 2023 Dec 1.

Autism Spectrum Disorder: A Neuro-Immunometabolic Hypothesis of the Developmental Origins.自闭症谱系障碍：发育起源的神经免疫代谢假说

Biology (Basel). 2023 Jun 26;12(7):914. doi: 10.3390/biology12070914.

A New Method for Recognizing Protein Complexes Based on Protein Interaction Networks and GO Terms.一种基于蛋白质相互作用网络和基因本体术语识别蛋白质复合物的新方法。

Front Genet. 2021 Dec 13;12:792265. doi: 10.3389/fgene.2021.792265. eCollection 2021.

本文引用的文献

Indexing a protein-protein interaction network expedites network alignment.对蛋白质-蛋白质相互作用网络进行索引可加快网络比对。

BMC Bioinformatics. 2015 Oct 9;16:326. doi: 10.1186/s12859-015-0756-0.

Accurate multiple network alignment through context-sensitive random walk.通过上下文敏感随机游走实现精确的多网络对齐

BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S7. doi: 10.1186/1752-0509-9-S1-S7. Epub 2015 Jan 21.

A multiobjective memetic algorithm for PPI network alignment.一种用于蛋白质相互作用网络比对的多目标进化算法。

Bioinformatics. 2015 Jun 15;31(12):1988-98. doi: 10.1093/bioinformatics/btv063. Epub 2015 Feb 9.

STRING v10: protein-protein interaction networks, integrated over the tree of life.STRING v10：整合了整个生命之树的蛋白质-蛋白质相互作用网络。

Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52. doi: 10.1093/nar/gku1003. Epub 2014 Oct 28.

Joint clustering of protein interaction networks through Markov random walk.通过马尔可夫随机游走对蛋白质相互作用网络进行联合聚类。

BMC Syst Biol. 2014;8 Suppl 1(Suppl 1):S9. doi: 10.1186/1752-0509-8-S1-S9. Epub 2014 Jan 24.

SMETANA: accurate and scalable algorithm for probabilistic alignment of large-scale biological networks.斯梅塔纳：用于大规模生物网络概率对齐的准确且可扩展的算法。

PLoS One. 2013 Jul 12;8(7):e67995. doi: 10.1371/journal.pone.0067995. Print 2013.

SPINAL: scalable protein interaction network alignment.SPINAL：可扩展的蛋白质相互作用网络比对。

Bioinformatics. 2013 Apr 1;29(7):917-24. doi: 10.1093/bioinformatics/btt071. Epub 2013 Feb 14.

Identifying functional modules in interaction networks through overlapping Markov clustering.通过重叠 Markov 聚类识别交互网络中的功能模块。

Bioinformatics. 2012 Sep 15;28(18):i473-i479. doi: 10.1093/bioinformatics/bts370.

AlignNemo: a local network alignment method to integrate homology and topology.AlignNemo：一种整合同源性和拓扑结构的局部网络比对方法。

PLoS One. 2012;7(6):e38107. doi: 10.1371/journal.pone.0038107. Epub 2012 Jun 12.

Affinity-purification coupled to mass spectrometry: basic principles and strategies.亲和纯化结合质谱法：基本原理和策略。

Proteomics. 2012 May;12(10):1576-90. doi: 10.1002/pmic.201100523.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ClusterM：一种用于跨多个蛋白质相互作用网络对保守蛋白质复合物进行计算预测的可扩展算法。

ClusterM: a scalable algorithm for computational prediction of conserved protein complexes across multiple protein interaction networks.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献