基于数据整合和监督学习方法的蛋白质-蛋白质相互作用网络中的蛋白质复合物检测

Protein complex detection in PPI networks based on data integration and supervised learning method.

作者信息

Yu Feng, Yang Zhi, Hu Xiao, Sun Yuan, Lin Hong, Wang Jian

出版信息

BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S3. doi: 10.1186/1471-2105-16-S12-S3. Epub 2015 Aug 25.

DOI:10.1186/1471-2105-16-S12-S3

PMID:26329886

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4705505/

Abstract

BACKGROUND

Revealing protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, which makes it possible to predict protein complexes from protein-protein interaction (PPI) networks. However, the small amount of known physical interactions may limit protein complex detection.

METHODS

The new PPI networks are constructed by integrating PPI datasets with the large and readily available PPI data from biomedical literature, and then the less reliable PPI between two proteins are filtered out based on semantic similarity and topological similarity of the two proteins. Finally, the supervised learning protein complex detection (SLPC), which can make full use of the information of available known complexes, is applied to detect protein complex on the new PPI networks.

RESULTS

The experimental results of SLPC on two different categories yeast PPI networks demonstrate effectiveness of the approach: compared with the original PPI networks, the best average improvements of 4.76, 6.81 and 15.75 percentage units in the F-score, accuracy and maximum matching ratio (MMR) are achieved respectively; compared with the denoising PPI networks, the best average improvements of 3.91, 4.61 and 12.10 percentage units in the F-score, accuracy and MMR are achieved respectively; compared with ClusterONE, the start-of the-art complex detection method, on the denoising extended PPI networks, the average improvements of 26.02 and 22.40 percentage units in the F-score and MMR are achieved respectively.

CONCLUSIONS

The experimental results show that the performances of SLPC have a large improvement through integration of new receivable PPI data from biomedical literature into original PPI networks and denoising PPI networks. In addition, our protein complexes detection method can achieve better performance than ClusterONE.

摘要

背景

揭示蛋白质复合物对于理解细胞组织和功能原理至关重要。高通量实验技术产生了大量的蛋白质相互作用，这使得从蛋白质-蛋白质相互作用（PPI）网络预测蛋白质复合物成为可能。然而，已知的物理相互作用数量较少可能会限制蛋白质复合物的检测。

方法

通过将PPI数据集与来自生物医学文献的大量且易于获取的PPI数据整合来构建新的PPI网络，然后基于两种蛋白质的语义相似性和拓扑相似性过滤掉两者之间不太可靠的PPI。最后，将能够充分利用可用已知复合物信息的监督学习蛋白质复合物检测（SLPC）应用于新的PPI网络上检测蛋白质复合物。

结果

SLPC在两类不同的酵母PPI网络上的实验结果证明了该方法的有效性：与原始PPI网络相比，在F值、准确率和最大匹配率（MMR）方面分别实现了4.76、6.81和15.75个百分点的最佳平均提升；与去噪PPI网络相比，在F值、准确率和MMR方面分别实现了3.91、4.61和12.10个百分点的最佳平均提升；与最先进的复合物检测方法ClusterONE相比，在去噪扩展PPI网络上，在F值和MMR方面分别实现了26.02和22.40个百分点的平均提升。

结论

实验结果表明，通过将来自生物医学文献的新的可接收PPI数据整合到原始PPI网络和去噪PPI网络中，SLPC的性能有了很大提升。此外，我们的蛋白质复合物检测方法比ClusterONE能取得更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/358b/4705505/f9f0e5e3d8a0/1471-2105-16-S12-S3-1.jpg

相似文献

Protein complex detection in PPI networks based on data integration and supervised learning method.基于数据整合和监督学习方法的蛋白质-蛋白质相互作用网络中的蛋白质复合物检测

BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S3. doi: 10.1186/1471-2105-16-S12-S3. Epub 2015 Aug 25.

Integrating PPI datasets with the PPI data from biomedical literature for protein complex detection.整合蛋白质-蛋白质相互作用数据集与来自生物医学文献的蛋白质-蛋白质相互作用数据以进行蛋白质复合物检测。

BMC Med Genomics. 2014;7 Suppl 2(Suppl 2):S3. doi: 10.1186/1755-8794-7-S2-S3. Epub 2014 Oct 22.

A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。

BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.

Construction of dynamic probabilistic protein interaction networks for protein complex identification.用于蛋白质复合物识别的动态概率蛋白质相互作用网络的构建。

BMC Bioinformatics. 2016 Apr 27;17(1):186. doi: 10.1186/s12859-016-1054-1.

Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks.基于从蛋白质-蛋白质相互作用网络中获得的节点嵌入来识别蛋白质复合物。

BMC Bioinformatics. 2018 Sep 21;19(1):332. doi: 10.1186/s12859-018-2364-2.

Integrating experimental and literature protein-protein interaction data for protein complex prediction.整合实验和文献中的蛋白质-蛋白质相互作用数据用于蛋白质复合物预测。

BMC Genomics. 2015;16 Suppl 2(Suppl 2):S4. doi: 10.1186/1471-2164-16-S2-S4. Epub 2015 Jan 21.

Identifying protein complex by integrating characteristic of core-attachment into dynamic PPI network.通过将核心附着特征整合到动态 PPI 网络中识别蛋白质复合物。

PLoS One. 2017 Oct 18;12(10):e0186134. doi: 10.1371/journal.pone.0186134. eCollection 2017.

MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw：一种改进的 MCL 方法，用于通过整合核心附着结构，从加权 PPI 网络中检测酵母复合物。

BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.

Prediction of problematic complexes from PPI networks: sparse, embedded, and small complexes.从蛋白质-蛋白质相互作用网络预测有问题的复合物：稀疏、嵌入和小型复合物。

Biol Direct. 2015 Aug 1;10:40. doi: 10.1186/s13062-015-0067-4.

Topological and functional comparison of community detection algorithms in biological networks.生物网络中社团检测算法的拓扑和功能比较。

BMC Bioinformatics. 2019 Apr 27;20(1):212. doi: 10.1186/s12859-019-2746-0.

引用本文的文献

HMNPPID-human malignant neoplasm protein-protein interaction database.HMNPPID-人类恶性肿瘤蛋白质-蛋白质相互作用数据库。

Hum Genomics. 2019 Oct 22;13(Suppl 1):44. doi: 10.1186/s40246-019-0223-5.

BMC Bioinformatics. 2018 Sep 21;19(1):332. doi: 10.1186/s12859-018-2364-2.

Predicting protein complexes using a supervised learning method combined with local structural information.利用监督学习方法结合局部结构信息预测蛋白质复合物。

PLoS One. 2018 Mar 19;13(3):e0194124. doi: 10.1371/journal.pone.0194124. eCollection 2018.

Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier.基于递归特征消除算法和支持向量机分类器的心肌梗死相关风险基因鉴定。

Mol Med Rep. 2018 Jan;17(1):1555-1560. doi: 10.3892/mmr.2017.8044. Epub 2017 Nov 14.

Discovering overlapped protein complexes from weighted PPI networks by removing inter-module hubs.从加权 PPI 网络中去除模块间枢纽发现重叠蛋白复合物。

Sci Rep. 2017 Jun 12;7(1):3247. doi: 10.1038/s41598-017-03268-w.

本文引用的文献

Predicting protein complex in protein interaction network - a supervised learning based method.蛋白质相互作用网络中蛋白质复合物的预测——一种基于监督学习的方法。

BMC Syst Biol. 2014;8 Suppl 3(Suppl 3):S4. doi: 10.1186/1752-0509-8-S3-S4. Epub 2014 Oct 22.

PPIExtractor: a protein interaction extraction and visualization system for biomedical literature.PPIExtractor：一种用于生物医学文献的蛋白质相互作用提取和可视化系统。

IEEE Trans Nanobioscience. 2013 Sep;12(3):173-81. doi: 10.1109/TNB.2013.2263837. Epub 2013 Aug 21.

Detecting overlapping protein complexes in protein-protein interaction networks.检测蛋白质-蛋白质相互作用网络中的重叠蛋白质复合物。

Nat Methods. 2012 Mar 18;9(5):471-2. doi: 10.1038/nmeth.1938.

Identifying protein complexes using hybrid properties.利用混合特性鉴定蛋白质复合物。

J Proteome Res. 2009 Nov;8(11):5212-8. doi: 10.1021/pr900554a.

A core-attachment based method to detect protein complexes in PPI networks.一种基于核心附着的方法来检测蛋白质-蛋白质相互作用网络中的蛋白质复合物。

BMC Bioinformatics. 2009 Jun 2;10:169. doi: 10.1186/1471-2105-10-169.

Complex discovery from weighted PPI networks.基于加权 PPI 网络的复杂发现。

Bioinformatics. 2009 Aug 1;25(15):1891-7. doi: 10.1093/bioinformatics/btp311. Epub 2009 May 12.

Protein complex identification by supervised graph local clustering.通过监督式图局部聚类进行蛋白质复合物鉴定。

Bioinformatics. 2008 Jul 1;24(13):i250-8. doi: 10.1093/bioinformatics/btn164.

How complete are current yeast and human protein-interaction networks?目前的酵母和人类蛋白质相互作用网络有多完整？

Genome Biol. 2006;7(11):120. doi: 10.1186/gb-2006-7-11-120.

Evaluation of clustering algorithms for protein-protein interaction networks.蛋白质-蛋白质相互作用网络聚类算法的评估

BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488.

A global view of pleiotropy and phenotypically derived gene function in yeast.酵母中多效性和表型衍生基因功能的全局视角。

Mol Syst Biol. 2005;1:2005.0001. doi: 10.1038/msb4100004. Epub 2005 Mar 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于数据整合和监督学习方法的蛋白质-蛋白质相互作用网络中的蛋白质复合物检测

Protein complex detection in PPI networks based on data integration and supervised learning method.

作者信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献