RocSampler：在蛋白质-蛋白质相互作用网络中对重叠蛋白质复合物进行正则化

RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks.

作者信息

Maruyama Osamu, Kuwahara Yuki

机构信息

Institute of Mathematics for Industry, Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, 819-0395, Japan.

Graduate School of Mathematics, Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, 819-0395, Japan.

出版信息

BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):491. doi: 10.1186/s12859-017-1920-5.

DOI:10.1186/s12859-017-1920-5

PMID:29244010

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5731504/

Abstract

BACKGROUND

In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two difficulties. One is related to small complexes, which contains two or three components. It is relatively difficult to identify them due to their simpler internal structure, but unfortunately complexes of such sizes are dominant in major protein complex databases, such as CYC2008. Another difficulty is how to model overlaps between predicted complexes, that is, how to evaluate different predicted complexes sharing common proteins because CYC2008 and other databases include such protein complexes. Thus, it is critical how to model overlaps between predicted complexes to identify them simultaneously.

RESULTS

In this paper, we propose a sampling-based protein complex prediction method, RocSampler (Regularizing Overlapping Complexes), which exploits, as part of the whole scoring function, a regularization term for the overlaps of predicted complexes and that for the distribution of sizes of predicted complexes. We have implemented RocSampler in MATLAB and its executable file for Windows is available at the site, http://imi.kyushu-u.ac.jp/~om/software/RocSampler/ .

CONCLUSIONS

We have applied RocSampler to five yeast PPI networks and shown that it is superior to other existing methods. This implies that the design of scoring functions including regularization terms is an effective approach for protein complex prediction.

摘要

背景

近年来，蛋白质 - 蛋白质相互作用（PPI）网络已被公认为是阐明各种生物过程和细胞机制的重要资源。在本文中，我们解决了从PPI网络预测蛋白质复合物的问题。这个问题存在两个难点。一个与小复合物有关，即包含两个或三个组分的复合物。由于其内部结构较为简单，识别它们相对困难，但不幸的是，这种规模的复合物在主要的蛋白质复合物数据库（如CYC2008）中占主导地位。另一个难点是如何对预测复合物之间的重叠进行建模，也就是说，如何评估共享共同蛋白质的不同预测复合物，因为CYC2008和其他数据库中都包含此类蛋白质复合物。因此，如何对预测复合物之间的重叠进行建模以同时识别它们至关重要。

结果

在本文中，我们提出了一种基于采样的蛋白质复合物预测方法RocSampler（正则化重叠复合物），该方法在整个评分函数中利用了一个针对预测复合物重叠的正则化项以及一个针对预测复合物大小分布的正则化项。我们已在MATLAB中实现了RocSampler，其Windows可执行文件可在网站http://imi.kyushu-u.ac.jp/~om/software/RocSampler/获取。

结论

我们将RocSampler应用于五个酵母PPI网络，并表明它优于其他现有方法。这意味着包含正则化项的评分函数设计是蛋白质复合物预测的一种有效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b24/5731504/f1b41a652115/12859_2017_1920_Fig1_HTML.jpg

相似文献

RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks.RocSampler：在蛋白质-蛋白质相互作用网络中对重叠蛋白质复合物进行正则化

BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):491. doi: 10.1186/s12859-017-1920-5.

Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物：进化增强的马尔可夫聚类。

Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.

Sampling strategy for protein complex prediction using cluster size frequency.基于簇大小频率的蛋白质复合物预测抽样策略。

Gene. 2013 Apr 10;518(1):152-8. doi: 10.1016/j.gene.2012.11.050. Epub 2012 Dec 9.

An effective approach to detecting both small and large complexes from protein-protein interaction networks.一种从蛋白质-蛋白质相互作用网络中检测大小复合物的有效方法。

BMC Bioinformatics. 2017 Oct 16;18(Suppl 12):419. doi: 10.1186/s12859-017-1820-8.

Protein Complexes Prediction Method Based on Core-Attachment Structure and Functional Annotations.基于核心附着结构和功能注释的蛋白质复合物预测方法。

Int J Mol Sci. 2017 Sep 6;18(9):1910. doi: 10.3390/ijms18091910.

Predicting overlapping protein complexes from weighted protein interaction graphs by gradually expanding dense neighborhoods.通过逐步扩展密集邻域从加权蛋白质相互作用图预测重叠蛋白质复合物。

Artif Intell Med. 2016 Jul;71:62-9. doi: 10.1016/j.artmed.2016.05.006. Epub 2016 Jun 28.

Protein complex prediction via dense subgraphs and false positive analysis.通过密集子图和误报分析进行蛋白质复合物预测

PLoS One. 2017 Sep 22;12(9):e0183460. doi: 10.1371/journal.pone.0183460. eCollection 2017.

Determining the minimum number of protein-protein interactions required to support known protein complexes.确定支持已知蛋白质复合物所需的蛋白质-蛋白质相互作用的最小数量。

PLoS One. 2018 Apr 26;13(4):e0195545. doi: 10.1371/journal.pone.0195545. eCollection 2018.

From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein-to-Protein Interaction Networks.从功能到相互作用：基于蛋白质-蛋白质相互作用网络准确预测蛋白质复合物的新范式。

IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):616-27. doi: 10.1109/TCBB.2014.2306825.

Heterodimeric protein complex identification by naïve Bayes classifiers.基于朴素贝叶斯分类器的异源二聚体蛋白质复合物鉴定

BMC Bioinformatics. 2013 Dec 3;14:347. doi: 10.1186/1471-2105-14-347.

引用本文的文献

Detecting complexes from edge-weighted PPI networks via genes expression analysis.通过基因表达分析从边加权蛋白质-蛋白质相互作用网络中检测复合物。

BMC Syst Biol. 2018 Apr 24;12(Suppl 4):40. doi: 10.1186/s12918-018-0565-y.

本文引用的文献

From the static interactome to dynamic protein complexes: Three challenges.从静态相互作用组到动态蛋白质复合物：三大挑战。

J Bioinform Comput Biol. 2015 Apr;13(2):1571001. doi: 10.1142/S0219720015710018. Epub 2015 Jan 7.

Repulsive parallel MCMC algorithm for discovering diverse motifs from large sequence sets.用于从大型序列集中发现多样基序的排斥并行马尔可夫链蒙特卡罗算法。

Bioinformatics. 2015 May 15;31(10):1561-8. doi: 10.1093/bioinformatics/btv017. Epub 2015 Jan 11.

Discovery of small protein complexes from PPI networks with size-specific supervised weighting.通过大小特异性监督加权从蛋白质-蛋白质相互作用网络中发现小蛋白质复合物。

BMC Syst Biol. 2014;8 Suppl 5(Suppl 5):S3. doi: 10.1186/1752-0509-8-S5-S3. Epub 2014 Dec 12.

ReSAPP: predicting overlapping protein complexes by merging multiple-sampled partitions of proteins.ReSAPP：通过合并蛋白质的多个采样分区来预测重叠蛋白质复合物

J Bioinform Comput Biol. 2014 Dec;12(6):1442004. doi: 10.1142/S0219720014420049.

PPSampler2: predicting protein complexes more accurately and efficiently by sampling.PPSampler2：通过采样更准确高效地预测蛋白质复合物

BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S14. doi: 10.1186/1752-0509-7-S6-S14. Epub 2013 Dec 13.

A survey of computational methods for protein complex prediction from protein interaction networks.从蛋白质相互作用网络预测蛋白质复合物的计算方法综述。

J Bioinform Comput Biol. 2013 Apr;11(2):1230002. doi: 10.1142/S021972001230002X. Epub 2012 Nov 7.

Detecting overlapping protein complexes in protein-protein interaction networks.检测蛋白质-蛋白质相互作用网络中的重叠蛋白质复合物。

Nat Methods. 2012 Mar 18;9(5):471-2. doi: 10.1038/nmeth.1938.

NWE: Node-weighted expansion for protein complex prediction using random walk distances.NWE：基于随机游走距离的节点加权扩展的蛋白质复合物预测方法。

Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S14. doi: 10.1186/1477-5956-9-S1-S14.

SPICi: a fast clustering algorithm for large biological networks.SPICi：一种用于大型生物网络的快速聚类算法。

Bioinformatics. 2010 Apr 15;26(8):1105-11. doi: 10.1093/bioinformatics/btq078. Epub 2010 Feb 24.

Computational approaches for detecting protein complexes from protein interaction networks: a survey.从蛋白质相互作用网络中检测蛋白质复合物的计算方法：综述。

BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2164-11-S1-S3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

RocSampler：在蛋白质-蛋白质相互作用网络中对重叠蛋白质复合物进行正则化

RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献