• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于贝叶斯非负矩阵分解的加权集成聚类检测蛋白质复合物。

Protein complex detection via weighted ensemble clustering based on Bayesian nonnegative matrix factorization.

机构信息

Center for Computer Vision and Department of Mathematics, Sun Yat-Sen University, Guangzhou, China.

出版信息

PLoS One. 2013 May 2;8(5):e62158. doi: 10.1371/journal.pone.0062158. Print 2013.

DOI:10.1371/journal.pone.0062158
PMID:23658709
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3642239/
Abstract

Detecting protein complexes from protein-protein interaction (PPI) networks is a challenging task in computational biology. A vast number of computational methods have been proposed to undertake this task. However, each computational method is developed to capture one aspect of the network. The performance of different methods on the same network can differ substantially, even the same method may have different performance on networks with different topological characteristic. The clustering result of each computational method can be regarded as a feature that describes the PPI network from one aspect. It is therefore desirable to utilize these features to produce a more accurate and reliable clustering. In this paper, a novel Bayesian Nonnegative Matrix Factorization (NMF)-based weighted Ensemble Clustering algorithm (EC-BNMF) is proposed to detect protein complexes from PPI networks. We first apply different computational algorithms on a PPI network to generate some base clustering results. Then we integrate these base clustering results into an ensemble PPI network, in the form of weighted combination. Finally, we identify overlapping protein complexes from this network by employing Bayesian NMF model. When generating an ensemble PPI network, EC-BNMF can automatically optimize the values of weights such that the ensemble algorithm can deliver better results. Experimental results on four PPI networks of Saccharomyces cerevisiae well verify the effectiveness of EC-BNMF in detecting protein complexes. EC-BNMF provides an effective way to integrate different clustering results for more accurate and reliable complex detection. Furthermore, EC-BNMF has a high degree of flexibility in the choice of base clustering results. It can be coupled with existing clustering methods to identify protein complexes.

摘要

从蛋白质-蛋白质相互作用(PPI)网络中检测蛋白质复合物是计算生物学中的一项具有挑战性的任务。已经提出了大量的计算方法来完成这项任务。然而,每种计算方法都是为了捕捉网络的一个方面而开发的。不同方法在同一网络上的性能可能有很大差异,即使同一方法在拓扑特征不同的网络上也可能有不同的性能。每个计算方法的聚类结果可以看作是从一个方面描述 PPI 网络的特征。因此,希望利用这些特征来产生更准确和可靠的聚类。在本文中,提出了一种新的基于贝叶斯非负矩阵分解(NMF)的加权集成聚类算法(EC-BNMF),用于从 PPI 网络中检测蛋白质复合物。我们首先将不同的计算算法应用于 PPI 网络,以生成一些基本的聚类结果。然后,我们将这些基本聚类结果集成到一个集成的 PPI 网络中,以加权组合的形式。最后,我们通过贝叶斯 NMF 模型从这个网络中识别重叠的蛋白质复合物。在生成集成 PPI 网络时,EC-BNMF 可以自动优化权重值,使集成算法能够产生更好的结果。在四个酿酒酵母 PPI 网络上的实验结果很好地验证了 EC-BNMF 在检测蛋白质复合物方面的有效性。EC-BNMF 为集成不同的聚类结果提供了一种有效的方法,以实现更准确和可靠的复合物检测。此外,EC-BNMF 在选择基本聚类结果方面具有很高的灵活性。它可以与现有的聚类方法结合使用,以识别蛋白质复合物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/5f76f24beeee/pone.0062158.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/2b5fb1ec8fc6/pone.0062158.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/28ae783e2a7f/pone.0062158.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/4ec0e8b4ea82/pone.0062158.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/67735e41f90f/pone.0062158.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/9034559b3fbb/pone.0062158.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/a8374d784e84/pone.0062158.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/9fb3b96a1694/pone.0062158.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/48369ddbf35f/pone.0062158.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/3c25f8bb68d8/pone.0062158.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/5f76f24beeee/pone.0062158.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/2b5fb1ec8fc6/pone.0062158.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/28ae783e2a7f/pone.0062158.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/4ec0e8b4ea82/pone.0062158.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/67735e41f90f/pone.0062158.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/9034559b3fbb/pone.0062158.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/a8374d784e84/pone.0062158.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/9fb3b96a1694/pone.0062158.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/48369ddbf35f/pone.0062158.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/3c25f8bb68d8/pone.0062158.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7efe/3642239/5f76f24beeee/pone.0062158.g010.jpg

相似文献

1
Protein complex detection via weighted ensemble clustering based on Bayesian nonnegative matrix factorization.基于贝叶斯非负矩阵分解的加权集成聚类检测蛋白质复合物。
PLoS One. 2013 May 2;8(5):e62158. doi: 10.1371/journal.pone.0062158. Print 2013.
2
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.
3
MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw:一种改进的 MCL 方法,用于通过整合核心附着结构,从加权 PPI 网络中检测酵母复合物。
BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.
4
Joint clustering of protein interaction networks through Markov random walk.通过马尔可夫随机游走对蛋白质相互作用网络进行联合聚类。
BMC Syst Biol. 2014;8 Suppl 1(Suppl 1):S9. doi: 10.1186/1752-0509-8-S1-S9. Epub 2014 Jan 24.
5
Protein Complex Detection via Effective Integration of Base Clustering Solutions and Co-Complex Affinity Scores.通过有效整合基本聚类解决方案和共复合体亲和力得分进行蛋白质复合体检测
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):733-739. doi: 10.1109/TCBB.2016.2552176. Epub 2016 Apr 8.
6
Decision tree classifier based on topological characteristics of subgraph for the mining of protein complexes from large scale PPI networks.基于子图拓扑特征的决策树分类器用于从大规模 PPI 网络中挖掘蛋白质复合物。
Comput Biol Chem. 2023 Oct;106:107935. doi: 10.1016/j.compbiolchem.2023.107935. Epub 2023 Jul 25.
7
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
8
Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources.通过整合不同物理相互作用数据源重建大肠杆菌的高可信度蛋白质-蛋白质相互作用网络来改进蛋白质复合物预测。
BMC Bioinformatics. 2017 Jan 3;18(1):10. doi: 10.1186/s12859-016-1422-x.
9
DPCMNE: Detecting Protein Complexes From Protein-Protein Interaction Networks Via Multi-Level Network Embedding.DPCMNE:通过多层次网络嵌入从蛋白质-蛋白质相互作用网络中检测蛋白质复合物。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1592-1602. doi: 10.1109/TCBB.2021.3050102. Epub 2022 Jun 3.
10
Discovering overlapped protein complexes from weighted PPI networks by removing inter-module hubs.从加权 PPI 网络中去除模块间枢纽发现重叠蛋白复合物。
Sci Rep. 2017 Jun 12;7(1):3247. doi: 10.1038/s41598-017-03268-w.

引用本文的文献

1
Identifying Protein Complexes With Clear Module Structure Using Pairwise Constraints in Protein Interaction Networks.利用蛋白质相互作用网络中的成对约束识别具有清晰模块结构的蛋白质复合物
Front Genet. 2021 Aug 27;12:664786. doi: 10.3389/fgene.2021.664786. eCollection 2021.
2
EC-PGMGR: Ensemble Clustering Based on Probability Graphical Model With Graph Regularization for Single-Cell RNA-seq Data.EC-PGMGR:基于概率图模型和图正则化的单细胞RNA测序数据集成聚类
Front Genet. 2020 Nov 4;11:572242. doi: 10.3389/fgene.2020.572242. eCollection 2020.
3
Overlapping functional modules detection in PPI network with pair-wise constrained non-negative matrix tri-factorisation.

本文引用的文献

1
Automatic relevance determination in nonnegative matrix factorization with the β-divergence.基于β散度的非负矩阵分解中的自动相关性确定。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1592-605. doi: 10.1109/TPAMI.2012.240.
2
Exploring overlapping functional units with various structure in protein interaction networks.探索蛋白质相互作用网络中具有不同结构的重叠功能单元。
PLoS One. 2012;7(8):e43092. doi: 10.1371/journal.pone.0043092. Epub 2012 Aug 20.
3
Consensus clustering in complex networks.复杂网络中的共识聚类。
基于成对约束非负矩阵三因式分解的蛋白质-蛋白质相互作用网络中重叠功能模块检测
IET Syst Biol. 2018 Apr;12(2):45-54. doi: 10.1049/iet-syb.2017.0084.
4
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.
5
Protein Complexes Prediction Method Based on Core-Attachment Structure and Functional Annotations.基于核心附着结构和功能注释的蛋白质复合物预测方法。
Int J Mol Sci. 2017 Sep 6;18(9):1910. doi: 10.3390/ijms18091910.
6
Protein complex detection based on partially shared multi-view clustering.基于部分共享多视图聚类的蛋白质复合物检测
BMC Bioinformatics. 2016 Sep 13;17(1):371. doi: 10.1186/s12859-016-1164-9.
7
BiCAMWI: A Genetic-Based Biclustering Algorithm for Detecting Dynamic Protein Complexes.BiCAMWI:一种用于检测动态蛋白质复合物的基于遗传的双聚类算法。
PLoS One. 2016 Jul 27;11(7):e0159923. doi: 10.1371/journal.pone.0159923. eCollection 2016.
8
A two-layer integration framework for protein complex detection.一种用于蛋白质复合物检测的双层集成框架。
BMC Bioinformatics. 2016 Feb 24;17:100. doi: 10.1186/s12859-016-0939-3.
9
Microbial community pattern detection in human body habitats via ensemble clustering framework.通过集成聚类框架检测人体栖息地中的微生物群落模式
BMC Syst Biol. 2014;8 Suppl 4(Suppl 4):S7. doi: 10.1186/1752-0509-8-S4-S7. Epub 2014 Dec 8.
10
Detecting temporal protein complexes from dynamic protein-protein interaction networks.从动态蛋白质-蛋白质相互作用网络中检测瞬时蛋白质复合物。
BMC Bioinformatics. 2014 Oct 4;15(1):335. doi: 10.1186/1471-2105-15-335.
Sci Rep. 2012;2:336. doi: 10.1038/srep00336. Epub 2012 Mar 27.
4
Detecting overlapping protein complexes in protein-protein interaction networks.检测蛋白质-蛋白质相互作用网络中的重叠蛋白质复合物。
Nat Methods. 2012 Mar 18;9(5):471-2. doi: 10.1038/nmeth.1938.
5
Ensemble manifold regularization.集成流形正则化。
IEEE Trans Pattern Anal Mach Intell. 2012 Jun;34(6):1227-33. doi: 10.1109/TPAMI.2012.57.
6
Protein complexes discovery based on protein-protein interaction data via a regularized sparse generative network model.基于正则化稀疏生成网络模型的蛋白质-蛋白质相互作用数据的蛋白质复合物发现。
IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):857-70. doi: 10.1109/TCBB.2012.20.
7
A framework for incorporating functional interrelationships into protein function prediction algorithms.将功能相互关系纳入蛋白质功能预测算法的框架。
IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):740-53. doi: 10.1109/TCBB.2011.148.
8
Multifunctional proteins revealed by overlapping clustering in protein interaction network.蛋白质相互作用网络中重叠聚类揭示的多功能蛋白
Bioinformatics. 2012 Jan 1;28(1):84-90. doi: 10.1093/bioinformatics/btr621. Epub 2011 Nov 10.
9
Overlapping community detection using Bayesian non-negative matrix factorization.使用贝叶斯非负矩阵分解的重叠社区检测
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Jun;83(6 Pt 2):066114. doi: 10.1103/PhysRevE.83.066114. Epub 2011 Jun 22.
10
MINE: Module Identification in Networks.矿:网络中的模块识别。
BMC Bioinformatics. 2011 May 23;12:192. doi: 10.1186/1471-2105-12-192.