• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过有效整合基本聚类解决方案和共复合体亲和力得分进行蛋白质复合体检测

Protein Complex Detection via Effective Integration of Base Clustering Solutions and Co-Complex Affinity Scores.

作者信息

Wu Min, Ou-Yang Le, Li Xiao-Li

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):733-739. doi: 10.1109/TCBB.2016.2552176. Epub 2016 Apr 8.

DOI:10.1109/TCBB.2016.2552176
PMID:27071190
Abstract

With the increasing availability of protein interaction data, various computational methods have been developed to predict protein complexes. However, different computational methods may have their own advantages and limitations. Ensemble clustering has thus been studied to minimize the potential bias and risk of individual methods and generate prediction results with better coverage and accuracy. In this paper, we extend the traditional ensemble clustering by taking into account the co-complex affinity scores and present an Ensemble H ierarchical Clustering framework (EnsemHC) to detect protein complexes. First, we construct co-cluster matrices by integrating the clustering results with the co-complex evidences. Second, we sum up the constructed co-cluster matrices to derive a final ensemble matrix via a novel iterative weighting scheme. Finally, we apply the hierarchical clustering to generate protein complexes from the final ensemble matrix. Experimental results demonstrate that our EnsemHC performs better than its base clustering methods and various existing integrative methods. In addition, we also observed that integrating the clusters and co-complex affinity scores from different data sources will improve the prediction performance, e.g., integrating the clusters from TAP data and co-complex affinities from binary PPI data achieved the best performance in our experiments.

摘要

随着蛋白质相互作用数据的日益丰富,人们开发了各种计算方法来预测蛋白质复合物。然而,不同的计算方法可能各有优缺点。因此,人们研究了集成聚类,以尽量减少单个方法的潜在偏差和风险,并生成具有更好覆盖范围和准确性的预测结果。在本文中,我们通过考虑共复合物亲和力得分扩展了传统的集成聚类,并提出了一种集成层次聚类框架(EnsemHC)来检测蛋白质复合物。首先,我们通过将聚类结果与共复合物证据相结合来构建共聚类矩阵。其次,我们通过一种新颖的迭代加权方案对构建的共聚类矩阵求和,以得到最终的集成矩阵。最后,我们应用层次聚类从最终的集成矩阵中生成蛋白质复合物。实验结果表明,我们的EnsemHC比其基础聚类方法和各种现有的综合方法表现更好。此外,我们还观察到,整合来自不同数据源的聚类和共复合物亲和力得分将提高预测性能,例如,在我们的实验中,整合来自TAP数据的聚类和来自二元PPI数据的共复合物亲和力获得了最佳性能。

相似文献

1
Protein Complex Detection via Effective Integration of Base Clustering Solutions and Co-Complex Affinity Scores.通过有效整合基本聚类解决方案和共复合体亲和力得分进行蛋白质复合体检测
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):733-739. doi: 10.1109/TCBB.2016.2552176. Epub 2016 Apr 8.
2
A two-layer integration framework for protein complex detection.一种用于蛋白质复合物检测的双层集成框架。
BMC Bioinformatics. 2016 Feb 24;17:100. doi: 10.1186/s12859-016-0939-3.
3
Identifying protein complexes from heterogeneous biological data.从异构生物数据中鉴定蛋白质复合物。
Proteins. 2013 Nov;81(11):2023-33. doi: 10.1002/prot.24365. Epub 2013 Aug 23.
4
Protein complex detection via weighted ensemble clustering based on Bayesian nonnegative matrix factorization.基于贝叶斯非负矩阵分解的加权集成聚类检测蛋白质复合物。
PLoS One. 2013 May 2;8(5):e62158. doi: 10.1371/journal.pone.0062158. Print 2013.
5
Protein complex detection based on partially shared multi-view clustering.基于部分共享多视图聚类的蛋白质复合物检测
BMC Bioinformatics. 2016 Sep 13;17(1):371. doi: 10.1186/s12859-016-1164-9.
6
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
7
Ensembling of Gene Clusters Utilizing Deep Learning and Protein-Protein Interaction Information.利用深度学习和蛋白质-蛋白质相互作用信息进行基因簇组装。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2005-2016. doi: 10.1109/TCBB.2019.2918523. Epub 2020 Dec 8.
8
Complex discovery from weighted PPI networks.基于加权 PPI 网络的复杂发现。
Bioinformatics. 2009 Aug 1;25(15):1891-7. doi: 10.1093/bioinformatics/btp311. Epub 2009 May 12.
9
Knowledge based cluster ensemble for cancer discovery from biomolecular data.基于知识的聚类集成在生物分子数据中的癌症发现。
IEEE Trans Nanobioscience. 2011 Jun;10(2):76-85. doi: 10.1109/TNB.2011.2144997. Epub 2011 Jul 7.
10
From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein-to-Protein Interaction Networks.从功能到相互作用:基于蛋白质-蛋白质相互作用网络准确预测蛋白质复合物的新范式。
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):616-27. doi: 10.1109/TCBB.2014.2306825.

引用本文的文献

1
Computational Approaches to Predict Protein-Protein Interactions in Crowded Cellular Environments.计算方法在拥挤细胞环境中预测蛋白质-蛋白质相互作用。
Chem Rev. 2024 Apr 10;124(7):3932-3977. doi: 10.1021/acs.chemrev.3c00550. Epub 2024 Mar 27.
2
Efficient and accurate identification of protein complexes from protein-protein interaction networks based on the clustering coefficient.基于聚类系数从蛋白质-蛋白质相互作用网络中高效准确地识别蛋白质复合物。
Comput Struct Biotechnol J. 2021 Sep 20;19:5255-5263. doi: 10.1016/j.csbj.2021.09.014. eCollection 2021.
3
Maximizing cohesion and separation for detecting protein functional modules in protein-protein interaction networks.
最大化蛋白质-蛋白质相互作用网络中检测蛋白质功能模块的内聚性和分离度。
PLoS One. 2020 Oct 13;15(10):e0240628. doi: 10.1371/journal.pone.0240628. eCollection 2020.