• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用核心与剥离方法对大型蛋白质-蛋白质相互作用网络进行蛋白质复合物预测。

Protein complex prediction for large protein protein interaction networks with the Core&Peel method.

作者信息

Pellegrini Marco, Baglioni Miriam, Geraci Filippo

机构信息

Laboratory for Integrative Systems Medicine - Istituto di Informatica e Telematica and Istituto di Fisiologia Clinica del CNR, via Moruzzi 1, Pisa, 56124, Italy.

出版信息

BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):372. doi: 10.1186/s12859-016-1191-6.

DOI:10.1186/s12859-016-1191-6
PMID:28185552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5123419/
Abstract

BACKGROUND

Biological networks play an increasingly important role in the exploration of functional modularity and cellular organization at a systemic level. Quite often the first tools used to analyze these networks are clustering algorithms. We concentrate here on the specific task of predicting protein complexes (PC) in large protein-protein interaction networks (PPIN). Currently, many state-of-the-art algorithms work well for networks of small or moderate size. However, their performance on much larger networks, which are becoming increasingly common in modern proteome-wise studies, needs to be re-assessed.

RESULTS AND DISCUSSION

We present a new fast algorithm for clustering large sparse networks: Core&Peel, which runs essentially in time and storage O(a(G)m+n) for a network G of n nodes and m arcs, where a(G) is the arboricity of G (which is roughly proportional to the maximum average degree of any induced subgraph in G). We evaluated Core&Peel on five PPI networks of large size and one of medium size from both yeast and homo sapiens, comparing its performance against those of ten state-of-the-art methods. We demonstrate that Core&Peel consistently outperforms the ten competitors in its ability to identify known protein complexes and in the functional coherence of its predictions. Our method is remarkably robust, being quite insensible to the injection of random interactions. Core&Peel is also empirically efficient attaining the second best running time over large networks among the tested algorithms.

CONCLUSIONS

Our algorithm Core&Peel pushes forward the state-of the-art in PPIN clustering providing an algorithmic solution with polynomial running time that attains experimentally demonstrable good output quality and speed on challenging large real networks.

摘要

背景

生物网络在系统层面探索功能模块性和细胞组织方面发挥着越来越重要的作用。聚类算法常常是用于分析这些网络的首批工具。我们在此专注于在大型蛋白质 - 蛋白质相互作用网络(PPIN)中预测蛋白质复合物(PC)这一特定任务。目前,许多先进算法在中小型网络中运行良好。然而,它们在大得多的网络上的性能需要重新评估,而这种大网络在现代全蛋白质组研究中越来越常见。

结果与讨论

我们提出了一种用于聚类大型稀疏网络的新的快速算法:核心&剥离算法(Core&Peel),对于一个具有n个节点和m条弧的网络G,该算法的运行时间和存储空间本质上为O(a(G)m + n),其中a(G)是G的树密度(大致与G中任何诱导子图的最大平均度成正比)。我们在来自酵母和智人的五个大型PPI网络和一个中型PPI网络上评估了核心&剥离算法,将其性能与十种先进方法的性能进行了比较。我们证明,在识别已知蛋白质复合物的能力以及预测的功能连贯性方面,核心&剥离算法始终优于这十种竞争对手。我们的方法非常稳健,对随机相互作用的注入相当不敏感。在测试算法中,核心&剥离算法在大型网络上的运行时间也经验性地达到了第二快。

结论

我们的算法核心&剥离算法推动了PPIN聚类技术的发展,提供了一种具有多项式运行时间的算法解决方案,在具有挑战性的大型真实网络上实现了实验上可证明的良好输出质量和速度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/15cc22705ed6/12859_2016_1191_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/54c8ab667294/12859_2016_1191_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/dd99e9ba91c8/12859_2016_1191_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/5f156d0f0e0c/12859_2016_1191_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/695303206fc5/12859_2016_1191_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/6f43a8f57f42/12859_2016_1191_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/e18dfaf79c6e/12859_2016_1191_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/e5224c009344/12859_2016_1191_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/0ad1b878ce4d/12859_2016_1191_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/5b517a60e587/12859_2016_1191_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/d76e78c78f91/12859_2016_1191_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/77f78c261dd3/12859_2016_1191_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/0b528c782a00/12859_2016_1191_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/944d0dcb039e/12859_2016_1191_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/d971fdd534b6/12859_2016_1191_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/9f224ee066ac/12859_2016_1191_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/19bad784eea3/12859_2016_1191_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/acfa072197c0/12859_2016_1191_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/0c5c828028e6/12859_2016_1191_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/a8862174f7de/12859_2016_1191_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/be2f0e16a662/12859_2016_1191_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/7253f7ff84f9/12859_2016_1191_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/15cc22705ed6/12859_2016_1191_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/54c8ab667294/12859_2016_1191_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/dd99e9ba91c8/12859_2016_1191_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/5f156d0f0e0c/12859_2016_1191_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/695303206fc5/12859_2016_1191_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/6f43a8f57f42/12859_2016_1191_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/e18dfaf79c6e/12859_2016_1191_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/e5224c009344/12859_2016_1191_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/0ad1b878ce4d/12859_2016_1191_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/5b517a60e587/12859_2016_1191_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/d76e78c78f91/12859_2016_1191_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/77f78c261dd3/12859_2016_1191_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/0b528c782a00/12859_2016_1191_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/944d0dcb039e/12859_2016_1191_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/d971fdd534b6/12859_2016_1191_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/9f224ee066ac/12859_2016_1191_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/19bad784eea3/12859_2016_1191_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/acfa072197c0/12859_2016_1191_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/0c5c828028e6/12859_2016_1191_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/a8862174f7de/12859_2016_1191_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/be2f0e16a662/12859_2016_1191_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/7253f7ff84f9/12859_2016_1191_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3b34/5123419/15cc22705ed6/12859_2016_1191_Fig22_HTML.jpg

相似文献

1
Protein complex prediction for large protein protein interaction networks with the Core&Peel method.使用核心与剥离方法对大型蛋白质-蛋白质相互作用网络进行蛋白质复合物预测。
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):372. doi: 10.1186/s12859-016-1191-6.
2
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.
3
Protein complex prediction via dense subgraphs and false positive analysis.通过密集子图和误报分析进行蛋白质复合物预测
PLoS One. 2017 Sep 22;12(9):e0183460. doi: 10.1371/journal.pone.0183460. eCollection 2017.
4
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
5
An effective approach to detecting both small and large complexes from protein-protein interaction networks.一种从蛋白质-蛋白质相互作用网络中检测大小复合物的有效方法。
BMC Bioinformatics. 2017 Oct 16;18(Suppl 12):419. doi: 10.1186/s12859-017-1820-8.
6
Topological and functional comparison of community detection algorithms in biological networks.生物网络中社团检测算法的拓扑和功能比较。
BMC Bioinformatics. 2019 Apr 27;20(1):212. doi: 10.1186/s12859-019-2746-0.
7
MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw:一种改进的 MCL 方法,用于通过整合核心附着结构,从加权 PPI 网络中检测酵母复合物。
BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.
8
Protein Complexes Prediction Method Based on Core-Attachment Structure and Functional Annotations.基于核心附着结构和功能注释的蛋白质复合物预测方法。
Int J Mol Sci. 2017 Sep 6;18(9):1910. doi: 10.3390/ijms18091910.
9
Predicting overlapping protein complexes based on core-attachment and a local modularity structure.基于核心附着和局部模块结构预测重叠蛋白质复合物。
BMC Bioinformatics. 2018 Aug 22;19(1):305. doi: 10.1186/s12859-018-2309-9.
10
Efficient and accurate Greedy Search Methods for mining functional modules in protein interaction networks.高效准确的贪心法在蛋白质相互作用网络中挖掘功能模块。
BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S19. doi: 10.1186/1471-2105-13-S10-S19.

引用本文的文献

1
Computational Approaches to Predict Protein-Protein Interactions in Crowded Cellular Environments.计算方法在拥挤细胞环境中预测蛋白质-蛋白质相互作用。
Chem Rev. 2024 Apr 10;124(7):3932-3977. doi: 10.1021/acs.chemrev.3c00550. Epub 2024 Mar 27.
2
Computational identification of protein complexes from network interactions: Present state, challenges, and the way forward.从网络相互作用中进行蛋白质复合物的计算识别:现状、挑战及未来方向。
Comput Struct Biotechnol J. 2022 May 27;20:2699-2712. doi: 10.1016/j.csbj.2022.05.049. eCollection 2022.
3
Gene Expression Profiling of Glioblastoma to Recognize Potential Biomarker Candidates.

本文引用的文献

1
Identification of Protein Complexes Using Weighted PageRank-Nibble Algorithm and Core-Attachment Structure.使用加权PageRank-Nibble算法和核心-附属结构识别蛋白质复合物
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):179-92. doi: 10.1109/TCBB.2014.2343954.
2
Dynamic identifying protein functional modules based on adaptive density modularity in protein-protein interaction networks.基于蛋白质-蛋白质相互作用网络中自适应密度模块度的动态识别蛋白质功能模块
BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S5. doi: 10.1186/1471-2105-16-S12-S5. Epub 2015 Aug 25.
3
Inferring drug-disease associations based on known protein complexes.
胶质母细胞瘤的基因表达谱分析以识别潜在的生物标志物候选物。
Front Genet. 2022 Apr 27;13:832742. doi: 10.3389/fgene.2022.832742. eCollection 2022.
4
AI applications in functional genomics.人工智能在功能基因组学中的应用。
Comput Struct Biotechnol J. 2021 Oct 11;19:5762-5790. doi: 10.1016/j.csbj.2021.10.009. eCollection 2021.
5
Efficient and accurate identification of protein complexes from protein-protein interaction networks based on the clustering coefficient.基于聚类系数从蛋白质-蛋白质相互作用网络中高效准确地识别蛋白质复合物。
Comput Struct Biotechnol J. 2021 Sep 20;19:5255-5263. doi: 10.1016/j.csbj.2021.09.014. eCollection 2021.
6
Accurate prediction of breast cancer survival through coherent voting networks with gene expression profiling.通过基因表达谱的相干投票网络准确预测乳腺癌的生存。
Sci Rep. 2021 Jul 19;11(1):14645. doi: 10.1038/s41598-021-94243-z.
7
Performance improvement for a 2D convolutional neural network by using SSC encoding on protein-protein interaction tasks.利用 SSC 编码提高二维卷积神经网络在蛋白质相互作用任务上的性能。
BMC Bioinformatics. 2021 Apr 12;22(1):184. doi: 10.1186/s12859-021-04111-w.
8
Finding disease modules for cancer and COVID-19 in gene co-expression networks with the Core&Peel method.利用 Core&Peel 方法在基因共表达网络中寻找癌症和 COVID-19 的疾病模块。
Sci Rep. 2020 Oct 19;10(1):17628. doi: 10.1038/s41598-020-74705-6.
9
Challenges in the construction of knowledge bases for human microbiome-disease associations.人类微生物组-疾病关联知识库构建面临的挑战。
Microbiome. 2019 Sep 5;7(1):129. doi: 10.1186/s40168-019-0742-2.
10
Integrating data and knowledge to identify functional modules of genes: a multilayer approach.整合数据和知识以识别基因的功能模块:一种多层方法。
BMC Bioinformatics. 2019 May 2;20(1):225. doi: 10.1186/s12859-019-2800-y.
基于已知蛋白质复合物推断药物-疾病关联。
BMC Med Genomics. 2015;8 Suppl 2(Suppl 2):S2. doi: 10.1186/1755-8794-8-S2-S2. Epub 2015 May 29.
4
Methods for protein complex prediction and their contributions towards understanding the organisation, function and dynamics of complexes.蛋白质复合物预测方法及其对理解复合物的组织、功能和动力学的贡献。
FEBS Lett. 2015 Sep 14;589(19 Pt A):2590-602. doi: 10.1016/j.febslet.2015.04.026. Epub 2015 Apr 23.
5
Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes.泛癌网络分析确定了跨通路和蛋白质复合物的罕见体细胞突变组合。
Nat Genet. 2015 Feb;47(2):106-14. doi: 10.1038/ng.3168. Epub 2014 Dec 15.
6
A proteome-scale map of the human interactome network.人类相互作用组网络的蛋白质组规模图谱。
Cell. 2014 Nov 20;159(5):1212-1226. doi: 10.1016/j.cell.2014.10.050.
7
ComPPI: a cellular compartment-specific database for protein-protein interaction network analysis.ComPPI:用于蛋白质-蛋白质相互作用网络分析的细胞区室特异性数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D485-93. doi: 10.1093/nar/gku1007. Epub 2014 Oct 27.
8
Detecting protein complexes in protein interaction networks using a ranking algorithm with a refined merging procedure.使用一种带有改进合并过程的排序算法在蛋白质相互作用网络中检测蛋白质复合物。
BMC Bioinformatics. 2014 Jun 19;15:204. doi: 10.1186/1471-2105-15-204.
9
PPSampler2: predicting protein complexes more accurately and efficiently by sampling.PPSampler2:通过采样更准确高效地预测蛋白质复合物
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S14. doi: 10.1186/1752-0509-7-S6-S14. Epub 2013 Dec 13.
10
Identifying conserved protein complexes between species by constructing interolog networks.通过构建种间同源蛋白互作网络来鉴定物种间保守的蛋白质复合物。
BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S8. doi: 10.1186/1471-2105-14-S16-S8. Epub 2013 Oct 22.