• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图的聚类对随机网络变化的稳健性。

On the Robustness of Graph-Based Clustering to Random Network Alterations.

机构信息

Michael Smith Laboratories, University of British Columbia, Vancouver, Canada.

Michael Smith Laboratories, University of British Columbia, Vancouver, Canada.

出版信息

Mol Cell Proteomics. 2021;20:100002. doi: 10.1074/mcp.RA120.002275. Epub 2020 Nov 24.

DOI:10.1074/mcp.RA120.002275
PMID:33592499
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7896145/
Abstract

Biological functions emerge from complex and dynamic networks of protein-protein interactions. Because these protein-protein interaction networks, or interactomes, represent pairwise connections within a hierarchically organized system, it is often useful to identify higher-order associations embedded within them, such as multimember protein complexes. Graph-based clustering techniques are widely used to accomplish this goal, and dozens of field-specific and general clustering algorithms exist. However, interactomes can be prone to errors, especially when inferred from high-throughput biochemical assays. Therefore, robustness to network-level noise is an important criterion. Here, we tested the robustness of a range of graph-based clustering algorithms in the presence of noise, including algorithms common across domains and those specific to protein networks. Strikingly, we found that all of the clustering algorithms tested here markedly amplified network-level noise. Randomly rewiring only 1% of network edges yielded more than a 50% change in clustering results. Moreover, we found the impact of network noise on individual clusters was not uniform: some clusters were consistently robust to injected noise, whereas others were not. Therefore we developed the clust.perturb R package and Shiny web application to measure the reproducibility of clusters by randomly perturbing the network. We show that clust.perturb results are predictive of real-world cluster stability: poorly reproducible clusters as identified by clust.perturb are significantly less likely to be reclustered across experiments. We conclude that graph-based clustering amplifies noise in protein interaction networks, but quantifying the robustness of a cluster to network noise can separate stable protein complexes from spurious associations.

摘要

生物功能源自于蛋白质-蛋白质相互作用的复杂和动态网络。由于这些蛋白质-蛋白质相互作用网络(即互作组)代表了分层组织系统内的两两连接,因此通常有用的是识别其中嵌入的更高阶关联,例如多成员蛋白质复合物。基于图的聚类技术被广泛用于实现这一目标,并且存在数十种特定于领域和通用的聚类算法。然而,互作组可能容易出错,特别是当从高通量生化测定中推断出来时。因此,对网络级噪声的鲁棒性是一个重要的标准。在这里,我们在存在噪声的情况下测试了一系列基于图的聚类算法的鲁棒性,包括跨领域和特定于蛋白质网络的算法。引人注目的是,我们发现这里测试的所有聚类算法都明显放大了网络级噪声。随机重连网络边缘的 1%,聚类结果就会发生超过 50%的变化。此外,我们发现网络噪声对单个簇的影响并不均匀:一些簇始终对注入的噪声具有鲁棒性,而其他簇则不然。因此,我们开发了 clust.perturb R 包和 Shiny 网络应用程序,通过随机扰动网络来测量簇的可重复性。我们表明,clust.perturb 的结果可预测真实世界的簇稳定性:clust.perturb 识别的可重复性差的簇在跨实验中重新聚类的可能性显著降低。我们得出结论,基于图的聚类放大了蛋白质相互作用网络中的噪声,但量化簇对网络噪声的鲁棒性可以将稳定的蛋白质复合物与虚假关联区分开来。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/db69cbcc71cd/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/d2ed6d680f71/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/8a2b601fcc20/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/47a6af27561e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/c57cf4f2fd21/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/21296c9f87b1/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/8953a518c598/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/577ffa11855d/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/e73517b3f162/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/db69cbcc71cd/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/d2ed6d680f71/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/8a2b601fcc20/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/47a6af27561e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/c57cf4f2fd21/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/21296c9f87b1/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/8953a518c598/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/577ffa11855d/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/e73517b3f162/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8245/7896145/db69cbcc71cd/gr8.jpg

相似文献

1
On the Robustness of Graph-Based Clustering to Random Network Alterations.基于图的聚类对随机网络变化的稳健性。
Mol Cell Proteomics. 2021;20:100002. doi: 10.1074/mcp.RA120.002275. Epub 2020 Nov 24.
2
Evaluation of clustering algorithms for protein-protein interaction networks.蛋白质-蛋白质相互作用网络聚类算法的评估
BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488.
3
Markov clustering versus affinity propagation for the partitioning of protein interaction graphs.用于蛋白质相互作用图划分的马尔可夫聚类与亲和传播算法
BMC Bioinformatics. 2009 Mar 30;10:99. doi: 10.1186/1471-2105-10-99.
4
Clust&See: a Cytoscape plugin for the identification, visualization and manipulation of network clusters.Clust&See:一款用于识别、可视化和操作网络簇的Cytoscape插件。
Biosystems. 2013 Aug;113(2):91-5. doi: 10.1016/j.biosystems.2013.05.010. Epub 2013 Jun 3.
5
Resolving the structure of interactomes with hierarchical agglomerative clustering.利用层次凝聚聚类解析互作组学结构。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2105-12-S1-S44.
6
Identifying protein complexes by reducing noise in interaction networks.通过降低相互作用网络中的噪声来识别蛋白质复合物。
Protein Pept Lett. 2014 Jul;21(7):688-95. doi: 10.2174/0929866521666140320111720.
7
A proximity-based graph clustering method for the identification and application of transcription factor clusters.一种基于邻近性的图聚类方法,用于转录因子簇的识别与应用。
BMC Bioinformatics. 2017 Nov 29;18(1):530. doi: 10.1186/s12859-017-1935-y.
8
Evolutionary Graph Clustering for Protein Complex Identification.基于进化的图聚类算法在蛋白质复合物识别中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):892-904. doi: 10.1109/TCBB.2016.2642107. Epub 2016 Dec 20.
9
Protein complexes predictions within protein interaction networks using genetic algorithms.利用遗传算法预测蛋白质相互作用网络中的蛋白质复合物
BMC Bioinformatics. 2016 Jul 25;17 Suppl 7(Suppl 7):269. doi: 10.1186/s12859-016-1096-4.
10
ScGSLC: An unsupervised graph similarity learning framework for single-cell RNA-seq data clustering.ScGSLC:一种用于单细胞 RNA-seq 数据聚类的无监督图相似性学习框架。
Comput Biol Chem. 2021 Feb;90:107415. doi: 10.1016/j.compbiolchem.2020.107415. Epub 2020 Nov 18.

引用本文的文献

1
CTEC: a cross-tabulation ensemble clustering approach for single-cell RNA sequencing data analysis.CTEC:一种用于单细胞 RNA 测序数据分析的交叉制表集成聚类方法。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae130.
2
Semi-automated approaches for interrogating spatial heterogeneity of tissue samples.半自动化方法用于探究组织样本的空间异质性。
Sci Rep. 2024 Feb 29;14(1):5025. doi: 10.1038/s41598-024-55387-w.
3
Comparative Clustering (CompaCt) of eukaryote complexomes identifies novel interactions and sheds light on protein complex evolution.

本文引用的文献

1
Dual proteome-scale networks reveal cell-specific remodeling of the human interactome.双重蛋白质组尺度网络揭示了人类相互作用组的细胞特异性重塑。
Cell. 2021 May 27;184(11):3022-3040.e28. doi: 10.1016/j.cell.2021.04.011. Epub 2021 May 6.
2
Dynamic rewiring of the human interactome by interferon signaling.干扰素信号动态重塑人类相互作用组。
Genome Biol. 2020 Jun 15;21(1):140. doi: 10.1186/s13059-020-02050-y.
3
A reference map of the human binary protein interactome.人类二进制蛋白质相互作用组参考图谱。
真核生物复合物的比较聚类(CompaCt)鉴定了新的相互作用,并揭示了蛋白质复合物的进化。
PLoS Comput Biol. 2023 Aug 7;19(8):e1011090. doi: 10.1371/journal.pcbi.1011090. eCollection 2023 Aug.
4
Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context.多组学整合背景下蛋白质-蛋白质相互作用网络的表征与可视化方法概述。
Front Mol Biosci. 2022 Sep 8;9:962799. doi: 10.3389/fmolb.2022.962799. eCollection 2022.
5
Meta-analysis defines principles for the design and analysis of co-fractionation mass spectrometry experiments.元分析为共馏分质谱实验的设计和分析定义了原则。
Nat Methods. 2021 Jul;18(7):806-815. doi: 10.1038/s41592-021-01194-4. Epub 2021 Jul 1.
6
Mapping the multiscale structure of biological systems.绘制生物系统的多尺度结构。
Cell Syst. 2021 Jun 16;12(6):622-635. doi: 10.1016/j.cels.2021.05.012.
Nature. 2020 Apr;580(7803):402-408. doi: 10.1038/s41586-020-2188-x. Epub 2020 Apr 8.
4
EPIC: software toolkit for elution profile-based inference of protein complexes.EPIC:基于洗脱轮廓的蛋白质复合物推断的软件工具包。
Nat Methods. 2019 Aug;16(8):737-742. doi: 10.1038/s41592-019-0461-4. Epub 2019 Jul 15.
5
From Louvain to Leiden: guaranteeing well-connected communities.从鲁汶到莱顿:保障互联互通的社区。
Sci Rep. 2019 Mar 26;9(1):5233. doi: 10.1038/s41598-019-41695-z.
6
The BioGRID interaction database: 2019 update.生物相互作用数据库(BioGRID):2019 年更新版。
Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541. doi: 10.1093/nar/gky1079.
7
CORUM: the comprehensive resource of mammalian protein complexes-2019.CORUM:哺乳动物蛋白质复合物综合资源-2019 年版。
Nucleic Acids Res. 2019 Jan 8;47(D1):D559-D563. doi: 10.1093/nar/gky973.
8
Context-specific interactions in literature-curated protein interaction databases.文献策管蛋白质交互作用数据库中的语境特定交互作用。
BMC Genomics. 2018 Oct 19;19(1):758. doi: 10.1186/s12864-018-5139-2.
9
Genomic data integration systematically biases interactome mapping.基因组数据整合系统地影响互作图谱绘制。
PLoS Comput Biol. 2018 Oct 17;14(10):e1006474. doi: 10.1371/journal.pcbi.1006474. eCollection 2018 Oct.
10
Comparison of clustering tools in R for medium-sized 10x Genomics single-cell RNA-sequencing data.用于中等规模10x基因组学单细胞RNA测序数据的R语言聚类工具比较
F1000Res. 2018 Aug 15;7:1297. doi: 10.12688/f1000research.15809.2. eCollection 2018.