• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过密集子图和误报分析进行蛋白质复合物预测

Protein complex prediction via dense subgraphs and false positive analysis.

作者信息

Hernandez Cecilia, Mella Carlos, Navarro Gonzalo, Olivera-Nappa Alvaro, Araya Jaime

机构信息

Computer Science, University of Concepción, Concepción, Chile.

Center for Biotechnology and Bioengineering (CeBiB), Department of Computer Science, University of Chile, Santiago, Chile.

出版信息

PLoS One. 2017 Sep 22;12(9):e0183460. doi: 10.1371/journal.pone.0183460. eCollection 2017.

DOI:10.1371/journal.pone.0183460
PMID:28937982
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5609739/
Abstract

Many proteins work together with others in groups called complexes in order to achieve a specific function. Discovering protein complexes is important for understanding biological processes and predict protein functions in living organisms. Large-scale and throughput techniques have made possible to compile protein-protein interaction networks (PPI networks), which have been used in several computational approaches for detecting protein complexes. Those predictions might guide future biologic experimental research. Some approaches are topology-based, where highly connected proteins are predicted to be complexes; some propose different clustering algorithms using partitioning, overlaps among clusters for networks modeled with unweighted or weighted graphs; and others use density of clusters and information based on protein functionality. However, some schemes still require much processing time or the quality of their results can be improved. Furthermore, most of the results obtained with computational tools are not accompanied by an analysis of false positives. We propose an effective and efficient mining algorithm for discovering highly connected subgraphs, which is our base for defining protein complexes. Our representation is based on transforming the PPI network into a directed acyclic graph that reduces the number of represented edges and the search space for discovering subgraphs. Our approach considers weighted and unweighted PPI networks. We compare our best alternative using PPI networks from Saccharomyces cerevisiae (yeast) and Homo sapiens (human) with state-of-the-art approaches in terms of clustering, biological metrics and execution times, as well as three gold standards for yeast and two for human. Furthermore, we analyze false positive predicted complexes searching the PDBe (Protein Data Bank in Europe) database in order to identify matching protein complexes that have been purified and structurally characterized. Our analysis shows that more than 50 yeast protein complexes and more than 300 human protein complexes found to be false positives according to our prediction method, i.e., not described in the gold standard complex databases, in fact contain protein complexes that have been characterized structurally and documented in PDBe. We also found that some of these protein complexes have recently been classified as part of a Periodic Table of Protein Complexes. The latest version of our software is publicly available at http://doi.org/10.6084/m9.figshare.5297314.v1.

摘要

许多蛋白质会与其他蛋白质以称为复合物的组合形式协同工作,以实现特定功能。发现蛋白质复合物对于理解生物过程以及预测生物体中的蛋白质功能至关重要。大规模和高通量技术使得编译蛋白质-蛋白质相互作用网络(PPI网络)成为可能,这些网络已被用于多种检测蛋白质复合物的计算方法中。这些预测可能会指导未来的生物学实验研究。一些方法基于拓扑结构,其中高度连接的蛋白质被预测为复合物;一些方法提出了不同的聚类算法,使用划分、聚类之间的重叠来处理未加权或加权图建模的网络;还有一些方法使用聚类密度和基于蛋白质功能的信息。然而,一些方案仍然需要大量处理时间,或者其结果质量有待提高。此外,大多数通过计算工具获得的结果都没有伴随对假阳性的分析。我们提出了一种有效且高效的挖掘算法来发现高度连接的子图,这是我们定义蛋白质复合物的基础。我们的表示方法基于将PPI网络转换为有向无环图,这减少了表示的边数以及发现子图的搜索空间。我们的方法考虑了加权和未加权的PPI网络。我们将使用来自酿酒酵母(酵母)和智人(人类)的PPI网络的最佳替代方案与最先进的方法在聚类、生物学指标和执行时间方面进行比较,同时还与酵母的三个金标准和人类的两个金标准进行比较。此外,我们通过搜索欧洲蛋白质数据库(PDBe)来分析预测的假阳性复合物,以识别已被纯化并进行结构表征的匹配蛋白质复合物。我们的分析表明,根据我们的预测方法,超过50个酵母蛋白质复合物和超过300个人类蛋白质复合物被发现为假阳性,即在金标准复合物数据库中未被描述,但实际上包含已在PDBe中进行结构表征和记录的蛋白质复合物。我们还发现,其中一些蛋白质复合物最近被归类为蛋白质复合物周期表的一部分。我们软件的最新版本可在http://doi.org/10.6084/m9.figshare.5297314.v1上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/580cff7837a0/pone.0183460.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/19741ba62e35/pone.0183460.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/a9bf7b26556a/pone.0183460.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/65ec9031684b/pone.0183460.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/580cff7837a0/pone.0183460.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/19741ba62e35/pone.0183460.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/a9bf7b26556a/pone.0183460.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/65ec9031684b/pone.0183460.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9308/5609739/580cff7837a0/pone.0183460.g004.jpg

相似文献

1
Protein complex prediction via dense subgraphs and false positive analysis.通过密集子图和误报分析进行蛋白质复合物预测
PLoS One. 2017 Sep 22;12(9):e0183460. doi: 10.1371/journal.pone.0183460. eCollection 2017.
2
Detection of Complexes in Biological Networks Through Diversified Dense Subgraph Mining.通过多样化密集子图挖掘检测生物网络中的复合物
J Comput Biol. 2017 Sep;24(9):923-941. doi: 10.1089/cmb.2017.0037. Epub 2017 Jun 1.
3
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
4
MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw:一种改进的 MCL 方法,用于通过整合核心附着结构,从加权 PPI 网络中检测酵母复合物。
BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.
5
RedNemo: topology-based PPI network reconstruction via repeated diffusion with neighborhood modifications.RedNemo:通过带邻域修改的重复扩散进行基于拓扑的蛋白质-蛋白质相互作用网络重建。
Bioinformatics. 2017 Feb 15;33(4):537-544. doi: 10.1093/bioinformatics/btw655.
6
Predicting overlapping protein complexes from weighted protein interaction graphs by gradually expanding dense neighborhoods.通过逐步扩展密集邻域从加权蛋白质相互作用图预测重叠蛋白质复合物。
Artif Intell Med. 2016 Jul;71:62-9. doi: 10.1016/j.artmed.2016.05.006. Epub 2016 Jun 28.
7
Discovering functional interdependence relationship in PPI networks for protein complex identification.发现蛋白质相互作用网络中的功能相互依赖关系,用于蛋白质复合物识别。
IEEE Trans Biomed Eng. 2012 Apr;59(4):899-908. doi: 10.1109/TBME.2010.2093524. Epub 2010 Nov 18.
8
Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks.基于从蛋白质-蛋白质相互作用网络中获得的节点嵌入来识别蛋白质复合物。
BMC Bioinformatics. 2018 Sep 21;19(1):332. doi: 10.1186/s12859-018-2364-2.
9
Detecting Protein Complexes Based on Uncertain Graph Model.基于不确定图模型的蛋白质复合物检测
IEEE/ACM Trans Comput Biol Bioinform. 2014 May-Jun;11(3):486-97. doi: 10.1109/TCBB.2013.2297915.
10
Identification of Protein Complexes Using Weighted PageRank-Nibble Algorithm and Core-Attachment Structure.使用加权PageRank-Nibble算法和核心-附属结构识别蛋白质复合物
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):179-92. doi: 10.1109/TCBB.2014.2343954.

引用本文的文献

1
PhyberSIM: a tool for the generation of ground truth to evaluate brain fiber clustering algorithms.PhyberSIM:一种用于生成评估脑纤维聚类算法的地面真值的工具。
Front Neurosci. 2024 May 30;18:1396518. doi: 10.3389/fnins.2024.1396518. eCollection 2024.
2
An informatic workflow for the enhanced annotation of excretory/secretory proteins of .一种用于增强对……的排泄/分泌蛋白注释的信息学工作流程。 (原文中“of”后面似乎缺失了具体内容)
Comput Struct Biotechnol J. 2023 Mar 18;21:2696-2704. doi: 10.1016/j.csbj.2023.03.025. eCollection 2023.
3
Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.

本文引用的文献

1
Protein complex prediction for large protein protein interaction networks with the Core&Peel method.使用核心与剥离方法对大型蛋白质-蛋白质相互作用网络进行蛋白质复合物预测。
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):372. doi: 10.1186/s12859-016-1191-6.
2
Identification of protein complexes from multi-relationship protein interaction networks.从多重关系蛋白质相互作用网络中识别蛋白质复合物。
Hum Genomics. 2016 Jul 25;10 Suppl 2(Suppl 2):17. doi: 10.1186/s40246-016-0069-z.
3
Principles of assembly reveal a periodic table of protein complexes.
基于序列的深度学习同时提高稳定性、准确性和假阳性率的蛋白质功能注释。
Brief Bioinform. 2020 Jul 15;21(4):1437-1447. doi: 10.1093/bib/bbz081.
4
PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks.PS-MCL:基于并行鸟枪法的蛋白质相互作用网络粗化马尔可夫聚类。
BMC Bioinformatics. 2019 Jul 24;20(Suppl 13):381. doi: 10.1186/s12859-019-2856-8.
组装原则揭示了蛋白质复合物的元素周期表。
Science. 2015 Dec 11;350(6266):aaa2245. doi: 10.1126/science.aaa2245.
4
A density-based clustering approach for identifying overlapping protein complexes with functional preferences.一种基于密度的聚类方法,用于识别具有功能偏好的重叠蛋白质复合物。
BMC Bioinformatics. 2015 May 27;16:174. doi: 10.1186/s12859-015-0583-3.
5
Structure, dynamics, assembly, and evolution of protein complexes.蛋白质复合物的结构、动态、组装和进化。
Annu Rev Biochem. 2015;84:551-75. doi: 10.1146/annurev-biochem-060614-034142. Epub 2014 Dec 8.
6
Detecting overlapping protein complexes based on a generative model with functional and topological properties.基于具有功能和拓扑特性的生成模型检测重叠蛋白质复合物。
BMC Bioinformatics. 2014 Jun 13;15:186. doi: 10.1186/1471-2105-15-186.
7
A novel algorithm for detecting protein complexes with the breadth first search.一种用于通过广度优先搜索检测蛋白质复合物的新算法。
Biomed Res Int. 2014;2014:354539. doi: 10.1155/2014/354539. Epub 2014 Apr 10.
8
Going the distance for protein function prediction: a new distance metric for protein interaction networks.为蛋白质功能预测全力以赴:一种用于蛋白质相互作用网络的新距离度量
PLoS One. 2013 Oct 23;8(10):e76339. doi: 10.1371/journal.pone.0076339. eCollection 2013.
9
Utilizing both topological and attribute information for protein complex identification in PPI networks.在蛋白质-蛋白质相互作用网络中利用拓扑和属性信息进行蛋白质复合物识别。
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):780-92. doi: 10.1109/TCBB.2013.37.
10
Protein complexes are under evolutionary selection to assemble via ordered pathways.蛋白质复合物是在进化选择的作用下通过有序的途径组装的。
Cell. 2013 Apr 11;153(2):461-70. doi: 10.1016/j.cell.2013.02.044.