• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于进化的图聚类算法在蛋白质复合物识别中的应用

Evolutionary Graph Clustering for Protein Complex Identification.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):892-904. doi: 10.1109/TCBB.2016.2642107. Epub 2016 Dec 20.

DOI:10.1109/TCBB.2016.2642107
PMID:28029628
Abstract

This paper presents a graph clustering algorithm, called EGCPI, to discover protein complexes in protein-protein interaction (PPI) networks. In performing its task, EGCPI takes into consideration both network topologies and attributes of interacting proteins, both of which have been shown to be important for protein complex discovery. EGCPI formulates the problem as an optimization problem and tackles it with evolutionary clustering. Given a PPI network, EGCPI first annotates each protein with corresponding attributes that are provided in Gene Ontology database. It then adopts a similarity measure to evaluate how similar the connected proteins are taking into consideration the network topology. Given this measure, EGCPI then discovers a number of graph clusters within which proteins are densely connected, based on an evolutionary strategy. At last, EGCPI identifies protein complexes in each discovered cluster based on the homogeneity of attributes performed by pairwise proteins. EGCPI has been tested with several real data sets and the experimental results show EGCPI is very effective on protein complex discovery, and the evolutionary clustering is helpful to identify protein complexes in PPI networks. The software of EGCPI can be downloaded via: https://github.com/hetiantian1985/EGCPI.

摘要

本文提出了一种名为 EGCPI 的图聚类算法,用于发现蛋白质 - 蛋白质相互作用(PPI)网络中的蛋白质复合物。在执行任务时,EGCPI 同时考虑了网络拓扑结构和相互作用蛋白质的属性,这两者都被证明对蛋白质复合物发现很重要。EGCPI 将问题表述为优化问题,并采用进化聚类来解决它。给定一个 PPI 网络,EGCPI 首先使用基因本体数据库中提供的相应属性注释每个蛋白质。然后,它采用相似性度量来评估考虑网络拓扑结构时连接蛋白质的相似程度。有了这个度量标准,EGCPI 然后根据进化策略在蛋白质密集连接的多个图聚类中发现。最后,EGCPI 根据成对蛋白质执行的属性同质性在每个发现的聚类中识别蛋白质复合物。EGCPI 已经在几个真实数据集上进行了测试,实验结果表明 EGCPI 在蛋白质复合物发现方面非常有效,进化聚类有助于识别 PPI 网络中的蛋白质复合物。EGCPI 的软件可以通过以下网址下载:https://github.com/hetiantian1985/EGCPI。

相似文献

1
Evolutionary Graph Clustering for Protein Complex Identification.基于进化的图聚类算法在蛋白质复合物识别中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):892-904. doi: 10.1109/TCBB.2016.2642107. Epub 2016 Dec 20.
2
From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein-to-Protein Interaction Networks.从功能到相互作用:基于蛋白质-蛋白质相互作用网络准确预测蛋白质复合物的新范式。
IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):616-27. doi: 10.1109/TCBB.2014.2306825.
3
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
4
A degree-distribution based hierarchical agglomerative clustering algorithm for protein complexes identification.基于度分布的层次凝聚聚类算法用于蛋白质复合物识别。
Comput Biol Chem. 2011 Oct 12;35(5):298-307. doi: 10.1016/j.compbiolchem.2011.07.005. Epub 2011 Jul 20.
5
A multi-network clustering method for detecting protein complexes from multiple heterogeneous networks.一种用于从多个异构网络中检测蛋白质复合物的多网络聚类方法。
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):463. doi: 10.1186/s12859-017-1877-4.
6
Utilizing both topological and attribute information for protein complex identification in PPI networks.在蛋白质-蛋白质相互作用网络中利用拓扑和属性信息进行蛋白质复合物识别。
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):780-92. doi: 10.1109/TCBB.2013.37.
7
Protein Complexes Detection Based on Semi-Supervised Network Embedding Model.基于半监督网络嵌入模型的蛋白质复合物检测。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):797-803. doi: 10.1109/TCBB.2019.2944809. Epub 2021 Apr 8.
8
A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks.一种基于种子扩展图聚类的蛋白质互作网络中蛋白质复合物检测方法。
Molecules. 2017 Dec 8;22(12):2179. doi: 10.3390/molecules22122179.
9
Identifying protein complexes based on brainstorming strategy.基于头脑风暴策略识别蛋白质复合物。
Methods. 2016 Nov 1;110:44-53. doi: 10.1016/j.ymeth.2016.07.006. Epub 2016 Jul 9.
10
A density-based clustering approach for identifying overlapping protein complexes with functional preferences.一种基于密度的聚类方法,用于识别具有功能偏好的重叠蛋白质复合物。
BMC Bioinformatics. 2015 May 27;16:174. doi: 10.1186/s12859-015-0583-3.

引用本文的文献

1
Small protein complex prediction algorithm based on protein-protein interaction network segmentation.基于蛋白质-蛋白质相互作用网络分割的小蛋白复合物预测算法。
BMC Bioinformatics. 2022 Sep 30;23(1):405. doi: 10.1186/s12859-022-04960-z.
2
Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context.多组学整合背景下蛋白质-蛋白质相互作用网络的表征与可视化方法概述。
Front Mol Biosci. 2022 Sep 8;9:962799. doi: 10.3389/fmolb.2022.962799. eCollection 2022.
3
An Ensemble Learning Framework for Detecting Protein Complexes From PPI Networks.
一种用于从蛋白质-蛋白质相互作用网络中检测蛋白质复合物的集成学习框架。
Front Genet. 2022 Feb 24;13:839949. doi: 10.3389/fgene.2022.839949. eCollection 2022.
4
CLCA4 and MS4A12 as the significant gene biomarkers of primary colorectal cancer.CLCA4 和 MS4A12 作为原发性结直肠癌的重要基因标志物。
Biosci Rep. 2020 Aug 28;40(8). doi: 10.1042/BSR20200963.
5
Integrating data and knowledge to identify functional modules of genes: a multilayer approach.整合数据和知识以识别基因的功能模块:一种多层方法。
BMC Bioinformatics. 2019 May 2;20(1):225. doi: 10.1186/s12859-019-2800-y.
6
A density-based approach for detecting complexes in weighted PPI networks by semantic similarity.一种基于密度的方法,用于通过语义相似性在加权蛋白质-蛋白质相互作用网络中检测复合物。
PLoS One. 2017 Jul 12;12(7):e0180570. doi: 10.1371/journal.pone.0180570. eCollection 2017.