Suppr超能文献

通过整合蛋白质相互作用网络的多重比对来鉴定蛋白质复合物。

Identification of protein complexes by integrating multiple alignment of protein interaction networks.

作者信息

Ma Cheng-Yu, Chen Yi-Ping Phoebe, Berger Bonnie, Liao Chung-Shou

机构信息

Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan.

Department of Computer Science and Computer Engineering, La Trobe University, Melbourne, Vic, Australia.

出版信息

Bioinformatics. 2017 Jun 1;33(11):1681-1688. doi: 10.1093/bioinformatics/btx043.

Abstract

MOTIVATION

Protein complexes are one of the keys to studying the behavior of a cell system. Many biological functions are carried out by protein complexes. During the past decade, the main strategy used to identify protein complexes from high-throughput network data has been to extract near-cliques or highly dense subgraphs from a single protein-protein interaction (PPI) network. Although experimental PPI data have increased significantly over recent years, most PPI networks still have many false positive interactions and false negative edge loss due to the limitations of high-throughput experiments. In particular, the false negative errors restrict the search space of such conventional protein complex identification approaches. Thus, it has become one of the most challenging tasks in systems biology to automatically identify protein complexes.

RESULTS

In this study, we propose a new algorithm, NEOComplex ( NE CC- and O rtholog-based Complex identification by multiple network alignment), which integrates functional orthology information that can be obtained from different types of multiple network alignment (MNA) approaches to expand the search space of protein complex detection. As part of our approach, we also define a new edge clustering coefficient (NECC) to assign weights to interaction edges in PPI networks so that protein complexes can be identified more accurately. The NECC is based on the intuition that there is functional information captured in the common neighbors of the common neighbors as well. Our results show that our algorithm outperforms well-known protein complex identification tools in a balance between precision and recall on three eukaryotic species: human, yeast, and fly. As a result of MNAs of the species, the proposed approach can tolerate edge loss in PPI networks and even discover sparse protein complexes which have traditionally been a challenge to predict.

AVAILABILITY AND IMPLEMENTATION

http://acolab.ie.nthu.edu.tw/bionetwork/NEOComplex.

CONTACT

bab@csail.mit.edu or csliao@ie.nthu.edu.tw.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

蛋白质复合物是研究细胞系统行为的关键之一。许多生物学功能是由蛋白质复合物执行的。在过去十年中,从高通量网络数据中识别蛋白质复合物的主要策略是从单个蛋白质 - 蛋白质相互作用(PPI)网络中提取近似团或高度密集的子图。尽管近年来实验性PPI数据显著增加,但由于高通量实验的局限性,大多数PPI网络仍然存在许多假阳性相互作用和假阴性边丢失的情况。特别是,假阴性错误限制了此类传统蛋白质复合物识别方法的搜索空间。因此,自动识别蛋白质复合物已成为系统生物学中最具挑战性的任务之一。

结果

在本研究中,我们提出了一种新算法NEOComplex(基于多网络比对的基于共表达和直系同源的复合物识别),该算法整合了可从不同类型的多网络比对(MNA)方法中获得的功能直系同源信息,以扩展蛋白质复合物检测的搜索空间。作为我们方法的一部分,我们还定义了一种新的边聚类系数(NECC),为PPI网络中的相互作用边分配权重,以便更准确地识别蛋白质复合物。NECC基于这样一种直觉,即共同邻居的共同邻居中也捕获了功能信息。我们的结果表明,我们的算法在人类、酵母和果蝇这三种真核生物物种上,在精度和召回率之间的平衡方面优于著名的蛋白质复合物识别工具。由于对这些物种进行了多网络比对,所提出的方法可以容忍PPI网络中的边丢失,甚至发现传统上难以预测的稀疏蛋白质复合物。

可用性和实现方式

http://acolab.ie.nthu.edu.tw/bionetwork/NEOComplex。

联系方式

bab@csail.mit.educsliao@ie.nthu.edu.tw

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

10
Protein complex prediction in interaction network based on network motif.基于网络基元的互作网络中蛋白质复合物预测。
Comput Biol Chem. 2020 Dec;89:107399. doi: 10.1016/j.compbiolchem.2020.107399. Epub 2020 Oct 9.

引用本文的文献

9
Classification in biological networks with hypergraphlet kernels.基于超图节点核的生物网络分类。
Bioinformatics. 2021 May 17;37(7):1000-1007. doi: 10.1093/bioinformatics/btaa768.

本文引用的文献

2
Fundamentals of protein interaction network mapping.蛋白质相互作用网络图谱的基础
Mol Syst Biol. 2015 Dec 17;11(12):848. doi: 10.15252/msb.20156351.
5
Topology-function conservation in protein-protein interaction networks.蛋白质-蛋白质相互作用网络中的拓扑结构-功能保守性
Bioinformatics. 2015 May 15;31(10):1632-9. doi: 10.1093/bioinformatics/btv026. Epub 2015 Jan 20.
9
Computational solutions for omics data.计算方法在组学数据中的应用。
Nat Rev Genet. 2013 May;14(5):333-46. doi: 10.1038/nrg3433.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验