• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PLW:用于从蛋白质相互作用网络中检测蛋白质复合物的概率局部游走

PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks.

作者信息

Wong Daniel, Li Xiao-Li, Wu Min, Zheng Jie, Ng See-Kiong

出版信息

BMC Genomics. 2013;14 Suppl 5(Suppl 5):S15. doi: 10.1186/1471-2164-14-S5-S15. Epub 2013 Oct 16.

DOI:10.1186/1471-2164-14-S5-S15
PMID:24564427
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3852146/
Abstract

BACKGROUND

Many biological processes are carried out by proteins interacting with each other in the form of protein complexes. However, large-scale detection of protein complexes has remained constrained by experimental limitations. As such, computational detection of protein complexes by applying clustering algorithms on the abundantly available protein-protein interaction (PPI) networks is an important alternative. However, many current algorithms have overlooked the importance of selecting seeds for expansion into clusters without excluding important proteins and including many noisy ones, while ensuring a high degree of functional homogeneity amongst the proteins detected for the complexes.

RESULTS

We designed a novel method called Probabilistic Local Walks (PLW) which clusters regions in a PPI network with high functional similarity to find protein complex cores with high precision and efficiency in O (|V| log |V| + |E|) time. A seed selection strategy, which prioritises seeds with dense neighbourhoods, was devised. We defined a topological measure, called common neighbour similarity, to estimate the functional similarity of two proteins given the number of their common neighbours.

CONCLUSIONS

Our proposed PLW algorithm achieved the highest F-measure (recall and precision) when compared to 11 state-of-the-art methods on yeast protein interaction data, with an improvement of 16.7% over the next highest score. Our experiments also demonstrated that our seed selection strategy is able to increase algorithm precision when applied to three previous protein complex mining techniques.

AVAILABILITY

The software, datasets and predicted complexes are available at http://wonglkd.github.io/PLW.

摘要

背景

许多生物学过程是由蛋白质以蛋白质复合物的形式相互作用来完成的。然而,蛋白质复合物的大规模检测一直受到实验限制的制约。因此,通过在大量可用的蛋白质-蛋白质相互作用(PPI)网络上应用聚类算法来进行蛋白质复合物的计算检测是一种重要的替代方法。然而,当前许多算法在选择用于扩展成簇的种子时忽略了其重要性,在不排除重要蛋白质的同时却包含了许多噪声蛋白质,同时还要确保为复合物检测出的蛋白质之间具有高度的功能同质性。

结果

我们设计了一种名为概率局部游走(PLW)的新方法,该方法在PPI网络中对具有高功能相似性的区域进行聚类,从而在O(|V| log |V| + |E|)时间内高精度、高效率地找到蛋白质复合物核心。我们设计了一种种子选择策略,该策略优先选择具有密集邻域的种子。我们定义了一种拓扑度量,称为共同邻居相似性,以根据两个蛋白质的共同邻居数量来估计它们的功能相似性。

结论

与11种最先进的方法相比,我们提出的PLW算法在酵母蛋白质相互作用数据上实现了最高的F值(召回率和精确率),比次高得分提高了16.7%。我们的实验还表明,我们的种子选择策略应用于之前的三种蛋白质复合物挖掘技术时能够提高算法的精确率。

可用性

该软件、数据集和预测的复合物可在http://wonglkd.github.io/PLW获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/9bdb7d259ccd/1471-2164-14-S5-S15-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/0b14f41283bc/1471-2164-14-S5-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/f908f96bf635/1471-2164-14-S5-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/1926208afdb0/1471-2164-14-S5-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/2679c6ea0994/1471-2164-14-S5-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/e0f7b84220f6/1471-2164-14-S5-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/70c46ffa761d/1471-2164-14-S5-S15-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/bdbd584d20dc/1471-2164-14-S5-S15-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/7bade6869df3/1471-2164-14-S5-S15-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/9bdb7d259ccd/1471-2164-14-S5-S15-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/0b14f41283bc/1471-2164-14-S5-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/f908f96bf635/1471-2164-14-S5-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/1926208afdb0/1471-2164-14-S5-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/2679c6ea0994/1471-2164-14-S5-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/e0f7b84220f6/1471-2164-14-S5-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/70c46ffa761d/1471-2164-14-S5-S15-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/bdbd584d20dc/1471-2164-14-S5-S15-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/7bade6869df3/1471-2164-14-S5-S15-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bcc/3852146/9bdb7d259ccd/1471-2164-14-S5-S15-9.jpg

相似文献

1
PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks.PLW:用于从蛋白质相互作用网络中检测蛋白质复合物的概率局部游走
BMC Genomics. 2013;14 Suppl 5(Suppl 5):S15. doi: 10.1186/1471-2164-14-S5-S15. Epub 2013 Oct 16.
2
Detection of protein complexes using a protein ranking algorithm.使用蛋白质排序算法检测蛋白质复合物。
Proteins. 2012 Oct;80(10):2459-68. doi: 10.1002/prot.24130. Epub 2012 Jul 7.
3
Neighbor Affinity-Based Core-Attachment Method to Detect Protein Complexes in Dynamic PPI Networks.基于邻居亲和力的核心附着方法在动态蛋白质-蛋白质相互作用网络中检测蛋白质复合物
Molecules. 2017 Jul 24;22(7):1223. doi: 10.3390/molecules22071223.
4
k-Partite cliques of protein interactions: A novel subgraph topology for functional coherence analysis on PPI networks.蛋白质相互作用的 k-分划团簇:一种用于 PPI 网络功能一致性分析的新子图拓扑结构。
J Theor Biol. 2014 Jan 7;340:146-54. doi: 10.1016/j.jtbi.2013.09.013. Epub 2013 Sep 19.
5
DPCMNE: Detecting Protein Complexes From Protein-Protein Interaction Networks Via Multi-Level Network Embedding.DPCMNE:通过多层次网络嵌入从蛋白质-蛋白质相互作用网络中检测蛋白质复合物。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1592-1602. doi: 10.1109/TCBB.2021.3050102. Epub 2022 Jun 3.
6
Decision tree classifier based on topological characteristics of subgraph for the mining of protein complexes from large scale PPI networks.基于子图拓扑特征的决策树分类器用于从大规模 PPI 网络中挖掘蛋白质复合物。
Comput Biol Chem. 2023 Oct;106:107935. doi: 10.1016/j.compbiolchem.2023.107935. Epub 2023 Jul 25.
7
Discovering protein complexes in protein interaction networks via exploring the weak ties effect.通过探索弱关系效应在蛋白质相互作用网络中发现蛋白质复合物。
BMC Syst Biol. 2012;6 Suppl 1(Suppl 1):S6. doi: 10.1186/1752-0509-6-S1-S6. Epub 2012 Jul 16.
8
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
9
Identifying conserved protein complexes between species by constructing interolog networks.通过构建种间同源蛋白互作网络来鉴定物种间保守的蛋白质复合物。
BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S8. doi: 10.1186/1471-2105-14-S16-S8. Epub 2013 Oct 22.
10
Protein complex prediction in large ontology attributed protein-protein interaction networks.大型本体属性蛋白质 - 蛋白质相互作用网络中的蛋白质复合物预测
IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):729-41. doi: 10.1109/TCBB.2013.86.

引用本文的文献

1
Identifying protein complexes based on an edge weight algorithm and core-attachment structure.基于边权重算法和核心附着结构识别蛋白质复合物。
BMC Bioinformatics. 2019 Sep 14;20(1):471. doi: 10.1186/s12859-019-3007-y.
2
A new two-stage method for revealing missing parts of edges in protein-protein interaction networks.一种揭示蛋白质-蛋白质相互作用网络中边缘缺失部分的新型两阶段方法。
PLoS One. 2017 May 11;12(5):e0177029. doi: 10.1371/journal.pone.0177029. eCollection 2017.
3
Protein complex prediction for large protein protein interaction networks with the Core&Peel method.使用核心与剥离方法对大型蛋白质-蛋白质相互作用网络进行蛋白质复合物预测。
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):372. doi: 10.1186/s12859-016-1191-6.

本文引用的文献

1
Identifying functional modules in interaction networks through overlapping Markov clustering.通过重叠 Markov 聚类识别交互网络中的功能模块。
Bioinformatics. 2012 Sep 15;28(18):i473-i479. doi: 10.1093/bioinformatics/bts370.
2
Detecting overlapping protein complexes in protein-protein interaction networks.检测蛋白质-蛋白质相互作用网络中的重叠蛋白质复合物。
Nat Methods. 2012 Mar 18;9(5):471-2. doi: 10.1038/nmeth.1938.
3
Discovery of protein complexes with core-attachment structures from Tandem Affinity Purification (TAP) data.从串联亲和纯化(TAP)数据中发现具有核心-附着结构的蛋白质复合物。
J Comput Biol. 2012 Sep;19(9):1027-42. doi: 10.1089/cmb.2010.0293. Epub 2011 Jul 21.
4
Construction of co-complex score matrix for protein complex prediction from AP-MS data.基于 AP-MS 数据的蛋白质复合物预测的共复合物评分矩阵构建。
Bioinformatics. 2011 Jul 1;27(13):i159-66. doi: 10.1093/bioinformatics/btr212.
5
Discover protein complexes in protein-protein interaction networks using parametric local modularity.使用参数化局部模块性发现蛋白质-蛋白质相互作用网络中的蛋白质复合物。
BMC Bioinformatics. 2010 Oct 19;11:521. doi: 10.1186/1471-2105-11-521.
6
MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw:一种改进的 MCL 方法,用于通过整合核心附着结构,从加权 PPI 网络中检测酵母复合物。
BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.
7
SPICi: a fast clustering algorithm for large biological networks.SPICi:一种用于大型生物网络的快速聚类算法。
Bioinformatics. 2010 Apr 15;26(8):1105-11. doi: 10.1093/bioinformatics/btq078. Epub 2010 Feb 24.
8
Refining Markov Clustering for protein complex prediction by incorporating core-attachment structure.通过整合核心-附属结构优化马尔可夫聚类用于蛋白质复合物预测
Genome Inform. 2009 Oct;23(1):159-68.
9
Computational approaches for detecting protein complexes from protein interaction networks: a survey.从蛋白质相互作用网络中检测蛋白质复合物的计算方法:综述。
BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2164-11-S1-S3.
10
RRW: repeated random walks on genome-scale protein networks for local cluster discovery.RRW:基于全基因组尺度蛋白质网络的重复随机游走用于局部簇发现。
BMC Bioinformatics. 2009 Sep 9;10:283. doi: 10.1186/1471-2105-10-283.