• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从亲和纯化/质谱数据中检测蛋白质复合物。

Detection of protein complexes from affinity purification/mass spectrometry data.

作者信息

Cai Bingjing, Wang Haiying, Zheng Huiru, Wang Hui

机构信息

School of Computing and Mathematics, Computer Sciences Research Institute, University of Ulster, N. Ireland, BT37 0QB, UK.

出版信息

BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S4. doi: 10.1186/1752-0509-6-S3-S4. Epub 2012 Dec 17.

DOI:10.1186/1752-0509-6-S3-S4
PMID:23282282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3524315/
Abstract

BACKGROUND

Recent advances in molecular biology have led to the accumulation of large amounts of data on protein-protein interaction networks in different species. An important challenge for the analysis of these data is to extract functional modules such as protein complexes and biological processes from networks which are characterised by the present of a significant number of false positives. Various computational techniques have been applied in recent years. However, most of them treat protein interaction as binary. Co-complex relations derived from affinity purification/mass spectrometry (AP-MS) experiments have been largely ignored.

METHODS

This paper presents a new algorithm for detecting protein complexes from AP-MS data. The algorithm intends to detect groups of prey proteins that are significantly co-associated with the same set of bait proteins. We first construct AP-MS data as a bipartite network, where one set of nodes consists of bait proteins and the other set is composed of prey proteins. We then calculate pair-wise similarities of bait proteins based on the number of their commonly shared neighbours. A hierarchical clustering algorithm is employed to cluster bait proteins based on the similarities and thus a set of 'seed' clusters is obtained. Starting from these 'seed' clusters, an expansion process is developed to identify prey proteins which are significantly associated with the same set of bait proteins. Then, a set of complete protein complexes is derived. In application to two real AP-MS datasets, we validate biological significance of predicted protein complexes by using curated protein complexes and well-characterized cellular component annotation from Gene Ontology (GO). Several statistical metrics have been applied for evaluation.

RESULTS

Experimental results show that, the proposed algorithm achieves significant improvement in detecting protein complexes from AP-MS data. In comparison to the well-known MCL algorithm, our algorithm improves the accuracy rate by about 20% in detecting protein complexes in both networks and increases the F-Measure value by about 50% in Krogan_2006 network. Greater precision and better accuracy have been achieved and the identified complexes are demonstrated to match well with existing curated protein complexes.

CONCLUSIONS

Our study highlights the significance of taking co-complex relations into account when extracting protein complexes from AP-MS data. The algorithm proposed in this paper can be easily extended to the analysis of other biological networks which can be conveniently represented by bipartite graphs such as drug-target networks.

摘要

背景

分子生物学的最新进展使得不同物种中蛋白质-蛋白质相互作用网络的大量数据得以积累。分析这些数据面临的一个重要挑战是从存在大量假阳性的网络中提取诸如蛋白质复合物和生物过程等功能模块。近年来已应用了各种计算技术。然而,它们大多将蛋白质相互作用视为二元关系。源自亲和纯化/质谱(AP-MS)实验的共复合物关系在很大程度上被忽视了。

方法

本文提出了一种从AP-MS数据中检测蛋白质复合物的新算法。该算法旨在检测与同一组诱饵蛋白显著共关联的猎物蛋白组。我们首先将AP-MS数据构建为一个二分网络,其中一组节点由诱饵蛋白组成,另一组由猎物蛋白组成。然后,我们根据诱饵蛋白共同拥有的邻居数量计算它们之间的成对相似度。采用层次聚类算法基于这些相似度对诱饵蛋白进行聚类,从而得到一组“种子”簇。从这些“种子”簇开始,开发一个扩展过程来识别与同一组诱饵蛋白显著相关的猎物蛋白。然后,得到一组完整的蛋白质复合物。在应用于两个真实的AP-MS数据集时,我们通过使用经过整理的蛋白质复合物和来自基因本体(GO)的特征明确的细胞成分注释来验证预测蛋白质复合物的生物学意义。应用了几种统计指标进行评估。

结果

实验结果表明,所提出的算法在从AP-MS数据中检测蛋白质复合物方面取得了显著改进。与著名的MCL算法相比,我们的算法在检测两个网络中的蛋白质复合物时,准确率提高了约20%,在Krogan_2006网络中F-Measure值提高了约50%。实现了更高的精度和更好的准确性,并且所识别的复合物与现有的经过整理的蛋白质复合物匹配良好。

结论

我们的研究强调了在从AP-MS数据中提取蛋白质复合物时考虑共复合物关系的重要性。本文提出的算法可以很容易地扩展到其他可以方便地用二分图表示的生物网络的分析,如药物-靶标网络。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42dd/3524315/d0983a462775/1752-0509-6-S3-S4-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42dd/3524315/fbc592ea9e78/1752-0509-6-S3-S4-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42dd/3524315/d0983a462775/1752-0509-6-S3-S4-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42dd/3524315/fbc592ea9e78/1752-0509-6-S3-S4-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42dd/3524315/d0983a462775/1752-0509-6-S3-S4-2.jpg

相似文献

1
Detection of protein complexes from affinity purification/mass spectrometry data.从亲和纯化/质谱数据中检测蛋白质复合物。
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S4. doi: 10.1186/1752-0509-6-S3-S4. Epub 2012 Dec 17.
2
Integrating domain similarity to improve protein complexes identification in TAP-MS data.整合领域相似性以提高 TAP-MS 数据中的蛋白质复合物鉴定。
Proteome Sci. 2013 Nov 7;11(Suppl 1):S2. doi: 10.1186/1477-5956-11-S1-S2.
3
Markov clustering versus affinity propagation for the partitioning of protein interaction graphs.用于蛋白质相互作用图划分的马尔可夫聚类与亲和传播算法
BMC Bioinformatics. 2009 Mar 30;10:99. doi: 10.1186/1471-2105-10-99.
4
Identification of Protein Complexes from Tandem Affinity Purification/Mass Spectrometry Data via Biased Random Walk.通过有偏随机游走从串联亲和纯化/质谱数据中鉴定蛋白质复合物
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):455-66. doi: 10.1109/TCBB.2014.2352616.
5
Computational detection of protein complexes in AP-MS experiments.基于 AP-MS 实验的蛋白质复合物计算检测
Proteomics. 2012 May;12(10):1663-8. doi: 10.1002/pmic.201100508.
6
Scoring Large-Scale Affinity Purification Mass Spectrometry Datasets with MiST.使用MiST对大规模亲和纯化质谱数据集进行评分。
Curr Protoc Bioinformatics. 2015 Mar 9;49:8.19.1-8.19.16. doi: 10.1002/0471250953.bi0819s49.
7
ROCS: a reproducibility index and confidence score for interaction proteomics studies.ROCS:交互蛋白质组学研究的可重复性指数和置信得分。
BMC Bioinformatics. 2012 Jun 8;13:128. doi: 10.1186/1471-2105-13-128.
8
Analysis of protein complexes through model-based biclustering of label-free quantitative AP-MS data.通过基于模型的无标签定量 AP-MS 数据的双聚类分析蛋白质复合物。
Mol Syst Biol. 2010 Jun 22;6:385. doi: 10.1038/msb.2010.41.
9
Computational framework for analysis of prey-prey associations in interaction proteomics identifies novel human protein-protein interactions and networks.用于分析相互作用蛋白质组学中猎物-猎物关联的计算框架确定了新的人类蛋白质-蛋白质相互作用和网络。
J Proteome Res. 2012 Sep 7;11(9):4476-87. doi: 10.1021/pr300227y. Epub 2012 Aug 21.
10
A two-step framework for inferring direct protein-protein interaction network from AP-MS data.一种从亲和纯化-质谱(AP-MS)数据推断直接蛋白质-蛋白质相互作用网络的两步框架。
BMC Syst Biol. 2017 Sep 21;11(Suppl 4):82. doi: 10.1186/s12918-017-0452-y.

引用本文的文献

1
Mapping the Bibliometrics Landscape of AI in Medicine: Methodological Study.人工智能在医学中的文献计量学图谱绘制:方法研究。
J Med Internet Res. 2023 Dec 8;25:e45815. doi: 10.2196/45815.
2
Towards a Hierarchical Strategy to Explore Multi-Scale IP/MS Data for Protein Complexes.迈向一种用于探索蛋白质复合物多尺度IP/MS数据的分层策略。
PLoS One. 2015 Oct 8;10(10):e0139704. doi: 10.1371/journal.pone.0139704. eCollection 2015.
3
Integrating domain similarity to improve protein complexes identification in TAP-MS data.整合领域相似性以提高 TAP-MS 数据中的蛋白质复合物鉴定。

本文引用的文献

1
Identification of protein complexes from co-immunoprecipitation data.从共免疫沉淀数据中鉴定蛋白质复合物。
Bioinformatics. 2011 Jan 1;27(1):111-7. doi: 10.1093/bioinformatics/btq652. Epub 2010 Nov 25.
2
How and when should interactome-derived clusters be used to predict functional modules and protein function?应当如何以及何时使用互作组学衍生的聚类来预测功能模块和蛋白质功能?
Bioinformatics. 2009 Dec 1;25(23):3143-50. doi: 10.1093/bioinformatics/btp551. Epub 2009 Sep 21.
3
RRW: repeated random walks on genome-scale protein networks for local cluster discovery.
Proteome Sci. 2013 Nov 7;11(Suppl 1):S2. doi: 10.1186/1477-5956-11-S1-S2.
4
Advances in systems biology: computational algorithms and applications.系统生物学进展:计算算法与应用
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S1. doi: 10.1186/1752-0509-6-S3-S1. Epub 2012 Dec 17.
RRW:基于全基因组尺度蛋白质网络的重复随机游走用于局部簇发现。
BMC Bioinformatics. 2009 Sep 9;10:283. doi: 10.1186/1471-2105-10-283.
4
A core-attachment based method to detect protein complexes in PPI networks.一种基于核心附着的方法来检测蛋白质-蛋白质相互作用网络中的蛋白质复合物。
BMC Bioinformatics. 2009 Jun 2;10:169. doi: 10.1186/1471-2105-10-169.
5
Up-to-date catalogues of yeast protein complexes.最新的酵母蛋白质复合物目录。
Nucleic Acids Res. 2009 Feb;37(3):825-31. doi: 10.1093/nar/gkn1005. Epub 2008 Dec 18.
6
The yeast GID complex, a novel ubiquitin ligase (E3) involved in the regulation of carbohydrate metabolism.酵母GID复合物,一种参与碳水化合物代谢调节的新型泛素连接酶(E3)。
Mol Biol Cell. 2008 Aug;19(8):3323-33. doi: 10.1091/mbc.e08-03-0328. Epub 2008 May 28.
7
Identifying functional modules in the physical interactome of Saccharomyces cerevisiae.在酿酒酵母的物理相互作用组中识别功能模块。
Proteomics. 2007 Mar;7(6):944-60. doi: 10.1002/pmic.200600636.
8
Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae.迈向酿酒酵母物理相互作用组的综合图谱。
Mol Cell Proteomics. 2007 Mar;6(3):439-50. doi: 10.1074/mcp.M600381-MCP200. Epub 2007 Jan 2.
9
Evaluation of clustering algorithms for protein-protein interaction networks.蛋白质-蛋白质相互作用网络聚类算法的评估
BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488.
10
Development and implementation of an algorithm for detection of protein complexes in large interaction networks.用于在大型相互作用网络中检测蛋白质复合物的算法的开发与实现。
BMC Bioinformatics. 2006 Apr 14;7:207. doi: 10.1186/1471-2105-7-207.