• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将蛋白质的模块性、保守性和相互作用相结合,显著提高了蛋白质功能预测的准确性和覆盖范围。

Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction.

机构信息

Knowledge Management in Bioinformatics, Humboldt-Universitat zu Berlin Unter den Linden 6, 10099 Berlin, Germany.

出版信息

BMC Genomics. 2010 Dec 20;11:717. doi: 10.1186/1471-2164-11-717.

DOI:10.1186/1471-2164-11-717
PMID:21171995
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3017542/
Abstract

BACKGROUND

While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species.

RESULTS

We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (MLH1, PMS2, EPHB4 ) and could confirm more than 73% of them based on evidence in the literature.

CONCLUSIONS

The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage.

摘要

背景

虽然新测序的基因组和基因数量不断增加,但阐明它们的功能仍然是一项费力且耗时的任务。这导致了广泛的方法来预测蛋白质的功能。我们报告了一种新的方法,该方法基于蛋白质相互作用、同源性和不同物种中蛋白质网络的保守性信息的组合来预测功能。

结果

我们表明,将这些独立证据来源聚合在一起,与基线和文献中报道的其他方法相比,会导致预测数量和质量的显著提高。例如,我们的方法为人类生成了超过 12000 个新的蛋白质功能,估计精度约为 76%,其中有 1973 个人类蛋白质的 7500 个新功能注释,这些蛋白质之前只有零个或只有一个功能注释。我们还在一组在结直肠癌中起重要作用的基因(MLH1、PMS2、EPHB4)上验证了我们的预测,根据文献中的证据,我们可以确认其中超过 73%的基因。

结论

将不同的方法组合成一种单一的、综合的预测方法,可以在不同的、但始终保持高精度和非常好的覆盖度的水平上,为分析中包含的每一个物种推断出数千种蛋白质功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/296bcda5b61c/1471-2164-11-717-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/d1863b7fe860/1471-2164-11-717-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/10aca186170f/1471-2164-11-717-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/fd08b08cd74a/1471-2164-11-717-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/982135512986/1471-2164-11-717-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/7bb21a15839c/1471-2164-11-717-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/2c0c81bd0c15/1471-2164-11-717-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/296bcda5b61c/1471-2164-11-717-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/d1863b7fe860/1471-2164-11-717-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/10aca186170f/1471-2164-11-717-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/fd08b08cd74a/1471-2164-11-717-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/982135512986/1471-2164-11-717-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/7bb21a15839c/1471-2164-11-717-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/2c0c81bd0c15/1471-2164-11-717-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de23/3017542/296bcda5b61c/1471-2164-11-717-7.jpg

相似文献

1
Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction.将蛋白质的模块性、保守性和相互作用相结合,显著提高了蛋白质功能预测的准确性和覆盖范围。
BMC Genomics. 2010 Dec 20;11:717. doi: 10.1186/1471-2164-11-717.
2
Truncation of the C-terminus of human MLH1 blocks intracellular stabilization of PMS2 and disrupts DNA mismatch repair.人源MLH1 C末端的截短会阻碍PMS2在细胞内的稳定,并破坏DNA错配修复。
DNA Repair (Amst). 2006 Mar 7;5(3):347-61. doi: 10.1016/j.dnarep.2005.11.001. Epub 2005 Dec 9.
3
Joint evolutionary trees: a large-scale method to predict protein interfaces based on sequence sampling.联合进化树:一种基于序列采样预测蛋白质界面的大规模方法。
PLoS Comput Biol. 2009 Jan;5(1):e1000267. doi: 10.1371/journal.pcbi.1000267. Epub 2009 Jan 23.
4
Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context.基因组比对、原核生物基因组组织的进化以及利用基因组背景预测基因功能。
Genome Res. 2001 Mar;11(3):356-72. doi: 10.1101/gr.gr-1619r.
5
6
Contribution of human mlh1 and pms2 ATPase activities to DNA mismatch repair.人源MLH1和PMS2 ATP酶活性对DNA错配修复的作用
J Biol Chem. 2002 Jun 14;277(24):21801-9. doi: 10.1074/jbc.M111342200. Epub 2002 Mar 15.
7
Evolutionarily conserved substrate substructures for automated annotation of enzyme superfamilies.用于酶超家族自动注释的进化保守底物亚结构
PLoS Comput Biol. 2008 Aug 1;4(8):e1000142. doi: 10.1371/journal.pcbi.1000142.
8
Algorithms to detect multiprotein modularity conserved during evolution.用于检测进化过程中保守的多蛋白模块性的算法。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1046-58. doi: 10.1109/TCBB.2011.125.
9
MLH3: a DNA mismatch repair gene associated with mammalian microsatellite instability.MLH3:一种与哺乳动物微卫星不稳定性相关的DNA错配修复基因。
Nat Genet. 2000 Jan;24(1):27-35. doi: 10.1038/71643.
10
Conservation of hot regions in protein-protein interaction in evolution.蛋白质-蛋白质相互作用中热点区域在进化过程中的保守性。
Methods. 2016 Nov 1;110:73-80. doi: 10.1016/j.ymeth.2016.06.020. Epub 2016 Jun 21.

引用本文的文献

1
The Protein Interactome of and Bacterial Meta-interactomes Improve Function Predictions.和细菌元相互作用组的蛋白质相互作用组改善功能预测。
mSystems. 2017 Jun 6;2(3). doi: 10.1128/mSystems.00019-17. eCollection 2017 May-Jun.
2
Protein-protein interaction networks (PPI) and complex diseases.蛋白质-蛋白质相互作用网络(PPI)与复杂疾病。
Gastroenterol Hepatol Bed Bench. 2014 Winter;7(1):17-31.
3
A combined approach for genome wide protein function annotation/prediction.一种用于全基因组蛋白质功能注释/预测的联合方法。

本文引用的文献

1
Evolution of biomolecular networks: lessons from metabolic and protein interactions.生物分子网络的演化:来自代谢和蛋白质相互作用的经验教训。
Nat Rev Mol Cell Biol. 2009 Nov;10(11):791-803. doi: 10.1038/nrm2787.
2
How and when should interactome-derived clusters be used to predict functional modules and protein function?应当如何以及何时使用互作组学衍生的聚类来预测功能模块和蛋白质功能?
Bioinformatics. 2009 Dec 1;25(23):3143-50. doi: 10.1093/bioinformatics/btp551. Epub 2009 Sep 21.
3
The Gene Ontology's Reference Genome Project: a unified framework for functional annotation across species.
Proteome Sci. 2013 Nov 7;11(Suppl 1):S1. doi: 10.1186/1477-5956-11-S1-S1.
4
Hidden information revealed by optimal community structure from a protein-complex bipartite network improves protein function prediction.最优社区结构揭示蛋白质复合物二分网络中的隐藏信息,提高蛋白质功能预测。
PLoS One. 2013;8(4):e60372. doi: 10.1371/journal.pone.0060372. Epub 2013 Apr 5.
5
Global network alignment using multiscale spectral signatures.使用多尺度谱特征进行全球网络对齐。
Bioinformatics. 2012 Dec 1;28(23):3105-14. doi: 10.1093/bioinformatics/bts592. Epub 2012 Oct 9.
基因本体论参考基因组计划:跨物种功能注释的统一框架。
PLoS Comput Biol. 2009 Jul;5(7):e1000431. doi: 10.1371/journal.pcbi.1000431. Epub 2009 Jul 3.
4
ESG: extended similarity group method for automated protein function prediction.ESG:用于蛋白质功能自动预测的扩展相似性分组方法。
Bioinformatics. 2009 Jul 15;25(14):1739-45. doi: 10.1093/bioinformatics/btp309. Epub 2009 May 12.
5
Preferential induction of EphB4 over EphB2 and its implication in colorectal cancer progression.EphB4相对于EphB2的优先诱导及其在结直肠癌进展中的意义。
Cancer Res. 2009 May 1;69(9):3736-45. doi: 10.1158/0008-5472.CAN-08-3232. Epub 2009 Apr 14.
6
Protein function annotation by homology-based inference.基于同源性推断的蛋白质功能注释。
Genome Biol. 2009 Feb 2;10(2):207. doi: 10.1186/gb-2009-10-2-207.
7
Annotating proteins with generalized functional linkages.用广义功能连接注释蛋白质。
Proc Natl Acad Sci U S A. 2008 Nov 18;105(46):17700-5. doi: 10.1073/pnas.0809583105. Epub 2008 Nov 12.
8
The rough guide to in silico function prediction, or how to use sequence and structure information to predict protein function.计算机模拟功能预测的简要指南,或如何利用序列和结构信息预测蛋白质功能。
PLoS Comput Biol. 2008 Oct;4(10):e1000160. doi: 10.1371/journal.pcbi.1000160. Epub 2008 Oct 31.
9
PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data.PFP:利用蛋白质序列数据自动预测具有置信度分数的基因本体功能注释。
Proteins. 2009 Feb 15;74(3):566-82. doi: 10.1002/prot.22172.
10
Predicting protein function from domain content.从结构域组成预测蛋白质功能。
Bioinformatics. 2008 Aug 1;24(15):1681-7. doi: 10.1093/bioinformatics/btn312. Epub 2008 Jun 30.