• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于贝叶斯系统发育基因组学的蛋白质分子功能预测

Protein molecular function prediction by Bayesian phylogenomics.

作者信息

Engelhardt Barbara E, Jordan Michael I, Muratore Kathryn E, Brenner Steven E

机构信息

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California, United States of America.

出版信息

PLoS Comput Biol. 2005 Oct;1(5):e45. doi: 10.1371/journal.pcbi.0010045. Epub 2005 Oct 7.

DOI:10.1371/journal.pcbi.0010045
PMID:16217548
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1246806/
Abstract

We present a statistical graphical model to infer specific molecular function for unannotated protein sequences using homology. Based on phylogenomic principles, SIFTER (Statistical Inference of Function Through Evolutionary Relationships) accurately predicts molecular function for members of a protein family given a reconciled phylogeny and available function annotations, even when the data are sparse or noisy. Our method produced specific and consistent molecular function predictions across 100 Pfam families in comparison to the Gene Ontology annotation database, BLAST, GOtcha, and Orthostrapper. We performed a more detailed exploration of functional predictions on the adenosine-5'-monophosphate/adenosine deaminase family and the lactate/malate dehydrogenase family, in the former case comparing the predictions against a gold standard set of published functional characterizations. Given function annotations for 3% of the proteins in the deaminase family, SIFTER achieves 96% accuracy in predicting molecular function for experimentally characterized proteins as reported in the literature. The accuracy of SIFTER on this dataset is a significant improvement over other currently available methods such as BLAST (75%), GeneQuiz (64%), GOtcha (89%), and Orthostrapper (11%). We also experimentally characterized the adenosine deaminase from Plasmodium falciparum, confirming SIFTER's prediction. The results illustrate the predictive power of exploiting a statistical model of function evolution in phylogenomic problems. A software implementation of SIFTER is available from the authors.

摘要

我们提出了一种统计图形模型,用于利用同源性推断未注释蛋白质序列的特定分子功能。基于系统发育基因组学原理,SIFTER(通过进化关系进行功能的统计推断)在给定协调的系统发育和可用功能注释的情况下,能够准确预测蛋白质家族成员的分子功能,即使数据稀疏或有噪声。与基因本体注释数据库、BLAST、GOtcha和Orthostrapper相比,我们的方法在100个Pfam家族中产生了特定且一致的分子功能预测。我们对腺苷-5'-单磷酸/腺苷脱氨酶家族和乳酸/苹果酸脱氢酶家族的功能预测进行了更详细的探索,在前一种情况下,将预测结果与一组已发表的功能特征的黄金标准进行比较。对于脱氨酶家族中3%的蛋白质给出功能注释的情况下,SIFTER在预测文献中报道的实验表征蛋白质的分子功能方面达到了96%的准确率。SIFTER在该数据集上的准确率比其他现有方法(如BLAST(75%)、GeneQuiz(64%)、GOtcha(89%)和Orthostrapper(11%))有显著提高。我们还对恶性疟原虫的腺苷脱氨酶进行了实验表征,证实了SIFTER的预测。结果说明了在系统发育基因组学问题中利用功能进化统计模型的预测能力。作者提供了SIFTER的软件实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/671f555a4e95/pcbi.0010045.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/eb65a3fca120/pcbi.0010045.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/143ef83f664a/pcbi.0010045.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/03d0ddee02e6/pcbi.0010045.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/7e0f2169175e/pcbi.0010045.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/639fc5fc4a33/pcbi.0010045.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/671f555a4e95/pcbi.0010045.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/eb65a3fca120/pcbi.0010045.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/143ef83f664a/pcbi.0010045.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/03d0ddee02e6/pcbi.0010045.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/7e0f2169175e/pcbi.0010045.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/639fc5fc4a33/pcbi.0010045.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6390/1274288/671f555a4e95/pcbi.0010045.g006.jpg

相似文献

1
Protein molecular function prediction by Bayesian phylogenomics.基于贝叶斯系统发育基因组学的蛋白质分子功能预测
PLoS Comput Biol. 2005 Oct;1(5):e45. doi: 10.1371/journal.pcbi.0010045. Epub 2005 Oct 7.
2
SIFTER-T: a scalable and optimized framework for the SIFTER phylogenomic method of probabilistic protein domain annotation.SIFTER-T:一种用于概率性蛋白质结构域注释的SIFTER系统发育基因组学方法的可扩展且优化的框架。
Biotechniques. 2015 Mar 1;58(3):140-2. doi: 10.2144/000114266. eCollection 2015 Mar.
3
Genome-scale phylogenetic function annotation of large and diverse protein families.大规模和多样化蛋白质家族的基因组尺度系统发育功能注释。
Genome Res. 2011 Nov;21(11):1969-80. doi: 10.1101/gr.104687.109. Epub 2011 Jul 22.
4
Clustering the annotation space of proteins.对蛋白质的注释空间进行聚类。
BMC Bioinformatics. 2005 Feb 9;6:24. doi: 10.1186/1471-2105-6-24.
5
SIFTER search: a web server for accurate phylogeny-based protein function prediction.SIFTER搜索:一个用于基于系统发育的蛋白质功能准确预测的网络服务器。
Nucleic Acids Res. 2015 Jul 1;43(W1):W141-7. doi: 10.1093/nar/gkv461. Epub 2015 May 15.
6
AVID: an integrative framework for discovering functional relationships among proteins.AVID:一个用于发现蛋白质间功能关系的综合框架。
BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.
7
Berkeley Phylogenomics Group web servers: resources for structural phylogenomic analysis.伯克利系统发育基因组学小组网络服务器:结构系统发育基因组分析资源。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W27-32. doi: 10.1093/nar/gkm325. Epub 2007 May 8.
8
PFP: Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data.PFP:利用蛋白质序列数据自动预测具有置信度分数的基因本体功能注释。
Proteins. 2009 Feb 15;74(3):566-82. doi: 10.1002/prot.22172.
9
Evaluation of methods for predicting the topology of beta-barrel outer membrane proteins and a consensus prediction method.β-桶状外膜蛋白拓扑结构预测方法的评估及一种共识预测方法
BMC Bioinformatics. 2005 Jan 12;6:7. doi: 10.1186/1471-2105-6-7.
10
SplitTester: software to identify domains responsible for functional divergence in protein family.SplitTester:用于识别蛋白质家族中功能差异相关结构域的软件。
BMC Bioinformatics. 2005 Jun 1;6:137. doi: 10.1186/1471-2105-6-137.

引用本文的文献

1
Improving enzyme functional annotation by integrating in vitro and in silico approaches: The example of histidinol phosphate phosphatases.通过整合体外和计算方法来改善酶功能注释:以肌醇磷酸磷酸酶为例。
Protein Sci. 2024 Feb;33(2):e4899. doi: 10.1002/pro.4899.
2
Phylogenetic inference of the emergence of sequence modules and protein-protein interactions in the ADAMTS-TSL family.ADAMTS-TSL 家族中序列模块和蛋白-蛋白相互作用出现的系统发育推断。
PLoS Comput Biol. 2023 Aug 31;19(8):e1011404. doi: 10.1371/journal.pcbi.1011404. eCollection 2023 Aug.
3
Bioinformatics approaches for classification and investigation of the evolution of the Na/K-ATPase alpha-subunit.

本文引用的文献

1
GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes.GOtcha:一种通过七个基因组注释评估蛋白质功能预测的新方法。
BMC Bioinformatics. 2004 Nov 18;5:178. doi: 10.1186/1471-2105-5-178.
2
Automated prediction of protein function and detection of functional sites from structure.从结构自动预测蛋白质功能并检测功能位点。
Proc Natl Acad Sci U S A. 2004 Oct 12;101(41):14754-9. doi: 10.1073/pnas.0404569101. Epub 2004 Sep 29.
3
Call for an enzyme genomics initiative.呼吁开展一项酶基因组学计划。
生物信息学方法在钠离子/钾离子-ATP 酶 α 亚基分类和进化研究中的应用。
BMC Ecol Evol. 2022 Oct 26;22(1):122. doi: 10.1186/s12862-022-02071-0.
4
Multiple Profile Models Extract Features from Protein Sequence Data and Resolve Functional Diversity of Very Different Protein Families.多剖面模型从蛋白质序列数据中提取特征,并解决非常不同蛋白质家族的功能多样性。
Mol Biol Evol. 2022 Apr 10;39(4). doi: 10.1093/molbev/msac070.
5
Schistosomiasis Drug Discovery in the Era of Automation and Artificial Intelligence.自动化和人工智能时代的血吸虫病药物发现。
Front Immunol. 2021 May 31;12:642383. doi: 10.3389/fimmu.2021.642383. eCollection 2021.
6
Bayesian parameter estimation for automatic annotation of gene functions using observational data and phylogenetic trees.贝叶斯参数估计在使用观测数据和系统发生树自动注释基因功能中的应用。
PLoS Comput Biol. 2021 Feb 18;17(2):e1007948. doi: 10.1371/journal.pcbi.1007948. eCollection 2021 Feb.
7
PhyloGenes: An online phylogenetics and functional genomics resource for plant gene function inference.PhyloGenes:一个用于植物基因功能推断的在线系统发育学和功能基因组学资源。
Plant Direct. 2020 Dec 30;4(12):e00293. doi: 10.1002/pld3.293. eCollection 2020 Dec.
8
The ortholog conjecture revisited: the value of orthologs and paralogs in function prediction.重新审视直系同源推断假说:直系同源物和旁系同源物在功能预测中的价值。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i219-i226. doi: 10.1093/bioinformatics/btaa468.
9
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.CAFA 挑战赛报告称,通过实验筛选,提高了数百个基因的蛋白质功能预测和新的功能注释。
Genome Biol. 2019 Nov 19;20(1):244. doi: 10.1186/s13059-019-1835-8.
10
SamPler - a novel method for selecting parameters for gene functional annotation routines.SamPler-一种用于选择基因功能注释例程参数的新方法。
BMC Bioinformatics. 2019 Sep 5;20(1):454. doi: 10.1186/s12859-019-3038-4.
Genome Biol. 2004;5(8):401. doi: 10.1186/gb-2004-5-8-401. Epub 2004 Jul 30.
4
GOblet: a platform for Gene Ontology annotation of anonymous sequence data.GOblet:一个用于对匿名序列数据进行基因本体注释的平台。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W313-7. doi: 10.1093/nar/gkh406.
5
The Bayesian revolution in genetics.遗传学中的贝叶斯革命。
Nat Rev Genet. 2004 Apr;5(4):251-61. doi: 10.1038/nrg1318.
6
Identifying protein function--a call for community action.识别蛋白质功能——呼吁社区行动。
PLoS Biol. 2004 Mar;2(3):E42. doi: 10.1371/journal.pbio.0020042. Epub 2004 Mar 16.
7
Adenosine deaminases acting on RNA (ADARs): RNA-editing enzymes.作用于RNA的腺苷脱氨酶(ADARs):RNA编辑酶。
Genome Biol. 2004;5(2):209. doi: 10.1186/gb-2004-5-2-209. Epub 2004 Feb 2.
8
Phylogenomic inference of protein molecular function: advances and challenges.蛋白质分子功能的系统发育基因组学推断:进展与挑战
Bioinformatics. 2004 Jan 22;20(2):170-9. doi: 10.1093/bioinformatics/bth021.
9
The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.基因本体注释(GOA)数据库:在UniProt中与基因本体共享知识。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D262-6. doi: 10.1093/nar/gkh021.
10
GoFigure: automated Gene Ontology annotation.GoFigure:自动基因本体注释
Bioinformatics. 2003 Dec 12;19(18):2484-5. doi: 10.1093/bioinformatics/btg338.