Suppr
超能文献

大规模计算蛋白质功能预测评估。

A large-scale evaluation of computational protein function prediction.

机构信息

School of Informatics and Computing, Indiana University, Bloomington, Indiana, USA.

出版信息

Nat Methods. 2013 Mar;10(3):221-7. doi: 10.1038/nmeth.2340. Epub 2013 Jan 27.

DOI:10.1038/nmeth.2340

PMID:23353650

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3584181/

Abstract

Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools.

摘要

蛋白质功能的自动注释具有挑战性。随着测序基因组数量的快速增长，绝大多数蛋白质产物只能通过计算进行注释。如果要依赖于计算预测，那么这些方法的准确性就至关重要。本文报告了首次大规模基于社区的蛋白质功能注释（CAFA）实验的结果。54 种方法代表了蛋白质功能预测的最新技术水平，它们在来自 11 个生物体的 866 个蛋白质目标集上进行了评估。有两个发现引人注目：（i）当今最好的蛋白质功能预测算法大大优于广泛使用的第一代方法，在所有类型的目标上都有显著提高；（ii）尽管顶级方法的表现足以指导实验，但目前可用工具仍有很大的改进空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ffe7/6871008/7a80c641dea9/41592_2013_Article_BFnmeth2340_Fig1_HTML.jpg

相似文献

A large-scale evaluation of computational protein function prediction.

Nat Methods. 2013 Mar;10(3):221-7. doi: 10.1038/nmeth.2340. Epub 2013 Jan 27.

Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA).

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S15. doi: 10.1186/1471-2105-14-s3-s15.

In-depth performance evaluation of PFP and ESG sequence-based function prediction methods in CAFA 2011 experiment.

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2105-14-S3-S2. Epub 2013 Feb 28.

An expanded evaluation of protein function prediction methods shows an improvement in accuracy.

Genome Biol. 2016 Sep 7;17(1):184. doi: 10.1186/s13059-016-1037-6.

Protein function prediction using text-based features extracted from the biomedical literature: the CAFA challenge.

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S14. doi: 10.1186/1471-2105-14-S3-S14. Epub 2013 Feb 28.

Using PFP and ESG Protein Function Prediction Web Servers.

Methods Mol Biol. 2017;1611:1-14. doi: 10.1007/978-1-4939-7015-5_1.

Protein function prediction by massive integration of evolutionary analyses and multiple data sources.

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2105-14-S3-S1. Epub 2013 Feb 28.

The PFP and ESG protein function prediction methods in 2014: effect of database updates and ensemble approaches.

Gigascience. 2015 Sep 14;4:43. doi: 10.1186/s13742-015-0083-4. eCollection 2015.

Ten years of predictions ... and counting.

FEBS J. 2005 Feb;272(4):881-2. doi: 10.1111/j.1742-4658.2005.04549.x.

Orthology prediction methods: a quality assessment using curated protein families.

Bioessays. 2011 Oct;33(10):769-80. doi: 10.1002/bies.201100062. Epub 2011 Aug 19.

引用本文的文献

Protein functional site annotation using local structure embeddings.

Proc Natl Acad Sci U S A. 2025 Aug 26;122(34):e2513219122. doi: 10.1073/pnas.2513219122. Epub 2025 Aug 20.

MKFGO: integrating multi-source knowledge fusion with pretrained language model for high-accuracy protein function prediction.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf420.

SuperEdgeGO: Edge-supervised graph representation learning for enhanced protein function prediction.

PLoS Comput Biol. 2025 Aug 1;21(8):e1013343. doi: 10.1371/journal.pcbi.1013343. eCollection 2025 Aug.

GeneAgent: self-verification language agent for gene-set analysis using domain databases.

Nat Methods. 2025 Jul 28. doi: 10.1038/s41592-025-02748-6.

PlantConnectome: A knowledge graph database encompassing >71,000 plant articles.

Plant Cell. 2025 Jul 1;37(7). doi: 10.1093/plcell/koaf169.

RC-GNN: A predictive model of enzyme-reaction pairs.

bioRxiv. 2025 Jun 27:2025.06.22.660952. doi: 10.1101/2025.06.22.660952.

FINCHES: A Computational Framework for Predicting Intermolecular Interactions in Intrinsically Disordered Proteins.

Int J Mol Sci. 2025 Jun 28;26(13):6246. doi: 10.3390/ijms26136246.

PLMSearch and PLMAlign: Protein Language Model (PLM)-Based Homologous Protein Sequence Search and Alignment.

Methods Mol Biol. 2025;2941:227-241. doi: 10.1007/978-1-0716-4623-6_14.

Multi-stage attention-based extraction and fusion of protein sequence and structural features for protein function prediction.

Bioinformatics. 2025 Jun 26. doi: 10.1093/bioinformatics/btaf374.

GOBeacon: An ensemble model for protein function prediction enhanced by contrastive learning.

Protein Sci. 2025 Jul;34(7):e70182. doi: 10.1002/pro.70182.

本文引用的文献

The Pfam protein families database.

Nucleic Acids Res. 2012 Jan;40(Database issue):D290-301. doi: 10.1093/nar/gkr1065. Epub 2011 Nov 29.

The Enzyme Function Initiative.

Biochemistry. 2011 Nov 22;50(46):9950-62. doi: 10.1021/bi201312u. Epub 2011 Oct 26.

Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.

Brief Bioinform. 2011 Sep;12(5):449-62. doi: 10.1093/bib/bbr042. Epub 2011 Aug 27.

Computational methods for identification of functional residues in protein structures.

Curr Protein Pept Sci. 2011 Sep;12(6):456-69. doi: 10.2174/138920311796957685.

Testing the ortholog conjecture with comparative functional genomic data from mammals.

PLoS Comput Biol. 2011 Jun;7(6):e1002073. doi: 10.1371/journal.pcbi.1002073. Epub 2011 Jun 9.

Analysis of protein function and its prediction from amino acid sequence.

Proteins. 2011 Jul;79(7):2086-96. doi: 10.1002/prot.23029. Epub 2011 Apr 19.

PNPASE regulates RNA import into mitochondria.

Cell. 2010 Aug 6;142(3):456-67. doi: 10.1016/j.cell.2010.06.035.

Hierarchical classification of gene ontology terms using the GOstruct method.

J Bioinform Comput Biol. 2010 Apr;8(2):357-76. doi: 10.1142/s0219720010004744.

Enzyme promiscuity: a mechanistic and evolutionary perspective.

Annu Rev Biochem. 2010;79:471-505. doi: 10.1146/annurev-biochem-030409-143718.

Bayesian Markov Random Field analysis for protein function prediction based on network data.

PLoS One. 2010 Feb 24;5(2):e9293. doi: 10.1371/journal.pone.0009293.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

大规模计算蛋白质功能预测评估。

A large-scale evaluation of computational protein function prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译