Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland.
Arch Microbiol. 2010 Mar;192(3):151-5. doi: 10.1007/s00203-010-0549-9. Epub 2010 Feb 3.
As the protein databases continue to expand at an exponential rate, fed by daily uploads from multiple large scale genomic and metagenomic projects, the problem of assigning a function to each new protein has become the focus of significant research interest in recent times. Herein, we review the most recent advances in the field of automated function prediction (AFP). We begin by defining what is meant by biological "function" and the means of describing such functions using standardised machine readable ontologies. We then focus on the various function-prediction programs available, both sequence and structure based, and outline their associated strengths and weaknesses. Finally, we conclude with a brief overview of the future challenges and outstanding questions in the field, which still remain unanswered.
随着蛋白质数据库的持续指数级增长,每天都有来自多个大规模基因组和宏基因组项目的上传,为每个新蛋白质分配功能的问题已成为近年来研究的重点。在此,我们综述了自动化功能预测 (AFP) 领域的最新进展。我们首先定义了生物“功能”的含义,以及使用标准化机器可读本体描述此类功能的方法。然后,我们专注于现有的各种基于序列和结构的功能预测程序,并概述了它们的优缺点。最后,我们简要概述了该领域未来的挑战和悬而未决的问题。