Saccharomyces Genome Database, Department of Genetics, Stanford University, 300 Pasteur Drive, MC-5477 Stanford, CA 94305, USA.
Database (Oxford). 2013 Jul 9;2013:bat054. doi: 10.1093/database/bat054. Print 2013.
The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374,000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all. DATABASE URL: http://www.geneontology.org.
基因本体论联盟(GOC)是一个基于社区的生物信息学项目,通过使用结构化的受控词汇表对基因产物功能进行分类。基因本体论(GO)的一个基本应用是在基因产物注释的创建中,即在基于实验或基于序列的分析与 GO 定义之间建立有证据支持的关联。目前,GOC 传播了超过 374,000 个物种的 1.26 亿个注释,包括所有生命领域。这个数字包括两类 GO 注释:由经验丰富的生物注释员通过审查文献或通过检查生物数据创建的(覆盖 2226 个物种的 110 万个注释)和通过自动方法生成的计算注释。由于手动注释通常用于在基因组内和基因组之间的相关蛋白质之间传播功能预测,因此提供准确一致的手动注释至关重要。为此,我们在此展示了 GOC 为创建手动注释定义的约定。本指南代表了 GOC 项目在过去 12 年中建立的手动注释的最佳实践。我们希望本指南将鼓励研究社区对其感兴趣的基因产物进行注释,以增强所有可用的 GO 注释语料库。数据库 URL:http://www.geneontology.org。