Nucleic Acids Res. 2021 Jan 8;49(D1):D325-D334. doi: 10.1093/nar/gkaa1113.
The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations.
基因本体论联盟 (GOC) 提供了目前关于基因和基因产物功能的可计算知识的最全面资源。在这里,我们报告了该联盟在过去两年中的进展。新的 GO-CAM 注释框架得到了显著改进,我们用计算模式使其形式化,以检查和验证 2838 个 GO-CAMs 快速增长的存储库。此外,我们描述了几个合作项目来完善 GO,并报告了 GO 注释数量增加了 10%,注释的基因产物增加了 25%,以及超过 9400 篇新的科学文章被注释。随着项目的成熟,我们继续根据新的发现和与其他本体的一致性来审查旧的注释。结果,有 20000 个来自实验数据的注释被审查,占实验 GO 注释的 2.5%。网站 (http://geneontology.org) 进行了重新设计,以便快速访问文档、下载和工具。为了保持资源的准确性,并支持可追溯性和可重复性,我们提供了一个历史存档,涵盖了过去 15 年的 GO 数据,具有一致的格式和文件结构,适用于本体和注释。