Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA.
Nucleic Acids Res. 2019 Jan 8;47(D1):D649-D659. doi: 10.1093/nar/gky977.
The Genomes Online Database (GOLD) (https://gold.jgi.doe.gov) is an open online resource, which maintains an up-to-date catalog of genome and metagenome projects in the context of a comprehensive list of associated metadata. Information in GOLD is organized into four levels: Study, Biosample/Organism, Sequencing Project and Analysis Project. Currently GOLD hosts information on 33 415 Studies, 49 826 Biosamples, 313 324 Organisms, 215 881 Sequencing Projects and 174 454 Analysis Projects with a total of 541 metadata fields, of which 80 are based on controlled vocabulary (CV) terms. GOLD provides a user-friendly web interface to browse sequencing projects and launch advanced search tools across four classification levels. Users submit metadata on a wide range of Sequencing and Analysis Projects in GOLD before depositing sequence data to the Integrated Microbial Genomes (IMG) system for analysis. GOLD conforms with and supports the rules set by the Genomic Standards Consortium (GSC) Minimum Information standards. The current version of GOLD (v.7) has seen the number of projects and associated metadata increase exponentially over the years. This paper provides an update on the current status of GOLD and highlights the new features added over the last two years.
基因组在线数据库(GOLD)(https://gold.jgi.doe.gov)是一个开放的在线资源,它维护着一个最新的基因组和宏基因组项目目录,以及相关元数据的综合列表。GOLD 中的信息分为四个层次:研究、生物样本/生物、测序项目和分析项目。目前,GOLD 收录了 33415 项研究、49826 个生物样本、313324 个生物、215881 个测序项目和 174454 个分析项目,共包含 541 个元数据字段,其中 80 个字段基于受控词汇 (CV) 术语。GOLD 提供了一个用户友好的 Web 界面,可用于浏览测序项目,并在四个分类级别上启动高级搜索工具。在向用于分析的集成微生物基因组 (IMG) 系统提交序列数据之前,用户会在 GOLD 中提交广泛的测序和分析项目的元数据。GOLD 符合并支持基因组标准联盟 (GSC) 最低信息标准设定的规则。多年来,GOLD 的当前版本(v.7)的项目数量和相关元数据呈指数级增长。本文介绍了 GOLD 的最新现状,并重点介绍了过去两年中添加的新功能。