Kazusa DNA Research Institute, Kisarazu, Chiba, 292-0813, Japan.
Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan.
BMC Plant Biol. 2023 Aug 12;23(1):391. doi: 10.1186/s12870-023-04392-8.
Plant genome information is fundamental to plant research and development. Along with the increase in the number of published plant genomes, there is a need for an efficient system to retrieve various kinds of genome-related information from many plant species across plant kingdoms. Various plant databases have been developed, but no public database covers both genomic and genetic resources over a wide range of plant species.
We have developed a plant genome portal site, Plant GARDEN (Genome And Resource Database Entry: https://plantgarden.jp/en/index ), to provide diverse information related to plant genomics and genetics in divergent plant species. Elasticsearch is used as a search engine, and cross-keyword search across species is available. Web-based user interfaces (WUI) for PCs and tablet computers were independently developed to make data searches more convenient. Several types of data are stored in Plant GARDEN: reference genomes, gene sequences, PCR-based DNA markers, trait-linked DNA markers identified in genetic studies, SNPs, and in/dels on publicly available sequence read archives (SRAs). The data registered in Plant GARDEN as of March 2023 included 304 assembled genome sequences, 11,331,614 gene sequences, 419,132 DNA markers, 8,225 QTLs, and 5,934 SNP lists (gvcf files). In addition, we have re-annotated all the genes registered in Plant GARDEN by using a functional annotation tool, Hayai-Annotation, to compare the orthologous relationships among genes.
The aim of Plant GARDEN is to provide plant genome information for use in the fields of plant science as well as for plant-based industries, education, and other relevant areas. Therefore, we have designed a WUI that allows a diverse range of users to access such information in an easy-to-understand manner. Plant GARDEN will eventually include a wide range of plant species for which genome sequences are assembled, and thus the number of plant species in the database will continue to expand. We anticipate that Plant GARDEN will promote the understanding of genomes and gene diversity by facilitating comparisons of the registered sequences.
植物基因组信息是植物研究与开发的基础。随着已发表植物基因组数量的增加,需要有一种高效的系统来从多个植物物种中检索各种与基因组相关的信息。已经开发了各种植物数据库,但没有公共数据库涵盖广泛的植物物种的基因组和遗传资源。
我们开发了一个植物基因组门户站点 Plant GARDEN(基因组和资源数据库条目:https://plantgarden.jp/en/index),以提供不同植物物种中与植物基因组学和遗传学相关的多样化信息。该站点使用 Elasticsearch 作为搜索引擎,支持跨物种的关键词交叉搜索。我们还独立开发了基于网络的 PC 和平板电脑用户界面(WUI),以方便数据搜索。Plant GARDEN 中存储了几种类型的数据:参考基因组、基因序列、基于 PCR 的 DNA 标记、遗传研究中鉴定的与性状相关的 DNA 标记、SNP 和公共序列读取档案(SRA)上的插入/缺失。截至 2023 年 3 月,在 Plant GARDEN 中注册的数据包括 304 个组装基因组序列、11331614 个基因序列、419132 个 DNA 标记、8225 个 QTL 和 5934 个 SNP 列表(gvcf 文件)。此外,我们使用功能注释工具 Hayai-Annotation 重新注释了 Plant GARDEN 中注册的所有基因,以比较基因之间的同源关系。
Plant GARDEN 的目标是为植物科学领域以及以植物为基础的产业、教育和其他相关领域提供植物基因组信息。因此,我们设计了一个 WUI,使各种用户都可以以易于理解的方式访问这些信息。Plant GARDEN 将最终包含广泛的组装基因组的植物物种,因此数据库中的植物物种数量将继续扩大。我们预计,Plant GARDEN 将通过促进已注册序列的比较,促进对基因组和基因多样性的理解。