Department of Horticulture, Washington State University, 45 Johnson Hall, Pullman, WA 99164, USA.
Database (Oxford). 2021 Apr 26;2021. doi: 10.1093/database/baab023.
Tripal MegaSearch is a Tripal module for querying and downloading biological data stored in Chado. This module allows site users to select data types, restrict the dataset by applying various filters and then customizing fields to view and download through a single interface. Set by site administrators, example data types include gene, germplasm, marker, map, QTL, genotype, phenotype and expression data. When querying for genes, users can restrict the gene dataset using various filters such as name, chromosome position and functional annotation. They can then customize fields to download, such as name, organism, type, chromosome position, various functional annotations such as BLAST, KEGG, InterPro and GO term. FASTA files can also be downloaded for the sequence data. Site administrators can choose from two different data sources to serve data: Tripal MegaSearch materialized views or Chado tables. If neither data source is desired, administrators may also create their own materialized views and serve them through the flexible dynamic Tripal MegaSearch query form. Tripal MegaSearch is currently implemented in several databases including the Genome Database for Rosaceae www.rosaceae.org and TreeGenes www.https://treegenesdb.org/.
TriPal MegaSearch 是一个用于查询和下载存储在 Chado 中的生物数据的 TriPal 模块。该模块允许站点用户选择数据类型,通过应用各种过滤器来限制数据集,然后通过单个界面自定义字段以进行查看和下载。由站点管理员设置的示例数据类型包括基因、种质、标记、图谱、QTL、基因型、表型和表达数据。在查询基因时,用户可以使用各种过滤器(如名称、染色体位置和功能注释)来限制基因数据集。然后,他们可以自定义要下载的字段,例如名称、生物体、类型、染色体位置、各种功能注释,如 BLAST、KEGG、InterPro 和 GO 术语。还可以下载序列数据的 FASTA 文件。站点管理员可以从两种不同的数据源中选择服务数据:TriPal MegaSearch 具体化视图或 Chado 表。如果不需要这两个数据源,管理员还可以创建自己的具体化视图,并通过灵活的动态 TriPal MegaSearch 查询表单提供服务。TriPal MegaSearch 目前在包括 Rosaceae 基因组数据库(www.rosaceae.org)和 TreeGenes(www.https://treegenesdb.org/)在内的几个数据库中实现。