Department of Animal Science, Iowa State University, 2255 Kildee Hall, Ames, IA 50011, USA.
Nucleic Acids Res. 2019 Jan 8;47(D1):D701-D710. doi: 10.1093/nar/gky1084.
Successful development of biological databases requires accommodation of the burgeoning amounts of data from high-throughput genomics pipelines. As the volume of curated data in Animal QTLdb (https://www.animalgenome.org/QTLdb) increases exponentially, the resulting challenges must be met with rapid infrastructure development to effectively accommodate abundant data curation and make metadata analysis more powerful. The development of Animal QTLdb and CorrDB for the past 15 years has provided valuable tools for researchers to utilize a wealth of phenotype/genotype data to study the genetic architecture of livestock traits. We have focused our efforts on data curation, improved data quality maintenance, new tool developments, and database co-developments, in order to provide convenient platforms for users to query and analyze data. The database currently has 158 499 QTL/associations, 10 482 correlations and 1977 heritability data as a result of an average 32% data increase per year. In addition, we have made >14 functional improvements or new tool implementations since our last report. Our ultimate goals of database development are to provide infrastructure for data collection, curation, and annotation, and more importantly, to support innovated data structure for new types of data mining, data reanalysis, and networked genetic analysis that lead to the generation of new knowledge.
成功开发生物数据库需要适应高通量基因组学管道不断增长的数据量。随着 Animal QTLdb(https://www.animalgenome.org/QTLdb)中经过注释数据量的指数级增长,必须快速开发基础设施,以有效地容纳丰富的注释数据,并使元数据分析更加强大。过去 15 年来,Animal QTLdb 和 CorrDB 的开发为研究人员提供了宝贵的工具,可利用大量表型/基因型数据来研究家畜性状的遗传结构。我们专注于数据注释、提高数据质量维护、新工具开发和数据库联合开发,以便为用户查询和分析数据提供便捷的平台。目前,该数据库有 158499 个 QTL/关联、10482 个相关性和 1977 个遗传力数据,这是因为每年平均有 32%的数据增长。此外,自上次报告以来,我们已经进行了 14 次以上的功能改进或新工具实现。我们数据库开发的最终目标是提供数据收集、注释和标注的基础设施,更重要的是,支持创新的数据结构,用于新型数据挖掘、数据分析和网络遗传分析,从而产生新知识。