Vize Peter D, Liu Yu, Karimi Kamran
Departments of Biological Sciences and Computer Science, University of Calgary, Calgary, Alta., Canada.
Cytogenet Genome Res. 2015;145(3-4):278-82. doi: 10.1159/000430427. Epub 2015 May 27.
At the heart of databases is a data model referred to as a schema. Relational databases store information in tables, and the schema defines the tables and provides a map of relationships that show how the different table/data types relate to one another. In Xenbase, we were tasked to represent genomic, molecular, and biological data of both a diploid and tetraploid Xenopus species. When the database model was built over a decade ago, we had very little information on the nature of the X.laevis tetraploidization, but a Chado-based data model was proposed that could deal with the various forms of data in both species. Once the X.laevis genome was sequenced and annotated, it became clear that the data schema is very like the evolutionary schema that resulted in the X.laevis genome.
数据库的核心是一种称为模式的数据模型。关系数据库将信息存储在表中,而模式定义了这些表,并提供了关系映射,展示了不同表/数据类型之间的相互关系。在非洲爪蟾数据库(Xenbase)中,我们的任务是呈现二倍体和四倍体非洲爪蟾物种的基因组、分子和生物学数据。十多年前构建数据库模型时,我们对非洲爪蟾(X.laevis)四倍体化的本质了解甚少,但有人提出了一种基于Chado的数据模型,该模型可以处理这两个物种的各种数据形式。一旦非洲爪蟾的基因组被测序和注释,就很明显数据模式与产生非洲爪蟾基因组的进化模式非常相似。