Odronitz Florian, Hellkamp Marcel, Kollmar Martin
Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Goettingen, Germany.
BMC Genomics. 2007 Apr 17;8:103. doi: 10.1186/1471-2164-8-103.
The number of completed eukaryotic genome sequences and cDNA projects has increased exponentially in the past few years although most of them have not been published yet. In addition, many microarray analyses yielded thousands of sequenced EST and cDNA clones. For the researcher interested in single gene analyses (from a phylogenetic, a structural biology or other perspective) it is therefore important to have up-to-date knowledge about the various resources providing primary data.
The database is built around 3 central tables: species, sequencing projects and publications. The species table contains commonly and alternatively used scientific names, common names and the complete taxonomic information. For projects the sequence type and links to species project web-sites and species homepages are stored. All publications are linked to projects. The web-interface provides comprehensive search modules with detailed options and three different views of the selected data. We have especially focused on developing an elaborate taxonomic tree search tool that allows the user to instantaneously identify e.g. the closest relative to the organism of interest.
We have developed a database, called diArk, to store, organize, and present the most relevant information about completed genome projects and EST/cDNA data from eukaryotes. Currently, diArk provides information about 415 eukaryotes, 823 sequencing projects, and 248 publications.
在过去几年中,已完成的真核生物基因组序列和cDNA项目数量呈指数增长,尽管其中大多数尚未发表。此外,许多微阵列分析产生了数千个已测序的EST和cDNA克隆。因此,对于对单基因分析感兴趣的研究人员(从系统发育、结构生物学或其他角度)来说,了解提供原始数据的各种资源的最新知识非常重要。
该数据库围绕3个中心表构建:物种、测序项目和出版物。物种表包含常用和交替使用的科学名称、通用名称以及完整的分类信息。对于项目,存储序列类型以及到物种项目网站和物种主页的链接。所有出版物都与项目相关联。网络界面提供了具有详细选项的综合搜索模块以及所选数据的三种不同视图。我们特别专注于开发一种精细的分类树搜索工具,该工具允许用户立即识别例如与感兴趣的生物体关系最密切的亲属。
我们开发了一个名为diArk的数据库,用于存储、组织和呈现有关已完成的真核生物基因组项目以及EST/cDNA数据的最相关信息。目前,diArk提供有关415种真核生物、823个测序项目和248篇出版物的信息。