Verbeek F J, Lawson K A, Bard J B
Hubrecht Laboratory, Netherlands Institute for Developmental Biology, Utrecht.
Int J Dev Biol. 1999;43(7):761-71.
This paper discusses current efforts to produce databases of gene expression for the major model embryos used in developmental biology. The efforts to build these resources were motivated by the need for immediate internet access to all types of research data, and the production of these databases is a major and new challenge for bioinformatics. Thus far bioinformatics has mainly been concerned with textually oriented resources and data, much of it concerned with gene and protein sequences. Because the genetic basis of developmental biology is integrated with developmental anatomy, these databases require the use of images to link molecular data with spatial information. In order to standardise database formats, digital atlases of some model systems are being produced that include integrated anatomical descriptions and these are being linked to appropriate genetic data. Integrating such image-based, searchable data into databases makes new demands on the field of bioinformatics and we consider here the imaging modalities that are used to obtain information and we discuss in particular the production of 3D images from serial sections. Next, we consider how to integrate textual and spatial descriptions of gene expression and the key tool needed to make this possible, i.e. anatomical nomenclature. A short review of internet resources on developmental biology is also given and future prospects for the development of these databases are discussed.
本文讨论了目前为发育生物学中主要的模式胚胎建立基因表达数据库的工作。构建这些资源的工作是出于对即时通过互联网获取各类研究数据的需求,而这些数据库的建立对生物信息学来说是一项重大的新挑战。到目前为止,生物信息学主要关注的是文本导向的资源和数据,其中大部分与基因和蛋白质序列有关。由于发育生物学的遗传基础与发育解剖学相互关联,这些数据库需要利用图像将分子数据与空间信息联系起来。为了使数据库格式标准化,正在制作一些模式系统的数字图谱,其中包括综合的解剖学描述,并将这些描述与相应的遗传数据相链接。将此类基于图像的、可搜索的数据整合到数据库中,对生物信息学领域提出了新的要求,我们在此考虑用于获取信息的成像方式,并特别讨论从连续切片生成三维图像的方法。接下来,我们思考如何整合基因表达的文本和空间描述以及实现这一目标所需的关键工具,即解剖学命名法。此外,还对发育生物学的互联网资源进行了简要回顾,并讨论了这些数据库未来的发展前景。