Institut de Biologie du Développement de Marseille-Luminy, UMR6216 CNRS, F-13288 Marseille, Cedex 9, France.
Genome Res. 2010 Oct;20(10):1459-68. doi: 10.1101/gr.108175.110. Epub 2010 Jul 20.
Developmental biology aims to understand how the dynamics of embryonic shapes and organ functions are encoded in linear DNA molecules. Thanks to recent progress in genomics and imaging technologies, systemic approaches are now used in parallel with small-scale studies to establish links between genomic information and phenotypes, often described at the subcellular level. Current model organism databases, however, do not integrate heterogeneous data sets at different scales into a global view of the developmental program. Here, we present a novel, generic digital system, NISEED, and its implementation, ANISEED, to ascidians, which are invertebrate chordates suitable for developmental systems biology approaches. ANISEED hosts an unprecedented combination of anatomical and molecular data on ascidian development. This includes the first detailed anatomical ontologies for these embryos, and quantitative geometrical descriptions of developing cells obtained from reconstructed three-dimensional (3D) embryos up to the gastrula stages. Fully annotated gene model sets are linked to 30,000 high-resolution spatial gene expression patterns in wild-type and experimentally manipulated conditions and to 528 experimentally validated cis-regulatory regions imported from specialized databases or extracted from 160 literature articles. This highly structured data set can be explored via a Developmental Browser, a Genome Browser, and a 3D Virtual Embryo module. We show how integration of heterogeneous data in ANISEED can provide a system-level understanding of the developmental program through the automatic inference of gene regulatory interactions, the identification of inducing signals, and the discovery and explanation of novel asymmetric divisions.
发育生物学旨在理解线性 DNA 分子中胚胎形状和器官功能的动态是如何编码的。由于基因组学和成像技术的最新进展,系统方法现在与小规模研究并行使用,以建立基因组信息与表型之间的联系,这些表型通常在亚细胞水平上描述。然而,当前的模式生物数据库并没有将不同尺度的异构数据集整合到发育程序的全局视图中。在这里,我们提出了一种新颖的、通用的数字系统 NISEED 及其在棘皮动物中的实现 ANISEED,棘皮动物是适合发育系统生物学方法的无脊椎脊索动物。ANISEED 承载了棘皮动物发育的前所未有的解剖学和分子数据集的组合。这包括这些胚胎的第一个详细解剖学本体论,以及从重建的三维 (3D) 胚胎到原肠胚阶段的发育细胞的定量几何描述。完全注释的基因模型集与 30,000 个野生型和实验处理条件下的高分辨率空间基因表达模式以及 528 个从专门数据库导入或从 160 篇文献文章中提取的经过实验验证的顺式调控区相关联。这个高度结构化的数据集可以通过发育浏览器、基因组浏览器和 3D 虚拟胚胎模块进行探索。我们展示了如何通过自动推理基因调控相互作用、识别诱导信号以及发现和解释新的不对称分裂,将 ANISEED 中的异构数据集成来提供对发育程序的系统级理解。