1] Research Institute of Molecular Pathology (IMP), Vienna Biocenter VBC, Dr Bohr-Gasse 7, 1030 Vienna, Austria [2] Howard Hughes Medical Institute, Janelia Farm Research Campus, Ashburn, Virginia 20147, USA (B.J.D.); Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA (E.Z.K.).
Research Institute of Molecular Pathology (IMP), Vienna Biocenter VBC, Dr Bohr-Gasse 7, 1030 Vienna, Austria.
Nature. 2014 Aug 7;512(7512):91-5. doi: 10.1038/nature13395. Epub 2014 Jun 1.
Transcriptional enhancers are crucial regulators of gene expression and animal development and the characterization of their genomic organization, spatiotemporal activities and sequence properties is a key goal in modern biology. Here we characterize the in vivo activity of 7,705 Drosophila melanogaster enhancer candidates covering 13.5% of the non-coding non-repetitive genome throughout embryogenesis. 3,557 (46%) candidates are active, suggesting a high density with 50,000 to 100,000 developmental enhancers genome-wide. The vast majority of enhancers display specific spatial patterns that are highly dynamic during development. Most appear to regulate their neighbouring genes, suggesting that the cis-regulatory genome is organized locally into domains, which are supported by chromosomal domains, insulator binding and genome evolution. However, 12 to 21 per cent of enhancers appear to skip non-expressed neighbours and regulate a more distal gene. Finally, we computationally identify cis-regulatory motifs that are predictive and required for enhancer activity, as we validate experimentally. This work provides global insights into the organization of an animal regulatory genome and the make-up of enhancer sequences and confirms and generalizes principles from previous studies. All enhancer patterns are annotated manually with a controlled vocabulary and all results are available through a web interface (http://enhancers.starklab.org), including the raw images of all microscopy slides for manual inspection at arbitrary zoom levels.
转录增强子是基因表达和动物发育的关键调控因子,其基因组组织、时空活性和序列特征的描述是现代生物学的一个关键目标。在这里,我们描述了 7705 个果蝇增强子候选者在胚胎发生过程中覆盖非编码非重复基因组的 13.5%的体内活性。其中 3557 个(46%)候选者是活跃的,这表明在全基因组中有 50,000 到 100,000 个发育增强子具有很高的密度。绝大多数增强子显示出特定的空间模式,在发育过程中高度动态。大多数似乎调节其邻近基因,这表明顺式调控基因组局部组织成域,这些域得到染色体域、绝缘子结合和基因组进化的支持。然而,12%到 21%的增强子似乎跳过非表达的邻居,调节更遥远的基因。最后,我们通过计算识别出与增强子活性相关的顺式调控基序,我们通过实验验证了这些基序的预测性和必要性。这项工作提供了对动物调控基因组组织和增强子序列构成的全面了解,并证实和推广了先前研究的原则。所有的增强子模式都用受控词汇进行手动注释,所有的结果都可以通过一个网络界面(http://enhancers.starklab.org)获得,包括所有显微镜载玻片的原始图像,以便在任意缩放级别进行手动检查。