Farley Emma K, Olson Katrina M, Zhang Wei, Rokhsar Daniel S, Levine Michael S
Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544;
Department of Medicine, University of California, San Diego, CA 92093-0688;
Proc Natl Acad Sci U S A. 2016 Jun 7;113(23):6508-13. doi: 10.1073/pnas.1605085113. Epub 2016 May 6.
Transcriptional enhancers are short segments of DNA that switch genes on and off in response to a variety of intrinsic and extrinsic signals. Despite the discovery of the first enhancer more than 30 y ago, the relationship between primary DNA sequence and enhancer activity remains obscure. In particular, the importance of "syntax" (the order, orientation, and spacing of binding sites) is unclear. A high-throughput screen identified synthetic notochord enhancers that are activated by the combination of ZicL and ETS transcription factors in Ciona embryos. Manipulation of these enhancers elucidated a "regulatory code" of sequence and syntax features for notochord-specific expression. This code enabled in silico discovery of bona fide notochord enhancers, including those containing low-affinity binding sites that would be excluded by standard motif identification methods. One of the newly identified enhancers maps upstream of the known enhancer that regulates Brachyury (Ci-Bra), a key determinant of notochord specification. This newly identified Ci-Bra shadow enhancer contains binding sites with very low affinity, but optimal syntax, and therefore mediates surprisingly strong expression in the notochord. Weak binding sites are compensated by optimal syntax, whereas enhancers containing high-affinity binding affinities possess suboptimal syntax. We suggest this balance has obscured the importance of regulatory syntax, as noncanonical binding motifs are typically disregarded by enhancer detection methods. As a result, enhancers with low binding affinities but optimal syntax may be a vastly underappreciated feature of the regulatory genome.
转录增强子是DNA的短片段,可根据各种内在和外在信号来开启和关闭基因。尽管30多年前就发现了第一个增强子,但初级DNA序列与增强子活性之间的关系仍不清楚。特别是,“语法”(结合位点的顺序、方向和间距)的重要性尚不清楚。一项高通量筛选鉴定出了合成的脊索增强子,它们在海鞘胚胎中被ZicL和ETS转录因子的组合激活。对这些增强子的操作阐明了脊索特异性表达的序列和语法特征的“调控密码”。该密码使得能够通过计算机发现真正的脊索增强子,包括那些含有低亲和力结合位点的增强子,而这些位点会被标准的基序识别方法排除在外。新鉴定出的增强子之一位于已知增强子的上游,该已知增强子调控Brachyury(Ci-Bra),这是脊索特化的关键决定因素。这个新鉴定出的Ci-Bra影子增强子包含亲和力非常低但语法最优的结合位点,因此在脊索中介导了惊人的强表达。弱结合位点通过最优语法得到补偿,而含有高亲和力结合位点的增强子则具有次优语法。我们认为这种平衡掩盖了调控语法的重要性,因为增强子检测方法通常会忽略非规范的结合基序。因此,具有低结合亲和力但语法最优的增强子可能是调控基因组中一个被大大低估的特征。