van Steensel Bas, Delrow Jeffrey, Bussemaker Harmen J
Netherlands Cancer Institute, 1066 CX, Amsterdam, The Netherlands.
Proc Natl Acad Sci U S A. 2003 Mar 4;100(5):2580-5. doi: 10.1073/pnas.0438000100. Epub 2003 Feb 24.
The association of sequence-specific DNA-binding factors with their cognate target sequences in vivo depends on the local molecular context, yet this context is poorly understood. To address this issue, we have performed genomewide mapping of in vivo target genes of Drosophila GAGA factor (GAF). The resulting list of approximately 250 target genes indicates that GAF regulates many cellular pathways. We applied unbiased motif-based regression analysis to identify the sequence context that determines GAF binding. Our results confirm that GAF selectively associates with (GA)(n) repeat elements in vivo. GAF binding occurs in upstream regulatory regions, but less in downstream regions. Surprisingly, GAF binds abundantly to introns but is virtually absent from exons, even though the density of (GA)(n) is roughly the same. Intron binding occurs equally frequently in last introns compared with first introns, suggesting that GAF may not only regulate transcription initiation, but possibly also elongation. We provide evidence for cooperative binding of GAF to closely spaced (GA)(n) elements and explain the lack of GAF binding to exons by the absence of such closely spaced GA repeats. Our approach for revealing determinants of context-dependent DNA binding will be applicable to many other transcription factors.
序列特异性DNA结合因子在体内与其同源靶序列的关联取决于局部分子环境,但对这种环境的了解却很少。为了解决这个问题,我们对果蝇GAGA因子(GAF)的体内靶基因进行了全基因组定位。大约250个靶基因的结果列表表明,GAF调节许多细胞途径。我们应用基于基序的无偏回归分析来确定决定GAF结合的序列环境。我们的结果证实,GAF在体内选择性地与(GA)n重复元件结合。GAF结合发生在上游调控区域,但在下游区域较少。令人惊讶的是,GAF大量结合内含子,但外显子中几乎没有,尽管(GA)n的密度大致相同。与第一个内含子相比,最后一个内含子中内含子结合的频率相同,这表明GAF可能不仅调节转录起始,还可能调节延伸。我们提供了GAF与紧密间隔的(GA)n元件协同结合的证据,并通过缺乏这种紧密间隔的GA重复来解释GAF与外显子结合的缺乏。我们揭示上下文依赖性DNA结合决定因素的方法将适用于许多其他转录因子。