Bieda Mark, Xu Xiaoqin, Singer Michael A, Green Roland, Farnham Peggy J
Department of Pharmacology and the Genome Center, University of California-Davis, Davis, California 95616, USA.
Genome Res. 2006 May;16(5):595-605. doi: 10.1101/gr.4887606. Epub 2006 Apr 10.
The E2F family of transcription factors regulates basic cellular processes. Here, we take an unbiased approach towards identifying E2F1 target genes by examining localization of E2F1-binding sites using high-density oligonucleotide tiling arrays. To begin, we developed a statistically-based methodology for analysis of ChIP-chip data obtained from arrays that represent 30 Mb of the human genome. Using this methodology, we identified regions bound by E2F1, MYC, and RNA Polymerase II (POLR2A). We found a large number of binding sites for all three factors; extrapolation suggests there may be approximately 20,000-30,000 E2F1- and MYC-binding sites and approximately 12,000-17,000 active promoters in HeLa cells. In contrast to our results for MYC, we find that the majority of E2F1-binding sites (>80%) are located in core promoters and that 50% of the sites overlap transcription starts. Only a small fraction of E2F1 sites possess the canonical binding motif. Surprisingly, we found that approximately 30% of genes in the 30-Mb region possessed an E2F1 binding site in a core promoter and E2F1 was bound near to 83% of POLR2A-bound sites. To determine if these results were representative of the entire human genome, we performed ChIP-chip analyses of approximately 24,000 promoters and confirmed that greater than 20% of the promoters were bound by E2F1. Our results suggest that E2F1 is recruited to promoters via a method distinct from recognition of the known consensus site and point toward a new understanding of E2F1 as a factor that contributes to the regulation of a large fraction of human genes.
转录因子E2F家族调控基本的细胞过程。在此,我们采用一种无偏向性的方法,通过使用高密度寡核苷酸平铺阵列检查E2F1结合位点的定位来鉴定E2F1靶基因。首先,我们开发了一种基于统计学的方法,用于分析从代表30 Mb人类基因组的阵列获得的ChIP-chip数据。使用这种方法,我们鉴定了E2F1、MYC和RNA聚合酶II(POLR2A)结合的区域。我们发现这三种因子都有大量的结合位点;外推表明,在HeLa细胞中可能有大约20,000 - 30,000个E2F1和MYC结合位点以及大约12,000 - 17,000个活性启动子。与我们关于MYC的结果相反,我们发现大多数E2F1结合位点(>80%)位于核心启动子中,并且50%的位点与转录起始位点重叠。只有一小部分E2F1位点具有典型的结合基序。令人惊讶的是,我们发现30 Mb区域中约30%的基因在核心启动子中具有E2F1结合位点,并且E2F1在接近83%的POLR2A结合位点处结合。为了确定这些结果是否代表整个人类基因组,我们对大约24,000个启动子进行了ChIP-chip分析,并证实超过20%的启动子被E2F1结合。我们的结果表明,E2F1通过一种不同于识别已知共有位点的方法被招募到启动子上,这为将E2F1理解为一种有助于调控大部分人类基因的因子提供了新的认识。