Pares-Matos Elsie I, Milligan Jason S, Bina Minou
Department of Chemistry, Purdue University, West Lafayette, IN 47907, USA.
J Mol Biol. 2006 Mar 31;357(3):732-47. doi: 10.1016/j.jmb.2005.12.071. Epub 2006 Jan 6.
We examined several DNA segments upstream of the transcription start site of the human NF-IL6 gene to evaluate the predictions of two computational models developed to identify potential regulatory elements in the non-coding regions of genes. One model, comparative genomics, is based on the hypothesis that functional regulatory sequences can be localized in alignments of genomic DNA from several species. The other model is based on the hypothesis that protein-binding sites in genomic DNA may include sequence elements that occur frequently in proximal promoters of genes. The segments selected for DNA binding and functional evaluations included: (1) two conserved regions identified in multi-species sequence alignments; (2) a region containing several localized hits with 9-mers that ranked highly in studies of proximal promoters of human genes; and (3) two regions that were either GC-rich and/or contained tracts of G. The assays were done under nearly identical experimental conditions, using a cell line (U937) representing human monocytes/macrophages. The experiments also aimed at evaluating what effect, if any, cellular stimulation could have on the interactions of nuclear proteins with naturally occurring GC-rich elements in a human genomic DNA. In DNA binding assays, several complexes were formed with the conserved regions identified in multi-species sequence alignment. Furthermore, these regions were active in functional assays. The region containing several matches with 9-mers derived from proximal promoters of human genes was not conserved but formed several complexes with nuclear proteins including Sp1, Egr-1, and an unidentified protein. In addition, this region was active in functional assays and responded to cellular stimulations. Overall, the results of the assays suggest an important role for the sequence context of genomic DNA in protein binding and selection.
我们检测了人类NF-IL6基因转录起始位点上游的几个DNA片段,以评估为识别基因非编码区潜在调控元件而开发的两种计算模型的预测结果。一种模型是比较基因组学,其基于这样的假设:功能调控序列可定位在几个物种的基因组DNA比对中。另一种模型基于这样的假设:基因组DNA中的蛋白质结合位点可能包括在基因近端启动子中频繁出现的序列元件。选择用于DNA结合和功能评估的片段包括:(1)在多物种序列比对中鉴定出的两个保守区域;(2)一个包含几个与9聚体局部匹配的区域,这些9聚体在人类基因近端启动子研究中排名靠前;(3)两个富含GC和/或含有G序列的区域。实验在几乎相同的实验条件下进行,使用代表人类单核细胞/巨噬细胞的细胞系(U937)。这些实验还旨在评估细胞刺激对核蛋白与人类基因组DNA中天然存在的富含GC元件相互作用可能产生的影响(如果有)。在DNA结合实验中,与多物种序列比对中鉴定出的保守区域形成了几种复合物。此外,这些区域在功能实验中具有活性。包含几个与源自人类基因近端启动子的9聚体匹配的区域并不保守,但与包括Sp1、Egr-1和一种未鉴定蛋白质在内的核蛋白形成了几种复合物。此外,该区域在功能实验中具有活性,并对细胞刺激有反应。总体而言,实验结果表明基因组DNA的序列背景在蛋白质结合和选择中起重要作用。