Boulikas T
Institute of Molecular Medical Sciences, Palo Alto, California 94306.
J Cell Biochem. 1993 May;52(1):23-36. doi: 10.1002/jcb.240520105.
beta-Globin genes in primates arose during evolution by duplication of an ancestral gene, and their order of arrangement along the DNA is related to their timing of expression during development. We believe that nuclear matrix anchorage sites (MARs) along the beta-globin gene complex considered to be mass binding sites for transcription protein factors, some of which are developmental stage specific and others ubiquitous, play a decisive role in cell memory by determining the developmental stage-specific expression of the genes. The AT-rich class of MARs appears to possess a significant number of ATTA and ATTTA motifs known to be mass binding sites for homeodomain proteins that determine body formation in development. MARs also appear to harbor origins of replication, to be enriched in inverted repeats (dyad symmetry motifs) and were proposed to include the DNase I hypersensitive sites of a particular gene determined at the chromatin level. This study is an attempt to finely identify MARs at the nucleotide level along the beta-globin gene complex. Searches of a contiguous stretch of about 73.3 kb of human sequences comprising and surrounding the epsilon, gamma G-, gamma A-, delta-, and beta-globin genes of the human beta-globin gene complex for homeotic protein binding sites as well as for inverted repeats has shown that these elements are clustered nonrandomly at particular sites within the beta-globin gene complex. These sites are presumed to be the AT-rich class of MARs of the beta-globin gene complex. The inverted repeats which are characteristic of origins of replication and some promoter/enhancer regions and the homeotic protein sites seem to include the DNase I hypersensitive sites of the gene complex. Indeed, dyad symmetry sequences are present close to the four DNase I HS sites in the locus control region (LCR) of the gene complex as well as in the 5' flanking regions and the large introns of the delta- and beta-globin genes. A search of the putative MAR regions of the gene complex suggests that, in addition to their enrichment in ATTA motifs, palindromes, and DNase I hypersensitive sites, these regions may comprise TG-rich motifs and potential Z-DNA as well as polypurine and polypyrimidine blocks. From the positions of palindromes and clusters of homeodomain protein sites along the complex we propose that an extended origin of replication able to initiate at several sites is present in the LCR and two others surrounding the delta- and beta-globin genes.(ABSTRACT TRUNCATED AT 400 WORDS)
灵长类动物的β-珠蛋白基因在进化过程中通过一个祖先基因的复制而产生,它们沿着DNA的排列顺序与它们在发育过程中的表达时间相关。我们认为,β-珠蛋白基因复合体上的核基质锚定位点(MARs)被认为是转录蛋白因子的大量结合位点,其中一些是发育阶段特异性的,另一些是普遍存在的,它们通过决定基因的发育阶段特异性表达在细胞记忆中起决定性作用。富含AT的MARs类似乎拥有大量已知是同源域蛋白大量结合位点的ATTA和ATTTA基序,这些同源域蛋白在发育过程中决定身体的形成。MARs似乎还含有复制起点,富含反向重复序列(二元对称基序),并且有人提出它们包括在染色质水平确定的特定基因的DNase I超敏位点。本研究试图在核苷酸水平上精细地鉴定β-珠蛋白基因复合体上的MARs。在人类β-珠蛋白基因复合体的ε、γG-、γA-、δ-和β-珠蛋白基因及其周围约73.3 kb的连续人类序列中搜索同源异型蛋白结合位点以及反向重复序列,结果表明这些元件在β-珠蛋白基因复合体内的特定位点非随机聚集。这些位点被推测为β-珠蛋白基因复合体的富含AT的MARs类。作为复制起点以及一些启动子/增强子区域特征的反向重复序列和同源异型蛋白位点似乎包括基因复合体的DNase I超敏位点。事实上,在基因复合体的基因座控制区(LCR)以及δ-和β-珠蛋白基因的5'侧翼区域和大内含子中的四个DNase I HS位点附近存在二元对称序列。对基因复合体假定的MAR区域的搜索表明,除了富含ATTA基序、回文序列和DNase I超敏位点外,这些区域可能还包括富含TG基序和潜在的Z-DNA以及聚嘌呤和聚嘧啶区段。根据沿着复合体的回文序列和同源域蛋白位点簇的位置,我们提出在LCR以及围绕δ-和β-珠蛋白基因的另外两个区域中存在一个能够在多个位点起始的扩展复制起点。(摘要截短至400字)