Davis F C, Shelton J C, Ingham L D
Department of Microbiology and Cell Science, University of Florida, Gainesville 32611-0144.
DNA Seq. 1992;2(4):247-56. doi: 10.3109/10425179209020810.
The 4942 bp nucleotide sequence of a repeating unit from the core histone gene tandem repeat of Urechis caupo and the predicted amino acid sequence of the four core histones are presented. Putative promoter elements including the CAP site and TATA box as well as multiple CAAT-like sequences are identified upstream from each gene. Upstream from each core histone gene are 26 or 30 bp sequences that may have a promoter function and appear to be unique to Urechis histone genes. Located 5' to both H2A and H2B is the 26 bp sequence, GGTCATGTGACTCTAATACCGCGCTG. An identical, but inverted, 26 bp sequence is present upstream of H4. Upstream from the H3 gene, two regions of a 30 bp sequence, GGTCTTGTGGCGGGAACAAATACCGCAACG, are very similar to corresponding regions of the 26 bp sequence. Additional 10 bp conserved sequences, CAGCGGGCGC, are present only upstream from the H2A and H2B genes. Conserved sequences containing a region of dyad symmetry followed by a purine-rich sequence that are typical of histone mRNA termination sites are present 27 to 36 bp 3' from the termination codon. Short repetitive DNA sequence elements are present in the spacer sequences between the H2A and H3 genes and the H2B and H4 gene.
本文给出了来自加州扁海豆芽核心组蛋白基因串联重复序列的一个重复单元的4942 bp核苷酸序列以及四种核心组蛋白的预测氨基酸序列。在每个基因上游鉴定出了包括CAP位点和TATA盒以及多个类CAAT序列在内的推定启动子元件。每个核心组蛋白基因上游有26或30 bp的序列,可能具有启动子功能,并且似乎是扁海豆芽组蛋白基因所特有的。在H2A和H2B基因的5'端均有26 bp序列GGTCATGTGACTCTAATACCGCGCTG。H4基因上游存在一个相同但反向的26 bp序列。在H3基因上游,一个30 bp序列GGTCTTGTGGCGGGAACAAATACCGCAACG的两个区域与26 bp序列的相应区域非常相似。另外,仅在H2A和H2B基因上游存在10 bp保守序列CAGCGGGCGC。在终止密码子下游27至36 bp处存在含有二重对称区域及富含嘌呤序列的保守序列,这些序列是组蛋白mRNA终止位点的典型特征。在H2A与H3基因以及H2B与H4基因之间的间隔序列中存在短重复DNA序列元件。