D'Andrea R, Harvey R, Wells J R
Nucleic Acids Res. 1981 Jul 10;9(13):3119-28. doi: 10.1093/nar/9.13.3119.
The DNA sequence of a chicken genomal fragment containing a histone H2A gene has been determined. It contains extensive 5' and 3' flanking regions and encodes a protein identical in sequence to the histone H2A protein isolated from chicken erythrocytes. In the 5' flanking region, a possible "TATA box" and three possible "cap sites" can be recognised upstream from the initiation codon. To the 5' side of the "TATA box" is found an unusual sequence of 21 A's interrupted by a central G residue. It occupies the same relative position as the P. miliaris H2A gene-specific 5' dyad symmetry sequence and the "CCAAT box" seen in other eukaryotic polymerase II genes but is clearly different from both. A significant feature of the 3' non-coding region is the presence of a 23 base-pair sequence that is nearly identical to a conserved region found in sea urchin histone genes. The coding region is extremely GC rich, with strong selection for these bases in the third position of codons. Not a single coding triplet ends in U. No intervening sequences were found in this gene.
已确定包含组蛋白H2A基因的鸡基因组片段的DNA序列。它包含广泛的5'和3'侧翼区域,并编码一种与从鸡红细胞中分离出的组蛋白H2A蛋白序列相同的蛋白质。在5'侧翼区域,在起始密码子上游可识别出一个可能的“TATA盒”和三个可能的“帽位点”。在“TATA盒”的5'侧发现了一个由21个A组成的不寻常序列,中间被一个G残基打断。它占据的相对位置与粟酒裂殖酵母H2A基因特异性5'二元对称序列以及其他真核生物聚合酶II基因中的“CCAAT盒”相同,但与两者明显不同。3'非编码区的一个显著特征是存在一个23个碱基对的序列,该序列与海胆组蛋白基因中发现的一个保守区域几乎相同。编码区富含GC,在密码子的第三位对这些碱基有强烈的选择。没有一个编码三联体以U结尾。该基因中未发现内含子序列。