Horowitz S, Bowen J K, Bannon G A, Gorovsky M A
Nucleic Acids Res. 1987 Jan 12;15(1):141-60. doi: 10.1093/nar/15.1.141.
The complete DNA sequence is presented of H4-II, the second of the pair of histone H4 genes of the ciliated protozoan, Tetrahymena thermophila. Both H4 genes code for the same protein. Codon usage in these and other Tetrahymena genes is severely restricted and is similar to that in yeast. Flanking regions are AT-rich (greater than or equal to 75%), relative to coding sequences (approximately 45% GC). Except for small, similarly positioned homologies, flanking sequences of the two genes are different. Canonical sequences in higher eukaryotic promoters are not obvious in these genes. Instead, short, localized, base composition eccentricities characterize the 5' flanking sequences of all Tetrahymena genes analyzed. The consensus, P yP u(A)3-4 ATGG initiates translation in these and all other known Tetrahymena genes. Nuclear transcripts and messages of both growing and starved cells begin at multiple sites, mainly at the first or second A residue following a pyrimidine. The palindrome typical of histone message 3' termini in higher organisms is not present. Downstream of both genes are sequences similar to the processing/polyadenylation signal of higher eukaryotes, although the unique 3' ends are not those predicted by the location of the signals.
本文展示了嗜热栖热四膜虫纤毛虫组蛋白H4基因对中第二个基因H4-II的完整DNA序列。两个H4基因编码相同的蛋白质。这些基因以及其他四膜虫基因的密码子使用受到严格限制,与酵母中的情况相似。相对于编码序列(约45%的GC),侧翼区域富含AT(大于或等于75%)。除了小的、位置相似的同源性外,两个基因的侧翼序列不同。在这些基因中,高等真核生物启动子中的典型序列并不明显。相反,所分析的所有四膜虫基因的5'侧翼序列的特征是短的、局部的碱基组成偏心。共有序列PyPu(A)3-4ATGG启动了这些基因以及所有其他已知四膜虫基因的翻译。生长细胞和饥饿细胞的核转录本和信使RNA从多个位点开始,主要在嘧啶后的第一个或第二个A残基处。高等生物中组蛋白信使RNA 3'末端典型的回文结构不存在。两个基因的下游都有与高等真核生物加工/聚腺苷酸化信号相似的序列,尽管独特的3'末端并非由信号位置所预测的那些。