Smith M M, Andrésson O S
J Mol Biol. 1983 Sep 25;169(3):663-90. doi: 10.1016/s0022-2836(83)80164-8.
The complete DNA sequences of two loci encoding H3 and H4 histones in Saccharomyces cerevisiae have been determined. Each locus contains one H3 and one H4 gene. The genes at each locus are divergently transcribed and the coding sequences are separated by 646 base-pairs at one locus and 676 base-pairs at the other. The H3 genes code for identical histone H3 proteins and the H4 genes code for identical histone H4 proteins. The yeast proteins differ from histones H3 and H4 of calf by 15 and 8 amino acid substitutions, respectively, and these differences are largely confined to the carboxy-terminal halves of the proteins. The genes demonstrate a bias in synonymous codon usage similar to that noted for other yeast genes. This bias is confined to the coding sequences of the genes and is specific for the reading frame encoding the proteins. The coding sequence of each gene is flanked on both sides by DNA with an A + T content of 70 to 80%. Possible regulatory sequences are located relative to the 5' and 3'-termini of the histone H3 and H4 RNA transcripts.
已测定酿酒酵母中两个编码H3和H4组蛋白的基因座的完整DNA序列。每个基因座包含一个H3基因和一个H4基因。每个基因座上的基因呈反向转录,两个基因座的编码序列分别被646个碱基对和676个碱基对隔开。H3基因编码相同的组蛋白H3蛋白,H4基因编码相同的组蛋白H4蛋白。酵母蛋白与小牛的组蛋白H3和H4分别有15个和8个氨基酸替换,这些差异主要局限于蛋白质的羧基末端部分。这些基因在同义密码子使用上表现出一种偏好,类似于其他酵母基因的情况。这种偏好局限于基因的编码序列,并且对于编码蛋白质的阅读框是特异性的。每个基因的编码序列两侧的DNA的A+T含量为70%至80%。可能的调控序列相对于组蛋白H3和H4 RNA转录本的5'和3'末端定位。