Zhou K, Brisco P R, Hinkkanen A E, Kohlhaw G B
Nucleic Acids Res. 1987 Jul 10;15(13):5261-73. doi: 10.1093/nar/15.13.5261.
Determination of the nucleotide sequence of a DNA region from Saccharomyces cerevisiae previously shown to contain the LEU3 gene revealed one long open reading frame (ORF) whose 887 codons predict the existence of a protein with a molecular mass of 100,162 daltons. The codon bias index of 0.02 suggests that LEU3 encodes a low-abundance protein. The predicted amino acid sequence contains a stretch of 31 residues near the N-terminus that is rich in cysteines and basic amino acids and shows strong homology to similar regions in five other regulatory proteins of lower eukaryotes. Additional regions with a predominance of basic amino acids are present adjacent to the cysteine-rich region. A stretch of 20 residues, 19 of which are glu or asp, is found in the carboxy terminal quarter of the protein. The 5' flanking region of LEU3 contains a TATA box 111 bp upstream from the beginning of the long ORF and two transcription initiation elements (5'TCAA3') 58 and 48 bp upstream from the ORF. The 3' flanking region shows a tripartite potential termination-polyadenylation signal. The predicted 5' and 3' ends of the transcript are in very good agreement with the previously determined size of the LEU3 message. Analysis of a LEU3'-'lacZ translational fusion suggests that the LEU3 gene, whose product is involved in the specific regulation of the leucine and possibly the isoleucine-valine pathways, is itself under general amino acid control. Consistent with this observation is the finding that the 5' flanking region of LEU3 contains two perfect copies of the general control target sequence 5'TGACTC3'.
对先前已证明含有LEU3基因的酿酒酵母DNA区域的核苷酸序列测定,揭示了一个长的开放阅读框(ORF),其887个密码子预示着存在一种分子量为100,162道尔顿的蛋白质。0.02的密码子偏好指数表明LEU3编码一种低丰度蛋白质。预测的氨基酸序列在N端附近包含一段31个残基的区域,该区域富含半胱氨酸和碱性氨基酸,并且与其他五种低等真核生物调节蛋白的相似区域具有很强的同源性。富含半胱氨酸区域附近还存在以碱性氨基酸为主的其他区域。在该蛋白质的羧基末端四分之一处发现一段20个残基的序列,其中19个是谷氨酸或天冬氨酸。LEU3的5'侧翼区域在长ORF起始点上游111 bp处含有一个TATA框,在ORF上游58和48 bp处含有两个转录起始元件(5'TCAA3')。3'侧翼区域显示出一个三联体潜在终止 - 聚腺苷酸化信号。转录本预测的5'和3'末端与先前确定的LEU3信息大小非常吻合。对LEU3'-'lacZ翻译融合体的分析表明,其产物参与亮氨酸以及可能异亮氨酸 - 缬氨酸途径特异性调节的LEU3基因本身受一般氨基酸控制。与这一观察结果一致的是,发现LEU3的5'侧翼区域包含两个通用控制靶序列5'TGACTC3'的完美拷贝。