Allison R, Johnston R E, Dougherty W G
Department of Plant Pathology, North Carolina State University, Raleigh, North Carolina 27695, USA.
Virology. 1986 Oct 15;154(1):9-20. doi: 10.1016/0042-6822(86)90425-3.
The complete nucleotide sequence of the tobacco etch virus (TEV) RNA genome has been determined excepting only the nucleotide(s) present at the extreme 5' terminus. The assembled TEV genomic sequence is 9496 nucleotides in length followed by a polyadenylated tract ranging from 20 to 140 residues. A computer search of the sequence reveals the following. A 5' untranslated region, rich in adenosine and uridine, is present between nucleotides 1 and 144. A putative initiation codon, at nucleotides 145-147, marks the beginning of a large open-reading frame (ORF) which ends with an opal (UGA) termination codon at positions 9307-9309. A 186-nucleotide untranslated region is present between the termination codon of the ORF and the beginning of the 3' polyadenylated region. The predicted translation product of this ORF is a 3054 amino acid polyprotein with a mol wt of 345,943. A function for the large (54,000 Mr) nuclear inclusion protein is suggested by a comparison of the deduced amino acid sequence with a protein data bank. This protein displays biochemical similarities to other viral RNA-dependent, RNA polymerases.
除了烟草蚀纹病毒(TEV)RNA基因组5'端最末端的核苷酸外,其完整的核苷酸序列已被确定。组装后的TEV基因组序列长度为9496个核苷酸,后面跟着一个长度在20到140个残基之间的聚腺苷酸化序列。对该序列进行计算机搜索发现以下情况。在核苷酸1和144之间存在一个富含腺苷和尿苷的5'非翻译区。在核苷酸145 - 147处的一个推定起始密码子标志着一个大的开放阅读框(ORF)的开始,该开放阅读框在位置9307 - 9309处以一个乳白(UGA)终止密码子结束。在ORF的终止密码子和3'聚腺苷酸化区域的起始之间存在一个186个核苷酸的非翻译区。该ORF预测的翻译产物是一个由3054个氨基酸组成的多聚蛋白,分子量为345,943。通过将推导的氨基酸序列与蛋白质数据库进行比较,推测出大的(54,000 Mr)核内含蛋白的功能。该蛋白与其他病毒RNA依赖性RNA聚合酶在生化性质上具有相似性。