Murchie M J, McGeoch D J
J Gen Virol. 1982 Sep;62 (Pt 1):1-15. doi: 10.1099/0022-1317-62-1-1.
The nucleotide sequence of 4 kilobases of DNA from within the short region of the genome of herpes simplex virus type 1 has been determined. This portion of DNA contains the junctions of the terminal and inverted repeat sequence components with the unique sequence component and the 5'-regions of the genes which encode the Vmw 12, Vmw 68 and Vmw 175 immediate-early polypeptides. The transcription and translation initiation sites of all three genes and the 5' and 3' boundaries of the Vmw 12 and Vmw 68 gene introns have been localized on the DNA sequence and shown to be flanked by sequences which resemble those found in similar positions in other eukaryotic genes. For the Vmw 12 and Vmw 68 genes the promoters, the 5'-non-coding regions of the mRNAs, and the introns lie within the terminal and internal inverted repeats respectively while the polypeptide-coding regions lie in the short unique component. The introns consist largely of tandemly reiterated copies of a 22-nucleotide sequence: the Vmw 12 gene intron contains seven of these and the Vmw 68 gene intron five. The Vmw 175 gene is located entirely within the short repeats, of which those regions sequenced here have the extremely high G + C content of 78%, in marked contrast to the value of 66% obtained for the adjacent region of the unique sequence component. Prediction of the complete amino acid sequence of the Vmw 12 polypeptide, accounting for a mol. wt. of 9830, and of the first 523 amino-terminal amino acids of the Vmw 175 polypeptide has been made from the DNA sequence. The polypeptide-coding region of the Vmw 175 gene has an average G + C content of 80% but nevertheless specifies a wide range of amino acid types because of the maximal assignment of G and C residues to colon third base positions.
已确定单纯疱疹病毒1型基因组短区域内4千碱基DNA的核苷酸序列。这段DNA包含末端和反向重复序列成分与独特序列成分的连接处,以及编码Vmw 12、Vmw 68和Vmw 175立即早期多肽的基因的5'区域。这三个基因的转录和翻译起始位点以及Vmw 12和Vmw 68基因内含子的5'和3'边界已定位在DNA序列上,并显示其两侧的序列类似于在其他真核基因相似位置发现的序列。对于Vmw 12和Vmw 68基因,启动子、mRNA的5'非编码区和内含子分别位于末端和内部反向重复序列内,而多肽编码区位于短独特成分中。内含子主要由一个22核苷酸序列的串联重复拷贝组成:Vmw 12基因内含子包含七个这样的拷贝,Vmw 68基因内含子包含五个。Vmw 175基因完全位于短重复序列内,此处测序的那些区域的G + C含量极高,为78%,这与独特序列成分相邻区域获得的66%的值形成鲜明对比。已根据DNA序列预测了Vmw 12多肽的完整氨基酸序列,其分子量为9830,以及Vmw 175多肽的前523个氨基末端氨基酸。Vmw 175基因的多肽编码区平均G + C含量为80%,但由于G和C残基在密码子第三位的最大分配,它仍指定了广泛的氨基酸类型。