Venkatesan S, Gershowitz A, Moss B
J Virol. 1982 Nov;44(2):637-46. doi: 10.1128/JVI.44.2.637-646.1982.
The proximal part of the 10,000-base pair (bp) inverted terminal repetition of vaccinia virus DNA encodes at least three early mRNAs. A 2,236-bp segment of the repetition was sequenced to characterize two of the genes. This task was facilitated by constructing a series of recombinants containing overlapping deletions; oligonucleotide linkers with synthetic restriction sites provided points for radioactive labeling before sequencing by the chemical degradation method of Maxam and Gilbert (Methods Enzymol. 65:499-560, 1980). The ends of the transcripts were mapped by hybridizing labeled DNA fragments to early viral RNA and resolving nuclease S1-protected fragments in sequencing gels, by sequencing cDNA clones, and from the lengths of the RNAs. The nucleotide sequences for at least 60 bp upstream of both transcriptional initiation sites are more than 80% adenine . thymine rich and contain long runs of adenines and thymines with some homology to procaryotic and eucaryotic consensus sequences. The gene transcribed in the rightward direction encodes an RNA of approximately 530 nucleotides with a single open reading frame of 420 nucleotides. Preceding the first AUG, there is a heptanucleotide that can hybridize to the 3' end of 18S rRNA with only one mismatch. The derived amino acid sequence of the protein indicated a molecular weight of 15,500. The gene transcribed in the leftward direction encodes an RNA 1,000 to 1,100 nucleotides long with an open reading frame of 996 nucleotides and a leader sequence of only 5 to 6 nucleotides. The derived amino acid sequence of this protein indicated a molecular weight of 38,500. The 3' ends of the two transcripts were located within 100 bp of each other. Although there are adenine . thymine-rich clusters near the putative transcriptional termination sites, specific AATAAA polyadenylic acid signal sequences are absent.
痘苗病毒DNA的10,000碱基对(bp)反向末端重复序列的近端部分至少编码三种早期mRNA。对该重复序列的一个2,236 bp片段进行了测序,以鉴定其中的两个基因。构建一系列含有重叠缺失的重组体有助于完成这项任务;带有合成限制位点的寡核苷酸接头为通过Maxam和Gilbert的化学降解法测序(《酶学方法》65:499 - 560, 1980)之前的放射性标记提供了位点。通过将标记的DNA片段与早期病毒RNA杂交,并在测序凝胶中解析核酸酶S1保护的片段、对cDNA克隆进行测序以及根据RNA的长度来确定转录本的末端。两个转录起始位点上游至少60 bp的核苷酸序列中腺嘌呤和胸腺嘧啶含量超过80%,并且含有长串的腺嘌呤和胸腺嘧啶,与原核生物和真核生物的共有序列有一些同源性。向右转录的基因编码一个约530个核苷酸的RNA,带有一个420个核苷酸的单一开放阅读框。在第一个AUG之前,有一个七核苷酸,它与18S rRNA的3'端杂交时只有一个错配。推导的蛋白质氨基酸序列表明分子量为15,500。向左转录的基因编码一个1,000至1,100个核苷酸长的RNA,有一个996个核苷酸的开放阅读框和仅5至6个核苷酸的前导序列。推导的该蛋白质氨基酸序列表明分子量为38,500。两个转录本的3'端彼此相距在100 bp之内。尽管在假定的转录终止位点附近有富含腺嘌呤和胸腺嘧啶的簇,但不存在特定的AATAAA聚腺苷酸信号序列。