Hérissé J, Courtois G, Galibert F
Nucleic Acids Res. 1980 May 24;8(10):2173-92. doi: 10.1093/nar/8.10.2173.
The entire nucleotide sequence of the Ad. 2 EcoRI D fragment has been determined using the Maxam and Gilbert method. This sequence of 2678 bp contains informations relative to late mRNAs ending at position 78 and for which an AATAAA sequence corresponding to their 3' ends is found at residue number 833. Position of the PVIII mRNA is determined thus allowing deduction of the probable amino acid sequence of the PVIII protein. The position and the sequence of the first leader of early 3 mRNAs is determined as well as the sequence and position of the second early leader of region 3 mRNAs, which also correspond to the "y" leader of the fiber mRNA. Following the localization of an open reading frame in which an ATG could initiate protein synthesis it can be predicted that 3a, b, c mRNAs code for the 16K early protein and the probable amino acid sequence of this protein can be deduced. The CAGTTT sequence frequently present at the 5' end of a leader or of a mRNA body as well as the GGTGAG sequence which is found at the 3' end of several leaders were used to postulate the position of various early mRNAs of region 3 and to suggest the existence of an additional splicing event during the processing of mRNAs 3a, b and c. They were also used to predict the position of the additional "x" late leaders. The imbrication of information concerning (i) the family of late mRNAs ending at position 78, (ii) the position of the "x" leader and the "y" leader and (iii) the beginning of early region 3 is also depicted.
利用马克萨姆和吉尔伯特方法确定了腺病毒2型EcoRI D片段的完整核苷酸序列。这段2678 bp的序列包含了与在78位终止的晚期mRNA相关的信息,并且在833位残基处发现了一个对应于其3'末端的AATAAA序列。由此确定了PVIII mRNA的位置,从而可以推断出PVIII蛋白可能的氨基酸序列。还确定了早期3 mRNA的第一个前导序列的位置和序列,以及3区mRNA的第二个早期前导序列的序列和位置,后者也对应于纤维mRNA的“y”前导序列。在定位了一个开放阅读框(其中ATG可启动蛋白质合成)之后,可以预测3a、b、c mRNA编码16K早期蛋白,并可推断出该蛋白可能的氨基酸序列。经常出现在前导序列或mRNA主体5'末端的CAGTTT序列,以及在几个前导序列3'末端发现的GGTGAG序列,被用于推测3区各种早期mRNA的位置,并暗示在3a、b和c mRNA加工过程中存在额外的剪接事件。它们还被用于预测额外的“x”晚期前导序列的位置。还描绘了关于(i)在78位终止的晚期mRNA家族、(ii)“x”前导序列和“y”前导序列的位置以及(iii)早期3区起始的信息的交织情况。