Hérissé J, Rigolet M, de Dinechin S D, Galibert F
Nucleic Acids Res. 1981 Aug 25;9(16):4023-42. doi: 10.1093/nar/9.16.4023.
The entire nucleotide sequence between coordinates 89.5 and 100% of the Ad 2 DNA genome has been determined using the Maxam and Gilbert method. This sequence of 3766 bp contains information relative to the carboxylic end of the fiber protein and to the entire E4 region. The position within the nucleotide sequence of various open reading frames and of several consensus splicing sequences was correlated with the location by EM and Sl digestion of the E4 mRNA. This correlation allows to suggest an additional splicing event in the maturation process of i or f mRNA and to deduce the structure of most E4 mRNA. The aminoacid sequences of the corresponding proteins are deduced allowing the location of several glycosylation sites. The presence of several open reading frames with a substantial coding capacity permits to postulate on the existence of additional genes located at the 3' end of the fiber gene and the 3' end of the E4 region. The existence of these putative additional genes might explain that termination of transcription is several hundred nucleotides beyond the main known poly A addition sites of the L5 and E4 regions.
利用马克萨姆和吉尔伯特方法确定了腺病毒2型(Ad 2)DNA基因组坐标89.5%至100%之间的完整核苷酸序列。这段3766 bp的序列包含与纤维蛋白羧基末端及整个E4区域相关的信息。通过电子显微镜(EM)和Sl核酸酶消化E4 mRNA,将各种开放阅读框和几个共有剪接序列在核苷酸序列中的位置与它们的定位相关联。这种关联使得我们能够推测i或f mRNA成熟过程中存在额外的剪接事件,并推断出大多数E4 mRNA的结构。由此推导了相应蛋白质的氨基酸序列,确定了几个糖基化位点的位置。存在几个具有相当编码能力的开放阅读框,这使得我们能够推测在纤维基因的3'端和E4区域的3'端存在其他基因。这些假定的额外基因的存在可能解释了转录终止发生在L5和E4区域主要已知的多聚腺苷酸添加位点之后数百个核苷酸处。