Engler J A, Hoppe M S, van Bree M P
Gene. 1983 Jan-Feb;21(1-2):145-59. doi: 10.1016/0378-1119(83)90156-7.
The nucleotide sequence of a cloned DNA segment encoding the early region 2b from the group B human adenovirus Ad7 has been determined. When compared to Ad2, a group C adenovirus, these sequences were found to be approx. 80% homologous within the l-strand gene-coding regions. Most changes are transitions or transversions, although several deletions/insertions also occur within the N-terminal domain of one of the coding regions. The substantial nucleotide homology results in a high degree of amino acid conservation in the predicted polypeptides encoded by the early region 2b genes. Two major open reading frames, corresponding to the Mr 87000 and Mr 140000 polypeptides of Ad2, are found in the l strand of Ad7 between genome coordinates 28.5 to 23.1 and 13.8, respectively. The r strand of the DNA in this region encodes the three leader segments joined to the 5' end of the most late viral mRNAs, and also encodes the i-leader segment found between the second and third leaders on some mRNAs. The positions of the donor and acceptor splice sites of the three leaders are conserved and can be identified by homology to Ad2. Only two of the unidentified open reading frames (URF) in Ad2 (Gingeras et al., J. Biol. Chem., in press) can be found in Ad7. URF1, encoding an Mr 13500 polypeptide at genome coordinate 17, is predominantly conserved in nucleotide and amino acid sequence, but contains one half as many arginine amino acids as does URF1 of Ad2. URF2, encoding an Mr 13600 protein which lies within the i-leader region, is not well conserved in either nucleotide or amino acid sequence.
编码B组人类腺病毒Ad7早期区域2b的一段克隆DNA片段的核苷酸序列已被确定。与C组腺病毒Ad2相比,发现这些序列在l链基因编码区域内约80%同源。大多数变化是转换或颠换,尽管在一个编码区域的N端结构域内也发生了一些缺失/插入。显著的核苷酸同源性导致早期区域2b基因编码的预测多肽中氨基酸高度保守。在Ad7的l链中分别在基因组坐标28.5至23.1和13.8之间发现了两个主要的开放阅读框,分别对应于Ad2的Mr 87000和Mr 140000多肽。该区域DNA的r链编码连接到大多数晚期病毒mRNA 5'端的三个前导序列,还编码在一些mRNA的第二个和第三个前导序列之间发现的i前导序列。三个前导序列的供体和受体剪接位点的位置是保守的,并且可以通过与Ad2的同源性来鉴定。在Ad7中只能找到Ad2中两个未鉴定的开放阅读框(URF)(Gingeras等人,《生物化学杂志》,即将发表)。URF1在基因组坐标17处编码一个Mr 13500多肽,在核苷酸和氨基酸序列上主要是保守的,但精氨酸氨基酸的数量只有Ad2的URF1的一半。URF2编码一个位于i前导区域内的Mr 13600蛋白,在核苷酸或氨基酸序列上都不太保守。