Perry L J, McGeoch D J
MRC Virology Unit, University of Glasgow, U.K.
J Gen Virol. 1988 Nov;69 ( Pt 11):2831-46. doi: 10.1099/0022-1317-69-11-2831.
We have determined the DNA sequence of the long repeat region (RL) in the genome of herpes simplex virus type 1 (HSV-1) strain 17, as 9215 bp of composition 71.6% G + C. In addition, the sequences of parts of the long unique region (UL) adjacent to the terminal (TRL) and internal (IRL) copies of RL were determined (2611 and 3836 bp, respectively). Gene organization in these regions of UL was deduced from the sequences and other available data. It was proposed that the region of UL sequenced, adjacent to TRL, contains three complete genes, none with significant previous characterization, and that the region of UL adjacent to IRL also contains three genes, one encoding the immediate early protein IE63. The RL sequence contains one well characterized gene, for the protein IE110, whose organization we have described previously. Between the downstream end of the IE110 gene and UL there is a 3500 bp segment of RL in which we did not find convincing protein-coding sequences, and which thus remains of obscure functionality. Upstream of the IE110 gene is a region previously proposed by others to contain a gene. However, our sequence data are not compatible with their interpretation. We do consider it possible that the region is protein-coding, but regard gene organization here as still unresolved.
我们已确定了单纯疱疹病毒1型(HSV-1)17株基因组中长重复区域(RL)的DNA序列,长度为9215 bp,G + C含量为71.6%。此外,还确定了与RL的末端(TRL)和内部(IRL)拷贝相邻的长单一区域(UL)部分的序列(分别为2611和3836 bp)。从这些序列和其他现有数据中推导了UL这些区域的基因组织。有人提出,测序的UL区域,与TRL相邻,包含三个完整基因,之前均未得到显著表征,且与IRL相邻的UL区域也包含三个基因,其中一个编码立即早期蛋白IE63。RL序列包含一个已充分表征的基因——编码蛋白IE110的基因,我们之前已描述过其结构。在IE110基因的下游末端与UL之间有一段3500 bp的RL片段,我们在其中未找到令人信服的蛋白质编码序列,因此其功能仍不清楚。在IE110基因的上游是一个其他人之前提出含有一个基因的区域。然而,我们的序列数据与他们的解释不相符。我们确实认为该区域可能编码蛋白质,但认为这里的基因结构仍未解决。