Gompels U A, Nicholas J, Lawrence G, Jones M, Thomson B J, Martin M E, Efstathiou S, Craxton M, Macaulay H A
Department of Clinical Sciences, London School of Hygiene and Tropical Medicine, University of London, United Kingdom.
Virology. 1995 May 10;209(1):29-51. doi: 10.1006/viro.1995.1228.
The complete DNA sequence was determined for strain U1102 of human herpesvirus-6, a CD4+ T-lymphotropic virus with disease associations in immunodeficient settings and a possible complicating factor in AIDS. The genome is 159,321 bp in size, has a base composition of 43% G + C, and contains 119 open reading frames. The overall structure is 143 kb bounded by 8 kb of direct repeats, DRL (left) and DRR (right), containing 0.35 kb of terminal and junctional arrays of human telomere-like simple repeats. Since eight open reading frames are duplicated in the repeats, six span repetitive elements and three are spliced, the genome is considered to contain 102 separate genes likely to encode protein. The genes are arranged colinearly with those in the genome of the previously sequenced betaherpesvirus, human cytomegalovirus, and has a distinct arrangement of conserved genes relative to the sequenced gammaherpesviruses, herpesvirus saimiri and Epstein-Barr virus, and the alphaherpesviruses, equine herpesvirus-1, varicella-zoster virus, and herpes simplex virus. Comparisons of predicted amino acid sequences allowed the functions of many human herpesvirus-6 encoded proteins to be assigned and showed the closest relationship in overall number and similarity to human cytomegalovirus products, with approximately 67% homologous proteins as compared to the 21% identified in all herpesviruses. The features of the conserved genes and their relative order suggested a general scheme for divergence among these herpesvirus lineages. In addition to the "core" conserved genes, the genome contains four distinct gene families which may be involved in immune evasion and persistence in immune cells: two have similarity to the "chemokine" chemotactic/proinflammatory family of cytokines, one to their peptide G-protein-coupled receptors, and a fourth to the immunoglobulin superfamily.
测定了人疱疹病毒6型U1102株的完整DNA序列。人疱疹病毒6型是一种嗜CD4⁺T淋巴细胞病毒,在免疫缺陷情况下与疾病相关,并且可能是艾滋病的一个复杂因素。该基因组大小为159,321 bp,碱基组成为43%的G + C,包含119个开放阅读框。整体结构为143 kb,由8 kb的正向重复序列DRL(左)和DRR(右)界定,其中包含0.35 kb的人端粒样简单重复序列的末端和连接阵列。由于8个开放阅读框在重复序列中重复,6个跨越重复元件,3个经过剪接,因此该基因组被认为包含102个可能编码蛋白质的独立基因。这些基因与先前测序的β疱疹病毒人巨细胞病毒基因组中的基因呈共线性排列,并且相对于已测序的γ疱疹病毒猴疱疹病毒和爱泼斯坦 - 巴尔病毒以及α疱疹病毒马疱疹病毒1型、水痘 - 带状疱疹病毒和单纯疱疹病毒,其保守基因具有独特的排列方式。预测氨基酸序列的比较使得许多人疱疹病毒6型编码蛋白的功能得以确定,结果显示其在总数和相似性上与人类巨细胞病毒产物关系最为密切,与所有疱疹病毒中鉴定出的21%的同源蛋白相比,约有67%的同源蛋白。保守基因的特征及其相对顺序提示了这些疱疹病毒谱系之间分化的一般模式。除了“核心”保守基因外,该基因组还包含四个不同的基因家族,它们可能参与免疫逃逸和在免疫细胞中的持续存在:两个与细胞因子的“趋化因子”趋化/促炎家族相似,一个与它们的肽G蛋白偶联受体相似,第四个与免疫球蛋白超家族相似。