Bukh J, Purcell R H, Miller R H
Hepatitis Viruses Section, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892.
Proc Natl Acad Sci U S A. 1993 Sep 1;90(17):8234-8. doi: 10.1073/pnas.90.17.8234.
In a previous study we sequenced the 5' noncoding (NC) region of 44 isolates of hepatitis C virus (HCV) and identified heterogeneous domains that provided evidence for additional genetic groups of HCV not previously recognized. In this study we have determined the complete nucleotide sequence of the putative envelope 1 (E1) gene in 51 HCV isolates from around the world and found that they could be grouped into at least 12 distinct genotypes. The E1 gene sequence of 8 of these genotypes has not been reported previously. Although the genetic relatedness of HCV isolates determined by the previous analysis of the 5' NC region predicted the relationships observed in the E1 gene, analysis of the 5' NC sequence alone did not accurately predict all HCV genotypes. The nucleotide and amino acid sequence identities of the E1 gene among HCV isolates of the same genotype were in the range of 88.0-99.1% and 89.1-98.4%, respectively, whereas those of HCV isolates of different genotypes were in the range of 53.5-78.6% and 49.0-82.8%, respectively. The latter differences are similar to those found when comparing the envelope gene sequences of the various serotypes of the related flaviviruses as well as other RNA viruses. We found that some genotypes of HCV were widely distributed around the world, whereas others were identified only in discreet geographical regions. Four genotypes were identified exclusively in Africa and comprised the majority of HCV isolates on that continent. The E1 gene was exactly 576 nucleotides in length in all 51 HCV isolates with no in-frame stop codons. Analysis of the predicted E1 protein identified several conserved domains that may be important for maintaining its biological function: (i) eight invariant cysteine residues, (ii) three potential N-linked glycosylation sites, (iii) a domain of nine amino acids (GHRMAWDMM), and (iv) an amino acid doublet (GV) near the putative cleavage site at the C terminus of the protein. In conclusion, the discovery of at least 12 genotypes of HCV has important implications for HCV diagnosis and vaccine development.
在之前的一项研究中,我们对44株丙型肝炎病毒(HCV)的5'非编码(NC)区域进行了测序,并鉴定出了异质结构域,这些结构域为先前未被识别的HCV其他遗传组提供了证据。在本研究中,我们测定了来自世界各地的51株HCV分离株中推定的包膜1(E1)基因的完整核苷酸序列,发现它们可分为至少12种不同的基因型。其中8种基因型的E1基因序列此前尚未见报道。尽管通过先前对5'NC区域的分析所确定的HCV分离株的遗传相关性预测了在E1基因中观察到的关系,但仅对5'NC序列进行分析并不能准确预测所有的HCV基因型。同一基因型的HCV分离株之间E1基因的核苷酸和氨基酸序列同一性分别在88.0 - 99.1%和89.1 - 98.4%的范围内,而不同基因型的HCV分离株之间的同一性分别在53.5 - 78.6%和49.0 - 82.8%的范围内。后一种差异与在比较相关黄病毒以及其他RNA病毒的各种血清型的包膜基因序列时所发现的差异相似。我们发现,一些HCV基因型在世界各地广泛分布,而其他基因型仅在特定的地理区域被鉴定出来。有四种基因型仅在非洲被鉴定出,并且在该大陆的HCV分离株中占大多数。所有51株HCV分离株的E1基因长度均为576个核苷酸,且无读框内终止密码子。对预测的E1蛋白的分析鉴定出了几个可能对维持其生物学功能很重要的保守结构域:(i)八个不变的半胱氨酸残基,(ii)三个潜在的N - 糖基化位点,(iii)一个由九个氨基酸组成的结构域(GHRMAWDMM),以及(iv)在该蛋白C末端推定的切割位点附近有一个氨基酸双峰(GV)。总之,至少12种HCV基因型的发现对HCV诊断和疫苗开发具有重要意义。