Koonin E V, Dolja V V
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894.
Crit Rev Biochem Mol Biol. 1993;28(5):375-430. doi: 10.3109/10409239309078440.
Despite the rapid mutational change that is typical of positive-strand RNA viruses, enzymes mediating the replication and expression of virus genomes contain arrays of conserved sequence motifs. Proteins with such motifs include RNA-dependent RNA polymerase, putative RNA helicase, chymotrypsin-like and papain-like proteases, and methyltransferases. The genes for these proteins form partially conserved modules in large subsets of viruses. A concept of the virus genome as a relatively evolutionarily stable "core" of housekeeping genes accompanied by a much more flexible "shell" consisting mostly of genes coding for virion components and various accessory proteins is discussed. Shuffling of the "shell" genes including genome reorganization and recombination between remote groups of viruses is considered to be one of the major factors of virus evolution. Multiple alignments for the conserved viral proteins were constructed and used to generate the respective phylogenetic trees. Based primarily on the tentative phylogeny for the RNA-dependent RNA polymerase, which is the only universally conserved protein of positive-strand RNA viruses, three large classes of viruses, each consisting of distinct smaller divisions, were delineated. A strong correlation was observed between this grouping and the tentative phylogenies for the other conserved proteins as well as the arrangement of genes encoding these proteins in the virus genome. A comparable correlation with the polymerase phylogeny was not found for genes encoding virion components or for genome expression strategies. It is surmised that several types of arrangement of the "shell" genes as well as basic mechanisms of expression could have evolved independently in different evolutionary lineages. The grouping revealed by phylogenetic analysis may provide the basis for revision of virus classification, and phylogenetic taxonomy of positive-strand RNA viruses is outlined. Some of the phylogenetically derived divisions of positive-strand RNA viruses also include double-stranded RNA viruses, indicating that in certain cases the type of genome nucleic acid may not be a reliable taxonomic criterion for viruses. Hypothetical evolutionary scenarios for positive-strand RNA viruses are proposed. It is hypothesized that all positive-strand RNA viruses and some related double-stranded RNA viruses could have evolved from a common ancestor virus that contained genes for RNA-dependent RNA polymerase, a chymotrypsin-related protease that also functioned as the capsid protein, and possibly an RNA helicase.
尽管正链RNA病毒通常具有快速的突变变化,但介导病毒基因组复制和表达的酶含有一系列保守的序列基序。具有此类基序的蛋白质包括RNA依赖性RNA聚合酶、假定的RNA解旋酶、胰凝乳蛋白酶样和木瓜蛋白酶样蛋白酶以及甲基转移酶。这些蛋白质的基因在很大一部分病毒中形成部分保守的模块。文中讨论了一种概念,即病毒基因组是由相对进化稳定的管家基因“核心”以及一个更为灵活的“外壳”组成,“外壳”主要由编码病毒粒子成分和各种辅助蛋白的基因组成。包括基因组重组和远距离病毒群体之间的重组在内的“外壳”基因重排被认为是病毒进化的主要因素之一。构建了保守病毒蛋白的多序列比对,并用于生成各自的系统发育树。主要基于RNA依赖性RNA聚合酶的初步系统发育,它是正链RNA病毒唯一普遍保守的蛋白质,划分出三大类病毒,每一类又由不同的较小分类组成。观察到这种分组与其他保守蛋白的初步系统发育以及病毒基因组中编码这些蛋白的基因排列之间存在很强的相关性。对于编码病毒粒子成分的基因或基因组表达策略,未发现与聚合酶系统发育有类似的相关性。据推测,“外壳”基因的几种排列类型以及基本表达机制可能在不同的进化谱系中独立进化。系统发育分析揭示的分组可能为病毒分类的修订提供基础,并概述了正链RNA病毒的系统发育分类。正链RNA病毒的一些系统发育衍生分类中还包括双链RNA病毒,这表明在某些情况下,基因组核酸类型可能不是病毒可靠的分类标准。提出了正链RNA病毒的假设进化情景。假设所有正链RNA病毒和一些相关的双链RNA病毒可能都从一种共同的祖先病毒进化而来,该祖先病毒含有RNA依赖性RNA聚合酶、一种也起衣壳蛋白作用的胰凝乳蛋白酶相关蛋白酶,以及可能的一种RNA解旋酶。