Herniou E A, Luque T, Chen X, Vlak J M, Winstanley D, Cory J S, O'Reilly D R
Department of Biology, Imperial College, London SW7 2AZ, United Kingdom.
J Virol. 2001 Sep;75(17):8117-26. doi: 10.1128/jvi.75.17.8117-8126.2001.
Several phylogenetic methods based on whole genome sequence data were evaluated using data from nine complete baculovirus genomes. The utility of three independent character sets was assessed. The first data set comprised the sequences of the 63 genes common to these viruses. The second set of characters was based on gene order, and phylogenies were inferred using both breakpoint distance analysis and a novel method developed here, termed neighbor pair analysis. The third set recorded gene content by scoring gene presence or absence in each genome. All three data sets yielded phylogenies supporting the separation of the Nucleopolyhedrovirus (NPV) and Granulovirus (GV) genera, the division of the NPVs into groups I and II, and species relationships within group I NPVs. Generation of phylogenies based on the combined sequences of all 63 shared genes proved to be the most effective approach to resolving the relationships among the group II NPVs and the GVs. The history of gene acquisitions and losses that have accompanied baculovirus diversification was visualized by mapping the gene content data onto the phylogenetic tree. This analysis highlighted the fluid nature of baculovirus genomes, with evidence of frequent genome rearrangements and multiple gene content changes during their evolution. Of more than 416 genes identified in the genomes analyzed, only 63 are present in all nine genomes, and 200 genes are found only in a single genome. Despite this fluidity, the whole genome-based methods we describe are sufficiently powerful to recover the underlying phylogeny of the viruses.
使用来自九个完整杆状病毒基因组的数据,对几种基于全基因组序列数据的系统发育方法进行了评估。评估了三个独立特征集的效用。第一个数据集包含这些病毒共有的63个基因的序列。第二组特征基于基因顺序,并使用断点距离分析和此处开发的一种新方法(称为邻对分析)推断系统发育。第三组通过对每个基因组中基因的存在或缺失进行评分来记录基因内容。所有这三个数据集产生的系统发育树都支持核型多角体病毒(NPV)和颗粒体病毒(GV)属的分离、NPV分为I组和II组以及I组NPV内的物种关系。基于所有63个共享基因的组合序列生成系统发育树被证明是解决II组NPV和GV之间关系的最有效方法。通过将基因内容数据映射到系统发育树上,直观展示了杆状病毒多样化过程中伴随的基因获得和丢失的历史。该分析突出了杆状病毒基因组的流动性,有证据表明在其进化过程中频繁发生基因组重排和多次基因内容变化。在所分析的基因组中鉴定出的416多个基因中,只有63个存在于所有九个基因组中,200个基因仅在单个基因组中发现。尽管存在这种流动性,但我们描述的基于全基因组的方法足够强大,能够恢复病毒的潜在系统发育。