Qi Ji, Luo Hong, Hao Bailin
The Institute of Theoretical Physics, Academia Sinica, Beijing 100080, China.
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W45-7. doi: 10.1093/nar/gkh362.
Composition Vector Tree (CVTree) implements a systematic method of inferring evolutionary relatedness of microbial organisms from the oligopeptide content of their complete proteomes (http://cvtree.cbi.pku.edu.cn). Since the first bacterial genomes were sequenced in 1995 there have been several attempts to infer prokaryote phylogeny from complete genomes. Most of them depend on sequence alignment directly or indirectly and, in some cases, need fine-tuning and adjustment. The composition vector method circumvents the ambiguity of choosing the genes for phylogenetic reconstruction and avoids the necessity of aligning sequences of essentially different length and gene content. This new method does not contain 'free' parameter and 'fine-tuning'. A bootstrap test for a phylogenetic tree of 139 organisms has shown the stability of the branchings, which support the small subunit ribosomal RNA (SSU rRNA) tree of life in its overall structure and in many details. It may provide a quick reference in prokaryote phylogenetics whenever the proteome of an organism is available, a situation that will become commonplace in the near future.
复合向量树(CVTree)实现了一种从微生物完整蛋白质组的寡肽含量推断微生物进化相关性的系统方法(http://cvtree.cbi.pku.edu.cn)。自1995年首批细菌基因组被测序以来,已经有几次尝试从完整基因组推断原核生物系统发育。其中大多数直接或间接地依赖于序列比对,并且在某些情况下需要微调与调整。复合向量方法规避了为系统发育重建选择基因时的模糊性,并且避免了比对本质上长度和基因含量不同的序列的必要性。这种新方法不包含“自由”参数和“微调”。对139种生物的系统发育树进行的自展检验显示了分支的稳定性,这在整体结构和许多细节上支持了生命的小亚基核糖体RNA(SSU rRNA)树。只要有生物的蛋白质组,它可能在原核生物系统发育学中提供一个快速参考,这种情况在不久的将来将会变得很常见。