Liu Zhihua, Meng Jihong, Sun Xiao
State Key Laboratory of Bioelectronics, Southeast University, Nanjing 210096, PR China.
Biochem Biophys Res Commun. 2008 Apr 4;368(2):223-30. doi: 10.1016/j.bbrc.2008.01.070. Epub 2008 Jan 28.
Traditional phylogenetic analysis is based on multiple sequence alignment. With the development of worldwide genome sequencing project, more and more completely sequenced genomes become available. However, traditional sequence alignment tools are impossible to deal with large-scale genome sequence. So, the development of new algorithms to infer phylogenetic relationship without alignment from whole genome information represents a new direction of phylogenetic study in the post-genome era. In the present study, a novel algorithm based on BBC (base-base correlation) is proposed to analyze the phylogenetic relationships of HEV (Hepatitis E virus). When 48 HEV genome sequences are analyzed, the phylogenetic tree that is constructed based on BBC algorithm is well consistent with that of previous study. When compared with methods of sequence alignment, the merit of BBC algorithm appears to be more rapid in calculating evolutionary distances of whole genome sequence and not requires any human intervention, such as gene identification, parameter selection. BBC algorithm can serve as an alternative to rapidly construct phylogenetic trees and infer evolutionary relationships.
传统的系统发育分析基于多序列比对。随着全球基因组测序项目的发展,越来越多的全基因组序列可供使用。然而,传统的序列比对工具无法处理大规模的基因组序列。因此,开发从全基因组信息中推断系统发育关系而无需比对的新算法代表了后基因组时代系统发育研究的一个新方向。在本研究中,提出了一种基于碱基-碱基相关性(BBC)的新算法来分析戊型肝炎病毒(HEV)的系统发育关系。当分析48条戊型肝炎病毒基因组序列时,基于BBC算法构建的系统发育树与先前研究的结果高度一致。与序列比对方法相比,BBC算法的优点在于计算全基因组序列的进化距离更快,并且不需要任何人工干预,如基因识别、参数选择。BBC算法可作为快速构建系统发育树和推断进化关系的一种替代方法。