Shih Arthur Chun-Chieh, Lee D T, Peng Chin-Lin, Wu Yu-Wei
Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan.
BMC Bioinformatics. 2007 Feb 24;8:63. doi: 10.1186/1471-2105-8-63.
When aligning several hundreds or thousands of sequences, such as epidemic virus sequences or homologous/orthologous sequences of some big gene families, to reconstruct the epidemiological history or their phylogenies, how to analyze and visualize the alignment results of many sequences has become a new challenge for computational biologists. Although there are several tools available for visualization of very long sequence alignments, few of them are applicable to the alignments of many sequences.
A multiple-logo alignment visualization tool, called Phylo-mLogo, is presented in this paper. Phylo-mLogo calculates the variabilities and homogeneities of alignment sequences by base frequencies or entropies. Different from the traditional representations of sequence logos, Phylo-mLogo not only displays the global logo patterns of the whole alignment of multiple sequences, but also demonstrates their local homologous logos for each clade hierarchically. In addition, Phylo-mLogo also allows the user to focus only on the analysis of some important, structurally or functionally constrained sites in the alignment selected by the user or by built-in automatic calculation.
With Phylo-mLogo, the user can symbolically and hierarchically visualize hundreds of aligned sequences simultaneously and easily check the changes of their amino acid sites when analyzing many homologous/orthologous or influenza virus sequences. More information of Phylo-mLogo can be found at URL http://biocomp.iis.sinica.edu.tw/phylomlogo.
在比对数百或数千条序列时,比如流行病病毒序列或某些大基因家族的同源/直系同源序列,以重建其流行病学历史或系统发育关系时,如何分析和可视化众多序列的比对结果已成为计算生物学家面临的一项新挑战。尽管有几种工具可用于可视化非常长的序列比对,但其中很少有适用于多条序列比对的。
本文提出了一种名为Phylo-mLogo的多序列标识比对可视化工具。Phylo-mLogo通过碱基频率或熵来计算比对序列的变异性和同质性。与传统的序列标识表示不同,Phylo-mLogo不仅显示多条序列整体比对的全局标识模式,还能分层展示每个进化枝的局部同源标识。此外,Phylo-mLogo还允许用户仅专注于分析由用户选择或通过内置自动计算选定的比对中一些重要的、结构或功能受限的位点。
使用Phylo-mLogo,用户可以同时以符号化和分层的方式可视化数百条比对序列,并在分析许多同源/直系同源序列或流感病毒序列时轻松检查其氨基酸位点的变化。有关Phylo-mLogo的更多信息可在网址http://biocomp.iis.sinica.edu.tw/phylomlogo上找到。