Department of Bioinformatics, Biocenter, University Würzburg, Germany.
Mol Phylogenet Evol. 2012 Jun;63(3):585-8. doi: 10.1016/j.ympev.2012.01.026. Epub 2012 Feb 18.
The first step of any molecular phylogenetic analysis is the selection of the species and sequences to be included, the taxon sampling. Already here different pitfalls exist. Sequences can contain errors, annotations in databases can be inaccurate and even the taxonomic classification of a species can be wrong. Usually, these artefacts become evident only after calculation of the phylogenetic tree. Following, the taxon sampling has to be corrected iteratively. This can become tedious and time consuming, as in most cases the taxon sampling is de-coupled from the further steps of the phylogenetic analysis. Here, we present the ITS2 Workbench (http://its2.bioapps.biozentrum.uni-wuerzburg.de/), which eliminates this problem by a tight integration of taxon sampling, secondary structure prediction, multiple alignment and phylogenetic tree calculation. The ITS2 Workbench has access to more than 280,000 ITS2 sequences and their structures provided by the ITS2 database enabling sequence-structure based alignment and tree reconstruction. This allows the interactive improvement of the taxon sampling throughout the whole phylogenetic tree reconstruction process. Thus, the ITS2 Workbench enables a fast, interactive and iterative taxon sampling leading to more accurate ITS2 based phylogenies.
任何分子系统发育分析的第一步是选择要包括的物种和序列,即分类群采样。这里已经存在不同的陷阱。序列可能包含错误,数据库中的注释可能不准确,甚至一个物种的分类学分类也可能是错误的。通常,只有在计算系统发育树后,这些伪影才会变得明显。接下来,必须迭代地纠正分类群采样。这可能会变得繁琐和耗时,因为在大多数情况下,分类群采样与系统发育分析的进一步步骤是解耦的。在这里,我们介绍了 ITS2 Workbench(http://its2.bioapps.biozentrum.uni-wuerzburg.de/),它通过紧密集成分类群采样、二级结构预测、多重比对和系统发育树计算来解决这个问题。ITS2 Workbench 可以访问超过 280,000 个 ITS2 序列及其结构,这些序列及其结构由 ITS2 数据库提供,支持基于序列-结构的比对和树重建。这允许在整个系统发育树重建过程中交互式地改进分类群采样。因此,ITS2 Workbench 能够实现快速、交互式和迭代的分类群采样,从而生成更准确的基于 ITS2 的系统发育关系。