Nye Tom M W
School of Mathematics and Statistics, Newcastle University, Newcastle, UK.
Syst Biol. 2008 Oct;57(5):785-94. doi: 10.1080/10635150802424072.
Phylogenetic analysis very commonly produces several alternative trees for a given fixed set of taxa. For example, different sets of orthologous genes may be analyzed, or the analysis may sample from a distribution of probable trees. This article describes an approach to comparing and visualizing multiple alternative phylogenies via the idea of a "tree of trees" or "meta-tree." A meta-tree clusters phylogenies with similar topologies together in the same way that a phylogeny clusters species with similar DNA sequences. Leaf nodes on a meta-tree correspond to the original set of phylogenies given by some analysis, whereas interior nodes correspond to certain consensus topologies. The construction of meta-trees is motivated by analogy with construction of a most parsimonious tree for DNA data, but instead of using DNA letters, in a meta-tree the characters are partitions or splits of the set of taxa. An efficient algorithm for meta-tree construction is described that makes use of a known relationship between the majority consensus and parsimony in terms of gain and loss of splits. To illustrate these ideas meta-trees are constructed for two datasets: a set of gene trees for species of yeast and trees from a bootstrap analysis of a set of gene trees in ray-finned fish. A software tool for constructing meta-trees and comparing alternative phylogenies is available online, and the source code can be obtained from the author.
系统发育分析通常会为给定的固定分类单元集生成多个备选树。例如,可以分析不同的直系同源基因集,或者分析可以从可能树的分布中进行抽样。本文描述了一种通过“树的树”或“元树”的概念来比较和可视化多个备选系统发育树的方法。元树以与系统发育树将具有相似DNA序列的物种聚类相同的方式,将具有相似拓扑结构的系统发育树聚类在一起。元树上的叶节点对应于某些分析给出的原始系统发育树集,而内部节点对应于某些一致拓扑结构。元树的构建是通过与为DNA数据构建最简约树的类比来推动的,但在元树中,字符不是使用DNA字母,而是分类单元集的划分或分裂。描述了一种用于构建元树的有效算法,该算法利用了多数一致性和简约性在分裂的获得和损失方面的已知关系。为了说明这些想法,为两个数据集构建了元树:一组酵母物种的基因树和一组硬骨鱼基因树的自展分析得到的树。一个用于构建元树和比较备选系统发育树的软件工具可在线获取,其源代码可从作者处获得。