Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany.
German Center for Infection Research (DZIF), Braunschweig, Germany.
Sci Rep. 2018 Nov 19;8(1):17000. doi: 10.1038/s41598-018-35421-4.
Phylogeographic methods reconstruct the origin and spread of taxa by inferring locations for internal nodes of the phylogenetic tree from sampling locations of genetic sequences. This is commonly applied to study pathogen outbreaks and spread. To evaluate such reconstructions, the inferred spread paths from root to leaf nodes should be compared to other methods or references. Usually, ancestral state reconstructions are evaluated by node-wise comparisons, therefore requiring the same tree topology, which is usually unknown. Here, we present a method for comparing phylogeographies across different trees inferred from the same taxa. We compare paths of locations by calculating discrete Fréchet distances. By correcting the distances by the number of paths going through a node, we define the Fréchet tree distance as a distance measure between phylogeographies. As an application, we compare phylogeographic spread patterns on trees inferred with different methods from hemagglutinin sequences of H5N1 influenza viruses, finding that both tree inference and ancestral reconstruction cause variation in phylogeographic spread that is not directly reflected by topological differences. The method is suitable for comparing phylogeographies inferred with different tree or phylogeographic inference methods to each other or to a known ground truth, thus enabling a quality assessment of such techniques.
系统发生地理学方法通过从遗传序列采样位置推断系统发育树内部节点的位置来重建分类单元的起源和传播。这通常应用于研究病原体的爆发和传播。为了评估此类重建,应该将从根到叶节点的推断传播路径与其他方法或参考资料进行比较。通常,祖先状态重建是通过节点比较进行评估的,因此需要相同的树拓扑结构,而这通常是未知的。在这里,我们提出了一种跨不同树比较同一分类单元推断的地理学方法。我们通过计算离散 Fréchet 距离来比较位置路径。通过将距离校正为通过节点的路径数量,我们将 Fréchet 树距离定义为地理学之间的距离度量。作为应用,我们比较了从 H5N1 流感病毒血凝素序列推断的不同方法的地理学传播模式,发现树推断和祖先重建都会导致地理学传播的变化,而这些变化并不能直接反映在拓扑差异上。该方法适用于比较彼此之间或与已知真实情况的不同树或地理学推断方法推断的地理学,从而能够对这些技术进行质量评估。