Bandelt H J, Forster P, Sykes B C, Richards M B
Mathematisches Seminar, Universität Hamburg, Germany.
Genetics. 1995 Oct;141(2):743-53. doi: 10.1093/genetics/141.2.743.
Analysis of variation in the hypervariable region of mitochondrial DNA (mtDNA) has emerged as an important tool for studying human evolution and migration. However, attempts to reconstruct optimal intraspecific mtDNA phylogenies frequently fail because parallel mutation events partly obscure the true evolutionary pathways. This makes it inadvisable to present a single phylogenetic tree at the expense of neglecting equally acceptable ones. As an alternative, we propose a novel network approach for portraying mtDNA relationships. For small sample sizes (< approximately 50), an unmodified median network contains all most parsimonious trees, displays graphically the full information content of the sequence data, and can easily be generated by hand. For larger sample sizes, we reduce the complexity of the network by identifying parallelisms. This reduction procedure is guided by a compatibility argument and an additional source of phylogenetic information: the frequencies of the mitochondrial haplotypes. As a spin-off, our approach can also assist in identifying sequencing errors, which manifest themselves in implausible network substructures. We illustrate the advantages of our approach with several examples from existing data sets.
线粒体DNA(mtDNA)高变区变异分析已成为研究人类进化和迁徙的重要工具。然而,重建最佳种内mtDNA系统发育树的尝试常常失败,因为平行突变事件部分掩盖了真实的进化路径。这使得以忽视同样合理的系统发育树为代价来呈现单一的系统发育树是不可取的。作为一种替代方法,我们提出了一种描绘mtDNA关系的新颖网络方法。对于小样本量(<约50),未修改的中位数网络包含所有最简约树,以图形方式展示序列数据的全部信息内容,并且可以很容易地手工生成。对于较大样本量,我们通过识别平行性来降低网络的复杂性。这种简化过程由兼容性论证和系统发育信息的另一个来源:线粒体单倍型频率来指导。作为一个附带成果,我们的方法还可以帮助识别测序错误,这些错误会在不合理的网络子结构中表现出来。我们用现有数据集的几个例子说明了我们方法的优点。