Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA.
Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.
Nat Commun. 2016 Sep 2;7:12758. doi: 10.1038/ncomms12758.
Phylogenetic trees reconstructed from molecular sequences are often considered more reliable than those reconstructed from morphological characters, in part because convergent evolution, which confounds phylogenetic reconstruction, is believed to be rarer for molecular sequences than for morphologies. However, neither the validity of this belief nor its underlying cause is known. Here comparing thousands of characters of each type that have been used for inferring the phylogeny of mammals, we find that on average morphological characters indeed experience much more convergences than amino acid sites, but this disparity is explained by fewer states per character rather than an intrinsically higher susceptibility to convergence for morphologies than sequences. We show by computer simulation and actual data analysis that a simple method for identifying and removing convergence-prone characters improves phylogenetic accuracy, potentially enabling, when necessary, the inclusion of morphologies and hence fossils for reliable tree inference.
从分子序列重建的系统发育树通常比从形态特征重建的系统发育树更可靠,部分原因是,趋同进化会混淆系统发育重建,人们认为它在分子序列中比在形态特征中更为罕见。然而,这种信念的有效性及其潜在原因尚不清楚。在这里,我们比较了数千种用于推断哺乳动物系统发育的每种类型的特征,发现平均而言,形态特征确实比氨基酸位点经历了更多的趋同进化,但这种差异是由每个特征的状态较少而不是形态比序列更易受趋同进化影响造成的。我们通过计算机模拟和实际数据分析表明,一种识别和去除易发生趋同进化特征的简单方法可以提高系统发育的准确性,从而在必要时能够包含形态特征和化石,以进行可靠的树推断。