Ho Simon Y, Jermiin Lars
1School of Biological Sciences, University of Sydney, NSW 2006, Australia.
Syst Biol. 2004 Aug;53(4):623-37. doi: 10.1080/10635150490503035.
Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invariant sites, and covariotides, may interfere adversely with the recovery of the historical signal. The effect of the interaction of these signals on phylogenetic inference is not well understood and may, in many cases, even be underappreciated. In this study, we investigate this matter and present results based on Monte Carlo simulations. We explored the success of four phylogenetic methods in recovering the true tree from data that had evolved under conditions where the equilibrium base frequencies and substitution rates were allowed to vary among lineages. Seven scenarios with increasingly complex conditions were investigated. All of the methods tested, with the exception of neighbor-joining using LogDet distances, were sensitive to compositional convergence in nonsister lineages. Maximum parsimony was also susceptible to attraction between long edges. In many cases, however, phylogenetic inference methods can still recover the true tree when misleading signals are present, in some instances even when the historical signal is no longer dominant. These results highlight the growing need for simple methods to detect violation of the phylogenetic assumptions.
核苷酸或氨基酸序列比对可能包含各种不同的信号,其中之一是我们常常试图通过系统发育分析来恢复的历史信号。其他信号,例如由于组成异质性、谱系间和位点间速率异质性、不变位点以及共变核苷酸所产生的信号,可能会对历史信号的恢复产生不利干扰。这些信号之间的相互作用对系统发育推断的影响尚未得到充分理解,而且在许多情况下甚至可能未受到足够重视。在本研究中,我们对这个问题进行了调查,并基于蒙特卡洛模拟给出结果。我们探讨了四种系统发育方法从在谱系间平衡碱基频率和替换率允许变化的条件下进化而来的数据中恢复真实树的成功率。研究了七种条件日益复杂的情形。除了使用对数行列式距离的邻接法之外,所有测试的方法都对非姐妹谱系中的组成趋同敏感。最大简约法也容易受到长分支间吸引的影响。然而,在许多情况下,即使存在误导性信号,有时甚至当历史信号不再占主导地位时,系统发育推断方法仍然可以恢复真实树。这些结果凸显了对检测违反系统发育假设的简单方法的需求日益增长。