来自模糊特征的不同参数分支支持值。

Disparate parametric branch-support values from ambiguous characters.

作者信息

Simmons Mark P, Randle Christopher P

机构信息

Department of Biology, Colorado State University, Fort Collins, CO 80523-1878, USA.

Department of Biological Sciences, Sam Houston State University, Huntsville, TX 77341, USA.

出版信息

Mol Phylogenet Evol. 2014 Sep;78:66-86. doi: 10.1016/j.ympev.2014.04.029. Epub 2014 May 10.

DOI:10.1016/j.ympev.2014.04.029

PMID:24821621

Abstract

The greater power of parametric methods over parsimony is frequently observed in empirical phylogenetic analyses by providing greater resolution and higher branch support. This greater power is provided by several different factors, including some that are generally regarded as disadvantageous. In this study we used both empirical and (modified) simulated matrices to examine how Bayesian MCMC, maximum likelihood, and parsimony methods interpret ambiguous optimization of character states. We describe the information content in "redundant" terminals as well as a novel approach to help identify clades that cannot be unequivocally supported by synapomorphies in empirical matrices. Four of our main conclusions are as follows. First, the SH-like approximate likelihood ratio test is a more reliable indicator than the bootstrap of branches that are only ambiguously supported in likelihood analyses wherein only a single fully resolved optimal tree is presented. Second, bootstrap values generated by methods that only ever present a single fully resolved optimal tree are less robust to differences in taxon sampling than are those generated by more conservative methods. Third, PAUP(∗) likelihood is more resilient to producing apparently unambiguous resolution and high support from ambiguous characters than is GARLI collapse 1 and MrBayes, which in turn are more resilient than PhyML. GARLI collapse 0, IQ-TREE, and RAxML are the least resilient bootstrapping methods examined. Fourth, frequent discrepancies with respect to resolution and/or branch support may be obtained by methods that only ever present a single fully resolved optimal tree in different contexts that are apparently unique to the specific program and/or method of quantifying branch support.

摘要

在实证系统发育分析中，参数方法相对于简约法具有更大的优势，这通常体现在它能提供更高的分辨率和更强的分支支持。这种更大的优势由几个不同因素促成，其中包括一些通常被视为不利的因素。在本研究中，我们使用实证矩阵和（修改后的）模拟矩阵来检验贝叶斯MCMC、最大似然法和简约法如何解释性状状态的模糊优化。我们描述了“冗余”终端中的信息内容，以及一种新方法，以帮助识别在实证矩阵中无法由共衍征明确支持的进化枝。我们的四个主要结论如下。第一，在仅呈现单个完全解析的最优树的似然分析中，SH 类近似似然比检验是比自展法更可靠的分支支持指标，这些分支仅得到模糊支持。第二，与更保守的方法所生成的自展值相比，仅呈现单个完全解析的最优树的方法所生成的自展值对分类群抽样差异的稳健性更低。第三，与GARLI collapse 1和MrBayes相比，PAUP(∗)似然法在处理由模糊性状产生明显明确的分辨率和高支持度方面更具弹性，而GARLI collapse 1和MrBayes又比PhyML更具弹性。GARLI collapse 0、IQ-TREE和RAxML是所检验的自展法中弹性最低的。第四，在不同情境下，仅呈现单个完全解析的最优树的方法可能会在分辨率和/或分支支持方面频繁出现差异，这些差异显然是特定程序和/或量化分支支持方法所特有的。