Grant Taran, Kluge Arnold G
Division of Vertebrate Zoology, Herpetology, American Museum of Natural History, New York, NY 10024, USA.
Department of Ecology, Evolution, and Environmental Biology, Columbia University, New York, NY 10027, USA.
Cladistics. 2003 Oct;19(5):379-418. doi: 10.1111/j.1096-0031.2003.tb00311.x.
The methods of data exploration have become the centerpiece of phylogenetic inference, but without the scientific importance of those methods having been identified. We examine in some detail the procedures and justifications of Wheeler's sensitivity analysis and relative rate comparison (saturation analysis). In addition, we review methods designed to explore evidential decisiveness, clade stability, transformation series additivity, methodological concordance, sensitivity to prior probabilities (Bayesian analysis), skewness, computer-intensive tests, long-branch attraction, model assumptions (likelihood ratio test), sensitivity to amount of data, polymorphism, clade concordance index, character compatibility, partitioned analysis, spectral analysis, relative apparent synapomorphy analysis, and congruence with a "known" phylogeny. In our review, we consider a method to be scientific if it performs empirical tests, i.e., if it applies empirical data that could potentially refute the hypothesis of interest. Methods that do not perform tests, and therefore are not scientific, may nonetheless be heuristic in the scientific enterprise if they point to more weakly or ambiguously corroborated hypotheses, such propositions being more easily refuted than those that have been more severely tested and are more strongly corroborated. Based on common usage, data exploration in phylogenetics is accomplished by any method that performs sensitivity or quality analysis. Sensitivity analysis evaluates the responsiveness of results to variation or errors in parameter values and assumptions. Sensitivity analysis is generally interpreted as providing a measure of support, where conclusions that are insensitive (robust, stable) to perturbations are judged to be accurate, probable, or reliable. As an alternative to that verificationist concept, we define support objectively as the degree to which critical evidence refutes competing hypotheses. As such, degree of support is secondary to the scientific optimality criterion of maximizing explanatory power. Quality analyses purport to distinguish good, reliable, accurate data from bad, misleading, erroneous data, thereby assessing the ability of data to indicate the true phylogeny. Only the quality analysis of character compatibility can be judged scientific-and a weak test at that compared to character congruence. Methods judged to be heuristic include Bremer support, long-branch extraction, and safe taxonomic reduction, and we underscore the great heuristic potential of a posteriori analysis of patterns of transformations on the total-evidence cladogram. However, of the more than 20 kinds of data exploration methods evaluated, the vast majority is neither scientific nor heuristic. Given so little demonstrated cognitive worth, we conclude that undue emphasis has been placed on data exploration in phylogenetic inference, and we urge phylogeneticists to consider more carefully the relevance of the methods that they employ. [T]he cult of impressive technicalities or the cult of precision may get the better of us, and interfere with our search for clarity, simplicity, and truth [Popper, 1983, p. 60. Empirical papers chosen for publication are judged to be of interest to a broad systematics audience because they represent exemplary case studies involving some important contemporary issue or issues. These may be unusually thorough explorations of data, applications of new methodology, illustrations of fundamental principles, and/or investigations of interesting evolutionary questions. [Systematic Biology: Instructions for authors, 2002; italics added].
数据探索方法已成为系统发育推断的核心,但这些方法的科学重要性尚未得到明确。我们详细研究了惠勒敏感性分析和相对速率比较(饱和度分析)的程序及理由。此外,我们回顾了旨在探索证据决定性、分支稳定性、转换系列可加性、方法一致性、对先验概率的敏感性(贝叶斯分析)、偏度、计算机密集型检验、长枝吸引、模型假设(似然比检验)、对数据量的敏感性、多态性、分支一致性指数、性状兼容性、分区分析、光谱分析、相对明显共近裔性状分析以及与“已知”系统发育的一致性等方法。在我们的综述中,如果一种方法进行实证检验,即应用可能反驳感兴趣假设的实证数据,我们就认为该方法是科学的。不进行检验因而不科学的方法,在科学事业中可能仍是启发式的,前提是它们指向得到较弱或较模糊支持的假设,这类命题比那些经过更严格检验且得到更强支持的命题更容易被反驳。根据通常用法,系统发育学中的数据探索可通过任何进行敏感性或质量分析的方法来实现。敏感性分析评估结果对参数值和假设的变化或误差的响应程度。敏感性分析通常被解释为提供一种支持度衡量,对扰动不敏感(稳健、稳定)的结论被判定为准确、可能或可靠。作为对这种证实主义概念的替代,我们将支持客观地定义为关键证据反驳竞争假设的程度。因此支持度从属于使解释力最大化的科学最优性标准。质量分析旨在区分好的、可靠的、准确的数据与坏的、误导性的、错误的数据,从而评估数据指示真实系统发育的能力。只有性状兼容性的质量分析可被判定为科学——而且与性状一致性相比,这是一个较弱的检验。被判定为启发式的方法包括布雷默支持度、长枝提取和安全分类简约法,我们强调对总证据分支图上转换模式进行后验分析具有巨大的启发式潜力。然而,在评估的20多种数据探索方法中,绝大多数既不科学也不是启发式的。鉴于所展示的认知价值如此之低,我们得出结论,在系统发育推断中对数据探索的强调过度了,我们敦促系统发育学家更仔细地考虑他们所采用方法的相关性。[T]对令人印象深刻的技术细节的崇拜或对精确性的崇拜可能会战胜我们,并干扰我们对清晰性、简单性和真理的追求[波普尔,1983年,第60页]。被选用于发表的实证论文被判定对广大系统学受众有吸引力,因为它们代表了涉及一些重要当代问题的典范案例研究。这些案例可能是对数据的异常深入的探索、新方法的应用、基本原理的阐释和/或对有趣进化问题的研究。[《系统生物学:作者须知》,2002年;斜体为添加]