Suppr超能文献

合并法与串联法:应用于被子植物系统发育根部溯源的复杂分析与基本原理

Coalescence vs. concatenation: Sophisticated analyses vs. first principles applied to rooting the angiosperms.

作者信息

Simmons Mark P, Gatesy John

机构信息

Department of Biology, Colorado State University, Fort Collins, CO 80523, USA.

Department of Biology, University of California, Riverside, CA 92521, USA.

出版信息

Mol Phylogenet Evol. 2015 Oct;91:98-122. doi: 10.1016/j.ympev.2015.05.011. Epub 2015 May 19.

Abstract

It has recently been concluded that phylogenomic data from 310 nuclear genes support the clade of (Amborellales, Nymphaeales) as sister to the remaining angiosperms and that shortcut coalescent phylogenetic methods outperformed concatenation for these data. We falsify both of those conclusions here by demonstrating that discrepant results between the coalescent and concatenation analyses are primarily caused by the coalescent methods applied (MP-EST and STAR) not being robust to the highly divergent and often mis-rooted gene trees that were used. This result reinforces the expectation that low amounts of phylogenetic signal and methodological artifacts in gene-tree reconstruction can be more problematic for shortcut coalescent methods than is the assumption of a single hierarchy for all genes by concatenation methods when these approaches are applied to ancient divergences in empirical studies. We also demonstrate that a third coalescent method, ASTRAL, is more robust to mis-rooted gene trees than MP-EST or STAR, and that both Observed Variability (OV) and Tree Independent Generation of Evolutionary Rates (TIGER), which are two character subsampling procedures, are biased in favor of characters with highly asymmetrical distributions of character states when applied to this dataset. We conclude that enthusiastic application of novel tools is not a substitute for rigorous application of first principles, and that trending methods (e.g., shortcut coalescent methods applied to ancient divergences, tree-independent character subsampling), may be novel sources of previously under-appreciated, systematic errors.

摘要

最近得出的结论是,来自310个核基因的系统发育基因组数据支持(无油樟目、睡莲目)分支作为其余被子植物的姐妹分支,并且对于这些数据,快捷合并系统发育方法优于串联法。我们在此通过证明合并分析和串联分析之间的差异结果主要是由所应用的合并方法(MP-EST和STAR)对所使用的高度分化且常常错误定根的基因树不稳健所导致的,从而证伪了这两个结论。这一结果强化了这样一种预期,即当这些方法应用于实证研究中的古老分歧时,基因树重建中系统发育信号量少和方法学假象对于快捷合并方法而言可能比串联方法对所有基因采用单一层次结构的假设更成问题。我们还证明,第三种合并方法ASTRAL比MP-EST或STAR对错误定根的基因树更稳健,并且观察变异(OV)和进化速率的树独立生成(TIGER)这两种特征抽样程序在应用于该数据集时,都偏向于具有高度不对称特征状态分布的特征。我们得出结论,热情应用新工具并不能替代对第一原理的严格应用,并且流行的方法(例如应用于古老分歧的快捷合并方法、与树无关的特征抽样)可能是以前未被充分认识的系统误差的新来源。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验