Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, P.R. China.
Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Department of Plant Pathology, South China Agricultural University, Guangzhou, P.R. China.
Mol Biol Evol. 2018 Feb 1;35(2):486-503. doi: 10.1093/molbev/msx302.
The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs' relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses.
用于解决生命之树分支的数据集大小已经大幅增加,这促使了快速而准确的推断程序的发展。例如,在非常流行的最大似然框架中已经开发了几个不同的快速程序,包括 RAxML/ExaML、PhyML、IQ-TREE 和 FastTree。尽管这些程序被广泛使用,但迄今为止,还缺乏使用经验基因组规模数据集对其性能进行系统评估和比较的研究。为了解决这个问题,我们针对 19 个具有数百到数千个基因和多达 200 个分类群的经验系统发育基因组数据集,评估了这四个程序在似然最大化、树拓扑和计算速度方面的表现。对于单基因树推断,我们发现,使用 RAxML、PhyML 或 IQ-TREE 时,更详尽和更慢的策略(每个比对进行 10 次搜索)优于更快的策略(每个比对进行 1 次树搜索)。有趣的是,这三个程序推断的单基因树产生了可比较的基于合并的种系树估计。对于基于串联的种系树推断,IQ-TREE 始终为所有数据集实现了最佳观测似然,而 RAxML/ExaML 则紧随其后。相比之下,PhyML 经常无法完成基于串联的分析,而 FastTree 是最快的,但在两种类型的分析中产生的似然值较低,树拓扑也更不相似。最后,数据矩阵特性,如分类群的数量和系统发育信号的强度,有时会极大地影响程序的相对性能。我们的结果提供了真实世界的基因和种系树系统发育推断基准,以指导大规模系统发育基因组数据分析的设计和执行。