Physical Biosciences Division, Lawrence Berkeley National Lab, Berkeley, California, United States of America.
PLoS One. 2010 Mar 10;5(3):e9490. doi: 10.1371/journal.pone.0009490.
We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability.
METHODOLOGY/PRINCIPAL FINDINGS: Where FastTree 1 used nearest-neighbor interchanges (NNIs) and the minimum-evolution criterion to improve the tree, FastTree 2 adds minimum-evolution subtree-pruning-regrafting (SPRs) and maximum-likelihood NNIs. FastTree 2 uses heuristics to restrict the search for better trees and estimates a rate of evolution for each site (the "CAT" approximation). Nevertheless, for both simulated and genuine alignments, FastTree 2 is slightly more accurate than a standard implementation of maximum-likelihood NNIs (PhyML 3 with default settings). Although FastTree 2 is not quite as accurate as methods that use maximum-likelihood SPRs, most of the splits that disagree are poorly supported, and for large alignments, FastTree 2 is 100-1,000 times faster. FastTree 2 inferred a topology and likelihood-based local support values for 237,882 distinct 16S ribosomal RNAs on a desktop computer in 22 hours and 5.8 gigabytes of memory.
CONCLUSIONS/SIGNIFICANCE: FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments. FastTree 2 is freely available at http://www.microbesonline.org/fasttree.
我们最近描述了 FastTree,这是一种用于推断具有多达数十万条序列的比对的系统发育的工具。在这里,我们描述了对 FastTree 的改进,这些改进提高了准确性而不牺牲可扩展性。
方法/主要发现:FastTree 1 使用最近邻交换(NNIs)和最小进化准则来改进树,而 FastTree 2 添加了最小进化子树剪枝重接(SPRs)和最大似然 NNIs。FastTree 2 使用启发式方法来限制对更好的树的搜索,并为每个位置估计进化率(“CAT”近似值)。尽管如此,对于模拟和真实的比对,FastTree 2 比标准实现的最大似然 NNIs(默认设置下的 PhyML 3)稍微准确一些。虽然 FastTree 2 不如使用最大似然 SPRs 的方法准确,但大多数不一致的分裂都支持不佳,对于大型比对,FastTree 2 的速度比 PhyML 3 快 100-1000 倍。FastTree 2 在桌面计算机上的 22 小时内和 5.8GB 的内存中推断出了 237882 个独特的 16S 核糖体 RNA 的拓扑结构和基于似然的局部支持值。
结论/意义:FastTree 2 允许对庞大的比对进行最大似然系统发育推断。FastTree 2 可在 http://www.microbesonline.org/fasttree 上免费获得。