Fleischauer Markus, Böcker Sebastian
Lehrstuhl für Bioinformatik, Friedrich-Schiller Universität , Jena , Thüringen , Germany.
PeerJ. 2016 Jun 28;4:e2172. doi: 10.7717/peerj.2172. eCollection 2016.
Supertree methods combine a set of phylogenetic trees into a single supertree. Similar to supermatrix methods, these methods provide a way to reconstruct larger parts of the Tree of Life, potentially evading the computational complexity of phylogenetic inference methods such as maximum likelihood. The supertree problem can be formalized in different ways, to cope with contradictory information in the input. Many supertree methods have been developed. Some of them solve NP-hard optimization problems like the well-known Matrix Representation with Parsimony, while others have polynomial worst-case running time but work in a greedy fashion (FlipCut). Both can profit from a set of clades that are already known to be part of the supertree. The Superfine approach shows how the Greedy Strict Consensus Merger (GSCM) can be used as preprocessing to find these clades. We introduce different scoring functions for the GSCM, a randomization, as well as a combination thereof to improve the GSCM to find more clades. This helps, in turn, to improve the resolution of the GSCM supertree. We find this modifications to increase the number of true positive clades by 18% compared to the currently used Overlap scoring.
超树方法将一组系统发育树合并为一棵单一的超树。与超矩阵方法类似,这些方法提供了一种重建生命之树更大片段的方式,有可能规避诸如最大似然法等系统发育推断方法的计算复杂性。超树问题可以用不同方式形式化,以处理输入中的矛盾信息。已经开发了许多超树方法。其中一些方法解决NP难的优化问题,如著名的简约矩阵表示法,而其他方法具有多项式最坏情况运行时间,但以贪婪方式工作(FlipCut)。两者都可以从一组已知是超树一部分的进化枝中受益。Superfine方法展示了如何将贪婪严格一致合并法(GSCM)用作预处理来找到这些进化枝。我们为GSCM引入了不同的评分函数、一种随机化方法以及它们的组合,以改进GSCM来找到更多进化枝。这反过来有助于提高GSCM超树的分辨率。我们发现,与当前使用的重叠评分相比,这些修改使真阳性进化枝的数量增加了18%。