Department of Computer Science, Iowa State University, Ames, IA 50011, USA.
BMC Bioinformatics. 2010 Nov 23;11:574. doi: 10.1186/1471-2105-11-574.
The ever-increasing wealth of genomic sequence information provides an unprecedented opportunity for large-scale phylogenetic analysis. However, species phylogeny inference is obfuscated by incongruence among gene trees due to evolutionary events such as gene duplication and loss, incomplete lineage sorting (deep coalescence), and horizontal gene transfer. Gene tree parsimony (GTP) addresses this issue by seeking a species tree that requires the minimum number of evolutionary events to reconcile a given set of incongruent gene trees. Despite its promise, the use of gene tree parsimony has been limited by the fact that existing software is either not fast enough to tackle large data sets or is restricted in the range of evolutionary events it can handle.
We introduce iGTP, a platform-independent software program that implements state-of-the-art algorithms that greatly speed up species tree inference under the duplication, duplication-loss, and deep coalescence reconciliation costs. iGTP significantly extends and improves the functionality and performance of existing gene tree parsimony software and offers advanced features such as building effective initial trees using stepwise leaf addition and the ability to have unrooted gene trees in the input. Moreover, iGTP provides a user-friendly graphical interface with integrated tree visualization software to facilitate analysis of the results.
iGTP enables, for the first time, gene tree parsimony analyses of thousands of genes from hundreds of taxa using the duplication, duplication-loss, and deep coalescence reconciliation costs, all from within a convenient graphical user interface.
基因组序列信息的不断丰富为大规模系统发育分析提供了前所未有的机会。然而,由于基因复制和丢失、不完全谱系分选(深合并)和水平基因转移等进化事件,基因树之间的不一致使得物种系统发育推断变得复杂。基因树简约(GTP)通过寻找需要最少进化事件来协调给定的一组不一致基因树的物种树来解决这个问题。尽管有很大的前景,但基因树简约的使用受到以下事实的限制:现有的软件要么不够快,无法处理大型数据集,要么在它可以处理的进化事件范围上受到限制。
我们引入了 iGTP,这是一个与平台无关的软件程序,它实现了最先进的算法,可以大大加快在复制、复制丢失和深合并协调成本下的物种树推断。iGTP 显著扩展和改进了现有基因树简约软件的功能和性能,并提供了一些高级功能,如使用逐步叶添加构建有效的初始树,以及在输入中具有无根基因树的能力。此外,iGTP 提供了一个用户友好的图形界面,集成了树可视化软件,以方便分析结果。
iGTP 首次能够使用复制、复制丢失和深合并协调成本对数百个分类单元的数千个基因进行基因树简约分析,所有这些都可以在一个方便的图形用户界面内完成。