Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA.
Syst Biol. 2009 Oct;58(5):489-500. doi: 10.1093/sysbio/syp054. Epub 2009 Sep 21.
Several methods have recently been developed to infer multilocus phylogenies by incorporating information from topological incongruence of the individual genes. In this study, we investigate 2 such methods, Bayesian concordance analysis and Bayesian estimation of species trees. Our test data are a collection of genes from cultivated rice (genus Oryza) and the most closely related wild species, generated using a high-throughput sequencing protocol and bioinformatics pipeline. Trees inferred from independent genes display levels of topological incongruence that far exceed that seen in previous data sets analyzed with these species tree methods. We identify differences in phylogenetic results between inference methods that incorporate gene tree incongruence. Finally, we discuss the challenges of scaling these analyses for data sets with thousands of gene trees and extensive levels of missing data.
最近已经开发了几种方法,通过整合来自个体基因拓扑不一致性的信息来推断多基因系统发育。在这项研究中,我们研究了两种这样的方法,贝叶斯一致性分析和物种树的贝叶斯估计。我们的测试数据是一组来自栽培稻(属稻属)和最密切相关的野生种的基因,使用高通量测序方案和生物信息学管道生成。从独立基因推断的树显示出的拓扑不一致程度远远超过以前使用这些物种树方法分析的数据集。我们确定了在整合基因树不一致性的推理方法之间的系统发育结果的差异。最后,我们讨论了为具有数千个基因树和广泛缺失数据的数据集进行这些分析的挑战。