Department of Statistics, The Ohio State University, Columbus, OH 43210, USA.
Syst Biol. 2009 Oct;58(5):478-88. doi: 10.1093/sysbio/syp055. Epub 2009 Sep 16.
As DNA sequences have become more readily available, it has become increasingly desirable to infer species phylogenies from multigene data sets. Much recent work has centered around the recognition that substantial incongruence in single-gene phylogenies necessitates the development of statistical procedures to estimate species phylogenies that appropriately model the process of evolution at the level of the individual genes. One process that gives rise to variation in the histories of individual genes is incomplete lineage sorting, which is commonly modeled by the coalescent, and thus much current work is focused on proper estimation of species phylogenies under the coalescent model. A second common source of discord in single-gene phylogenies is hybridization, a process that is ubiquitous in many groups of plants and animals. Although methods to incorporate hybridization into phylogenetic estimation have also been developed, only a handful of methods that address both coalescence and hybridization have been proposed. Here, I propose an extension of an existing model that incorporates both of these processes simultaneously by utilizing gene trees for inference in a likelihood framework. The model allows examination of the evidence for hybridization in the presence of incomplete lineage sorting due to deep coalescence via model selection using standard information criteria (e.g., Akaike information criterion and Bayesian information criterion). The potential of the method is evaluated using simulated data.
随着 DNA 序列变得更容易获取,从多基因数据集推断物种系统发育的需求变得越来越迫切。最近的许多工作都集中在认识到,单基因系统发育中的大量不一致性需要开发统计程序来估计物种系统发育,这些程序适当地模拟了单个基因水平上的进化过程。导致单个基因历史发生变化的一个过程是不完全谱系分选,这通常通过合并来建模,因此目前的许多工作都集中在合并模型下正确估计物种系统发育上。单基因系统发育中另一个常见的不一致来源是杂交,这是许多动植物群体中普遍存在的过程。尽管已经开发出将杂交纳入系统发育估计的方法,但仅提出了少数几种同时解决合并和杂交问题的方法。在这里,我提出了一种现有模型的扩展,该模型通过在似然框架中利用基因树进行推断,同时包含这两个过程。该模型允许通过使用标准信息准则(例如,Akaike 信息准则和贝叶斯信息准则)进行模型选择,在由于深度合并而导致不完全谱系分选的情况下检查杂交的证据。使用模拟数据评估了该方法的潜力。