Department of Computer Science, Rice University, 6100 Main Street, Houston, TX 77005, USA.
Syst Biol. 2011 Mar;60(2):138-49. doi: 10.1093/sysbio/syq084. Epub 2011 Jan 19.
Analyses of the increasingly available genomic data continue to reveal the extent of hybridization and its role in the evolutionary diversification of various groups of species. We show, through extensive coalescent-based simulations of multilocus data sets on phylogenetic networks, how divergence times before and after hybridization events can result in incomplete lineage sorting with gene tree incongruence signatures identical to those exhibited by hybridization. Evolutionary analysis of such data under the assumption of a species tree model can miss all hybridization events, whereas analysis under the assumption of a species network model would grossly overestimate hybridization events. These issues necessitate a paradigm shift in evolutionary analysis under these scenarios, from a model that assumes a priori a single source of gene tree incongruence to one that integrates multiple sources in a unifying framework. We propose a framework of coalescence within the branches of a phylogenetic network and show how this framework can be used to detect hybridization despite incomplete lineage sorting. We apply the model to simulated data and show that the signature of hybridization can be revealed as long as the interval between the divergence times of the species involved in hybridization is not too small. We reanalyze a data set of 106 loci from 7 in-group Saccharomyces species for which a species tree with no hybridization has been reported in the literature. Our analysis supports the hypothesis that hybridization occurred during the evolution of this group, explaining a large amount of the incongruence in the data. Our findings show that an integrative approach to gene tree incongruence and its reconciliation is needed. Our framework will help in systematically analyzing genomic data for the occurrence of hybridization and elucidating its evolutionary role.
对越来越多的基因组数据的分析不断揭示出杂交的程度及其在各种物种群体进化多样化中的作用。我们通过对基于系统发育网络的多基因座数据集进行广泛的分歧时间模拟,展示了杂交事件前后的分歧时间如何导致不完全谱系分选,其基因树不一致性特征与杂交所表现出的特征完全相同。在物种树模型下对这些数据进行进化分析可能会错过所有的杂交事件,而在物种网络模型下进行分析则会严重高估杂交事件。这些问题需要在这些情况下的进化分析中进行范式转变,从假设基因树不一致性的单一来源的模型转变为在统一框架中整合多个来源的模型。我们提出了在系统发育网络分支内进行合并的框架,并展示了如何尽管不完全谱系分选,仍可以使用该框架检测杂交。我们将该模型应用于模拟数据,并表明只要参与杂交的物种之间的分歧时间间隔不太小,就可以揭示杂交的特征。我们重新分析了文献中报道没有杂交的 7 个内组 Saccharomyces 物种的 106 个基因座的数据集。我们的分析支持了在该组进化过程中发生杂交的假设,解释了数据中大量的不一致性。我们的研究结果表明,需要采用综合方法来解决基因树不一致性及其协调问题。我们的框架将有助于系统地分析基因组数据以检测杂交并阐明其进化作用。