Institut de recherche en biologie végétale, Université de Montréal and Jardin botanique de Montréal, 4101 Sherbrooke Est, Montréal, Quebec, Canada.
Mol Ecol Resour. 2012 Jan;12(1):179-84. doi: 10.1111/j.1755-0998.2011.03065.x. Epub 2011 Sep 8.
I introduce the software JML that tests for the presence of hybridization in multispecies sequence data sets by posterior predictive checking following Joly, McLenachan and Lockhart (2009, American Naturalist 174, e54). Although their method could potentially be applied on any data set, the lack of appropriate software made its application difficult. The software JML thus fills a need for an easy application of the method but also includes improvements such as the possibility to incorporate uncertainty in the species tree topology. The JML software uses a posterior distribution of species trees, population sizes and branch lengths to simulate replicate sequence data sets using the coalescent with no migration. A test quantity, defined as the minimum pairwise sequence distance between sequences of two species, is then evaluated on the simulated data sets and compared to the one estimated from the original data. Because the test quantity is a good predictor of hybridization events, departure from the bifurcating species tree model could be interpreted as evidence of hybridization. Software performance in terms of computing time is evaluated for several parameters. I also show an application example of the software for detecting hybridization among native diploid North American roses.
我介绍了软件 JML,它通过 Joly、McLenachan 和 Lockhart(2009 年,《美国自然学家》174 期,e54)提出的后验预测检查,来检测多物种序列数据集是否存在杂交。尽管他们的方法可以潜在地应用于任何数据集,但缺乏适当的软件使其应用变得困难。因此,JML 软件满足了该方法的简便应用需求,并且还包括了一些改进,例如在种系树拓扑结构中纳入不确定性的可能性。JML 软件使用种系树、种群大小和分支长度的后验分布,通过无迁移的合并来模拟复制序列数据集。然后,在模拟数据集上评估一个定义为两个物种序列之间最小成对序列距离的测试量,并将其与从原始数据估计的测试量进行比较。因为测试量是杂交事件的良好预测指标,所以与分支种系树模型的偏离可以解释为杂交的证据。针对几个参数评估了软件的计算时间性能。我还展示了该软件在检测北美本地二倍体玫瑰种间杂交方面的应用示例。