Hickerson Michael J, Stahl Eli, Takebayashi Naoki
Biology Department, Queens College, CUNY, Flushing, NY 11367-1597, USA.
BMC Bioinformatics. 2007 Jul 26;8:268. doi: 10.1186/1471-2105-8-268.
Although testing for simultaneous divergence (vicariance) across different population-pairs that span the same barrier to gene flow is of central importance to evolutionary biology, researchers often equate the gene tree and population/species tree thereby ignoring stochastic coalescent variance in their conclusions of temporal incongruence. In contrast to other available phylogeographic software packages, msBayes is the only one that analyses data from multiple species/population pairs under a hierarchical model.
msBayes employs approximate Bayesian computation (ABC) under a hierarchical coalescent model to test for simultaneous divergence (TSD) in multiple co-distributed population-pairs. Simultaneous isolation is tested by estimating three hyper-parameters that characterize the degree of variability in divergence times across co-distributed population pairs while allowing for variation in various within population-pair demographic parameters (sub-parameters) that can affect the coalescent. msBayes is a software package consisting of several C and R programs that are run with a Perl "front-end".
The method reasonably distinguishes simultaneous isolation from temporal incongruence in the divergence of co-distributed population pairs, even with sparse sampling of individuals. Because the estimate step is decoupled from the simulation step, one can rapidly evaluate different ABC acceptance/rejection conditions and the choice of summary statistics. Given the complex and idiosyncratic nature of testing multi-species biogeographic hypotheses, we envision msBayes as a powerful and flexible tool for tackling a wide array of difficult research questions that use population genetic data from multiple co-distributed species. The msBayes pipeline is available for download at http://msbayes.sourceforge.net/ under an open source license (GNU Public License). The msBayes pipeline is comprised of several C and R programs that are run with a Perl "front-end" and runs on Linux, Mac OS-X, and most POSIX systems. Although the current implementation is for a single locus per species-pair, future implementations will allow analysis of multi-loci data per species pair.
尽管对跨越相同基因流障碍的不同种群对间的同时分化(地理隔离)进行检测对进化生物学至关重要,但研究人员常常将基因树与种群/物种树等同起来,从而在其关于时间不一致性的结论中忽略了随机合并方差。与其他可用的系统发育地理软件包不同,msBayes是唯一一个在分层模型下分析来自多个物种/种群对数据的软件包。
msBayes在分层合并模型下采用近似贝叶斯计算(ABC)来检测多个同域分布种群对中的同时分化(TSD)。通过估计三个超参数来检验同时隔离,这三个超参数表征了同域分布种群对间分化时间的变异性程度,同时考虑了各个种群对内可能影响合并的各种种群统计参数(子参数)的变化。msBayes是一个由几个C和R程序组成的软件包,通过Perl“前端”运行。
该方法能够合理地区分同域分布种群对分化中的同时隔离和时间不一致性,即使在个体抽样稀疏的情况下也是如此。由于估计步骤与模拟步骤解耦,人们可以快速评估不同的ABC接受/拒绝条件以及汇总统计量的选择。鉴于检验多物种生物地理假设的复杂性和独特性,我们认为msBayes是一个强大且灵活的工具,可用于解决大量使用来自多个同域分布物种的种群遗传数据的难题。msBayes管道可在http://msbayes.sourceforge.net/ 以开源许可(GNU公共许可)下载。msBayes管道由几个C和R程序组成,通过Perl“前端”运行,可在Linux、Mac OS-X和大多数POSIX系统上运行。虽然当前实现是针对每个物种对的单个基因座,但未来实现将允许分析每个物种对的多位点数据。