Jones Graham, Aydin Zeynep, Oxelman Bengt
Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, SE 405 30 Göteborg, Sweden and Department of Biology, Faculty of Sciences, University of Dicle, 21280 Diyarbakir, Turkey.
Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, SE 405 30 Göteborg, Sweden and Department of Biology, Faculty of Sciences, University of Dicle, 21280 Diyarbakir, Turkey Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, SE 405 30 Göteborg, Sweden and Department of Biology, Faculty of Sciences, University of Dicle, 21280 Diyarbakir, Turkey.
Bioinformatics. 2015 Apr 1;31(7):991-8. doi: 10.1093/bioinformatics/btu770. Epub 2014 Nov 23.
The multispecies coalescent model provides a formal framework for the assignment of individual organisms to species, where the species are modeled as the branches of the sp tree. None of the available approaches so far have simultaneously co-estimated all the relevant parameters in the model, without restricting the parameter space by requiring a guide tree and/or prior assignment of individuals to clusters or species.
We present DISSECT, which explores the full space of possible clusterings of individuals and species tree topologies in a Bayesian framework. It uses an approximation to avoid the need for reversible-jump Markov Chain Monte Carlo, in the form of a prior that is a modification of the birth-death prior for the species tree. It incorporates a spike near zero in the density for node heights. The model has two extra parameters: one controls the degree of approximation and the second controls the prior distribution on the numbers of species. It is implemented as part of BEAST and requires only a few changes from a standard *BEAST analysis. The method is evaluated on simulated data and demonstrated on an empirical dataset. The method is shown to be insensitive to the degree of approximation, but quite sensitive to the second parameter, suggesting that large numbers of sequences are needed to draw firm conclusions.
http://tree.bio.ed.ac.uk/software/beast/, http://www.indriid.com/dissectinbeast.html.
Supplementary data are available at Bioinformatics online.
多物种合并模型为将个体生物归为物种提供了一个正式框架,其中物种被建模为物种树的分支。到目前为止,所有可用方法都没有在不通过要求引导树和/或事先将个体分配到聚类或物种来限制参数空间的情况下,同时共同估计模型中的所有相关参数。
我们提出了DISSECT,它在贝叶斯框架中探索个体和物种树拓扑结构的所有可能聚类空间。它采用一种近似方法来避免使用可逆跳跃马尔可夫链蒙特卡罗方法,其形式是对物种树的生死先验进行修改的先验。它在节点高度的密度中纳入了一个接近零的尖峰。该模型有两个额外参数:一个控制近似程度,另一个控制物种数量的先验分布。它作为BEAST的一部分实现,与标准的BEAST分析相比只需做少量修改。该方法在模拟数据上进行了评估,并在一个实证数据集上进行了演示。结果表明该方法对近似程度不敏感,但对第二个参数相当敏感,这表明需要大量序列才能得出确凿结论。
http://tree.bio.ed.ac.uk/software/beast/,http://www.indriid.com/dissectinbeast.html。
补充数据可在《生物信息学》在线获取。