Suppr超能文献

从基因树推断物种树的一类距离矩阵方法的改进。

Improvements to a class of distance matrix methods for inferring species trees from gene trees.

作者信息

Helmkamp Laura J, Jewett Ethan M, Rosenberg Noah A

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

J Comput Biol. 2012 Jun;19(6):632-49. doi: 10.1089/cmb.2012.0042.

Abstract

Among the methods currently available for inferring species trees from gene trees, the GLASS method of Mossel and Roch (2010), the Shallowest Divergence (SD) method of Maddison and Knowles (2006), the STEAC method of Liu et al. (2009), and a related method that we call Minimum Average Coalescence (MAC) are computationally efficient and provide branch length estimates. Further, GLASS and STEAC have been shown to be consistent estimators of tree topology under a multispecies coalescent model. However, divergence time estimates obtained with these methods are all systematically biased under the model because the pairwise interspecific gene divergence times on which they rely must be more ancient than the species divergence time. Jewett and Rosenberg (2012) derived an expression for the bias of GLASS and used it to propose an improved method that they termed iGLASS. Here, we derive the biases of SD, STEAC, and MAC, and we propose improved analogues of these methods that we call iSD, iSTEAC, and iMAC. We conduct simulations to compare the performance of these methods with their original counterparts and with GLASS and iGLASS, finding that each of them decreases the bias and mean squared error of pairwise divergence time estimates. The new methods can therefore contribute to improvements in the estimation of species trees from information on gene trees.

摘要

在目前可用于从基因树推断物种树的方法中,莫塞尔和罗奇(2010年)提出的GLASS方法、麦迪逊和诺尔斯(2006年)提出的最浅分歧(SD)方法、刘等人(2009年)提出的STEAC方法,以及我们称为最小平均合并(MAC)的一种相关方法,在计算上效率较高,并能提供分支长度估计。此外,在多物种合并模型下,GLASS和STEAC已被证明是树拓扑结构的一致估计量。然而,在该模型下,用这些方法获得的分歧时间估计都存在系统偏差,因为它们所依赖的种间基因对分歧时间必定比物种分歧时间更古老。朱伊特和罗森伯格(2012年)推导了GLASS偏差的表达式,并据此提出了一种改进方法,他们称之为iGLASS。在此,我们推导了SD、STEAC和MAC的偏差,并提出了这些方法的改进类似方法,我们分别称之为iSD、iSTEAC和iMAC。我们进行了模拟,以比较这些方法与其原始对应方法以及GLASS和iGLASS的性能,发现它们每一种都降低了成对分歧时间估计的偏差和均方误差。因此,这些新方法有助于从基因树信息改进物种树的估计。

相似文献

5
The gene tree delusion.基因树错觉
Mol Phylogenet Evol. 2016 Jan;94(Pt A):1-33. doi: 10.1016/j.ympev.2015.07.018. Epub 2015 Jul 31.

引用本文的文献

3
Multilocus inference of species trees and DNA barcoding.物种树的多位点推断与DNA条形码
Philos Trans R Soc Lond B Biol Sci. 2016 Sep 5;371(1702). doi: 10.1098/rstb.2015.0335.
5
The Pace of Hybrid Incompatibility Evolution in House Mice.家鼠中杂种不相容性进化的速度
Genetics. 2015 Sep;201(1):229-42. doi: 10.1534/genetics.115.179499. Epub 2015 Jul 20.
8
kdetrees: Non-parametric estimation of phylogenetic tree distributions.KD树:系统发育树分布的非参数估计
Bioinformatics. 2014 Aug 15;30(16):2280-7. doi: 10.1093/bioinformatics/btu258. Epub 2014 Apr 24.
10
Theory and applications of a deterministic approximation to the coalescent model.溯祖模型确定性近似的理论与应用
Theor Popul Biol. 2014 May;93:14-29. doi: 10.1016/j.tpb.2013.12.007. Epub 2014 Jan 7.

本文引用的文献

2
Fast and accurate methods for phylogenomic analyses.用于系统基因组分析的快速而准确的方法。
BMC Bioinformatics. 2011 Oct 5;12 Suppl 9(Suppl 9):S4. doi: 10.1186/1471-2105-12-S9-S4.
5
Species tree inference by minimizing deep coalescences.通过最小化深度合并来推断物种树。
PLoS Comput Biol. 2009 Sep;5(9):e1000501. doi: 10.1371/journal.pcbi.1000501. Epub 2009 Sep 11.
7
Maximum tree: a consistent estimator of the species tree.最大树:物种树的一种一致估计量。
J Math Biol. 2010 Jan;60(1):95-106. doi: 10.1007/s00285-009-0260-0. Epub 2009 Mar 13.
10
Rooted triple consensus and anomalous gene trees.有根三元共识和异常基因树。
BMC Evol Biol. 2008 Apr 25;8:118. doi: 10.1186/1471-2148-8-118.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验