基于不变量的四重奏谜题

Invariant based quartet puzzling.

作者信息

Rusinko Joseph P, Hipp Brian

机构信息

Department of Mathematics, Winthrop University, 142 Bancroft Hall, Rock Hill, SC 29733, USA.

出版信息

Algorithms Mol Biol. 2012 Dec 6;7(1):35. doi: 10.1186/1748-7188-7-35.

DOI:10.1186/1748-7188-7-35

PMID:23217018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3549829/

Abstract

BACKGROUND

First proposed by Cavender and Felsenstein, and Lake, invariant based algorithms for phylogenetic reconstruction were widely dismissed by practicing biologists because invariants were perceived to have limited accuracy in constructing trees based on DNA sequences of reasonable length. Recent developments by algebraic geometers have led to the construction of lists of invariants which have been demonstrated to be more accurate on small sequences, but were limited in that they could only be used for trees with small numbers of taxa. We have developed and tested an invariant based quartet puzzling algorithm which is accurate and efficient for biologically reasonable data sets.

RESULTS

We found that our algorithm outperforms Maximum Likelihood based quartet puzzling on data sets simulated with low to medium evolutionary rates. For faster rates of evolution, invariant based quartet puzzling is reasonable but less effective than maximum likelihood based puzzling.

CONCLUSIONS

This is a proof of concept algorithm which is not intended to replace existing reconstruction algorithms. Rather, the conclusion is that when seeking solutions to a new wave of phylogenetic problems (super tree algorithms, gene vs. species tree, mixture models), invariant based methods should be considered. This article demonstrates that invariants are a practical, reasonable and flexible source for reconstruction techniques.

摘要

背景

基于不变量的系统发育重建算法最早由卡文德、费尔森斯坦和莱克提出，但被生物学家广泛摒弃，因为人们认为不变量在基于合理长度的DNA序列构建树时准确性有限。代数几何学家的最新进展导致构建了不变量列表，这些不变量在小序列上已被证明更准确，但局限于只能用于分类单元数量较少的树。我们开发并测试了一种基于不变量的四重奏迷惑算法，该算法对于生物学上合理的数据集既准确又高效。

结果

我们发现，在以低到中等进化速率模拟的数据集上，我们的算法优于基于最大似然的四重奏迷惑算法。对于更快的进化速率，基于不变量的四重奏迷惑算法是合理的，但不如基于最大似然的迷惑算法有效。

结论

这是一种概念验证算法，并非旨在取代现有的重建算法。相反，结论是在寻求解决新一代系统发育问题（超级树算法、基因树与物种树、混合模型）时，应考虑基于不变量的方法。本文表明不变量是重建技术的一个实用、合理且灵活的来源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8f3/3549829/5a9c553135a2/1748-7188-7-35-1.jpg

相似文献

Invariant based quartet puzzling.基于不变量的四重奏谜题

Algorithms Mol Biol. 2012 Dec 6;7(1):35. doi: 10.1186/1748-7188-7-35.

Performance of a new invariants method on homogeneous and nonhomogeneous quartet trees.一种新的不变量方法在齐次和非齐次四重树方面的性能。

Mol Biol Evol. 2007 Jan;24(1):288-93. doi: 10.1093/molbev/msl153. Epub 2006 Oct 19.

Quartet-based phylogenetic inference: improvements and limits.基于四重奏的系统发育推断：改进与局限

Mol Biol Evol. 2001 Jun;18(6):1103-16. doi: 10.1093/oxfordjournals.molbev.a003881.

Quartet MaxCut: a fast algorithm for amalgamating quartet trees.四重最大切割：一种快速的合并四分树的算法。

Mol Phylogenet Evol. 2012 Jan;62(1):1-8. doi: 10.1016/j.ympev.2011.06.021. Epub 2011 Jul 6.

Short quartet puzzling: a new quartet-based phylogeny reconstruction algorithm.短四重奏难题：一种基于四重奏的新系统发育重建算法。

J Comput Biol. 2008 Jan-Feb;15(1):91-103. doi: 10.1089/cmb.2007.0103.

Quartets MaxCut: a divide and conquer quartets algorithm.四重体最大切割：一种分而治之的四重体算法。

IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):704-18. doi: 10.1109/TCBB.2008.133.

Developing a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants.利用系统发育一致性和马尔可夫不变量开发一种用于四重树推断的具有统计学效力的方法。

J Math Biol. 2017 Dec;75(6-7):1619-1654. doi: 10.1007/s00285-017-1129-2. Epub 2017 Apr 22.

Designing Weights for Quartet-Based Methods When Data are Heterogeneous Across Lineages.当数据在谱系间存在异质性时，为基于四重奏的方法设计权重。

Bull Math Biol. 2023 Jun 13;85(7):68. doi: 10.1007/s11538-023-01167-y.

'Multi-SpaM': a maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees.“多间隔词匹配法”：一种使用多个间隔词匹配和四重树进行系统发育重建的最大似然法。

NAR Genom Bioinform. 2019 Oct 30;2(1):lqz013. doi: 10.1093/nargab/lqz013. eCollection 2020 Mar.

Invariant Versus Classical Quartet Inference When Evolution is Heterogeneous Across Sites and Lineages.当进化在不同位点和谱系间存在异质性时，不变量与经典四重奏推断法

Syst Biol. 2016 Mar;65(2):280-91. doi: 10.1093/sysbio/syv086. Epub 2015 Nov 11.

引用本文的文献

Spectral neighbor joining for reconstruction of latent tree Models.用于潜在树模型重建的谱邻接合并

SIAM J Math Data Sci. 2021;3(1):113-141. doi: 10.1137/20m1365715. Epub 2021 Feb 1.

J Math Biol. 2017 Dec;75(6-7):1619-1654. doi: 10.1007/s00285-017-1129-2. Epub 2017 Apr 22.

本文引用的文献

Quartet MaxCut: a fast algorithm for amalgamating quartet trees.四重最大切割：一种快速的合并四分树的算法。

Mol Phylogenet Evol. 2012 Jan;62(1):1-8. doi: 10.1016/j.ympev.2011.06.021. Epub 2011 Jul 6.

MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.MEGA5：用于最大似然法、进化距离法和最大简约法的分子进化遗传学分析。

Mol Biol Evol. 2011 Oct;28(10):2731-9. doi: 10.1093/molbev/msr121. Epub 2011 May 4.

Identifiability of two-tree mixtures for group-based models.基于群组模型的两棵树混合物的可识别性。

IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):710-22. doi: 10.1109/TCBB.2010.79.

Markov invariants and the isotropy subgroup of a quartet tree.马尔可夫不变量与四重树的各向同性子群。

J Theor Biol. 2009 May 21;258(2):302-10. doi: 10.1016/j.jtbi.2009.01.021. Epub 2009 Feb 1.

Markov invariants, plethysms, and phylogenetics.马尔可夫不变量、 plethysms（ plethysm 是一种在对称函数理论等领域中使用的运算，暂没有完全对应的简单中文术语，可保留英文）和系统发育学。

J Theor Biol. 2008 Aug 7;253(3):601-15. doi: 10.1016/j.jtbi.2008.04.001. Epub 2008 Apr 7.

Short quartet puzzling: a new quartet-based phylogeny reconstruction algorithm.短四重奏难题：一种基于四重奏的新系统发育重建算法。

J Comput Biol. 2008 Jan-Feb;15(1):91-103. doi: 10.1089/cmb.2007.0103.

Performance of a new invariants method on homogeneous and nonhomogeneous quartet trees.一种新的不变量方法在齐次和非齐次四重树方面的性能。

Mol Biol Evol. 2007 Jan;24(1):288-93. doi: 10.1093/molbev/msl153. Epub 2006 Oct 19.

Toric ideals of phylogenetic invariants.系统发育不变量的环面理想

J Comput Biol. 2005 Mar;12(2):204-28. doi: 10.1089/cmb.2005.12.204.

Phylogenetic invariants for the general Markov model of sequence mutation.序列突变通用马尔可夫模型的系统发育不变量。

Math Biosci. 2003 Dec;186(2):113-44. doi: 10.1016/j.mbs.2003.08.004.

Suboptimal measles-mumps-rubella vaccination coverage facilitates an imported measles outbreak in ireland.次优的麻疹-腮腺炎-风疹疫苗接种覆盖率促使爱尔兰发生了一起输入性麻疹疫情。

Clin Infect Dis. 2002 Jul 1;35(1):84-6. doi: 10.1086/340708. Epub 2002 Jun 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于不变量的四重奏谜题

Invariant based quartet puzzling.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献