• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

捉迷藏:为数千个同源序列丰富的序列放置和找到最佳树。

Hide and seek: placing and finding an optimal tree for thousands of homoplasy-rich sequences.

机构信息

Biomathematics Research Centre, University of Canterbury, Christchurch, New Zealand.

出版信息

Mol Phylogenet Evol. 2013 Dec;69(3):1186-9. doi: 10.1016/j.ympev.2013.08.001. Epub 2013 Aug 9.

DOI:10.1016/j.ympev.2013.08.001
PMID:23939134
Abstract

Finding optimal evolutionary trees from sequence data is typically an intractable problem, and there is usually no way of knowing how close to optimal the best tree from some search truly is. The problem would seem to be particularly acute when we have many taxa and when that data has high levels of homoplasy, in which the individual characters require many changes to fit on the best tree. However, a recent mathematical result has provided a precise tool to generate a short number of high-homoplasy characters for any given tree, so that this tree is provably the optimal tree under the maximum parsimony criterion. This provides, for the first time, a rigorous way to test tree search algorithms on homoplasy-rich data, where we know in advance what the 'best' tree is. In this short note we consider just one search program (TNT) but show that it is able to locate the globally optimal tree correctly for 32,768 taxa, even though the characters in the dataset require, on average, 1148 state-changes each to fit on this tree, and the number of characters is only 57.

摘要

从序列数据中找到最优进化树通常是一个棘手的问题,而且通常无法知道从某些搜索中找到的最佳树有多接近最优。当我们有许多分类群并且数据具有高水平的同形性时,问题似乎尤为严重,在这种情况下,每个字符都需要多次变化才能适应最佳树。然而,最近的一个数学结果为生成给定树的少数几个高同形性字符提供了一个精确的工具,从而证明了在最大简约标准下该树是最优的。这首次为在同形性丰富的数据上测试树搜索算法提供了一种严格的方法,我们事先知道“最佳”树是什么。在这个简短的注释中,我们仅考虑一个搜索程序(TNT),但表明它能够正确找到全局最优树,即使数据集中的字符平均每个字符需要 1148 次状态变化才能适应这棵树,并且字符数仅为 57。

相似文献

1
Hide and seek: placing and finding an optimal tree for thousands of homoplasy-rich sequences.捉迷藏:为数千个同源序列丰富的序列放置和找到最佳树。
Mol Phylogenet Evol. 2013 Dec;69(3):1186-9. doi: 10.1016/j.ympev.2013.08.001. Epub 2013 Aug 9.
2
Towards improving searches for optimal phylogenies.迈向改进对最优系统发育树的搜索。
Syst Biol. 2015 Jan;64(1):56-65. doi: 10.1093/sysbio/syu065. Epub 2014 Aug 26.
3
Hide and vanish: data sets where the most parsimonious tree is known but hard to find, and their implications for tree search methods.隐藏与消失:已知最简约树但难以找到的数据集,及其对树搜索方法的启示。
Mol Phylogenet Evol. 2014 Oct;79:118-31. doi: 10.1016/j.ympev.2014.06.008. Epub 2014 Jun 18.
4
On defining a unique phylogenetic tree with homoplastic characters.定义具有同形特征的独特系统发育树。
Mol Phylogenet Evol. 2018 May;122:95-101. doi: 10.1016/j.ympev.2018.01.020. Epub 2018 Jan 31.
5
Is homoplasy or lineage sorting the source of incongruent mtdna and nuclear gene trees in the stiff-tailed ducks (Nomonyx-Oxyura)?在硬尾鸭(Nomonyx - Oxyura)中,同塑性或谱系分选是线粒体DNA和核基因树不一致的根源吗?
Syst Biol. 2005 Feb;54(1):35-55. doi: 10.1080/10635150590910249.
6
New approaches to phylogenetic tree search and their application to large numbers of protein alignments.系统发育树搜索的新方法及其在大量蛋白质序列比对中的应用。
Syst Biol. 2007 Oct;56(5):727-40. doi: 10.1080/10635150701611134.
7
The size of the character state space affects the occurrence and detection of homoplasy: modelling the probability of incompatibility for unordered phylogenetic characters.字符状态空间的大小会影响同塑性的出现和检测:对无序系统发育特征不相容性的概率进行建模。
J Theor Biol. 2015 Feb 7;366:24-32. doi: 10.1016/j.jtbi.2014.10.033. Epub 2014 Nov 6.
8
A Linear Bound on the Number of States in Optimal Convex Characters for Maximum Parsimony Distance.关于最大简约距离的最优凸特征中状态数的线性界。
IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):472-477. doi: 10.1109/TCBB.2016.2543727. Epub 2016 Mar 17.
9
Characterizing the phylogenetic tree-search problem.刻画系统发育树搜索问题。
Syst Biol. 2012 Mar;61(2):228-39. doi: 10.1093/sysbio/syr097. Epub 2011 Nov 10.
10
Refining phylogenetic trees given additional data: an algorithm based on parsimony.给定额外数据时完善系统发育树:一种基于简约法的算法
IEEE/ACM Trans Comput Biol Bioinform. 2009 Jan-Mar;6(1):118-25. doi: 10.1109/TCBB.2008.100.

引用本文的文献

1
Phylogenetic relationships, distribution, and conservation of Roosmalens' dwarf porcupine, Voss & da Silva, 2001 (Rodentia, Erethizontidae).罗斯马伦矮豪猪的系统发育关系、分布及保护,沃斯和达席尔瓦,2001年(啮齿目,帚尾豪猪科)
Zookeys. 2023 Sep 11;1179:139-155. doi: 10.3897/zookeys.1179.108766. eCollection 2023.
2
HomoplasyFinder: a simple tool to identify homoplasies on a phylogeny.同形异源发生识别器:一个用于在系统发育树上识别同形异源发生的简单工具。
Microb Genom. 2019 Jan;5(1). doi: 10.1099/mgen.0.000245. Epub 2019 Jan 21.
3
The marker choice: Unexpected resolving power of an unexplored CO1 region for layered DNA barcoding approaches.
标记物的选择:未探索的CO1区域对分层DNA条形码方法具有意外的分辨能力。
PLoS One. 2017 Apr 13;12(4):e0174842. doi: 10.1371/journal.pone.0174842. eCollection 2017.