• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

序列长度、树形拓扑结构和分类单元数量对系统发育方法性能的影响。

The effects of sequence length, tree topology, and number of taxa on the performance of phylogenetic methods.

作者信息

Charleston M A, Hendy M D, Penny D

机构信息

Department of Mathematics, Massey University, Palmerston North, New Zealand.

出版信息

J Comput Biol. 1994 Summer;1(2):133-51. doi: 10.1089/cmb.1994.1.133.

DOI:10.1089/cmb.1994.1.133
PMID:8790460
Abstract

Simulations were used to study the performance of several character-based and distance-based phylogenetic methods in obtaining the correct tree from pseudo-randomly generated input data. The study included all the topologies of unrooted binary trees with from 4 to 10 pendant vertices (taxa) inclusive. The length of the character sequences used ranged from 10 to 10(5) characters exponentially. The methods studied include Closest Tree, Compatibility, Li's method, Maximum Parsimony, Neighbor-joining, Neighborliness, and UPGMA. We also provide a modification to Li's method (SimpLi) which is consistent with additive data. We give estimations of the sequence lengths required for given confidence in the output of these methods under the assumptions of molecular evolution used in this study. A notation for characterizing all tree topologies is described. We show that when the number of taxa, the maximum path length, and the minimum edge length are held constant, there it little but significant dependence of the performance of the methods on the tree topology. We show that those methods that are consistent with the model used perform similarly, whereas the inconsistent methods, UPGMA and Li's method, perform very poorly.

摘要

利用模拟研究了几种基于特征和基于距离的系统发育方法从伪随机生成的输入数据中获取正确树的性能。该研究涵盖了具有4至10个悬垂顶点(分类单元)的无根二叉树的所有拓扑结构(包括4个和10个)。所使用的字符序列长度呈指数分布,范围从10到10⁵个字符。所研究的方法包括最近树法、相容性法、李法、最大简约法、邻接法、相邻法和UPGMA法。我们还对李法进行了一种与加性数据一致的修改(简化李法)。在本研究中使用的分子进化假设下,我们给出了这些方法输出具有给定置信度所需序列长度的估计。描述了一种表征所有树拓扑结构的符号。我们表明,当分类单元数量、最大路径长度和最小边长度保持恒定时,这些方法的性能对树拓扑结构几乎没有但存在显著依赖性。我们表明,那些与所使用模型一致的方法表现相似,而不一致的方法,即UPGMA法和李法,表现非常差。

相似文献

1
The effects of sequence length, tree topology, and number of taxa on the performance of phylogenetic methods.序列长度、树形拓扑结构和分类单元数量对系统发育方法性能的影响。
J Comput Biol. 1994 Summer;1(2):133-51. doi: 10.1089/cmb.1994.1.133.
2
Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rate varies with site.当替换率随位点变化时,最大似然法、邻接法和最大简约法的相对效率。
Mol Biol Evol. 1994 Mar;11(2):261-77. doi: 10.1093/oxfordjournals.molbev.a040108.
3
Accuracy of phylogenetic trees estimated from DNA sequence data.从DNA序列数据估计的系统发育树的准确性。
Mol Biol Evol. 1987 Mar;4(2):159-66. doi: 10.1093/oxfordjournals.molbev.a040432.
4
Evaluating the relationship between evolutionary divergence and phylogenetic accuracy in AFLP data sets.评估 AFLP 数据集内进化分歧与系统发育准确性之间的关系。
Mol Biol Evol. 2010 May;27(5):988-1000. doi: 10.1093/molbev/msp315. Epub 2009 Dec 21.
5
The neighbor-joining method: a new method for reconstructing phylogenetic trees.邻接法:一种重建系统发育树的新方法。
Mol Biol Evol. 1987 Jul;4(4):406-25. doi: 10.1093/oxfordjournals.molbev.a040454.
6
Multiple sequence alignment accuracy and phylogenetic inference.多序列比对准确性和系统发育推断
Syst Biol. 2006 Apr;55(2):314-28. doi: 10.1080/10635150500541730.
7
Application of the character compatibility approach to generalized molecular sequence data: branching order of the proteobacterial subdivisions.字符兼容性方法在广义分子序列数据中的应用:变形菌门各亚类的分支顺序。
J Mol Evol. 2007 Jan;64(1):90-100. doi: 10.1007/s00239-006-0082-2. Epub 2006 Dec 9.
8
Towards building the tree of life: a simulation study for all angiosperm genera.迈向构建生命之树:对所有被子植物属的模拟研究
Syst Biol. 2005 Apr;54(2):183-96. doi: 10.1080/10635150590923254.
9
Phylogenetic mixtures on a single tree can mimic a tree of another topology.单棵树上的系统发育混合可以模拟出具有另一种拓扑结构的树。
Syst Biol. 2007 Oct;56(5):767-75. doi: 10.1080/10635150701627304.
10
Relative efficiencies of the maximum-parsimony and distance-matrix methods of phylogeny construction for restriction data.基于限制性数据构建系统发育树的最大简约法和距离矩阵法的相对效率
Mol Biol Evol. 1991 May;8(3):356-65. doi: 10.1093/oxfordjournals.molbev.a040648.

引用本文的文献

1
Regulatory context drives conservation of glycine riboswitch aptamers.调控背景驱动甘氨酸核糖开关适配体的保守性。
PLoS Comput Biol. 2019 Dec 20;15(12):e1007564. doi: 10.1371/journal.pcbi.1007564. eCollection 2019 Dec.
2
Molecular evolution methods to study HIV-1 epidemics.用于研究HIV-1流行情况的分子进化方法。
Future Virol. 2018 May;13(6):399-404. doi: 10.2217/fvl-2017-0159. Epub 2018 May 21.
3
SpartaABC: a web server to simulate sequences with indel parameters inferred using an approximate Bayesian computation algorithm.
SpartaABC:一个 Web 服务器,用于模拟使用近似贝叶斯计算算法推断出的插入缺失参数的序列。
Nucleic Acids Res. 2017 Jul 3;45(W1):W453-W457. doi: 10.1093/nar/gkx322.
4
Multilocus genotype analysis of Escherichia coli O157 isolates from Australia and the United States provides evidence of geographic divergence.对澳大利亚和美国的大肠杆菌 O157 分离株的多位点基因型分析提供了地理分化的证据。
Appl Environ Microbiol. 2013 Aug;79(16):5050-8. doi: 10.1128/AEM.01525-13. Epub 2013 Jun 14.
5
Visualizing differences in phylogenetic information content of alignments and distinction of three classes of long-branch effects.可视化比对中系统发育信息内容的差异以及区分三类长枝效应。
BMC Evol Biol. 2007 Aug 28;7:147. doi: 10.1186/1471-2148-7-147.
6
Microbial composition of near-boiling silica-depositing thermal springs throughout Yellowstone National Park.黄石国家公园内接近沸点的二氧化硅沉积温泉的微生物组成。
Appl Environ Microbiol. 2002 Oct;68(10):5123-35. doi: 10.1128/AEM.68.10.5123-5135.2002.
7
Complete mitochondrial DNA genome sequences show that modern birds are not descended from transitional shorebirds.完整的线粒体DNA基因组序列表明,现代鸟类并非起源于过渡性滨鸟。
Proc Biol Sci. 2002 Apr 22;269(1493):839-46. doi: 10.1098/rspb.2002.1961.