不同搜索策略在估计简约重抽样、自展法和布雷默支持率方面的效率。

The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support.

作者信息

Müller Kai F

机构信息

Nees-Institut für Biodiversität der Pflanzen, Rheinische Friedrich-Wilhelms-Universität Bonn, Meckenheimer Allee 170, Bonn, D-53115, Germany.

出版信息

BMC Evol Biol. 2005 Oct 29;5:58. doi: 10.1186/1471-2148-5-58.

DOI:10.1186/1471-2148-5-58

PMID:16255783

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1282575/

Abstract

BACKGROUND

For parsimony analyses, the most common way to estimate confidence is by resampling plans (nonparametric bootstrap, jackknife), and Bremer support (Decay indices). The recent literature reveals that parameter settings that are quite commonly employed are not those that are recommended by theoretical considerations and by previous empirical studies. The optimal search strategy to be applied during resampling was previously addressed solely via standard search strategies available in PAUP*. The question of a compromise between search extensiveness and improved support accuracy for Bremer support received even less attention. A set of experiments was conducted on different datasets to find an empirical cut-off point at which increased search extensiveness does not significantly change Bremer support and jackknife or bootstrap proportions any more.

RESULTS

For the number of replicates needed for accurate estimates of support in resampling plans, a diagram is provided that helps to address the question whether apparently different support values really differ significantly. It is shown that the use of random addition cycles and parsimony ratchet iterations during bootstrapping does not translate into higher support, nor does any extension of the search extensiveness beyond the rather moderate effort of TBR (tree bisection and reconnection branch swapping) plus saving one tree per replicate. Instead, in case of very large matrices, saving more than one shortest tree per iteration and using a strict consensus tree of these yields decreased support compared to saving only one tree. This can be interpreted as a small risk of overestimating support but should be more than compensated by other factors that counteract an enhanced type I error. With regard to Bremer support, a rule of thumb can be derived stating that not much is gained relative to the surplus computational effort when searches are extended beyond 20 ratchet iterations per constrained node, at least not for datasets that fall within the size range found in the current literature.

CONCLUSION

In view of these results, calculating bootstrap or jackknife proportions with narrow confidence intervals even for very large datasets can be achieved with less expense than often thought. In particular, iterated bootstrap methods that aim at reducing statistical bias inherent to these proportions are more feasible when the individual bootstrap searches require less time.

摘要

背景

对于简约分析，估计置信度最常用的方法是重抽样计划（非参数自展法、刀切法）以及布雷默支持度（衰减指数）。最近的文献表明，相当普遍采用的参数设置并非理论考量和先前实证研究推荐的设置。重抽样过程中应用的最优搜索策略此前仅通过PAUP*中可用的标准搜索策略来探讨。对于布雷默支持度，在搜索广度和提高支持度准确性之间进行折中的问题受到的关注更少。针对不同数据集进行了一系列实验，以找到一个经验性的临界点，超过该点后增加搜索广度不会再显著改变布雷默支持度以及刀切法或自展法比例。

结果

针对重抽样计划中准确估计支持度所需的重复次数，提供了一个图表，有助于解决明显不同的支持度值是否真的存在显著差异这一问题。结果表明，在自展过程中使用随机添加循环和简约棘轮迭代并不会转化为更高的支持度，而且搜索广度超过相当适度的TBR（树二分与重连分支交换）加上每次重复保存一棵树的工作量之后，也不会有任何提升。相反，在矩阵非常大的情况下，与每次迭代仅保存一棵最短树相比，每次迭代保存多棵最短树并使用这些树的严格合意树会导致支持度降低。这可以解释为存在高估支持度的小风险，但应该会被抵消I型错误增加的其他因素所弥补。关于布雷默支持度，可以得出一个经验法则，即当每个受约束节点的搜索扩展超过20次棘轮迭代时，相对于额外的计算工作量，收获并不多，至少对于当前文献中发现的大小范围内的数据集是这样。

结论

鉴于这些结果，即使对于非常大的数据集，以比通常认为的更低成本计算具有窄置信区间的自展法或刀切法比例也是可以实现的。特别是，当单个自展搜索所需时间较少时，旨在减少这些比例中固有统计偏差的迭代自展法更可行。

相似文献

The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support.

BMC Evol Biol. 2005 Oct 29;5:58. doi: 10.1186/1471-2148-5-58.

Parsimony analysis of phylogenomic datasets (II): evaluation of PAUP*, MEGA and MPBoot.

Cladistics. 2022 Feb;38(1):126-146. doi: 10.1111/cla.12476. Epub 2021 Jul 14.

Spurious 99% bootstrap and jackknife support for unsupported clades.

Mol Phylogenet Evol. 2011 Oct;61(1):177-91. doi: 10.1016/j.ympev.2011.06.003. Epub 2011 Jun 16.

Bias in tree searches and its consequences for measuring group supports.

Syst Biol. 2014 Nov;63(6):851-61. doi: 10.1093/sysbio/syu051. Epub 2014 Jul 28.

MPBoot: fast phylogenetic maximum parsimony tree inference and bootstrap approximation.

BMC Evol Biol. 2018 Feb 2;18(1):11. doi: 10.1186/s12862-018-1131-3.

Quantification and relative severity of inflated branch-support values generated by alternative methods: an empirical example.

Mol Phylogenet Evol. 2013 Apr;67(1):277-96. doi: 10.1016/j.ympev.2013.01.020. Epub 2013 Feb 9.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Branch support via resampling: an empirical study.

Cladistics. 2010 Dec;26(6):643-656. doi: 10.1111/j.1096-0031.2010.00304.x.

Gene-wise resampling outperforms site-wise resampling in phylogenetic coalescence analyses.

Mol Phylogenet Evol. 2019 Feb;131:80-92. doi: 10.1016/j.ympev.2018.10.001. Epub 2018 Nov 2.

An expanded plastid DNA phylogeny of Orchidaceae and analysis of jackknife branch support strategy.

Am J Bot. 2004 Jan;91(1):149-57. doi: 10.3732/ajb.91.1.149.

引用本文的文献

The most detailed anatomical reconstruction of a Mesozoic coelacanth.

PLoS One. 2024 Nov 6;19(11):e0312026. doi: 10.1371/journal.pone.0312026. eCollection 2024.

The diversification of Caribbean Buxus in time and space: elevated speciation rates in lineages that accumulate nickel and spreading to other islands from Cuba in non-obligate ultramafic species.

Ann Bot. 2023 Aug 25;131(7):1133-1147. doi: 10.1093/aob/mcad063.

Taxonomy of (Caryophyllaceae) - overall phylogenetic relationships and assessment of species diversity based on a first comprehensive checklist of the genus.

PhytoKeys. 2022 May 23;196:91-214. doi: 10.3897/phytokeys.196.77940. eCollection 2022.

Revisiting the taxonomy of and related genera (Leguminosae, Papilionoideae), with new generic circumscriptions.

PhytoKeys. 2020 Oct 21;164:67-114. doi: 10.3897/phytokeys.164.55441. eCollection 2020.

From to (Annonaceae): transfer of species enlarges a previously monotypic genus.

PhytoKeys. 2020 May 26;148:71-91. doi: 10.3897/phytokeys.148.50929. eCollection 2020.

Evolutionary dynamics in the dispersal of sign languages.

R Soc Open Sci. 2020 Jan 22;7(1):191100. doi: 10.1098/rsos.191100. eCollection 2020 Jan.

Unveiling the Multilocus Sequence Typing (MLST) Schemes and Core Genome Phylogenies for Genotyping .

Front Microbiol. 2018 Aug 22;9:1854. doi: 10.3389/fmicb.2018.01854. eCollection 2018.

The fossil Osmundales (Royal Ferns)-a phylogenetic network analysis, revised taxonomy, and evolutionary classification of anatomically preserved trunks and rhizomes.

PeerJ. 2017 Jul 11;5:e3433. doi: 10.7717/peerj.3433. eCollection 2017.

Alphonsea glandulosa (Annonaceae), a New Species from Yunnan, China.

PLoS One. 2017 Feb 1;12(2):e0170107. doi: 10.1371/journal.pone.0170107. eCollection 2017.

Taxonomic reassessment of Hydralmosaurus as Styxosaurus: new insights on the elasmosaurid neck evolution throughout the Cretaceous.

PeerJ. 2016 Mar 15;4:e1777. doi: 10.7717/peerj.1777. eCollection 2016.

本文引用的文献

CONFIDENCE LIMITS ON PHYLOGENIES: THE BOOTSTRAP REVISITED.

Cladistics. 1989 Jun;5(2):113-129. doi: 10.1111/j.1096-0031.1989.tb00559.x.

CHARACTER REMOVAL AS A MEANS FOR ASSESSING STABILITY OF CLADES.

Cladistics. 1993 Jun;9(2):201-210. doi: 10.1111/j.1096-0031.1993.tb00218.x.

SKEWNESS AND PERMUTATION.

Cladistics. 1992 Sep;8(3):275-287. doi: 10.1111/j.1096-0031.1992.tb00071.x.

Support, Ribosomal Sequences and the Phylogeny Of The Eukaryotes.

Cladistics. 1998 Dec;14(4):303-338. doi: 10.1111/j.1096-0031.1998.tb00341.x.

PARSIMONY JACKKNIFING OUTPERFORMS NEIGHBOR-JOINING.

Cladistics. 1996 Jun;12(2):99-124. doi: 10.1111/j.1096-0031.1996.tb00196.x.

The Parsimony Ratchet, a New Method for Rapid Parsimony Analysis.

Cladistics. 1999 Dec;15(4):407-414. doi: 10.1111/j.1096-0031.1999.tb00277.x.

THE LIMITS OF AMINO ACID SEQUENCE DATA IN ANGIOSPERM PHYLOGENETIC RECONSTRUCTION.

Evolution. 1988 Jul;42(4):795-803. doi: 10.1111/j.1558-5646.1988.tb02497.x.

CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP.

Evolution. 1985 Jul;39(4):783-791. doi: 10.1111/j.1558-5646.1985.tb00420.x.

An expanded plastid DNA phylogeny of Orchidaceae and analysis of jackknife branch support strategy.

Am J Bot. 2004 Jan;91(1):149-57. doi: 10.3732/ajb.91.1.149.

Angiosperm phylogeny based on matK sequence information.

Am J Bot. 2003 Dec;90(12):1758-76. doi: 10.3732/ajb.90.12.1758.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

不同搜索策略在估计简约重抽样、自展法和布雷默支持率方面的效率。

The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献