• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MPBoot:快速系统发育最大简约树推断与自展近似法

MPBoot: fast phylogenetic maximum parsimony tree inference and bootstrap approximation.

作者信息

Hoang Diep Thi, Vinh Le Sy, Flouri Tomáš, Stamatakis Alexandros, von Haeseler Arndt, Minh Bui Quang

机构信息

University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam.

Department of Genetics, Evolution and Environment, University College London, Gower Street, London, WC1E 6BT, UK.

出版信息

BMC Evol Biol. 2018 Feb 2;18(1):11. doi: 10.1186/s12862-018-1131-3.

DOI:10.1186/s12862-018-1131-3
PMID:29390973
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5796505/
Abstract

BACKGROUND

The nonparametric bootstrap is widely used to measure the branch support of phylogenetic trees. However, bootstrapping is computationally expensive and remains a bottleneck in phylogenetic analyses. Recently, an ultrafast bootstrap approximation (UFBoot) approach was proposed for maximum likelihood analyses. However, such an approach is still missing for maximum parsimony.

RESULTS

To close this gap we present MPBoot, an adaptation and extension of UFBoot to compute branch supports under the maximum parsimony principle. MPBoot works for both uniform and non-uniform cost matrices. Our analyses on biological DNA and protein showed that under uniform cost matrices, MPBoot runs on average 4.7 (DNA) to 7 times (protein data) (range: 1.2-20.7) faster than the standard parsimony bootstrap implemented in PAUP*; but 1.6 (DNA) to 4.1 times (protein data) slower than the standard bootstrap with a fast search routine in TNT (fast-TNT). However, for non-uniform cost matrices MPBoot is 5 (DNA) to 13 times (protein data) (range:0.3-63.9) faster than fast-TNT. We note that MPBoot achieves better scores more frequently than PAUP* and fast-TNT. However, this effect is less pronounced if an intensive but slower search in TNT is invoked. Moreover, experiments on large-scale simulated data show that while both PAUP* and TNT bootstrap estimates are too conservative, MPBoot bootstrap estimates appear more unbiased.

CONCLUSIONS

MPBoot provides an efficient alternative to the standard maximum parsimony bootstrap procedure. It shows favorable performance in terms of run time, the capability of finding a maximum parsimony tree, and high bootstrap accuracy on simulated as well as empirical data sets. MPBoot is easy-to-use, open-source and available at http://www.cibiv.at/software/mpboot .

摘要

背景

非参数自展法被广泛用于衡量系统发育树的分支支持度。然而,自展法计算成本高昂,仍然是系统发育分析的瓶颈。最近,一种超快速自展近似(UFBoot)方法被提出用于最大似然分析。然而,最大简约法仍缺少这样一种方法。

结果

为了填补这一空白,我们提出了MPBoot,它是UFBoot的一种改编和扩展,用于在最大简约原则下计算分支支持度。MPBoot适用于均匀和非均匀成本矩阵。我们对生物DNA和蛋白质的分析表明,在均匀成本矩阵下,MPBoot的运行速度平均比PAUP中实现的标准简约自展法快4.7倍(DNA)至7倍(蛋白质数据)(范围:1.2 - 20.7);但比TNT中具有快速搜索程序的标准自展法慢1.6倍(DNA)至4.1倍(蛋白质数据)。然而,对于非均匀成本矩阵,MPBoot比快速TNT快5倍(DNA)至13倍(蛋白质数据)(范围:0.3 - 63.9)。我们注意到,MPBoot比PAUP和快速TNT更频繁地获得更好的分数。然而,如果在TNT中调用密集但较慢的搜索,这种效果就不那么明显了。此外,对大规模模拟数据的实验表明,虽然PAUP*和TNT自展估计都过于保守,但MPBoot自展估计似乎更无偏差。

结论

MPBoot为标准最大简约自展程序提供了一种有效的替代方法。它在运行时间、找到最大简约树的能力以及对模拟和实证数据集的高自展准确性方面表现出良好的性能。MPBoot易于使用、开源,可在http://www.cibiv.at/software/mpboot获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/dd704a9c38e4/12862_2018_1131_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/60d51478216e/12862_2018_1131_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/e73b201726ed/12862_2018_1131_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/8ac1b706f09b/12862_2018_1131_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/dd704a9c38e4/12862_2018_1131_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/60d51478216e/12862_2018_1131_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/e73b201726ed/12862_2018_1131_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/8ac1b706f09b/12862_2018_1131_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2dda/5796505/dd704a9c38e4/12862_2018_1131_Fig4_HTML.jpg

相似文献

1
MPBoot: fast phylogenetic maximum parsimony tree inference and bootstrap approximation.MPBoot:快速系统发育最大简约树推断与自展近似法
BMC Evol Biol. 2018 Feb 2;18(1):11. doi: 10.1186/s12862-018-1131-3.
2
Parsimony analysis of phylogenomic datasets (II): evaluation of PAUP*, MEGA and MPBoot.系统发育基因组数据集简约分析(二):PAUP*、MEGA 和 MPBoot 的评估。
Cladistics. 2022 Feb;38(1):126-146. doi: 10.1111/cla.12476. Epub 2021 Jul 14.
3
Ultrafast approximation for phylogenetic bootstrap.快速近似的系统发育自举法。
Mol Biol Evol. 2013 May;30(5):1188-95. doi: 10.1093/molbev/mst024. Epub 2013 Feb 15.
4
UFBoot2: Improving the Ultrafast Bootstrap Approximation.UFBoot2:改进超快bootstrap 逼近算法。
Mol Biol Evol. 2018 Feb 1;35(2):518-522. doi: 10.1093/molbev/msx281.
5
The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support.不同搜索策略在估计简约重抽样、自展法和布雷默支持率方面的效率。
BMC Evol Biol. 2005 Oct 29;5:58. doi: 10.1186/1471-2148-5-58.
6
Bayesian and maximum likelihood phylogenetic analyses of protein sequence data under relative branch-length differences and model violation.基于相对分支长度差异和模型违背情况下蛋白质序列数据的贝叶斯和最大似然系统发育分析。
BMC Evol Biol. 2005 Jan 28;5:8. doi: 10.1186/1471-2148-5-8.
7
PTree: pattern-based, stochastic search for maximum parsimony phylogenies.PTree:基于模式的最大简约系统发生学的随机搜索。
PeerJ. 2013 Jun 25;1:e89. doi: 10.7717/peerj.89. Print 2013.
8
Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability.系统发育可靠性的贝叶斯和最大似然自展法测度的比较
Mol Biol Evol. 2003 Feb;20(2):248-54. doi: 10.1093/molbev/msg042.
9
Molecular systematics of terraranas (Anura: Brachycephaloidea) with an assessment of the effects of alignment and optimality criteria.陆栖蛙类(无尾目:短头蟾超科)的分子系统学及比对和最优性标准的影响评估
Zootaxa. 2014 Jun 26;3825:1-132. doi: 10.11646/zootaxa.3825.1.1.
10
Early evolutionary relationships among known life forms inferred from elongation factor EF-2/EF-G sequences: phylogenetic coherence and structure of the archaeal domain.从延伸因子EF-2/EF-G序列推断已知生命形式之间的早期进化关系:古菌域的系统发育一致性和结构
J Mol Evol. 1992 May;34(5):396-405. doi: 10.1007/BF00162996.

引用本文的文献

1
Telomere attrition becomes an instrument for clonal selection in aging hematopoiesis and leukemogenesis.端粒损耗成为衰老造血和白血病发生过程中克隆选择的一种机制。
Nat Genet. 2025 Aug 28. doi: 10.1038/s41588-025-02296-x.
2
Superinfection Exclusion in Neotropical Honey Bees May Block DWV-B, an Emerging Infectious Disease Variant of Deformed Wing Virus.新热带地区蜜蜂的超级感染排除可能会阻止DWV-B,一种新出现的变形翼病毒感染性疾病变体。
Evol Appl. 2025 Aug 12;18(8):e70143. doi: 10.1111/eva.70143. eCollection 2025 Aug.
3
Detecting and quantifying clonal selection in somatic stem cells.

本文引用的文献

1
METHODS FOR FASTER PARSIMONY ANALYSIS.更快简约分析的方法。
Cladistics. 1996 Sep;12(3):199-220. doi: 10.1111/j.1096-0031.1996.tb00009.x.
2
PARSIMONY JACKKNIFING OUTPERFORMS NEIGHBOR-JOINING.简约自展法优于邻接法。
Cladistics. 1996 Jun;12(2):99-124. doi: 10.1111/j.1096-0031.1996.tb00196.x.
3
The Parsimony Ratchet, a New Method for Rapid Parsimony Analysis.简约棘轮法:一种快速简约分析的新方法。
检测和量化体细胞干细胞中的克隆选择
Nat Genet. 2025 Jul 3. doi: 10.1038/s41588-025-02217-y.
4
The long-term effects of chemotherapy on normal blood cells.化疗对正常血细胞的长期影响。
Nat Genet. 2025 Jul 1. doi: 10.1038/s41588-025-02234-x.
5
Genome-wide diversity and MHC characterisation in a critically endangered freshwater turtle susceptible to disease.一种易患疾病的极度濒危淡水龟的全基因组多样性和主要组织相容性复合体特征分析
Immunogenetics. 2025 May 6;77(1):21. doi: 10.1007/s00251-025-01378-8.
6
Timing and trajectory of BCR::ABL1-driven chronic myeloid leukaemia.BCR::ABL1驱动的慢性髓性白血病的发病时间和发展轨迹。
Nature. 2025 Apr;640(8060):982-990. doi: 10.1038/s41586-025-08817-2. Epub 2025 Apr 9.
7
The somatic mutation landscape of normal gastric epithelium.正常胃上皮的体细胞突变图谱。
Nature. 2025 Apr;640(8058):418-426. doi: 10.1038/s41586-025-08708-6. Epub 2025 Mar 19.
8
Multi-Province Outbreak of Acute Gastroenteritis Linked to Potential Novel Lineage of GII.17 Norovirus in Argentina in 2024.2024年阿根廷多省爆发与GII.17诺如病毒潜在新谱系相关的急性肠胃炎
Viruses. 2025 Feb 5;17(2):223. doi: 10.3390/v17020223.
9
Prolonged persistence of mutagenic DNA lesions in somatic cells.体细胞中诱变DNA损伤的长期持续存在。
Nature. 2025 Feb;638(8051):729-738. doi: 10.1038/s41586-024-08423-8. Epub 2025 Jan 15.
10
Exploitation of Key Regulatory Modules and Genes for High-Salt Adaptation in Schizothoracine by Weighted Gene Co-Expression Network Analysis.基于加权基因共表达网络分析挖掘裂腹鱼高盐适应关键调控模块及基因
Animals (Basel). 2024 Dec 29;15(1):56. doi: 10.3390/ani15010056.
Cladistics. 1999 Dec;15(4):407-414. doi: 10.1111/j.1096-0031.1999.tb00277.x.
4
UFBoot2: Improving the Ultrafast Bootstrap Approximation.UFBoot2:改进超快bootstrap 逼近算法。
Mol Biol Evol. 2018 Feb 1;35(2):518-522. doi: 10.1093/molbev/msx281.
5
CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP.系统发育树的置信区间:一种使用自展法的方法。
Evolution. 1985 Jul;39(4):783-791. doi: 10.1111/j.1558-5646.1985.tb00420.x.
6
IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.IQ-TREE:一种用于估计最大似然系统发育树的快速且有效的随机算法。
Mol Biol Evol. 2015 Jan;32(1):268-74. doi: 10.1093/molbev/msu300. Epub 2014 Nov 3.
7
The phylogenetic likelihood library.系统发育似然库。
Syst Biol. 2015 Mar;64(2):356-62. doi: 10.1093/sysbio/syu084. Epub 2014 Oct 30.
8
Ultrafast approximation for phylogenetic bootstrap.快速近似的系统发育自举法。
Mol Biol Evol. 2013 May;30(5):1188-95. doi: 10.1093/molbev/mst024. Epub 2013 Feb 15.
9
Pruning rogue taxa improves phylogenetic accuracy: an efficient algorithm and webservice.修剪有害分类单元可提高系统发育准确性:一种高效算法和网络服务。
Syst Biol. 2013 Jan 1;62(1):162-6. doi: 10.1093/sysbio/sys078. Epub 2012 Sep 6.
10
How many bootstrap replicates are necessary?需要多少个自展重复样本?
J Comput Biol. 2010 Mar;17(3):337-54. doi: 10.1089/cmb.2009.0179.