• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

支长估计和分歧日期:贝叶斯和最大似然框架中的误差估计。

Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks.

机构信息

Department of Biology, Colorado State University, Fort Collins, CO 80523-1878, USA.

出版信息

BMC Evol Biol. 2010 Jan 11;10:5. doi: 10.1186/1471-2148-10-5.

DOI:10.1186/1471-2148-10-5
PMID:20064267
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2827399/
Abstract

BACKGROUND

Estimates of divergence dates between species improve our understanding of processes ranging from nucleotide substitution to speciation. Such estimates are frequently based on molecular genetic differences between species; therefore, they rely on accurate estimates of the number of such differences (i.e. substitutions per site, measured as branch length on phylogenies). We used simulations to determine the effects of dataset size, branch length heterogeneity, branch depth, and analytical framework on branch length estimation across a range of branch lengths. We then reanalyzed an empirical dataset for plethodontid salamanders to determine how inaccurate branch length estimation can affect estimates of divergence dates.

RESULTS

The accuracy of branch length estimation varied with branch length, dataset size (both number of taxa and sites), branch length heterogeneity, branch depth, dataset complexity, and analytical framework. For simple phylogenies analyzed in a Bayesian framework, branches were increasingly underestimated as branch length increased; in a maximum likelihood framework, longer branch lengths were somewhat overestimated. Longer datasets improved estimates in both frameworks; however, when the number of taxa was increased, estimation accuracy for deeper branches was less than for tip branches. Increasing the complexity of the dataset produced more misestimated branches in a Bayesian framework; however, in an ML framework, more branches were estimated more accurately. Using ML branch length estimates to re-estimate plethodontid salamander divergence dates generally resulted in an increase in the estimated age of older nodes and a decrease in the estimated age of younger nodes.

CONCLUSIONS

Branch lengths are misestimated in both statistical frameworks for simulations of simple datasets. However, for complex datasets, length estimates are quite accurate in ML (even for short datasets), whereas few branches are estimated accurately in a Bayesian framework. Our reanalysis of empirical data demonstrates the magnitude of effects of Bayesian branch length misestimation on divergence date estimates. Because the length of branches for empirical datasets can be estimated most reliably in an ML framework when branches are <1 substitution/site and datasets are > or =1 kb, we suggest that divergence date estimates using datasets, branch lengths, and/or analytical techniques that fall outside of these parameters should be interpreted with caution.

摘要

背景

物种分歧日期的估计可以帮助我们理解从核苷酸替换到物种形成等各种过程。这些估计通常基于物种之间的分子遗传差异;因此,它们依赖于对这些差异数量(即每个位点的替换数,通过系统发育树上的分支长度来衡量)的准确估计。我们使用模拟来确定数据集大小、分支长度异质性、分支深度和分析框架对一系列分支长度的分支长度估计的影响。然后,我们重新分析了有袋类蝾螈的一个经验数据集,以确定分支长度估计不准确如何影响分歧日期的估计。

结果

分支长度估计的准确性随分支长度、数据集大小(分类单元和位点的数量)、分支长度异质性、分支深度、数据集复杂性和分析框架而变化。对于在贝叶斯框架中分析的简单系统发育,随着分支长度的增加,分支被逐渐低估;在最大似然框架中,较长的分支长度略有高估。在两个框架中,更长的数据集都改善了估计;但是,当增加分类单元的数量时,较深分支的估计准确性低于尖端分支。增加数据集的复杂性会在贝叶斯框架中产生更多的错误估计分支;然而,在 ML 框架中,更多的分支被更准确地估计。使用 ML 分支长度估计值重新估计有袋类蝾螈的分歧日期通常会导致较老节点的估计年龄增加,而较年轻节点的估计年龄减少。

结论

对于简单数据集的模拟,两个统计框架中的分支长度都被错误估计。然而,对于复杂数据集,ML 中的长度估计非常准确(即使对于短数据集),而在贝叶斯框架中,很少有分支被准确估计。我们对经验数据的重新分析表明,贝叶斯分支长度估计错误对分歧日期估计的影响程度。由于在 ML 框架中,当分支长度<1 替换/位点且数据集长度> =1 kb 时,经验数据集的分支长度可以最可靠地估计,因此我们建议,使用超出这些参数的数据集、分支长度和/或分析技术的分歧日期估计应谨慎解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/b1ece212beea/1471-2148-10-5-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/4872fed346fe/1471-2148-10-5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/a37840f5781c/1471-2148-10-5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/a89ff7f94c71/1471-2148-10-5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/c5f71900ba10/1471-2148-10-5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/28df4a74f160/1471-2148-10-5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/ab1d6d9c23c5/1471-2148-10-5-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/ecded4df6f4c/1471-2148-10-5-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/bde8c4ba871e/1471-2148-10-5-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/c538b077413a/1471-2148-10-5-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/09219d54c9ab/1471-2148-10-5-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/31a74c5c2bef/1471-2148-10-5-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/b5a0f64379fd/1471-2148-10-5-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/bc342677971c/1471-2148-10-5-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/b1ece212beea/1471-2148-10-5-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/4872fed346fe/1471-2148-10-5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/a37840f5781c/1471-2148-10-5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/a89ff7f94c71/1471-2148-10-5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/c5f71900ba10/1471-2148-10-5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/28df4a74f160/1471-2148-10-5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/ab1d6d9c23c5/1471-2148-10-5-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/ecded4df6f4c/1471-2148-10-5-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/bde8c4ba871e/1471-2148-10-5-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/c538b077413a/1471-2148-10-5-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/09219d54c9ab/1471-2148-10-5-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/31a74c5c2bef/1471-2148-10-5-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/b5a0f64379fd/1471-2148-10-5-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/bc342677971c/1471-2148-10-5-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c43b/2827399/b1ece212beea/1471-2148-10-5-14.jpg

相似文献

1
Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks.支长估计和分歧日期:贝叶斯和最大似然框架中的误差估计。
BMC Evol Biol. 2010 Jan 11;10:5. doi: 10.1186/1471-2148-10-5.
2
Bayesian and maximum likelihood phylogenetic analyses of protein sequence data under relative branch-length differences and model violation.基于相对分支长度差异和模型违背情况下蛋白质序列数据的贝叶斯和最大似然系统发育分析。
BMC Evol Biol. 2005 Jan 28;5:8. doi: 10.1186/1471-2148-5-8.
3
The effect of ambiguous data on phylogenetic estimates obtained by maximum likelihood and Bayesian inference.歧义数据对最大似然法和贝叶斯推断得出的系统发育估计的影响。
Syst Biol. 2009 Feb;58(1):130-45. doi: 10.1093/sysbio/syp017. Epub 2009 May 22.
4
The influence of ignoring secondary structure on divergence time estimates from ribosomal RNA genes.忽略二级结构对核糖体 RNA 基因分歧时间估计的影响。
Mol Phylogenet Evol. 2014 Feb;71:214-23. doi: 10.1016/j.ympev.2013.12.003. Epub 2013 Dec 19.
5
Angiosperm divergence times: the effect of genes, codon positions, and time constraints.被子植物的分化时间:基因、密码子位置和时间限制的影响。
Evolution. 2005 Aug;59(8):1653-70. doi: 10.1554/04-565.1.
6
Empirical and Bayesian approaches to fossil-only divergence times: A study across three reptile clades.仅基于化石的分歧时间的经验方法和贝叶斯方法:对三个爬行动物类群的研究。
PLoS One. 2017 Feb 10;12(2):e0169885. doi: 10.1371/journal.pone.0169885. eCollection 2017.
7
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.细节中的魔鬼:分支长度先验和似然模型之间的相互作用影响了 Psoraceae 系统发育中的节点支持和分支长度。
Syst Biol. 2011 Jul;60(4):541-61. doi: 10.1093/sysbio/syr022. Epub 2011 Mar 24.
8
Evolutionary rates, divergence dates, and the performance of mitochondrial genes in Bayesian phylogenetic analysis.进化速率、分歧时间以及线粒体基因在贝叶斯系统发育分析中的表现
Syst Biol. 2006 Apr;55(2):289-300. doi: 10.1080/10635150500541672.
9
Phylogeny and divergence of the pinnipeds (Carnivora: Mammalia) assessed using a multigene dataset.利用多基因数据集评估鳍足类动物(食肉目:哺乳纲)的系统发育和分化。
BMC Evol Biol. 2007 Nov 9;7:216. doi: 10.1186/1471-2148-7-216.
10
When trees grow too long: investigating the causes of highly inaccurate bayesian branch-length estimates.当树木生长过长时:探究贝叶斯分支长度估计高度不准确的原因。
Syst Biol. 2010 Mar;59(2):145-61. doi: 10.1093/sysbio/syp081. Epub 2009 Dec 10.

引用本文的文献

1
Reliable estimation of tree branch lengths using deep neural networks.利用深度神经网络可靠估计树枝长度。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012337. doi: 10.1371/journal.pcbi.1012337. eCollection 2024 Aug.
2
Evolution of a key enzyme of aerobic metabolism reveals Proterozoic functional subunit duplication events and an ancient origin of animals.有氧代谢关键酶的进化揭示了元古代功能亚基复制事件和动物的古老起源。
Sci Rep. 2021 Aug 3;11(1):15744. doi: 10.1038/s41598-021-95094-4.
3
Advantages of an easy-to-use DNA extraction method for minimal-destructive analysis of collection specimens.

本文引用的文献

1
The excess of transitions among nucleotide substitutions: new methods of estimating transition bias underscore its significance.核苷酸取代中转换的过剩:新的转换偏向估计方法强调了其重要性。
Trends Ecol Evol. 1996 Apr;11(4):158-62. doi: 10.1016/0169-5347(96)10009-4.
2
Dating the age of the SIV lineages that gave rise to HIV-1 and HIV-2.确定导致HIV-1和HIV-2的猴免疫缺陷病毒(SIV)谱系的年代。
PLoS Comput Biol. 2009 May;5(5):e1000377. doi: 10.1371/journal.pcbi.1000377. Epub 2009 May 1.
3
Mutational meltdown in primary endosymbionts: selection limits Muller's ratchet.
一种易于使用的DNA提取方法用于采集样本最小破坏性分析的优势。
PLoS One. 2020 Jul 8;15(7):e0235222. doi: 10.1371/journal.pone.0235222. eCollection 2020.
4
Maximize Resolution or Minimize Error? Using Genotyping-By-Sequencing to Investigate the Recent Diversification of (Cistaceae).最大化分辨率还是最小化误差?利用简化基因组测序技术研究半日花科植物的近期分化
Front Plant Sci. 2019 Nov 11;10:1416. doi: 10.3389/fpls.2019.01416. eCollection 2019.
5
Near-Full-Length Genetic Characterization of a Novel HIV-1 Unique Recombinant with Similarities to A1, CRF01_AE, and CRFO2_AG Viruses in Yaoundé, Cameroon.喀麦隆雅温得一种与A1、CRF01_AE和CRF02_AG病毒相似的新型HIV-1独特重组体的近全长基因特征分析
AIDS Res Hum Retroviruses. 2019 Aug;35(8):762-768. doi: 10.1089/AID.2019.0042. Epub 2019 Apr 16.
6
A Phylogenomic Framework and Divergence History of Cephalochordata Amphioxus.头索动物文昌鱼的系统基因组框架和分化历史
Front Physiol. 2018 Dec 18;9:1833. doi: 10.3389/fphys.2018.01833. eCollection 2018.
7
Impact of the tree prior on estimating clock rates during epidemic outbreaks.树先验对传染病爆发期间估计时钟率的影响。
Proc Natl Acad Sci U S A. 2018 Apr 17;115(16):4200-4205. doi: 10.1073/pnas.1713314115. Epub 2018 Apr 2.
8
Molecular Phylogenies indicate a Paleo-Tibetan Origin of Himalayan Lazy Toads (Scutiger).分子系统发育表明喜马拉雅旱獭(Scutiger)起源于古西藏。
Sci Rep. 2017 Jun 12;7(1):3308. doi: 10.1038/s41598-017-03395-4.
9
Phylogenetic incongruence and the evolutionary origins of cardenolide-resistant forms of Na(+) ,K(+) -ATPase in Danaus butterflies.系统发育不一致与黑脉金斑蝶中耐强心甾形式的Na(+),K(+) -ATP酶的进化起源
Evolution. 2016 Aug;70(8):1913-21. doi: 10.1111/evo.12999. Epub 2016 Jul 27.
10
Molecular clocks and the early evolution of metazoan nervous systems.分子钟与后生动物神经系统的早期演化
Philos Trans R Soc Lond B Biol Sci. 2015 Dec 19;370(1684). doi: 10.1098/rstb.2015.0046.
初级内共生体中的突变崩溃:选择限制了穆勒棘轮效应。
PLoS One. 2009;4(3):e4969. doi: 10.1371/journal.pone.0004969. Epub 2009 Mar 23.
4
Amazonian amphibian diversity is primarily derived from late Miocene Andean lineages.亚马逊地区的两栖动物多样性主要源自中新世晚期的安第斯谱系。
PLoS Biol. 2009 Mar 10;7(3):e56. doi: 10.1371/journal.pbio.1000056.
5
Removal of noisy characters from chloroplast genome-scale data suggests revision of phylogenetic placements of Amborella and Ceratophyllum.从叶绿体基因组规模数据中去除噪声字符表明需要修订无油樟和金鱼藻的系统发育位置。
J Mol Evol. 2009 Mar;68(3):197-204. doi: 10.1007/s00239-009-9206-9. Epub 2009 Feb 27.
6
Branch-length estimation bias misleads molecular dating for a vertebrate mitochondrial phylogeny.分支长度估计偏差误导了脊椎动物线粒体系统发育的分子年代测定。
Gene. 2009 Jul 15;441(1-2):132-40. doi: 10.1016/j.gene.2008.08.017. Epub 2008 Sep 6.
7
Dating of divergences within the Rattus genus phylogeny using whole mitochondrial genomes.利用全线粒体基因组对家鼠属系统发育中的分歧进行年代测定。
Mol Phylogenet Evol. 2008 Nov;49(2):460-6. doi: 10.1016/j.ympev.2008.08.001. Epub 2008 Aug 8.
8
Molecular phylogeny of reed beetles (Col., Chrysomelidae, Donaciinae): the signature of ecological specialization and geographical isolation.芦苇叶甲(鞘翅目,叶甲科,水叶甲亚科)的分子系统发育:生态特化与地理隔离的印记
Mol Phylogenet Evol. 2008 Sep;48(3):936-52. doi: 10.1016/j.ympev.2008.05.035. Epub 2008 Jun 4.
9
BEAST: Bayesian evolutionary analysis by sampling trees.BEAST:通过抽样树进行贝叶斯进化分析。
BMC Evol Biol. 2007 Nov 8;7:214. doi: 10.1186/1471-2148-7-214.
10
Dating phylogenies with hybrid local molecular clocks.用混合局部分子钟对进化关系进行定年。
PLoS One. 2007 Sep 12;2(9):e879. doi: 10.1371/journal.pone.0000879.