• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于改进基于似然性的系统发育估计的每个速率类别的独立碱基组成;5rf模型

An independent base composition of each rate class for improved likelihood-based phylogeny estimation; the 5rf model.

作者信息

Waddell Peter J, Bouckaert Remco

机构信息

School of Natural Sciences, Massey University, Palmerston North, New Zealand.

Centre for Computational Evolution, University of Auckland, Auckland, New Zealand.

出版信息

bioRxiv. 2024 Sep 7:2024.09.03.610719. doi: 10.1101/2024.09.03.610719.

DOI:10.1101/2024.09.03.610719
PMID:39282393
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11398347/
Abstract

The combination of a ime eversible Markov process with a "hidden" mixture of amma distributed relative site rates plus nvariant sites have become the most favoured options for likelihood and other probabilistic models of nucleotide evolution (e.g., tr4gi which approximates a gamma with four rate classes). However, these models assume a homogeneous and stationary distribution of nucleotide (character or base) frequencies. Here, we explore the potential benefits and pitfalls of allowing each rate category (rate class) of a 4gi mixture model to have its own base frequencies. This is achieved by starting each of the five rate classes, at the tree's oot, with its own free choice of nucleotide requencies to create a 4gi5rf model or a 5rf model in shorthand. We assess the practical identifiability of this approach with a BEAST 2 implementation, aiming to determine if it can accurately estimate credibility intervals and expected values for a wide range of plausible parameter values. Practical identifiability, as distinguished from mathematical identifiability, gauges the model's ability to identify parameters in real-world scenarios, as opposed to theoretically with infinite data. One of the most common types of phylogenetic data is mitochondrial DNA (mtDNA) protein coding sequence. It is often assumed current models analyse robustly such data and that higher likelihood/posterior probability models do better. However, this abstract shows that vertebrate mtDNA remains a very difficult type of data to fully model, and that dramatically higher likelihoods do not mean a model is measurably more accurate with respect to recovering key parameters of biological interest (e.g., monophyletic groups, their support and their ages). The 4gi5rf model considerably improves marginal likelihoods and seems to reverse some apparent errors exacerbated by the 4gi model, while introducing others. Problems appear to be linked to non-stationary DNA repair processes that alter the mutation/substitution spectra across lineages and time. We also show such problems are not unique to mtDNA and are encountered in analysing nuclear sequences. Non-stationarity of DNA repair processes mutation/substitution spectra thus pose an active challenge to obtaining reliable inferences of relationships and divergence times near the root of placental mammals, for example. An open source implementation is available under the LGPL 3.0 license in the beastbooster package for BEAST 2, available from https://github.com/rbouckaert/beastbooster.

摘要

将一个不可逆马尔可夫过程与伽马分布的相对位点速率的“隐藏”混合以及不变位点相结合,已成为核苷酸进化的似然模型和其他概率模型(例如,近似具有四个速率类别的伽马分布的tr4gi)最青睐的选择。然而,这些模型假定核苷酸(特征或碱基)频率呈均匀且平稳的分布。在此,我们探讨了允许4gi混合模型的每个速率类别(速率类)具有其自身碱基频率的潜在益处和缺陷。这是通过在树的根部为五个速率类别中的每一个开始时,自由选择其自身的核苷酸频率来实现的,从而创建一个4gi5rf模型,简称为5rf模型。我们使用BEAST 2实现来评估这种方法的实际可识别性,旨在确定它是否能够针对广泛的合理参数值准确估计可信区间和期望值。与数学可识别性不同,实际可识别性衡量的是模型在现实场景中识别参数的能力,而非理论上具有无限数据时的情况。系统发育数据最常见的类型之一是线粒体DNA(mtDNA)蛋白质编码序列。人们通常认为当前模型能有力地分析此类数据,且更高似然性/后验概率的模型表现更好。然而,本摘要表明脊椎动物mtDNA仍然是一种极难完全建模的数据类型,而且显著更高的似然性并不意味着模型在恢复生物学感兴趣的关键参数(例如,单系类群、它们的支持度及其年龄)方面更准确可测。4gi5rf模型显著提高了边际似然性,似乎扭转了一些由4gi模型加剧的明显错误,同时也引入了其他错误。问题似乎与非平稳的DNA修复过程有关,这些过程会改变谱系和时间上的突变/替换谱。我们还表明,此类问题并非mtDNA所特有,在分析核序列时也会遇到。因此,DNA修复过程突变/替换谱的非平稳性对获得胎盘哺乳动物根部附近关系和分歧时间的可靠推断构成了现实挑战。可在BEAST 2 的beastbooster包中根据LGPL 3.0许可获得开源实现,可从https://github.com/rbouckaert/beastbooster获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/1e147af804c7/nihpp-2024.09.03.610719v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/e27bf957fa8e/nihpp-2024.09.03.610719v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/bfbc14478a19/nihpp-2024.09.03.610719v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/75408cb4627a/nihpp-2024.09.03.610719v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/606cda2c1dff/nihpp-2024.09.03.610719v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/df397e0e716a/nihpp-2024.09.03.610719v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/5659093e5f61/nihpp-2024.09.03.610719v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/befc95290329/nihpp-2024.09.03.610719v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/72ab55ce68ef/nihpp-2024.09.03.610719v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/1e147af804c7/nihpp-2024.09.03.610719v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/e27bf957fa8e/nihpp-2024.09.03.610719v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/bfbc14478a19/nihpp-2024.09.03.610719v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/75408cb4627a/nihpp-2024.09.03.610719v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/606cda2c1dff/nihpp-2024.09.03.610719v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/df397e0e716a/nihpp-2024.09.03.610719v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/5659093e5f61/nihpp-2024.09.03.610719v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/befc95290329/nihpp-2024.09.03.610719v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/72ab55ce68ef/nihpp-2024.09.03.610719v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06aa/11398347/1e147af804c7/nihpp-2024.09.03.610719v1-f0009.jpg

相似文献

1
An independent base composition of each rate class for improved likelihood-based phylogeny estimation; the 5rf model.用于改进基于似然性的系统发育估计的每个速率类别的独立碱基组成;5rf模型
bioRxiv. 2024 Sep 7:2024.09.03.610719. doi: 10.1101/2024.09.03.610719.
2
Mixture models of nucleotide sequence evolution that account for heterogeneity in the substitution process across sites and across lineages.能够解释核苷酸序列进化过程中替换过程在各位置和各谱系间异质性的混合模型。
Syst Biol. 2014 Sep;63(5):726-42. doi: 10.1093/sysbio/syu036. Epub 2014 Jun 12.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny.一种根据特定位点氨基酸频率进行调整并改进蛋白质系统发育推断的类频率混合模型。
BMC Evol Biol. 2008 Dec 16;8:331. doi: 10.1186/1471-2148-8-331.
5
Estimation of phylogeny and invariant sites under the general Markov model of nucleotide sequence evolution.核苷酸序列进化的一般马尔可夫模型下的系统发育和不变位点估计。
Syst Biol. 2007 Apr;56(2):155-62. doi: 10.1080/10635150701247921.
6
Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST.马科夫调制连续时间马科夫链在 BEAST 中识别位点和分支特异性进化变化。
Syst Biol. 2021 Jan 1;70(1):181-189. doi: 10.1093/sysbio/syaa037.
7
Model Selection and Parameter Inference in Phylogenetics Using Nested Sampling.使用嵌套抽样进行系统发育学中的模型选择和参数推断。
Syst Biol. 2019 Mar 1;68(2):219-233. doi: 10.1093/sysbio/syy050.
8
bModelTest: Bayesian phylogenetic site model averaging and model comparison.bModelTest:贝叶斯系统发育位点模型平均与模型比较。
BMC Evol Biol. 2017 Feb 6;17(1):42. doi: 10.1186/s12862-017-0890-6.
9
Estimating the pattern of nucleotide substitution.估计核苷酸替换模式。
J Mol Evol. 1994 Jul;39(1):105-11. doi: 10.1007/BF00178256.
10
Enhancing mitogenomic phylogeny and resolving the relationships of extinct megafaunal placental mammals.增强线粒体基因组系统发育并解析已灭绝巨型胎盘哺乳动物的关系。
Mol Phylogenet Evol. 2021 May;158:107082. doi: 10.1016/j.ympev.2021.107082. Epub 2021 Jan 20.

本文引用的文献

1
How to Validate a Bayesian Evolutionary Model.如何验证贝叶斯进化模型。
Syst Biol. 2025 Feb 10;74(1):158-175. doi: 10.1093/sysbio/syae064.
2
An Efficient Coalescent Epoch Model for Bayesian Phylogenetic Inference.一种用于贝叶斯系统发育推断的高效合并时代模型。
Syst Biol. 2022 Oct 12;71(6):1549-1560. doi: 10.1093/sysbio/syac015.
3
Evolutionary and phylogenetic insights from a nuclear genome sequence of the extinct, giant, "subfossil" koala lemur .从已灭绝的巨型“化石”考拉狐猴的核基因组序列中获得的进化和系统发育见解。
Proc Natl Acad Sci U S A. 2021 Jun 29;118(26). doi: 10.1073/pnas.2022117118.
4
Adaptive dating and fast proposals: Revisiting the phylogenetic relaxed clock model.自适应约会和快速提议:重新审视系统发生钟模型。
PLoS Comput Biol. 2021 Feb 2;17(2):e1008322. doi: 10.1371/journal.pcbi.1008322. eCollection 2021 Feb.
5
OBAMA: OBAMA for Bayesian amino-acid model averaging.奥巴马:用于贝叶斯氨基酸模型平均的奥巴马方法。 (注:这里“OBAMA”可能是特定语境下的一种方法名称,并非指美国总统奥巴马,翻译时保留原文以便更准确对应专业内容)
PeerJ. 2020 Aug 4;8:e9460. doi: 10.7717/peerj.9460. eCollection 2020.
6
BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis.BEAST 2.5:一个用于贝叶斯进化分析的高级软件平台。
PLoS Comput Biol. 2019 Apr 8;15(4):e1006650. doi: 10.1371/journal.pcbi.1006650. eCollection 2019 Apr.
7
Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.使用BEAST 1.10进行贝叶斯系统发育和系统动力学数据整合。
Virus Evol. 2018 Jun 8;4(1):vey016. doi: 10.1093/ve/vey016. eCollection 2018 Jan.
8
Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7.贝叶斯系统发生学中使用 Tracer 1.7 进行的后验总结
Syst Biol. 2018 Sep 1;67(5):901-904. doi: 10.1093/sysbio/syy032.
9
CONFIDENCE LIMITS ON THE MAXIMUM-LIKELIHOOD ESTIMATE OF THE HOMINOID TREE FROM MITOCHONDRIAL-DNA SEQUENCES.基于线粒体DNA序列的类人猿树最大似然估计的置信限
Evolution. 1989 May;43(3):672-677. doi: 10.1111/j.1558-5646.1989.tb04264.x.
10
bModelTest: Bayesian phylogenetic site model averaging and model comparison.bModelTest:贝叶斯系统发育位点模型平均与模型比较。
BMC Evol Biol. 2017 Feb 6;17(1):42. doi: 10.1186/s12862-017-0890-6.