• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

性状与树:基于连续性状模型的概率距离在解析系统发育、模型和数据之间相互作用中的应用

Of Traits and Trees: Probabilistic Distances under Continuous Trait Models for Dissecting the Interplay among Phylogeny, Model, and Data.

机构信息

Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA.

Department of Biology, Texas A&M University, College Station, TX 77843, USA.

出版信息

Syst Biol. 2021 Jun 16;70(4):660-680. doi: 10.1093/sysbio/syab009.

DOI:10.1093/sysbio/syab009
PMID:33587145
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8208806/
Abstract

Stochastic models of character trait evolution have become a cornerstone of evolutionary biology in an array of contexts. While probabilistic models have been used extensively for statistical inference, they have largely been ignored for the purpose of measuring distances between phylogeny-aware models. Recent contributions to the problem of phylogenetic distance computation have highlighted the importance of explicitly considering evolutionary model parameters and their impacts on molecular sequence data when quantifying dissimilarity between trees. By comparing two phylogenies in terms of their induced probability distributions that are functions of many model parameters, these distances can be more informative than traditional approaches that rely strictly on differences in topology or branch lengths alone. Currently, however, these approaches are designed for comparing models of nucleotide substitution and gene tree distributions, and thus, are unable to address other classes of traits and associated models that may be of interest to evolutionary biologists. Here, we expand the principles of probabilistic phylogenetic distances to compute tree distances under models of continuous trait evolution along a phylogeny. By explicitly considering both the degree of relatedness among species and the evolutionary processes that collectively give rise to character traits, these distances provide a foundation for comparing models and their predictions, and for quantifying the impacts of assuming one phylogenetic background over another while studying the evolution of a particular trait. We demonstrate the properties of these approaches using theory, simulations, and several empirical data sets that highlight potential uses of probabilistic distances in many scenarios. We also introduce an open-source R package named PRDATR for easy application by the scientific community for computing phylogenetic distances under models of character trait evolution.[Brownian motion; comparative methods; phylogeny; quantitative traits.].

摘要

随机特质进化模型已经成为诸多情境下进化生物学的基石。概率模型虽然被广泛用于统计推断,但在测量与系统发育相关模型之间的距离方面,它们在很大程度上被忽视了。最近,在计算系统发生距离的问题上的一些进展强调了在量化树之间的差异时,明确考虑进化模型参数及其对分子序列数据的影响的重要性。通过比较两个系统发生树的诱导概率分布,这些概率分布是许多模型参数的函数,这些距离比仅依赖拓扑或分支长度差异的传统方法更具信息量。然而,目前这些方法是为比较核苷酸替代和基因树分布模型而设计的,因此无法解决进化生物学家可能感兴趣的其他类别的特质和相关模型。在这里,我们将概率系统发生距离的原理扩展到计算系统发生树上连续特质进化模型下的树距离。通过明确考虑物种之间的亲缘关系程度以及共同导致特质的进化过程,这些距离为比较模型及其预测提供了基础,并量化了在研究特定特质的进化时,假设另一个系统发生背景相对于另一个系统发生背景的影响。我们使用理论、模拟和几个实证数据集来证明这些方法的特性,这些数据集突出了在许多情况下概率距离在许多场景中的潜在用途。我们还引入了一个名为 PRDATR 的开源 R 包,方便科学界在特质进化模型下计算系统发生距离。[布朗运动;比较方法;系统发生;数量特质。]。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/067d49253d4e/syab009f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/a3814f8cf4ee/syab009f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/270afeb6827c/syab009f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/f08423a85682/syab009f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/f1dfadf6e256/syab009f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/4997ca801b19/syab009f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/106e59fd9e73/syab009f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/a60794b55236/syab009f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/2121ed0e63b2/syab009f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/067d49253d4e/syab009f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/a3814f8cf4ee/syab009f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/270afeb6827c/syab009f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/f08423a85682/syab009f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/f1dfadf6e256/syab009f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/4997ca801b19/syab009f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/106e59fd9e73/syab009f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/a60794b55236/syab009f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/2121ed0e63b2/syab009f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/407a/8208806/067d49253d4e/syab009f9.jpg

相似文献

1
Of Traits and Trees: Probabilistic Distances under Continuous Trait Models for Dissecting the Interplay among Phylogeny, Model, and Data.性状与树:基于连续性状模型的概率距离在解析系统发育、模型和数据之间相互作用中的应用
Syst Biol. 2021 Jun 16;70(4):660-680. doi: 10.1093/sysbio/syab009.
2
Probabilistic Species Tree Distances: Implementing the Multispecies Coalescent to Compare Species Trees Within the Same Model-Based Framework Used to Estimate Them.概率种系发生树距离:在用于估计种系发生树的同一基于模型的框架内实施多物种并合,以比较种系发生树。
Syst Biol. 2020 Jan 1;69(1):194-207. doi: 10.1093/sysbio/syz031.
3
The Cauchy Process on Phylogenies: A Tractable Model for Pulsed Evolution.系统发育树上的柯西过程:脉冲进化的一个可处理模型。
Syst Biol. 2023 Dec 30;72(6):1296-1315. doi: 10.1093/sysbio/syad053.
4
Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.具有转移的多元高斯系统发育模型的快速似然计算。
Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2.
5
Comparing evolutionary rates for different phenotypic traits on a phylogeny using likelihood.在系统发育树上使用似然比较不同表型特征的进化速率。
Syst Biol. 2013 Mar;62(2):181-92. doi: 10.1093/sysbio/sys083. Epub 2012 Sep 27.
6
Testing for phylogenetic signal in comparative data: behavioral traits are more labile.比较数据中系统发育信号的检测:行为特征更不稳定。
Evolution. 2003 Apr;57(4):717-45. doi: 10.1111/j.0014-3820.2003.tb00285.x.
7
Diversity, disparity, and evolutionary rate estimation for unresolved Yule trees.未解 Yule 树的多样性、差异和进化率估计。
Syst Biol. 2013 May 1;62(3):439-55. doi: 10.1093/sysbio/syt010. Epub 2013 Feb 15.
8
A Relaxed Directional Random Walk Model for Phylogenetic Trait Evolution.一种用于系统发育性状进化的宽松定向随机游走模型。
Syst Biol. 2017 May 1;66(3):299-319. doi: 10.1093/sysbio/syw093.
9
On the distribution of interspecies correlation for Markov models of character evolution on Yule trees.关于尤尔树上性状进化的马尔可夫模型的种间相关性分布
J Theor Biol. 2015 Jan 7;364:275-83. doi: 10.1016/j.jtbi.2014.09.016. Epub 2014 Sep 18.
10
Euclidean nature of phylogenetic distance matrices.系统发育距离矩阵的欧几里得性质。
Syst Biol. 2011 Dec;60(6):826-32. doi: 10.1093/sysbio/syr066. Epub 2011 Jul 29.

引用本文的文献

1
Discriminating models of trait evolution.性状进化的判别模型。
bioRxiv. 2025 Jun 13:2025.06.12.659377. doi: 10.1101/2025.06.12.659377.
2
A Tale of Too Many Trees: A Conundrum for Phylogenetic Regression.树木过多的故事:系统发育回归的难题
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf032.
3
TraitTrainR: accelerating large-scale simulation under models of continuous trait evolution.TraitTrainR:加速连续性状进化模型下的大规模模拟

本文引用的文献

1
Information geometry for phylogenetic trees.系统发生树的信息几何。
J Math Biol. 2021 Feb 15;82(3):19. doi: 10.1007/s00285-021-01553-x.
2
On the Jensen-Shannon Symmetrization of Distances Relying on Abstract Means.关于基于抽象均值的距离的詹森 - 香农对称化
Entropy (Basel). 2019 May 11;21(5):485. doi: 10.3390/e21050485.
3
Extant timetrees are consistent with a myriad of diversification histories.现存的时间树与无数的多样化历史是一致的。
Bioinform Adv. 2024 Dec 9;5(1):vbae196. doi: 10.1093/bioadv/vbae196. eCollection 2025.
4
Piikun: an information theoretic toolkit for analysis and visualization of species delimitation metric space.Piikun:用于物种界定度量空间分析与可视化的信息论工具包。
BMC Bioinformatics. 2024 Dec 18;25(1):385. doi: 10.1186/s12859-024-05997-y.
5
New generalized metric based on branch length distance to compare B cell lineage trees.基于分支长度距离的新广义度量用于比较B细胞谱系树。
Algorithms Mol Biol. 2024 Oct 5;19(1):22. doi: 10.1186/s13015-024-00267-1.
Nature. 2020 Apr;580(7804):502-505. doi: 10.1038/s41586-020-2176-1. Epub 2020 Apr 15.
4
Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.具有转移的多元高斯系统发育模型的快速似然计算。
Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2.
5
Phylogenetic Trees and Networks Can Serve as Powerful and Complementary Approaches for Analysis of Genomic Data.系统发生树和网络可以作为分析基因组数据的强大且互补的方法。
Syst Biol. 2020 May 1;69(3):593-601. doi: 10.1093/sysbio/syz056.
6
Automatic generation of evolutionary hypotheses using mixed Gaussian phylogenetic models.使用混合高斯系统发育模型自动生成进化假说。
Proc Natl Acad Sci U S A. 2019 Aug 20;116(34):16921-16926. doi: 10.1073/pnas.1813823116. Epub 2019 Aug 2.
7
Probabilistic Species Tree Distances: Implementing the Multispecies Coalescent to Compare Species Trees Within the Same Model-Based Framework Used to Estimate Them.概率种系发生树距离:在用于估计种系发生树的同一基于模型的框架内实施多物种并合,以比较种系发生树。
Syst Biol. 2020 Jan 1;69(1):194-207. doi: 10.1093/sysbio/syz031.
8
Bayesian Estimation of Species Divergence Times Using Correlated Quantitative Characters.贝叶斯法利用相关数量性状估计物种分歧时间。
Syst Biol. 2019 Nov 1;68(6):967-986. doi: 10.1093/sysbio/syz015.
9
Statistical binning leads to profound model violation due to gene tree error incurred by trying to avoid gene tree error.统计分箱会导致严重的模型违反,因为试图避免基因树错误会导致基因树错误。
Mol Phylogenet Evol. 2019 May;134:164-171. doi: 10.1016/j.ympev.2019.02.012. Epub 2019 Feb 18.
10
Quantifying the risk of hemiplasy in phylogenetic inference.量化系统发生推断中偏体现象的风险。
Proc Natl Acad Sci U S A. 2018 Dec 11;115(50):12787-12792. doi: 10.1073/pnas.1811268115. Epub 2018 Nov 27.