有原则、实用、灵活、快速：系统发育因子分析的一种新方法。

Principled, practical, flexible, fast: a new approach to phylogenetic factor analysis.

作者信息

Hassler Gabriel W, Gallone Brigida, Aristide Leandro, Allen William L, Tolkoff Max R, Holbrook Andrew J, Baele Guy, Lemey Philippe, Suchard Marc A

机构信息

Department of Computational Medicine, David Geffen School of Medicine at UCLA, University of California, Los Angeles, United States.

VIB-KU Leuven Center for Microbiology, Leuven, Belgium.

出版信息

Methods Ecol Evol. 2022 Oct;13(10):2181-2197. doi: 10.1111/2041-210X.13920. Epub 2022 Jun 19.

DOI:10.1111/2041-210X.13920

PMID:36908682

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9997680/

Abstract

Biological phenotypes are products of complex evolutionary processes in which selective forces influence multiple biological trait measurements in unknown ways. Phylogenetic comparative methods seek to disentangle these relationships across the evolutionary history of a group of organisms. Unfortunately, most existing methods fail to accommodate high-dimensional data with dozens or even thousands of observations per taxon. Phylogenetic factor analysis offers a solution to the challenge of dimensionality. However, scientists seeking to employ this modeling framework confront numerous modeling and implementation decisions, the details of which pose computational and replicability challenges.We develop new inference techniques that increase both the computational efficiency and modeling flexibility of phylogenetic factor analysis. To facilitate adoption of these new methods, we present a practical analysis plan that guides researchers through the web of complex modeling decisions. We codify this analysis plan in an automated pipeline that distills the potentially overwhelming array of decisions into a small handful of (typically binary) choices.We demonstrate the utility of these methods and analysis plan in four real-world problems of varying scales. Specifically, we study floral phenotype and pollination in columbines, domestication in industrial yeast, life history in mammals, and brain morphology in New World monkeys.General and impactful community employment of these methods requires a data scientific analysis plan that balances flexibility, speed and ease of use, while minimizing model and algorithm tuning. Even in the presence of non-trivial phylogenetic model constraints, we show that one may analytically address latent factor uncertainty in a way that (a) aids model flexibility, (b) accelerates computation (by as much as 500-fold) and (c) decreases required tuning. These efforts coalesce to create an accessible Bayesian approach to high-dimensional phylogenetic comparative methods on large trees.

摘要

生物学表型是复杂进化过程的产物，在这些过程中，选择力以未知方式影响多种生物学性状测量。系统发育比较方法试图理清一组生物体进化历史中的这些关系。不幸的是，大多数现有方法无法处理每个分类单元有数十甚至数千个观测值的高维数据。系统发育因子分析为维度挑战提供了一种解决方案。然而，寻求采用此建模框架的科学家面临众多建模和实施决策，其细节带来了计算和可重复性挑战。我们开发了新的推理技术，提高了系统发育因子分析的计算效率和建模灵活性。为便于采用这些新方法，我们提出了一个实用的分析计划，指导研究人员应对复杂的建模决策网络。我们将此分析计划编入一个自动化流程，将潜在的大量决策提炼为少数几个（通常是二元）选择。我们在四个不同规模的实际问题中展示了这些方法和分析计划的效用。具体而言，我们研究了耧斗菜的花表型和授粉、工业酵母的驯化、哺乳动物的生活史以及新大陆猴的脑形态。这些方法的广泛且有影响力的社区应用需要一个数据科学分析计划，该计划要在灵活性、速度和易用性之间取得平衡，同时尽量减少模型和算法调整。即使存在非平凡的系统发育模型约束，我们表明可以以一种有助于（a）提高模型灵活性、（b）加速计算（多达500倍）和（c）减少所需调整的方式来分析处理潜在因子的不确定性。这些努力共同促成了一种易于使用的贝叶斯方法，用于处理大树上的高维系统发育比较方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8dd4/9997680/ab3978b9303e/nihms-1870319-f0001.jpg

相似文献

Principled, practical, flexible, fast: a new approach to phylogenetic factor analysis.有原则、实用、灵活、快速：系统发育因子分析的一种新方法。

Methods Ecol Evol. 2022 Oct;13(10):2181-2197. doi: 10.1111/2041-210X.13920. Epub 2022 Jun 19.

Erratum: Eyestalk Ablation to Increase Ovarian Maturation in Mud Crabs.勘误：切除眼柄以增加泥蟹的卵巢成熟度。

J Vis Exp. 2023 May 26(195). doi: 10.3791/6561.

Taming the BEAST-A Community Teaching Material Resource for BEAST 2.驯服BEAST——BEAST 2社区教材资源

Syst Biol. 2018 Jan 1;67(1):170-174. doi: 10.1093/sysbio/syx060.

A Penalized Likelihood Framework for High-Dimensional Phylogenetic Comparative Methods and an Application to New-World Monkeys Brain Evolution.一种用于高维系统发育比较方法的惩罚似然框架及其在新世界猴脑进化研究中的应用。

Syst Biol. 2019 Jan 1;68(1):93-116. doi: 10.1093/sysbio/syy045.

Inferring Phenotypic Trait Evolution on Large Trees With Many Incomplete Measurements.在具有许多不完整测量值的大型树上推断表型性状进化

J Am Stat Assoc. 2022;117(538):678-692. doi: 10.1080/01621459.2020.1799812. Epub 2020 Sep 16.

Accelerating Bayesian inference of dependency between mixed-type biological traits.加速混合类型生物特征间相关性的贝叶斯推断。

PLoS Comput Biol. 2023 Aug 28;19(8):e1011419. doi: 10.1371/journal.pcbi.1011419. eCollection 2023 Aug.

Simultaneously estimating evolutionary history and repeated traits phylogenetic signal: applications to viral and host phenotypic evolution.同时估计进化历史和重复性状的系统发育信号：在病毒和宿主表型进化中的应用。

Methods Ecol Evol. 2015 Jan 1;6(1):67-82. doi: 10.1111/2041-210X.12293.

πBUSS: a parallel BEAST/BEAGLE utility for sequence simulation under complex evolutionary scenarios.πBUSS：一种用于复杂进化场景下序列模拟的并行 BEAST/BEAGLE 工具。

BMC Bioinformatics. 2014 May 7;15:133. doi: 10.1186/1471-2105-15-133.

Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.使用BEAST 1.10进行贝叶斯系统发育和系统动力学数据整合。

Virus Evol. 2018 Jun 8;4(1):vey016. doi: 10.1093/ve/vey016. eCollection 2018 Jan.

Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.具有转移的多元高斯系统发育模型的快速似然计算。

Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2.

引用本文的文献

BEAST X for Bayesian phylogenetic, phylogeographic and phylodynamic inference.用于贝叶斯系统发育、系统地理学和系统动力学推断的BEAST X。

Nat Methods. 2025 Jul 7. doi: 10.1038/s41592-025-02751-x.

Diel activity correlates with colour pattern morphology of heterobranch sea slugs.昼夜活动与裸鳃海蛞蝓的体色形态相关。

J Anim Ecol. 2025 Jun;94(6):1165-1179. doi: 10.1111/1365-2656.70036. Epub 2025 Apr 15.

Multi-response phylogenetic mixed models: concepts and application.多响应系统发育混合模型：概念与应用

Biol Rev Camb Philos Soc. 2025 Jun;100(3):1294-1316. doi: 10.1111/brv.70001. Epub 2025 Apr 7.

Leveraging graphical model techniques to study evolution on phylogenetic networks.利用图形模型技术研究系统发育网络上的进化。

Philos Trans R Soc Lond B Biol Sci. 2025 Feb 13;380(1919):20230310. doi: 10.1098/rstb.2023.0310. Epub 2025 Feb 20.

Modeling the velocity of evolving lineages and predicting dispersal patterns.模拟进化谱系的速度和预测扩散模式。

Proc Natl Acad Sci U S A. 2024 Nov 19;121(47):e2411582121. doi: 10.1073/pnas.2411582121. Epub 2024 Nov 15.

Modeling the velocity of evolving lineages and predicting dispersal patterns.模拟进化谱系的速度并预测扩散模式。

bioRxiv. 2024 Oct 28:2024.06.06.597755. doi: 10.1101/2024.06.06.597755.

Data integration in Bayesian phylogenetics.贝叶斯系统发育学中的数据整合。

Annu Rev Stat Appl. 2023;10:353-377. doi: 10.1146/annurev-statistics-033021-112532. Epub 2022 Sep 28.

Accelerating Bayesian inference of dependency between mixed-type biological traits.加速混合类型生物特征间相关性的贝叶斯推断。

PLoS Comput Biol. 2023 Aug 28;19(8):e1011419. doi: 10.1371/journal.pcbi.1011419. eCollection 2023 Aug.

Body size and life history shape the historical biogeography of tetrapods.体型和生活史塑造了四足动物的历史生物地理学。

Nat Ecol Evol. 2023 Sep;7(9):1467-1479. doi: 10.1038/s41559-023-02150-5. Epub 2023 Aug 21.

本文引用的文献

Inferring Phenotypic Trait Evolution on Large Trees With Many Incomplete Measurements.在具有许多不完整测量值的大型树上推断表型性状进化

J Am Stat Assoc. 2022;117(538):678-692. doi: 10.1080/01621459.2020.1799812. Epub 2020 Sep 16.

Relaxed Random Walks at Scale.大规模松弛随机游走。

Syst Biol. 2021 Feb 10;70(2):258-267. doi: 10.1093/sysbio/syaa056.

Variation in the strength of allometry drives rates of evolution in primate brain shape.种间体型异速变化的差异驱动灵长类动物脑形的进化速率。

Proc Biol Sci. 2020 Jul 8;287(1930):20200807. doi: 10.1098/rspb.2020.0807.

Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.具有转移的多元高斯系统发育模型的快速似然计算。

Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2.

Interspecific hybridization facilitates niche adaptation in beer yeast.种间杂交促进了啤酒酵母的生态位适应。

Nat Ecol Evol. 2019 Nov;3(11):1562-1575. doi: 10.1038/s41559-019-0997-9. Epub 2019 Oct 21.

Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.使用BEAST 1.10进行贝叶斯系统发育和系统动力学数据整合。

Virus Evol. 2018 Jun 8;4(1):vey016. doi: 10.1093/ve/vey016. eCollection 2018 Jan.

Syst Biol. 2019 Jan 1;68(1):93-116. doi: 10.1093/sysbio/syy045.

Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7.贝叶斯系统发生学中使用 Tracer 1.7 进行的后验总结

Syst Biol. 2018 Sep 1;67(5):901-904. doi: 10.1093/sysbio/syy032.

Inference of Adaptive Shifts for Multivariate Correlated Traits.多变量相关性状的适应性变化推断。

Syst Biol. 2018 Jul 1;67(4):662-680. doi: 10.1093/sysbio/syy005.

Phylogenetic Factor Analysis.系统发育因子分析。

Syst Biol. 2018 May 1;67(3):384-399. doi: 10.1093/sysbio/syx066.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

有原则、实用、灵活、快速：系统发育因子分析的一种新方法。

Principled, practical, flexible, fast: a new approach to phylogenetic factor analysis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献