• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多物种溯祖模型下的高效贝叶斯物种树推断

Efficient Bayesian Species Tree Inference under the Multispecies Coalescent.

作者信息

Rannala Bruce, Yang Ziheng

机构信息

Department of Evolution and Ecology, University of California, Davis, CA 95616, USA.

Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK.

出版信息

Syst Biol. 2017 Sep 1;66(5):823-842. doi: 10.1093/sysbio/syw119.

DOI:10.1093/sysbio/syw119
PMID:28053140
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8562347/
Abstract

We develop a Bayesian method for inferring the species phylogeny under the multispecies coalescent (MSC) model. To improve the mixing properties of the Markov chain Monte Carlo (MCMC) algorithm that traverses the space of species trees, we implement two efficient MCMC proposals: the first is based on the Subtree Pruning and Regrafting (SPR) algorithm and the second is based on a node-slider algorithm. Like the Nearest-Neighbor Interchange (NNI) algorithm we implemented previously, both new algorithms propose changes to the species tree, while simultaneously altering the gene trees at multiple genetic loci to automatically avoid conflicts with the newly proposed species tree. The method integrates over gene trees, naturally taking account of the uncertainty of gene tree topology and branch lengths given the sequence data. A simulation study was performed to examine the statistical properties of the new method. The method was found to show excellent statistical performance, inferring the correct species tree with near certainty when 10 loci were included in the dataset. The prior on species trees has some impact, particularly for small numbers of loci. We analyzed several previously published datasets (both real and simulated) for rattlesnakes and Philippine shrews, in comparison with alternative methods. The results suggest that the Bayesian coalescent-based method is statistically more efficient than heuristic methods based on summary statistics, and that our implementation is computationally more efficient than alternative full-likelihood methods under the MSC. Parameter estimates for the rattlesnake data suggest drastically different evolutionary dynamics between the nuclear and mitochondrial loci, even though they support largely consistent species trees. We discuss the different challenges facing the marginal likelihood calculation and transmodel MCMC as alternative strategies for estimating posterior probabilities for species trees. [Bayes factor; Bayesian inference; MCMC; multispecies coalescent; nodeslider; species tree; SPR.].

摘要

我们开发了一种贝叶斯方法,用于在多物种溯祖(MSC)模型下推断物种系统发育。为了改善遍历物种树空间的马尔可夫链蒙特卡罗(MCMC)算法的混合特性,我们实现了两种高效的MCMC提议:第一种基于子树修剪和重新嫁接(SPR)算法,第二种基于节点滑动算法。与我们之前实现的最近邻交换(NNI)算法一样,这两种新算法都提议对物种树进行更改,同时在多个基因座处改变基因树,以自动避免与新提议的物种树产生冲突。该方法对基因树进行整合,自然地考虑了给定序列数据时基因树拓扑结构和分支长度的不确定性。我们进行了一项模拟研究,以检验新方法的统计特性。结果发现,当数据集中包含10个基因座时,该方法具有出色的统计性能,几乎可以确定地推断出正确的物种树。物种树的先验有一定影响,特别是对于少量基因座的情况。我们分析了几个先前发表的响尾蛇和菲律宾鼩鼱的数据集(包括真实的和模拟的),并与其他方法进行了比较。结果表明,基于贝叶斯溯祖的方法在统计上比基于摘要统计的启发式方法更有效,并且我们的实现比MSC下的其他全似然方法在计算上更高效。响尾蛇数据的参数估计表明,核基因座和线粒体基因座之间的进化动态差异很大,尽管它们支持的物种树在很大程度上是一致的。我们讨论了边际似然计算和跨模型MCMC作为估计物种树后验概率的替代策略所面临的不同挑战。[贝叶斯因子;贝叶斯推断;MCMC;多物种溯祖;节点滑动器;物种树;SPR。]

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/f86bdc431992/sysbio_66_5_823_f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/fd6ed2afef47/sysbio_66_5_823_f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/403822fa707b/sysbio_66_5_823_f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/33af29b2ec74/sysbio_66_5_823_f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/16aea30f9c9a/sysbio_66_5_823_f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/997cfb778614/sysbio_66_5_823_f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/b396ba1fe900/sysbio_66_5_823_f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/44ef95c79305/sysbio_66_5_823_f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/f86bdc431992/sysbio_66_5_823_f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/fd6ed2afef47/sysbio_66_5_823_f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/403822fa707b/sysbio_66_5_823_f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/33af29b2ec74/sysbio_66_5_823_f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/16aea30f9c9a/sysbio_66_5_823_f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/997cfb778614/sysbio_66_5_823_f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/b396ba1fe900/sysbio_66_5_823_f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/44ef95c79305/sysbio_66_5_823_f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f36d/8562347/f86bdc431992/sysbio_66_5_823_f8.jpg

相似文献

1
Efficient Bayesian Species Tree Inference under the Multispecies Coalescent.多物种溯祖模型下的高效贝叶斯物种树推断
Syst Biol. 2017 Sep 1;66(5):823-842. doi: 10.1093/sysbio/syw119.
2
Unguided species delimitation using DNA sequence data from multiple Loci.使用来自多个基因座的DNA序列数据进行无指导的物种界定。
Mol Biol Evol. 2014 Dec;31(12):3125-35. doi: 10.1093/molbev/msu279. Epub 2014 Oct 1.
3
Algorithmic improvements to species delimitation and phylogeny estimation under the multispecies coalescent.多物种溯祖模型下物种界定和系统发育估计的算法改进
J Math Biol. 2017 Jan;74(1-2):447-467. doi: 10.1007/s00285-016-1034-0. Epub 2016 Jun 10.
4
Quartet inference from SNP data under the coalescent model.在溯祖模型下从单核苷酸多态性(SNP)数据进行四重奏推断。
Bioinformatics. 2014 Dec 1;30(23):3317-24. doi: 10.1093/bioinformatics/btu530. Epub 2014 Aug 7.
5
Species Tree Inference with BPP Using Genomic Sequences and the Multispecies Coalescent.使用基因组序列和多物种合并进行 BPP 的种系树推断。
Mol Biol Evol. 2018 Oct 1;35(10):2585-2593. doi: 10.1093/molbev/msy147.
6
The accuracy of species tree estimation under simulation: a comparison of methods.基于模拟的物种树估计精度:方法比较。
Syst Biol. 2011 Mar;60(2):126-37. doi: 10.1093/sysbio/syq073. Epub 2010 Nov 18.
7
Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.基于简约引导的树提议加速贝叶斯系统发育推断的收敛。
Syst Biol. 2020 Sep 1;69(5):1016-1032. doi: 10.1093/sysbio/syaa002.
8
Practical Speedup of Bayesian Inference of Species Phylogenies by Restricting the Space of Gene Trees.通过限制基因树空间来实现物种系统发育贝叶斯推断的实用加速。
Mol Biol Evol. 2020 Jun 1;37(6):1809-1818. doi: 10.1093/molbev/msaa045.
9
Bayesian Phylogenetic Inference using Relaxed-clocks and the Multispecies Coalescent.贝叶斯系统发育推断使用松弛时钟和多物种 coalescent。
Mol Biol Evol. 2022 Aug 3;39(8). doi: 10.1093/molbev/msac161.
10
Challenges in Species Tree Estimation Under the Multispecies Coalescent Model.多物种溯祖模型下物种树估计的挑战
Genetics. 2016 Dec;204(4):1353-1368. doi: 10.1534/genetics.116.190173.

引用本文的文献

1
Genomics Reveals Distinct Evolutionary Lineages in Asian Elephants.基因组学揭示亚洲象不同的进化谱系。
Ecol Evol. 2025 Aug 18;15(8):e72019. doi: 10.1002/ece3.72019. eCollection 2025 Aug.
2
The Impact of Sequencing and Genotyping Errors on Bayesian Analysis of Genomic Data under the Multispecies Coalescent Model.测序和基因分型错误对多物种溯祖模型下基因组数据贝叶斯分析的影响。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf184.
3
Reticulate allopolyploidy and subsequent dysploidy drive evolution and diversification in the cotton family.

本文引用的文献

1
Challenges in Species Tree Estimation Under the Multispecies Coalescent Model.多物种溯祖模型下物种树估计的挑战
Genetics. 2016 Dec;204(4):1353-1368. doi: 10.1534/genetics.116.190173.
2
Maximum Likelihood Implementation of an Isolation-with-Migration Model for Three Species.三种物种的带迁移隔离模型的最大似然实现
Syst Biol. 2017 May 1;66(3):379-398. doi: 10.1093/sysbio/syw063.
3
RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language.RevBayes:使用图形模型和交互式模型规范语言进行贝叶斯系统发育推断
网状异源多倍体及随后的染色体数异常推动了棉属植物的进化与多样化。
Nat Commun. 2025 Aug 12;16(1):7480. doi: 10.1038/s41467-025-62644-7.
4
Concatenation fails to describe the anomalous radiation of giant cockroaches (Blattodea: Blaberidae) despite moderate to low discordance.尽管存在中度到低度的不一致性,但串联法仍无法描述巨型蟑螂(蜚蠊目:硕蠊科)的异常辐射。
BMC Ecol Evol. 2025 Jul 21;25(1):72. doi: 10.1186/s12862-025-02409-4.
5
The complete mitochondrial genomes of the two species of Astyanax (Characiformes: Acestrorhamphidae) that occur in cenotes of the Yucatán Peninsula karst aquifer: comparative analyses and their taxonomic implications.两种分布于尤卡坦半岛岩溶含水层天然井中的丽脂鲤属(脂鲤目:无齿脂鲤科)鱼类的线粒体全基因组:比较分析及其分类学意义
Mol Biol Rep. 2025 Jul 10;52(1):698. doi: 10.1007/s11033-025-10788-6.
6
Inference of Gene Flow between Species from Genomic Data When the Mode, Direction, and Lineages are Misspecified.当模式、方向和谱系指定错误时,从基因组数据推断物种间的基因流动
Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf121.
7
Comparative population pangenomes reveal unexpected complexity and fitness effects of structural variants.比较群体泛基因组揭示了结构变异出人意料的复杂性和适应性效应。
bioRxiv. 2025 Feb 13:2025.02.11.637762. doi: 10.1101/2025.02.11.637762.
8
Hierarchical Heuristic Species Delimitation Under the Multispecies Coalescent Model with Migration.具有迁移的多物种溯祖模型下的分层启发式物种界定
Syst Biol. 2024 Nov 29;73(6):1015-1037. doi: 10.1093/sysbio/syae050.
9
Bayesian Inference Under the Multispecies Coalescent with Ancient DNA Sequences.基于古代DNA序列的多物种溯祖模型下的贝叶斯推断
Syst Biol. 2024 Nov 29;73(6):964-978. doi: 10.1093/sysbio/syae047.
10
Detection of Ghost Introgression Requires Exploiting Topological and Branch Length Information.检测幽灵渐渗需要利用拓扑和分支长度信息。
Syst Biol. 2024 May 27;73(1):207-222. doi: 10.1093/sysbio/syad077.
Syst Biol. 2016 Jul;65(4):726-36. doi: 10.1093/sysbio/syw021. Epub 2016 May 28.
4
Computational Performance and Statistical Accuracy of *BEAST and Comparisons with Other Methods.BEAST的计算性能和统计准确性以及与其他方法的比较。
Syst Biol. 2016 May;65(3):381-96. doi: 10.1093/sysbio/syv118. Epub 2016 Jan 28.
5
Implementing and testing the multispecies coalescent model: A valuable paradigm for phylogenomics.实施和测试多物种合并模型:系统发育基因组学的一个有价值的范例。
Mol Phylogenet Evol. 2016 Jan;94(Pt A):447-62. doi: 10.1016/j.ympev.2015.10.027. Epub 2015 Oct 27.
6
ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes.ASTRAL-II:基于合并的数百个分类群和数千个基因的种系发生树估计。
Bioinformatics. 2015 Jun 15;31(12):i44-52. doi: 10.1093/bioinformatics/btv234.
7
The Challenges of Resolving a Rapid, Recent Radiation: Empirical and Simulated Phylogenomics of Philippine Shrews.解析快速、近期辐射的挑战:菲律宾鼩鼱的实证和模拟系统发育基因组学
Syst Biol. 2015 Sep;64(5):727-40. doi: 10.1093/sysbio/syv029. Epub 2015 May 14.
8
Estimating phylogenetic trees from genome-scale data.从基因组规模数据估计系统发育树。
Ann N Y Acad Sci. 2015 Dec;1360:36-53. doi: 10.1111/nyas.12747. Epub 2015 Apr 14.
9
Likelihood-based tree reconstruction on a concatenation of aligned sequence data sets can be statistically inconsistent.基于比对序列数据集串联的似然法树重建可能在统计上不一致。
Theor Popul Biol. 2015 Mar;100C:56-62. doi: 10.1016/j.tpb.2014.12.005. Epub 2014 Dec 26.
10
Unguided species delimitation using DNA sequence data from multiple Loci.使用来自多个基因座的DNA序列数据进行无指导的物种界定。
Mol Biol Evol. 2014 Dec;31(12):3125-35. doi: 10.1093/molbev/msu279. Epub 2014 Oct 1.