• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因树间一致性的贝叶斯估计。

Bayesian estimation of concordance among gene trees.

作者信息

Ané Cécile, Larget Bret, Baum David A, Smith Stacey D, Rokas Antonis

机构信息

Department of Statistics, University of Wisconsin, USA.

出版信息

Mol Biol Evol. 2007 Feb;24(2):412-26. doi: 10.1093/molbev/msl170. Epub 2006 Nov 9.

DOI:10.1093/molbev/msl170
PMID:17095535
Abstract

Multigene sequence data have great potential for elucidating important and interesting evolutionary processes, but statistical methods for extracting information from such data remain limited. Although various biological processes may cause different genes to have different genealogical histories (and hence different tree topologies), we also may expect that the number of distinct topologies among a set of genes is relatively small compared with the number of possible topologies. Therefore evidence about the tree topology for one gene should influence our inferences of the tree topology on a different gene, but to what extent? In this paper, we present a new approach for modeling and estimating concordance among a set of gene trees given aligned molecular sequence data. Our approach introduces a one-parameter probability distribution to describe the prior distribution of concordance among gene trees. We describe a novel 2-stage Markov chain Monte Carlo (MCMC) method that first obtains independent Bayesian posterior probability distributions for individual genes using standard methods. These posterior distributions are then used as input for a second MCMC procedure that estimates a posterior distribution of gene-to-tree maps (GTMs). The posterior distribution of GTMs can then be summarized to provide revised posterior probability distributions for each gene (taking account of concordance) and to allow estimation of the proportion of the sampled genes for which any given clade is true (the sample-wide concordance factor). Further, under the assumption that the sampled genes are drawn randomly from a genome of known size, we show how one can obtain an estimate, with credibility intervals, on the proportion of the entire genome for which a clade is true (the genome-wide concordance factor). We demonstrate the method on a set of 106 genes from 8 yeast species.

摘要

多基因序列数据在阐明重要且有趣的进化过程方面具有巨大潜力,但从这类数据中提取信息的统计方法仍然有限。尽管各种生物学过程可能导致不同基因具有不同的系统发育历史(从而具有不同的树拓扑结构),但我们也可能预期,一组基因中不同拓扑结构的数量与可能的拓扑结构数量相比相对较少。因此,关于一个基因的树拓扑结构的证据应该会影响我们对另一个不同基因的树拓扑结构的推断,但影响程度如何呢?在本文中,我们提出了一种新方法,用于在给定比对分子序列数据的情况下,对一组基因树之间的一致性进行建模和估计。我们的方法引入了一个单参数概率分布来描述基因树之间一致性的先验分布。我们描述了一种新颖的两阶段马尔可夫链蒙特卡罗(MCMC)方法,该方法首先使用标准方法为各个基因获得独立的贝叶斯后验概率分布。然后,这些后验分布被用作第二个MCMC程序的输入,该程序估计基因到树映射(GTM)的后验分布。然后,可以总结GTM的后验分布,以提供每个基因的修正后验概率分布(考虑到一致性),并允许估计任何给定分支为真的抽样基因的比例(全样本一致性因子)。此外,在假设抽样基因是从已知大小的基因组中随机抽取的情况下,我们展示了如何获得对整个基因组中某个分支为真的比例的估计以及可信区间(全基因组一致性因子)。我们在来自8个酵母物种的一组106个基因上演示了该方法。

相似文献

1
Bayesian estimation of concordance among gene trees.基因树间一致性的贝叶斯估计。
Mol Biol Evol. 2007 Feb;24(2):412-26. doi: 10.1093/molbev/msl170. Epub 2006 Nov 9.
2
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
3
Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses.从贝叶斯系统发育分析估计树形拓扑结构的有效样本量
Genome Biol Evol. 2016 Aug 16;8(8):2319-32. doi: 10.1093/gbe/evw171.
4
Joint Bayesian estimation of alignment and phylogeny.比对与系统发育的联合贝叶斯估计。
Syst Biol. 2005 Jun;54(3):401-18. doi: 10.1080/10635150590947041.
5
Guided tree topology proposals for Bayesian phylogenetic inference.贝叶斯系统发育推断的引导树拓扑提议。
Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.
6
Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.基于简约引导的树提议加速贝叶斯系统发育推断的收敛。
Syst Biol. 2020 Sep 1;69(5):1016-1032. doi: 10.1093/sysbio/syaa002.
7
Bayesian phylogenetic inference using DNA sequences: a Markov Chain Monte Carlo Method.使用DNA序列的贝叶斯系统发育推断:一种马尔可夫链蒙特卡罗方法。
Mol Biol Evol. 1997 Jul;14(7):717-24. doi: 10.1093/oxfordjournals.molbev.a025811.
8
Estimating species trees using multiple-allele DNA sequence data.利用多等位基因DNA序列数据估计物种树。
Evolution. 2008 Aug;62(8):2080-91. doi: 10.1111/j.1558-5646.2008.00414.x. Epub 2008 May 5.
9
Polytomies and Bayesian phylogenetic inference.多歧分类和贝叶斯系统发育推断
Syst Biol. 2005 Apr;54(2):241-53. doi: 10.1080/10635150590924208.
10
Quantifying MCMC exploration of phylogenetic tree space.量化马尔可夫链蒙特卡罗方法对系统发育树空间的探索。
Syst Biol. 2015 May;64(3):472-91. doi: 10.1093/sysbio/syv006. Epub 2015 Jan 27.

引用本文的文献

1
Leveraging Weighted Quartet Distributions for Enhanced Species Tree Inference from Genome-Wide Data.利用加权四重奏分布从全基因组数据中增强物种树推断
Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf159.
2
Phylogenetic networks empower biodiversity research.系统发育网络助力生物多样性研究。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2410934122. doi: 10.1073/pnas.2410934122. Epub 2025 Jul 28.
3
Phylogenomics, historical biogeography, and diversification of leaf traits in the Malagasy-endemic genus Uncarina (Pedaliaceae).
马达加斯加特有属翅蒴麻属(胡麻科)的系统发育基因组学、历史生物地理学及叶性状的多样化
Am J Bot. 2025 Apr;112(4):e70028. doi: 10.1002/ajb2.70028. Epub 2025 Apr 2.
4
Accurate Bayesian phylogenetic point estimation using a tree distribution parameterized by clade probabilities.使用由分支概率参数化的树分布进行精确的贝叶斯系统发育点估计。
PLoS Comput Biol. 2025 Feb 13;21(2):e1012789. doi: 10.1371/journal.pcbi.1012789. eCollection 2025 Feb.
5
The Meaning and Measure of Concordance Factors in Phylogenomics.系统发生基因组学中一致性因子的意义和度量。
Mol Biol Evol. 2024 Nov 1;41(11). doi: 10.1093/molbev/msae214.
6
Phylogenomics and Pervasive Genome-Wide Phylogenetic Discordance Among Fin Whales (Balaenoptera physalus).长须鲸(Balaenoptera physalus)的系统发育基因组学与全基因组范围内普遍存在的系统发育不一致性
Syst Biol. 2024 Nov 29;73(6):873-885. doi: 10.1093/sysbio/syae049.
7
Tropical Logistic Regression Model on Space of Phylogenetic Trees.热带进化树空间上的逻辑回归模型。
Bull Math Biol. 2024 Jul 2;86(8):99. doi: 10.1007/s11538-024-01327-8.
8
A Guide to Phylogenomic Inference.系统发育基因组推断指南。
Methods Mol Biol. 2024;2802:267-345. doi: 10.1007/978-1-0716-3838-5_11.
9
Practical Guidance and Workflows for Identifying Fast Evolving Non-Coding Genomic Elements Using PhyloAcc.利用 PhyloAcc 识别快速进化的非编码基因组元件的实用指南和工作流程。
Integr Comp Biol. 2024 Nov 21;64(5):1513-1525. doi: 10.1093/icb/icae056.
10
Phylogenomic perspectives on speciation and reproductive isolation in a North American biodiversity hotspot: an example using California sages (Salvia subgenus Audibertia: Lamiaceae).从系统基因组学角度看北美生物多样性热点地区的物种形成和生殖隔离:以加利福尼亚鼠尾草为例(鼠尾草亚属 Audibertia:唇形科)。
Ann Bot. 2024 Jul 9;134(2):295-310. doi: 10.1093/aob/mcae073.