• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种快速准确的基因树重建的贝叶斯方法。

A Bayesian approach for fast and accurate gene tree reconstruction.

机构信息

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology.

出版信息

Mol Biol Evol. 2011 Jan;28(1):273-90. doi: 10.1093/molbev/msq189. Epub 2010 Jul 25.

DOI:10.1093/molbev/msq189
PMID:20660489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3002250/
Abstract

Recent sequencing and computing advances have enabled phylogenetic analyses to expand to both entire genomes and large clades, thus requiring more efficient and accurate methods designed specifically for the phylogenomic context. Here, we present SPIMAP, an efficient Bayesian method for reconstructing gene trees in the presence of a known species tree. We observe many improvements in reconstruction accuracy, achieved by modeling multiple aspects of evolution, including gene duplication and loss (DL) rates, speciation times, and correlated substitution rate variation across both species and loci. We have implemented and applied this method on two clades of fully sequenced species, 12 Drosophila and 16 fungal genomes as well as simulated phylogenies and find dramatic improvements in reconstruction accuracy as compared with the most popular existing methods, including those that take the species tree into account. We find that reconstruction inaccuracies of traditional phylogenetic methods overestimate the number of DL events by as much as 2-3-fold, whereas our method achieves significantly higher accuracy. We feel that the results and methods presented here will have many important implications for future investigations of gene evolution.

摘要

最近的测序和计算进展使系统发育分析能够扩展到整个基因组和大的进化枝,因此需要专门针对系统基因组学背景设计的更有效和准确的方法。在这里,我们提出了 SPIMAP,这是一种在已知物种树存在的情况下,用于重建基因树的高效贝叶斯方法。我们观察到通过对包括基因复制和丢失(DL)率、物种形成时间以及物种和基因座之间的相关替代率变化等多个进化方面进行建模,在重建准确性方面取得了许多改进。我们已经在两个完全测序的物种进化枝(12 个果蝇和 16 个真菌基因组)以及模拟的系统发育树上实现和应用了这种方法,与最流行的现有方法相比,包括那些考虑物种树的方法,重建准确性有了显著提高。我们发现传统系统发育方法的重建不准确会使 DL 事件的数量高估多达 2-3 倍,而我们的方法则实现了更高的准确性。我们认为,这里提出的结果和方法将对未来的基因进化研究有许多重要的启示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/d1845dec1dc3/molbiolevolmsq189f07_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/2d200b47cb17/molbiolevolmsq189f01_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/75a447687e62/molbiolevolmsq189f02_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/e300638fd5aa/molbiolevolmsq189f03_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/15b4f228320c/molbiolevolmsq189f04_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/0adbc2dc0f45/molbiolevolmsq189f05_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/1c28705a9b31/molbiolevolmsq189f06_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/d1845dec1dc3/molbiolevolmsq189f07_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/2d200b47cb17/molbiolevolmsq189f01_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/75a447687e62/molbiolevolmsq189f02_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/e300638fd5aa/molbiolevolmsq189f03_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/15b4f228320c/molbiolevolmsq189f04_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/0adbc2dc0f45/molbiolevolmsq189f05_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/1c28705a9b31/molbiolevolmsq189f06_4c.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb86/3002250/d1845dec1dc3/molbiolevolmsq189f07_4c.jpg

相似文献

1
A Bayesian approach for fast and accurate gene tree reconstruction.一种快速准确的基因树重建的贝叶斯方法。
Mol Biol Evol. 2011 Jan;28(1):273-90. doi: 10.1093/molbev/msq189. Epub 2010 Jul 25.
2
Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.通过跨多个完整基因组学习基因和物种特异性替换率进行准确的基因树重建。
Genome Res. 2007 Dec;17(12):1932-42. doi: 10.1101/gr.7105007. Epub 2007 Nov 7.
3
Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions.从基因树构建物种树:利用估计的基因树分布重建物种系统发育的贝叶斯后验分布。
Syst Biol. 2007 Jun;56(3):504-14. doi: 10.1080/10635150701429982.
4
Species Tree Estimation from Genome-Wide Data with guenomu.使用guenomu从全基因组数据估计物种树。
Methods Mol Biol. 2017;1525:461-478. doi: 10.1007/978-1-4939-6622-6_18.
5
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
6
Species Tree Inference Using a Mixture Model.使用混合模型进行种系发生树推断。
Mol Biol Evol. 2015 Sep;32(9):2469-82. doi: 10.1093/molbev/msv115. Epub 2015 May 11.
7
Genome-scale coestimation of species and gene trees.基因组规模的种系和基因树共估计。
Genome Res. 2013 Feb;23(2):323-30. doi: 10.1101/gr.141978.112. Epub 2012 Nov 6.
8
Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes.用于计算基因组进化简约进化情景、最后共同祖先以及原核生物进化中水平基因转移主导地位的算法。
BMC Evol Biol. 2003 Jan 6;3:2. doi: 10.1186/1471-2148-3-2.
9
The Unconstrained Diameters of the Duplication-Loss Cost and the Loss Cost.无约束复制损耗成本和损耗成本的直径。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2125-2135. doi: 10.1109/TCBB.2019.2919617. Epub 2021 Dec 8.
10
A hybrid micro-macroevolutionary approach to gene tree reconstruction.一种用于基因树重建的混合微观-宏观进化方法。
J Comput Biol. 2006 Mar;13(2):320-35. doi: 10.1089/cmb.2006.13.320.

引用本文的文献

1
soibean: High-Resolution Taxonomic Identification of Ancient Environmental DNA Using Mitochondrial Pangenome Graphs.苏拜恩:利用线粒体泛基因组图谱进行古代环境 DNA 的高分辨率分类鉴定。
Mol Biol Evol. 2024 Oct 4;41(10). doi: 10.1093/molbev/msae203.
2
Phylogenetic inference using generative adversarial networks.基于生成对抗网络的系统发育推断。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad543.
3
"Correcting" Gene Trees to be More Like Species Trees Frequently Increases Topological Error.“修正”基因树使其更像物种树会经常增加拓扑错误。

本文引用的文献

1
FastTree 2--approximately maximum-likelihood trees for large alignments.FastTree 2--用于大型比对的近似最大似然树。
PLoS One. 2010 Mar 10;5(3):e9490. doi: 10.1371/journal.pone.0009490.
2
Space of gene/species trees reconciliations and parsimonious models.基因/物种树和解空间与简约模型
J Comput Biol. 2009 Oct;16(10):1399-418. doi: 10.1089/cmb.2009.0095.
3
Ancestral population genomics: the coalescent hidden Markov model approach.祖先群体基因组学:合并隐马尔可夫模型方法。
Genome Biol Evol. 2023 Jun 1;15(6). doi: 10.1093/gbe/evad094.
4
Species Tree Inference Methods Intended to Deal with Incomplete Lineage Sorting Are Robust to the Presence of Paralogs.旨在处理不完全谱系分类的种系树推断方法对旁系同源基因的存在具有稳健性。
Syst Biol. 2022 Feb 10;71(2):367-381. doi: 10.1093/sysbio/syab056.
5
Evolution of drug resistance in HIV protease.HIV 蛋白酶耐药性的演变。
BMC Bioinformatics. 2020 Dec 30;21(Suppl 18):497. doi: 10.1186/s12859-020-03825-7.
6
Ab Initio Construction and Evolutionary Analysis of Protein-Coding Gene Families with Partially Homologous Relationships: Closely Related Drosophila Genomes as a Case Study.从头构建和进化分析具有部分同源关系的蛋白质编码基因家族:以密切相关的果蝇基因组为例。
Genome Biol Evol. 2020 Mar 1;12(3):185-202. doi: 10.1093/gbe/evaa041.
7
Consistency and convergence rate of phylogenetic inference via regularization.通过正则化进行系统发育推断的一致性和收敛速度
Ann Stat. 2018 Aug;46(4):1481-1512. doi: 10.1214/17-AOS1592. Epub 2018 Jun 27.
8
MIPhy: identify and quantify rapidly evolving members of large gene families.MIPhy:识别和量化大型基因家族中快速进化的成员。
PeerJ. 2018 May 29;6:e4873. doi: 10.7717/peerj.4873. eCollection 2018.
9
GATC: a genetic algorithm for gene tree construction under the Duplication-Transfer-Loss model of evolution.GATC:一种在进化的复制-转移-丢失模型下构建基因树的遗传算法。
BMC Genomics. 2018 May 9;19(Suppl 2):102. doi: 10.1186/s12864-018-4455-x.
10
Integrated pipeline for inferring the evolutionary history of a gene family embedded in the species tree: a case study on the STIMATE gene family.用于推断嵌入物种树中的基因家族进化历史的综合流程:以STIMATE基因家族为例
BMC Bioinformatics. 2017 Oct 3;18(1):439. doi: 10.1186/s12859-017-1850-2.
Genetics. 2009 Sep;183(1):259-74. doi: 10.1534/genetics.109.103010. Epub 2009 Jul 6.
4
Evolution of pathogenicity and sexual reproduction in eight Candida genomes.八种念珠菌基因组中致病性和有性生殖的进化
Nature. 2009 Jun 4;459(7247):657-62. doi: 10.1038/nature08064.
5
Berkeley PHOG: PhyloFacts orthology group prediction web server.伯克利PHOG:系统发育事实直系同源组预测网络服务器。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W84-9. doi: 10.1093/nar/gkp373. Epub 2009 May 12.
6
Simultaneous Bayesian gene tree reconstruction and reconciliation analysis.同时进行贝叶斯基因树重建与和解分析。
Proc Natl Acad Sci U S A. 2009 Apr 7;106(14):5714-9. doi: 10.1073/pnas.0806251106. Epub 2009 Mar 19.
7
EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.EnsemblCompara基因树:脊椎动物中完整的、可识别基因复制的系统发育树。
Genome Res. 2009 Feb;19(2):327-35. doi: 10.1101/gr.073585.107. Epub 2008 Nov 24.
8
Gene family evolution across 12 Drosophila genomes.跨越12个果蝇基因组的基因家族进化。
PLoS Genet. 2007 Nov;3(11):e197. doi: 10.1371/journal.pgen.0030197.
9
Evolution of genes and genomes on the Drosophila phylogeny.果蝇系统发育中基因和基因组的进化。
Nature. 2007 Nov 8;450(7167):203-18. doi: 10.1038/nature06341.
10
Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.通过跨多个完整基因组学习基因和物种特异性替换率进行准确的基因树重建。
Genome Res. 2007 Dec;17(12):1932-42. doi: 10.1101/gr.7105007. Epub 2007 Nov 7.