量化系统发育推断中位点间相依进化的影响。

Quantifying the impact of dependent evolution among sites in phylogenetic inference.

机构信息

Department of Integrative Biology, University of California, Berkeley, 3060 Valley Life Sciences Building #3140, Berkeley, CA 94720-3140, USA.

出版信息

Syst Biol. 2011 Jan;60(1):60-73. doi: 10.1093/sysbio/syq074. Epub 2010 Nov 15.

DOI:10.1093/sysbio/syq074

PMID:21081481

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2997629/

Abstract

Nearly all commonly used methods of phylogenetic inference assume that characters in an alignment evolve independently of one another. This assumption is attractive for simplicity and computational tractability but is not biologically reasonable for RNAs and proteins that have secondary and tertiary structures. Here, we simulate RNA and protein-coding DNA sequence data under a general model of dependence in order to assess the robustness of traditional methods of phylogenetic inference to violation of the assumption of independence among sites. We find that the accuracy of independence-assuming methods is reduced by the dependence among sites; for proteins this reduction is relatively mild, but for RNA this reduction may be substantial. We introduce the concept of effective sequence length and its utility for considering information content in phylogenetics.

摘要

几乎所有常用的系统发育推断方法都假设比对中的特征彼此独立地进化。这种假设在简单性和计算可处理性方面很有吸引力，但对于具有二级和三级结构的 RNA 和蛋白质来说，这在生物学上是不合理的。在这里，我们模拟了依赖于一般模型的 RNA 和蛋白质编码 DNA 序列数据，以评估传统的系统发育推断方法对违反站点之间独立性假设的稳健性。我们发现，依赖于站点的方法的准确性降低了；对于蛋白质来说，这种降低相对较轻，但对于 RNA 来说，这种降低可能是实质性的。我们引入了有效序列长度的概念及其在系统发育学中考虑信息含量的有用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ca8/2997629/bd628748945b/sysbiosyq074f01_lw.jpg

相似文献

Quantifying the impact of dependent evolution among sites in phylogenetic inference.

Syst Biol. 2011 Jan;60(1):60-73. doi: 10.1093/sysbio/syq074. Epub 2010 Nov 15.

Robustness of Phylogenetic Inference to Model Misspecification Caused by Pairwise Epistasis.

Mol Biol Evol. 2021 Sep 27;38(10):4603-4615. doi: 10.1093/molbev/msab163.

An evolutionary model for protein-coding regions with conserved RNA structure.

Mol Biol Evol. 2004 Oct;21(10):1913-22. doi: 10.1093/molbev/msh199. Epub 2004 Jun 30.

Using multiple alignments and phylogenetic trees to detect RNA secondary structure.

Pac Symp Biocomput. 1996:350-67.

ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules.

Nucleic Acids Res. 2016 Jul 8;44(W1):W344-50. doi: 10.1093/nar/gkw408. Epub 2016 May 10.

Bayesian coestimation of phylogeny and sequence alignment.

BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

Analysis of DNA sequence data: phylogenetic inference.

Methods Enzymol. 1993;224:456-87. doi: 10.1016/0076-6879(93)24035-s.

Dependence among sites in RNA evolution.

Mol Biol Evol. 2006 Aug;23(8):1525-37. doi: 10.1093/molbev/msl015. Epub 2006 May 23.

Secondary structure prediction for aligned RNA sequences.

J Mol Biol. 2002 Jun 21;319(5):1059-66. doi: 10.1016/S0022-2836(02)00308-X.

Phylogenetically enhanced statistical tools for RNA structure prediction.

Bioinformatics. 2000 Jun;16(6):501-12. doi: 10.1093/bioinformatics/16.6.501.

引用本文的文献

Developing and Applying RNA Empirical Models With Secondary Structure Insights for Orthoptera Phylogenetics.

Ecol Evol. 2025 Aug 31;15(9):e72068. doi: 10.1002/ece3.72068. eCollection 2025 Sep.

Protein Structural Phylogenetics.

Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf139.

Evolution is coupled with branching across many granularities of life.

Proc Biol Sci. 2025 May;292(2047):20250182. doi: 10.1098/rspb.2025.0182. Epub 2025 May 28.

Reconstruction of Ancestral Protein Sequences Using Autoregressive Generative Models.

Mol Biol Evol. 2025 Apr 1;42(4). doi: 10.1093/molbev/msaf070.

A critical analysis of the current state of virus taxonomy.

Front Microbiol. 2023 Aug 3;14:1240993. doi: 10.3389/fmicb.2023.1240993. eCollection 2023.

Shifts in amino acid preferences as proteins evolve: A synthesis of experimental and theoretical work.

Protein Sci. 2021 Oct;30(10):2009-2028. doi: 10.1002/pro.4161. Epub 2021 Aug 12.

Robustness of Phylogenetic Inference to Model Misspecification Caused by Pairwise Epistasis.

Mol Biol Evol. 2021 Sep 27;38(10):4603-4615. doi: 10.1093/molbev/msab163.

A new phylogenetic protocol: dealing with model misspecification and confirmation bias in molecular phylogenetics.

NAR Genom Bioinform. 2020 Jun 23;2(2):lqaa041. doi: 10.1093/nargab/lqaa041. eCollection 2020 Jun.

ASPEN, a methodology for reconstructing protein evolution with improved accuracy using ensemble models.

Elife. 2019 Oct 17;8:e47676. doi: 10.7554/eLife.47676.

Simultaneous Bayesian inference of phylogeny and molecular coevolution.

Proc Natl Acad Sci U S A. 2019 Mar 12;116(11):5027-5036. doi: 10.1073/pnas.1813836116. Epub 2019 Feb 26.

本文引用的文献

Evidence for an ancient adaptive episode of convergent molecular evolution.

Proc Natl Acad Sci U S A. 2009 Jun 2;106(22):8986-91. doi: 10.1073/pnas.0900233106. Epub 2009 Apr 28.

Investigating protein-coding sequence evolution with probabilistic codon substitution models.

Mol Biol Evol. 2009 Feb;26(2):255-71. doi: 10.1093/molbev/msn232. Epub 2008 Oct 14.

Quantifying the impact of protein tertiary structure on molecular evolution.

Mol Biol Evol. 2007 Aug;24(8):1769-82. doi: 10.1093/molbev/msm097. Epub 2007 May 23.

A maximum likelihood framework for protein design.

BMC Bioinformatics. 2006 Jun 29;7:326. doi: 10.1186/1471-2105-7-326.

Dependence among sites in RNA evolution.

Mol Biol Evol. 2006 Aug;23(8):1525-37. doi: 10.1093/molbev/msl015. Epub 2006 May 23.

Site interdependence attributed to tertiary structure in amino acid sequence evolution.

Gene. 2005 Mar 14;347(2):207-17. doi: 10.1016/j.gene.2004.12.011. Epub 2005 Feb 19.

Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure.

Proc Natl Acad Sci U S A. 2004 May 11;101(19):7287-92. doi: 10.1073/pnas.0401799101. Epub 2004 May 3.

Protein evolution with dependence among codons due to tertiary structure.

Mol Biol Evol. 2003 Oct;20(10):1692-704. doi: 10.1093/molbev/msg184. Epub 2003 Jul 28.

Effect of nonindependent substitution on phylogenetic accuracy.

Syst Biol. 1999 Jun;48(2):317-28. doi: 10.1080/106351599260319.

How to guarantee optimal stability for most representative structures in the Protein Data Bank.

Proteins. 2001 Aug 1;44(2):79-96. doi: 10.1002/prot.1075.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

量化系统发育推断中位点间相依进化的影响。

Quantifying the impact of dependent evolution among sites in phylogenetic inference.

机构信息

Department of Integrative Biology, University of California, Berkeley, 3060 Valley Life Sciences Building #3140, Berkeley, CA 94720-3140, USA.

出版信息

Syst Biol. 2011 Jan;60(1):60-73. doi: 10.1093/sysbio/syq074. Epub 2010 Nov 15.

DOI:10.1093/sysbio/syq074

PMID:21081481

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2997629/

Abstract

摘要

量化系统发育推断中位点间相依进化的影响。

Quantifying the impact of dependent evolution among sites in phylogenetic inference.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

量化系统发育推断中位点间相依进化的影响。

Quantifying the impact of dependent evolution among sites in phylogenetic inference.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献