• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

阿达马共轭以及对位点间速率不等的序列进化进行建模。

Hadamard conjugations and modeling sequence evolution with unequal rates across sites.

作者信息

Waddell P J, Penny D, Moore T

机构信息

Department of Plant Biology and Biotechnology, School of Biological Sciences, Massey University, Palmerston North, New Zealand.

出版信息

Mol Phylogenet Evol. 1997 Aug;8(1):33-50. doi: 10.1006/mpev.1997.0405.

DOI:10.1006/mpev.1997.0405
PMID:9242594
Abstract

This paper considers the many different distributions that may approximate the distribution of site rates in DNA sequences and shows how the Hadamard conjugation may be modified to take these into account. This is done for both 2-state and 4-state data. Distributions which give simple closed forms include the gamma (gamma) distribution, the inverse Gaussian distribution (which is similar to the lognormal), and a mixture of either of these with a proportion of sites which cannot change (invariant sites). It is seen that the tail of a distribution can have major effects upon the coefficient of variation of site rates. Because the Hadamard conjugation can be used to either correct data or predict the data given the model (i.e., the likelihood of site patterns), light is shed on properties of maximum likelihood tree selection with unequal site rates. Analysis of rRNA shows how unequal rates across sites can change the optimal tree. Maximum likelihood analysis also shows that distinct distributions fit each data set, with the gamma often not being the best. Analyzing both these data and a long stretch of primate mtDNA reveals evidence of many "hidden" multiple substitutions, while signals not corresponding to the preferred biological tree generally decrease an unequal rates are allowed for. Last, we discuss the expected behavior of sequences evolving by models where stabilizing selection alone explains unequal site rates. Such models do not explain "synapomorphies" or informative changes in ancient molecules, because while stabilizing selection can vastly decrease change at a site, it will also vastly accelerate back-substitution (leaving only a covarion model to explain old synapomorphies). When and why models allowing a continuous distribution of site rates (e.g., gamma) will approximate covarion evolution requires further study.

摘要

本文考虑了许多可能近似DNA序列中位点速率分布的不同分布,并展示了如何修改哈达玛共轭以将这些因素考虑在内。这是针对二态和四态数据进行的。能给出简单封闭形式的分布包括伽马(gamma)分布、逆高斯分布(类似于对数正态分布),以及这些分布之一与一定比例不变位点(即不能变化的位点)的混合分布。可以看出,分布的尾部对位点速率的变异系数有重大影响。由于哈达玛共轭可用于校正数据或在给定模型(即位点模式的似然性)的情况下预测数据,这为具有不等位点速率的最大似然树选择的性质提供了启示。对rRNA的分析表明位点间的不等速率如何改变最优树。最大似然分析还表明,不同的分布适合每个数据集,伽马分布往往并非最佳。对这些数据以及一段较长的灵长类线粒体DNA的分析揭示了许多“隐藏”的多次替换的证据,而与首选生物学树不对应的信号在允许不等速率时通常会减少。最后,我们讨论了仅由稳定选择解释不等位点速率的模型中序列进化的预期行为。此类模型无法解释古代分子中的“共近裔性状”或信息性变化,因为虽然稳定选择可大幅减少位点处的变化,但它也会大幅加速反向替换(仅留下协变模型来解释古老的共近裔性状)。允许位点速率连续分布(如伽马分布)的模型何时以及为何会近似协变进化需要进一步研究。

相似文献

1
Hadamard conjugations and modeling sequence evolution with unequal rates across sites.阿达马共轭以及对位点间速率不等的序列进化进行建模。
Mol Phylogenet Evol. 1997 Aug;8(1):33-50. doi: 10.1006/mpev.1997.0405.
2
Taking variation of evolutionary rates between sites into account in inferring phylogenies.在推断系统发育时考虑位点间进化速率的变化。
J Mol Evol. 2001 Oct-Nov;53(4-5):447-55. doi: 10.1007/s002390010234.
3
Is homoplasy or lineage sorting the source of incongruent mtdna and nuclear gene trees in the stiff-tailed ducks (Nomonyx-Oxyura)?在硬尾鸭(Nomonyx - Oxyura)中,同塑性或谱系分选是线粒体DNA和核基因树不一致的根源吗?
Syst Biol. 2005 Feb;54(1):35-55. doi: 10.1080/10635150590910249.
4
Estimating the distribution of selection coefficients from phylogenetic data with applications to mitochondrial and viral DNA.从系统发育数据估计选择系数的分布及其在线粒体和病毒DNA中的应用。
Mol Biol Evol. 2003 Aug;20(8):1231-9. doi: 10.1093/molbev/msg147. Epub 2003 May 30.
5
General time-reversible distances with unequal rates across sites: mixing gamma and inverse Gaussian distributions with invariant sites.位点间速率不等的一般时间可逆距离:混合伽马分布和逆高斯分布以及不变位点。
Mol Phylogenet Evol. 1997 Dec;8(3):398-414. doi: 10.1006/mpev.1997.0452.
6
Testing for covarion-like evolution in protein sequences.检测蛋白质序列中的类共变进化。
Mol Biol Evol. 2007 Jan;24(1):294-305. doi: 10.1093/molbev/msl155. Epub 2006 Oct 20.
7
A simple hierarchical approach to modeling distributions of substitution rates.一种用于模拟替换率分布的简单分层方法。
Mol Biol Evol. 2005 Feb;22(2):223-34. doi: 10.1093/molbev/msi009. Epub 2004 Oct 13.
8
A dirichlet process covarion mixture model and its assessments using posterior predictive discrepancy tests.Dirichlet 过程协变量混合模型及其使用后验预测差异检验的评估。
Mol Biol Evol. 2010 Feb;27(2):371-84. doi: 10.1093/molbev/msp248. Epub 2009 Oct 12.
9
Variation in modes and rates of evolution in nuclear and mitochondrial ribosomal DNA in the mushroom genus Amanita (Agaricales, Basidiomycota): phylogenetic implications.鹅膏菌属(伞菌目,担子菌门)核核糖体DNA和线粒体核糖体DNA进化模式与速率的变异:系统发育学意义
Mol Phylogenet Evol. 2000 Jul;16(1):48-63. doi: 10.1006/mpev.2000.0782.
10
Measuring the fit of sequence data to phylogenetic model: allowing for missing data.评估序列数据与系统发育模型的拟合度:考虑缺失数据。
Mol Biol Evol. 2005 Mar;22(3):395-401. doi: 10.1093/molbev/msi002. Epub 2004 Oct 6.

引用本文的文献

1
An independent base composition of each rate class for improved likelihood-based phylogeny estimation; the 5rf model.用于改进基于似然性的系统发育估计的每个速率类别的独立碱基组成;5rf模型
bioRxiv. 2024 Sep 7:2024.09.03.610719. doi: 10.1101/2024.09.03.610719.
2
Rooting gene trees without outgroups: EP rooting.无根基因树的构建:EP 无根法。
Genome Biol Evol. 2012;4(8):709-19. doi: 10.1093/gbe/evs047. Epub 2012 May 16.
3
Measuring fit of sequence data to phylogenetic model: gain of power using marginal tests.衡量序列数据与系统发育模型的拟合度:使用边缘检验增加功效。
J Mol Evol. 2009 Oct;69(4):289-99. doi: 10.1007/s00239-009-9268-8. Epub 2009 Oct 23.
4
Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships.能否将全面的背景知识纳入替代模型以改进系统发育分析?以主要节肢动物关系为例的研究。
BMC Evol Biol. 2009 May 27;9:119. doi: 10.1186/1471-2148-9-119.
5
Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.核苷酸替换的平稳和非平稳模型下进化距离的估计。
Proc Natl Acad Sci U S A. 1998 May 26;95(11):5899-905. doi: 10.1073/pnas.95.11.5899.