• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

为什么在成对序列比较中,差异更大的序列产生更小的非同义/同义替换率比值?

Why do more divergent sequences produce smaller nonsynonymous/synonymous rate ratios in pairwise sequence comparisons?

机构信息

Department of Genetics, Evolution, and Environment, University College London, London, WC1E 6BT, United Kingdom.

出版信息

Genetics. 2013 Sep;195(1):195-204. doi: 10.1534/genetics.113.152025. Epub 2013 Jun 21.

DOI:10.1534/genetics.113.152025
PMID:23792953
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3761301/
Abstract

Several studies have reported a negative correlation between estimates of the nonsynonymous to synonymous rate ratio (ω = dN/dS) and the sequence distance d in pairwise comparisons of the same gene from different species. That is, more divergent sequences produce smaller estimates of ω. Explanations for this negative correlation have included segregating nonsynonymous polymorphisms in closely related species and nonlinear dynamics of the ratio of two random variables. Here we study the statistical properties of the maximum-likelihood estimates of ω and d in pairwise alignments and explore the possibility that the negative correlation can be entirely explained by those properties. We show that the ω estimate is positively biased for small d and that the bias decreases with the increase of d. We also show that the estimates of ω and d are negatively correlated when ω < 1 and positively correlated when ω > 1. However, the bias in estimates of ω and the correlation between estimates of ω and d are not enough to explain the much stronger correlation observed in real data sets. We then explore the behavior of the estimates when the model is misspecified and suggest that the observed correlation may be due to protein-level selection that causes very different amino acids to be favored in different domains of the protein. Widely used models fail to account for such among-site heterogeneity and cause underestimation of the nonsynonymous rate and ω, with the bias being much stronger for distant sequences. We point out that tests of positive selection based on the ω ratio are invariant to the parameterization of the model and thus unaffected by bias in the ω estimates or the correlation between estimates of ω and d.

摘要

已有几项研究报告称,在比较不同物种同一基因的成对序列时,非同义替换与同义替换的比率(ω=dN/dS)的估计值与序列距离 d 之间存在负相关关系。也就是说,差异较大的序列产生的ω估计值较小。对于这种负相关关系的解释包括在亲缘关系较近的物种中存在分离的非同义多态性和两个随机变量比值的非线性动态。在这里,我们研究了成对比对中最大似然估计值 ω 和 d 的统计性质,并探讨了这种负相关关系是否可以完全由这些性质来解释。我们表明,对于较小的 d,ω 的估计值存在正偏差,并且随着 d 的增加,偏差会减小。我们还表明,当 ω < 1 时,ω 和 d 的估计值呈负相关,当 ω > 1 时,ω 和 d 的估计值呈正相关。然而,ω 估计值的偏差以及 ω 和 d 的估计值之间的相关性不足以解释在实际数据集观察到的更强相关性。然后,我们探索了模型误定时估计值的行为,并提出观察到的相关性可能是由于蛋白质水平的选择,导致不同蛋白质域中偏好非常不同的氨基酸。广泛使用的模型无法解释这种位置异质性,导致非同义替换率和 ω 的低估,对于距离较远的序列,偏差要强得多。我们指出,基于ω 比值的正选择检验对于模型的参数化是不变的,因此不受 ω 估计值的偏差或 ω 和 d 的估计值之间的相关性的影响。

相似文献

1
Why do more divergent sequences produce smaller nonsynonymous/synonymous rate ratios in pairwise sequence comparisons?为什么在成对序列比较中,差异更大的序列产生更小的非同义/同义替换率比值?
Genetics. 2013 Sep;195(1):195-204. doi: 10.1534/genetics.113.152025. Epub 2013 Jun 21.
2
Bayesian estimation of nonsynonymous/synonymous rate ratios for pairwise sequence comparisons.用于成对序列比较的非同义/同义率比的贝叶斯估计。
Mol Biol Evol. 2014 Jul;31(7):1902-13. doi: 10.1093/molbev/msu142. Epub 2014 Apr 18.
3
The positive correlation between dN/dS and dS in mammals is due to runs of adjacent substitutions.哺乳动物中 dN/dS 与 dS 的正相关性是由于相邻替换的连续出现。
Mol Biol Evol. 2011 Apr;28(4):1371-80. doi: 10.1093/molbev/msq320. Epub 2010 Nov 29.
4
Preponderance of slightly deleterious polymorphism in mitochondrial DNA: nonsynonymous/synonymous rate ratio is much higher within species than between species.线粒体DNA中轻度有害多态性的优势:种内非同义/同义比率远高于种间。
Mol Biol Evol. 1998 Nov;15(11):1499-505. doi: 10.1093/oxfordjournals.molbev.a025877.
5
Polymorphism Data Assist Estimation of the Nonsynonymous over Synonymous Fixation Rate Ratio ω for Closely Related Species.多态性数据辅助估计近缘物种非同义替换与同义替换固定率比值ω。
Mol Biol Evol. 2020 Jan 1;37(1):260-279. doi: 10.1093/molbev/msz203.
6
Uncorrected nucleotide bias in mtDNA can mimic the effects of positive Darwinian selection.线粒体DNA中未经校正的核苷酸偏差可模拟正向达尔文选择的效应。
Mol Biol Evol. 2008 Dec;25(12):2521-4. doi: 10.1093/molbev/msn224. Epub 2008 Oct 8.
7
Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates.当存在多核苷酸突变和多个非同义速率时,广义参数密码子模型下的位点特异性正选择的推断得到改进。
BMC Evol Biol. 2019 Jan 14;19(1):22. doi: 10.1186/s12862-018-1326-7.
8
Synonymous and nonsynonymous rate variation in nuclear genes of mammals.哺乳动物核基因中的同义突变率和非同义突变率变异
J Mol Evol. 1998 Apr;46(4):409-18. doi: 10.1007/pl00006320.
9
Molecular evolution of nuclear genes in Cupressacea, a group of conifer trees.柏科(一类针叶树)核基因的分子进化
Mol Biol Evol. 2002 May;19(5):736-47. doi: 10.1093/oxfordjournals.molbev.a004132.
10
Codon-substitution models for heterogeneous selection pressure at amino acid sites.氨基酸位点上异质选择压力的密码子替换模型。
Genetics. 2000 May;155(1):431-49. doi: 10.1093/genetics/155.1.431.

引用本文的文献

1
Myxozoan parasite genomes assembled from contaminated host data reveal extensive gene order conservation and rapid sequence evolution.从受污染的宿主数据中组装的粘孢子虫寄生虫基因组揭示了广泛的基因顺序保守性和快速的序列进化。
G3 (Bethesda). 2025 Jul 9;15(7). doi: 10.1093/g3journal/jkaf061.
2
Genomic comparison of non-photosynthetic plants from the family Balanophoraceae with their photosynthetic relatives.蛇菰科非光合植物与其光合近缘植物的基因组比较。
PeerJ. 2021 Aug 31;9:e12106. doi: 10.7717/peerj.12106. eCollection 2021.
3
Faster Rates of Molecular Sequence Evolution in Reproduction-Related Genes and in Species with Hypodermic Sperm Morphologies.生殖相关基因和具有皮下精子形态的物种中分子序列进化的更快速率。
Mol Biol Evol. 2021 Dec 9;38(12):5685-5703. doi: 10.1093/molbev/msab276.
4
has one of the most reduced and mutated plastid genomes known.拥有已知的最为简化和发生突变的质体基因组之一。
PeerJ. 2019 Sep 12;7:e7500. doi: 10.7717/peerj.7500. eCollection 2019.
5
Evolutionary dynamics of the chromatophore genome in three photosynthetic Paulinella species.三种光合 Paulinella 物种中色素体基因组的进化动态。
Sci Rep. 2019 Feb 22;9(1):2560. doi: 10.1038/s41598-019-38621-8.
6
Species groups distributed across elevational gradients reveal convergent and continuous genetic adaptation to high elevations.分布在海拔梯度上的物种群揭示了对高海拔的趋同和连续的遗传适应。
Proc Natl Acad Sci U S A. 2018 Nov 6;115(45):E10634-E10641. doi: 10.1073/pnas.1813593115. Epub 2018 Oct 22.
7
Time dependence of evolutionary metrics during the 2009 pandemic influenza virus outbreak.2009年甲型H1N1流感大流行期间进化指标的时间依赖性
Virus Evol. 2015 Jan;1(1). doi: 10.1093/ve/vev006. Epub 2015 Jan 1.
8
The rate of adaptive evolution in animal mitochondria.动物线粒体中的适应性进化速率。
Mol Ecol. 2016 Jan;25(1):67-78. doi: 10.1111/mec.13475. Epub 2015 Dec 17.
9
Heterogeneity of dN/dS Ratios at the Classical HLA Class I Genes over Divergence Time and Across the Allelic Phylogeny.经典HLA I类基因的非同义/同义替换率(dN/dS)在进化时间和等位基因系统发育中的异质性。
J Mol Evol. 2016 Jan;82(1):38-50. doi: 10.1007/s00239-015-9713-9. Epub 2015 Nov 14.
10
Declining transition/transversion ratios through time reveal limitations to the accuracy of nucleotide substitution models.随着时间推移,转换/颠换比下降揭示了核苷酸替换模型准确性的局限性。
BMC Evol Biol. 2015 Mar 11;15:36. doi: 10.1186/s12862-015-0312-6.

本文引用的文献

1
Estimating the distribution of selection coefficients from phylogenetic data using sitewise mutation-selection models.利用基于位点的突变-选择模型从系统发育数据估计选择系数的分布。
Genetics. 2012 Mar;190(3):1101-15. doi: 10.1534/genetics.111.136432. Epub 2011 Dec 29.
2
Nonlinear dynamics of nonsynonymous (dN) and synonymous (dS) substitution rates affects inference of selection.非 synonymous(dN)和同义(dS)替换率的非线性动力学影响选择的推断。
Genome Biol Evol. 2009 Aug 13;1:308-19. doi: 10.1093/gbe/evp030.
3
Quantitative prediction of molecular clock and ka/ks at short timescales.短时间尺度下分子钟和ka/ks的定量预测。
Mol Biol Evol. 2009 Nov;26(11):2595-603. doi: 10.1093/molbev/msp175. Epub 2009 Aug 6.
4
Likelihood-based clustering (LiBaC) for codon models, a method for grouping sites according to similarities in the underlying process of evolution.基于似然性的密码子模型聚类(LiBaC),一种根据潜在进化过程中的相似性对位点进行分组的方法。
Mol Biol Evol. 2008 Sep;25(9):1995-2007. doi: 10.1093/molbev/msn145. Epub 2008 Jun 26.
5
Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage.密码子替换的突变选择模型及其在估计密码子使用选择强度方面的应用。
Mol Biol Evol. 2008 Mar;25(3):568-79. doi: 10.1093/molbev/msm284. Epub 2008 Jan 3.
6
PAML 4: phylogenetic analysis by maximum likelihood.PAML 4:基于最大似然法的系统发育分析。
Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.
7
Methods for selecting fixed-effect models for heterogeneous codon evolution, with comments on their application to gene and genome data.用于选择异质密码子进化固定效应模型的方法,及其在基因和基因组数据中的应用评注。
BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2148-7-S1-S5.
8
Comparisons of dN/dS are time dependent for closely related bacterial genomes.对于密切相关的细菌基因组,非同义替换率与同义替换率的比较是随时间变化的。
J Theor Biol. 2006 Mar 21;239(2):226-35. doi: 10.1016/j.jtbi.2005.08.037. Epub 2005 Oct 18.
9
Statistical methods for detecting molecular adaptation.检测分子适应性的统计方法。
Trends Ecol Evol. 2000 Dec 1;15(12):496-503. doi: 10.1016/s0169-5347(00)01994-7.
10
Codon-substitution models for heterogeneous selection pressure at amino acid sites.氨基酸位点上异质选择压力的密码子替换模型。
Genetics. 2000 May;155(1):431-49. doi: 10.1093/genetics/155.1.431.