• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

额外的安打:对瞬时多核苷酸变化的广泛实证支持。

Extra base hits: Widespread empirical support for instantaneous multiple-nucleotide changes.

作者信息

Lucaci Alexander G, Wisotsky Sadie R, Shank Stephen D, Weaver Steven, Kosakovsky Pond Sergei L

机构信息

Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, United States of America.

出版信息

PLoS One. 2021 Mar 12;16(3):e0248337. doi: 10.1371/journal.pone.0248337. eCollection 2021.

DOI:10.1371/journal.pone.0248337
PMID:33711070
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7954308/
Abstract

Despite many attempts to introduce evolutionary models that permit substitutions to instantly alter more than one nucleotide in a codon, the prevailing wisdom remains that such changes are rare and generally negligible or are reflective of non-biological artifacts, such as alignment errors. Codon models continue to posit that only single nucleotide change have non-zero rates. Here, we develop and test a simple hierarchy of codon-substitution models with non-zero evolutionary rates for only one-nucleotide (1H), one- and two-nucleotide (2H), or any (3H) codon substitutions. Using over 42, 000 empirical alignments, we find widespread statistical support for multiple hits: 61% of alignments prefer models with 2H allowed, and 23%-with 3H allowed. Analyses of simulated data suggest that these results are not likely to be due to simple artifacts such as model misspecification or alignment errors. Further modeling reveals that synonymous codon island jumping among codons encoding serine, especially along short branches, contributes significantly to this 3H signal. While serine codons were prominently involved in multiple-hit substitutions, there were other common exchanges contributing to better model fit. It appears that a small subset of sites in most alignments have unusual evolutionary dynamics not well explained by existing model formalisms, and that commonly estimated quantities, such as dN/dS ratios may be biased by model misspecification. Our findings highlight the need for continued evaluation of assumptions underlying workhorse evolutionary models and subsequent evolutionary inference techniques. We provide a software implementation for evolutionary biologists to assess the potential impact of extra base hits in their data in the HyPhy package and in the Datamonkey.org server.

摘要

尽管人们多次尝试引入允许替换瞬间改变密码子中多个核苷酸的进化模型,但普遍的观点仍然是,这种变化很少见,通常可以忽略不计,或者反映的是非生物学假象,比如比对错误。密码子模型仍然假定只有单核苷酸变化具有非零速率。在此,我们开发并测试了一个简单的密码子替换模型层次结构,该模型对于单核苷酸(1H)、单核苷酸和双核苷酸(2H)或任何(3H)密码子替换具有非零进化速率。使用超过42000个经验比对,我们发现对多次击中存在广泛的统计支持:61%的比对更倾向于允许2H的模型,23%的比对更倾向于允许3H的模型。对模拟数据的分析表明,这些结果不太可能归因于简单的假象,比如模型错误设定或比对错误。进一步的建模显示,编码丝氨酸的密码子之间同义密码子岛跳跃,尤其是沿着短分支的跳跃,对这个3H信号有显著贡献。虽然丝氨酸密码子在多次击中替换中显著参与,但还有其他常见的替换有助于更好地拟合模型。看来大多数比对中的一小部分位点具有不寻常的进化动态,现有模型形式无法很好地解释,而且通常估计的量,比如dN/dS比率可能会因模型错误设定而有偏差。我们的发现凸显了持续评估常用进化模型以及后续进化推断技术所依据假设的必要性。我们为进化生物学家提供了一个软件实现,以便在HyPhy软件包和Datamonkey.org服务器中评估其数据中额外碱基击中的潜在影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/cd0aace9d3b9/pone.0248337.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/af10a2770cf8/pone.0248337.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/f33fe03ed06e/pone.0248337.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/2d4925732883/pone.0248337.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/fd6b1c23133d/pone.0248337.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/cd0aace9d3b9/pone.0248337.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/af10a2770cf8/pone.0248337.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/f33fe03ed06e/pone.0248337.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/2d4925732883/pone.0248337.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/fd6b1c23133d/pone.0248337.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/175c/7954308/cd0aace9d3b9/pone.0248337.g005.jpg

相似文献

1
Extra base hits: Widespread empirical support for instantaneous multiple-nucleotide changes.额外的安打:对瞬时多核苷酸变化的广泛实证支持。
PLoS One. 2021 Mar 12;16(3):e0248337. doi: 10.1371/journal.pone.0248337. eCollection 2021.
2
Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates.当存在多核苷酸突变和多个非同义速率时,广义参数密码子模型下的位点特异性正选择的推断得到改进。
BMC Evol Biol. 2019 Jan 14;19(1):22. doi: 10.1186/s12862-018-1326-7.
3
Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses.多核苷酸替换的进化捷径及其对自然选择分析的影响。
Mol Biol Evol. 2023 Jul 5;40(7). doi: 10.1093/molbev/msad150.
4
Empirical codon substitution matrix.经验密码子替换矩阵。
BMC Bioinformatics. 2005 Jun 1;6:134. doi: 10.1186/1471-2105-6-134.
5
Codon substitution models based on residue similarity and their applications.基于残基相似性的密码子替换模型及其应用。
Gene. 2012 Nov 1;509(1):136-41. doi: 10.1016/j.gene.2012.07.075. Epub 2012 Aug 10.
6
Fitting discrete probability distributions to evolutionary events.将离散概率分布拟合到进化事件中。
Science. 1971 Jun 11;172(3988):1089-96. doi: 10.1126/science.172.3988.1089.
7
An empirical codon model for protein sequence evolution.一种用于蛋白质序列进化的经验密码子模型。
Mol Biol Evol. 2007 Jul;24(7):1464-79. doi: 10.1093/molbev/msm064. Epub 2007 Mar 30.
8
PoSE: visualization of patterns of sequence evolution using PAML and MATLAB.PoSE:使用 PAML 和 MATLAB 可视化序列进化模式。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):364. doi: 10.1186/s12859-018-2335-7.
9
Codon substitution in evolution and the "saturation" of synonymous changes.进化中的密码子替换与同义变化的“饱和”
Genetics. 1983 Dec;105(4):1011-27. doi: 10.1093/genetics/105.4.1011.
10
Synonymous substitutions substantially improve evolutionary inference from highly diverged proteins.同义替换显著改善了从高度分化的蛋白质进行的进化推断。
Syst Biol. 2008 Jun;57(3):367-77. doi: 10.1080/10635150802158670.

引用本文的文献

1
A Sitewise Model of Natural Selection on Individual Antibodies via a Transformer-Encoder.一种通过Transformer编码器对个体抗体进行自然选择的位点特异性模型。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf186.
2
Detecting Interspecific Positive Selection Using Convolutional Neural Networks.使用卷积神经网络检测种间正选择
Mol Biol Evol. 2025 Jul 1;42(7). doi: 10.1093/molbev/msaf154.
3
Comparative Phylogenetics Reveal Clade-specific Drivers of Recombination Rate Evolution Across Vertebrates.比较系统发育学揭示了脊椎动物中特定分支的重组率进化驱动因素。

本文引用的文献

1
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.等概率离散模型低估了特定位置替代率的变化程度。
PLoS One. 2020 Mar 2;15(3):e0229493. doi: 10.1371/journal.pone.0229493. eCollection 2020.
2
Synonymous Site-to-Site Substitution Rate Variation Dramatically Inflates False Positive Rates of Selection Analyses: Ignore at Your Own Peril.同义站点间替换率的变化极大地夸大了选择分析的假阳性率:忽视后果自负。
Mol Biol Evol. 2020 Aug 1;37(8):2430-2439. doi: 10.1093/molbev/msaa037.
3
A Phenotype-Genotype Codon Model for Detecting Adaptive Evolution.
Mol Biol Evol. 2025 Apr 30;42(5). doi: 10.1093/molbev/msaf100.
4
Efficient discovery of frequently co-occurring mutations in a sequence database with matrix factorization.利用矩阵分解在序列数据库中高效发现频繁共现的突变。
PLoS Comput Biol. 2025 Apr 24;21(4):e1012391. doi: 10.1371/journal.pcbi.1012391. eCollection 2025 Apr.
5
A New Comparative Framework for Estimating Selection on Synonymous Substitutions.一种用于估计同义替换选择的新比较框架。
Mol Biol Evol. 2025 Apr 1;42(4). doi: 10.1093/molbev/msaf068.
6
A new comparative framework for estimating selection on synonymous substitutions.一种用于估计同义替换选择的新比较框架。
bioRxiv. 2025 Feb 6:2024.09.17.613331. doi: 10.1101/2024.09.17.613331.
7
Minus the Error: Testing for Positive Selection in the Presence of Residual Alignment Errors.减去误差:在存在残留比对误差的情况下检测正选择。
bioRxiv. 2025 Mar 21:2024.11.13.620707. doi: 10.1101/2024.11.13.620707.
8
The evolution of mammalian Rem2: unraveling the impact of purifying selection and coevolution on protein function, and implications for human disorders.哺乳动物Rem2的进化:揭示纯化选择和共同进化对蛋白质功能的影响及其对人类疾病的意义。
Front Bioinform. 2024 Jun 24;4:1381540. doi: 10.3389/fbinf.2024.1381540. eCollection 2024.
9
AOC: Analysis of Orthologous Collections - an application for the characterization of natural selection in protein-coding sequences.AOC:直系同源基因集分析——一种用于表征蛋白质编码序列中自然选择的应用程序。
ArXiv. 2024 Jun 13:arXiv:2406.09522v1.
10
Rapid evolution of genes with anti-cancer functions during the origins of large bodies and cancer resistance in elephants.在大象大体型和抗癌能力起源过程中具有抗癌功能的基因的快速进化。
bioRxiv. 2024 Feb 29:2024.02.27.582135. doi: 10.1101/2024.02.27.582135.
一种用于检测适应性进化的表型-基因型密码子模型。
Syst Biol. 2020 Jul 1;69(4):722-738. doi: 10.1093/sysbio/syz075.
4
HyPhy 2.5-A Customizable Platform for Evolutionary Hypothesis Testing Using Phylogenies.HyPhy 2.5-A 可定制的基于系统发生树的进化假说检验平台。
Mol Biol Evol. 2020 Jan 1;37(1):295-299. doi: 10.1093/molbev/msz197.
5
Exome-wide assessment of the functional impact and pathogenicity of multinucleotide mutations.外显子组范围内多核苷酸突变的功能影响和致病性评估。
Genome Res. 2019 Jul;29(7):1047-1056. doi: 10.1101/gr.239756.118. Epub 2019 Jun 21.
6
Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates.当存在多核苷酸突变和多个非同义速率时,广义参数密码子模型下的位点特异性正选择的推断得到改进。
BMC Evol Biol. 2019 Jan 14;19(1):22. doi: 10.1186/s12862-018-1326-7.
7
Immune genes are hotspots of shared positive selection across birds and mammals.免疫基因是鸟类和哺乳动物共有的正选择热点。
Elife. 2019 Jan 8;8:e41815. doi: 10.7554/eLife.41815.
8
Multinucleotide mutations cause false inferences of lineage-specific positive selection.多核苷酸突变导致谱系特异性正选择的错误推断。
Nat Ecol Evol. 2018 Aug;2(8):1280-1288. doi: 10.1038/s41559-018-0584-5. Epub 2018 Jul 2.
9
Phenomenological Load on Model Parameters Can Lead to False Biological Conclusions.现象学负荷对模型参数的影响可能导致错误的生物学结论。
Mol Biol Evol. 2018 Jun 1;35(6):1473-1488. doi: 10.1093/molbev/msy049.
10
Datamonkey 2.0: A Modern Web Application for Characterizing Selective and Other Evolutionary Processes.数据猴2.0:一款用于描述选择性及其他进化过程的现代网络应用程序。
Mol Biol Evol. 2018 Mar 1;35(3):773-777. doi: 10.1093/molbev/msx335.