• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在残基溶剂可及性的背景下对编码序列进化进行建模。

Modeling coding-sequence evolution within the context of residue solvent accessibility.

机构信息

Center for Computational Biology and Bioinformatics, Institute for Cellular and Molecular Biology, and Section of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USA.

出版信息

BMC Evol Biol. 2012 Sep 12;12:179. doi: 10.1186/1471-2148-12-179.

DOI:10.1186/1471-2148-12-179
PMID:22967129
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3527230/
Abstract

BACKGROUND

Protein structure mediates site-specific patterns of sequence divergence. In particular, residues in the core of a protein (solvent-inaccessible residues) tend to be more evolutionarily conserved than residues on the surface (solvent-accessible residues).

RESULTS

Here, we present a model of sequence evolution that explicitly accounts for the relative solvent accessibility of each residue in a protein. Our model is a variant of the Goldman-Yang 1994 (GY94) model in which all model parameters can be functions of the relative solvent accessibility (RSA) of a residue. We apply this model to a data set comprised of nearly 600 yeast genes, and find that an evolutionary-rate ratio ω that varies linearly with RSA provides a better model fit than an RSA-independent ω or an ω that is estimated separately in individual RSA bins. We further show that the branch length t and the transition-transverion ratio κ also vary with RSA. The RSA-dependent GY94 model performs better than an RSA-dependent Muse-Gaut 1994 (MG94) model in which the synonymous and non-synonymous rates individually are linear functions of RSA. Finally, protein core size affects the slope of the linear relationship between ω and RSA, and gene expression level affects both the intercept and the slope.

CONCLUSIONS

Structure-aware models of sequence evolution provide a significantly better fit than traditional models that neglect structure. The linear relationship between ω and RSA implies that genes are better characterized by their ω slope and intercept than by just their mean ω.

摘要

背景

蛋白质结构介导特定位置的序列差异。特别是,蛋白质核心(溶剂不可接近残基)中的残基比表面(溶剂可接近残基)中的残基更具进化保守性。

结果

在这里,我们提出了一种序列进化模型,该模型明确考虑了蛋白质中每个残基的相对溶剂可及性。我们的模型是 Goldman-Yang 1994(GY94)模型的变体,其中所有模型参数都可以是残基相对溶剂可及性(RSA)的函数。我们将该模型应用于由近 600 个酵母基因组成的数据集,并发现与 RSA 无关的 ω 或在单独的 RSA 箱中单独估计的 ω 相比,与 RSA 线性变化的进化率比 ω 提供了更好的模型拟合。我们进一步表明,分支长度 t 和转换-颠换比 κ 也随 RSA 而变化。与 RSA 相关的 GY94 模型在与 RSA 相关的 Muse-Gaut 1994(MG94)模型中的表现更好,其中同义和非同义率分别是 RSA 的线性函数。最后,蛋白质核心大小影响 ω 与 RSA 之间线性关系的斜率,而基因表达水平影响截距和斜率。

结论

与忽略结构的传统模型相比,具有结构意识的序列进化模型提供了更好的拟合。ω 与 RSA 之间的线性关系意味着基因可以通过 ω 斜率和截距而不是仅通过平均 ω 更好地描述。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/11c82936baf5/1471-2148-12-179-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/b9d956053591/1471-2148-12-179-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/7326213937a4/1471-2148-12-179-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/282702f0d5af/1471-2148-12-179-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/f9082161c565/1471-2148-12-179-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/4e866973e6a9/1471-2148-12-179-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/11c82936baf5/1471-2148-12-179-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/b9d956053591/1471-2148-12-179-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/7326213937a4/1471-2148-12-179-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/282702f0d5af/1471-2148-12-179-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/f9082161c565/1471-2148-12-179-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/4e866973e6a9/1471-2148-12-179-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b17d/3527230/11c82936baf5/1471-2148-12-179-6.jpg

相似文献

1
Modeling coding-sequence evolution within the context of residue solvent accessibility.在残基溶剂可及性的背景下对编码序列进化进行建模。
BMC Evol Biol. 2012 Sep 12;12:179. doi: 10.1186/1471-2148-12-179.
2
Integrating sequence variation and protein structure to identify sites under selection.整合序列变异和蛋白质结构以鉴定受选择的位点。
Mol Biol Evol. 2013 Jan;30(1):36-44. doi: 10.1093/molbev/mss217. Epub 2012 Sep 12.
3
The relationship between relative solvent accessibility and evolutionary rate in protein evolution.蛋白质进化中相对溶剂可及性与进化速率的关系。
Genetics. 2011 Jun;188(2):479-88. doi: 10.1534/genetics.111.128025. Epub 2011 Apr 5.
4
Independent effects of protein core size and expression on residue-level structure-evolution relationships.蛋白质核心大小和表达对残基水平结构进化关系的独立影响。
PLoS One. 2012;7(10):e46602. doi: 10.1371/journal.pone.0046602. Epub 2012 Oct 3.
5
Structural Determinants of Yeast Protein-Protein Interaction Interface Evolution at the Residue Level.酵母蛋白-蛋白相互作用界面在残基水平上进化的结构决定因素。
J Mol Biol. 2022 Oct 15;434(19):167750. doi: 10.1016/j.jmb.2022.167750. Epub 2022 Jul 16.
6
Linear regression models for solvent accessibility prediction in proteins.用于蛋白质溶剂可及性预测的线性回归模型。
J Comput Biol. 2005 Apr;12(3):355-69. doi: 10.1089/cmb.2005.12.355.
7
Structural determinants of protein evolution are context-sensitive at the residue level.蛋白质进化的结构决定因素在残基水平上是上下文敏感的。
Mol Biol Evol. 2009 Oct;26(10):2387-95. doi: 10.1093/molbev/msp146. Epub 2009 Jul 13.
8
On the relation between residue flexibility and local solvent accessibility in proteins.关于蛋白质中残基柔性与局部溶剂可及性之间的关系。
Proteins. 2009 Aug 15;76(3):617-36. doi: 10.1002/prot.22375.
9
Site-specific structural constraints on protein sequence evolutionary divergence: local packing density versus solvent exposure.蛋白质序列进化分歧的位点特异性结构约束:局部堆积密度与溶剂暴露。
Mol Biol Evol. 2014 Jan;31(1):135-9. doi: 10.1093/molbev/mst178. Epub 2013 Oct 8.
10
Roles of solvent accessibility and gene expression in modeling protein sequence evolution.溶剂可及性和基因表达在蛋白质序列进化建模中的作用。
Evol Bioinform Online. 2015 Apr 29;11:85-96. doi: 10.4137/EBO.S22911. eCollection 2015.

引用本文的文献

1
Improved prediction of site-rates from structure with averaging across homologs.通过同源物平均化提高从结构预测位点速率。
Protein Sci. 2024 Jul;33(7):e5086. doi: 10.1002/pro.5086.
2
Insertion-Deletion Events Are Depleted in Protein Regions with Predicted Secondary Structure.插入-缺失事件在具有预测二级结构的蛋白质区域中减少。
Genome Biol Evol. 2024 May 2;16(5). doi: 10.1093/gbe/evae093.
3
Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness.大规模计算机诱变实验揭示了遗传密码和密码子使用的优化,以提高蛋白质突变稳健性。

本文引用的文献

1
Integrating sequence variation and protein structure to identify sites under selection.整合序列变异和蛋白质结构以鉴定受选择的位点。
Mol Biol Evol. 2013 Jan;30(1):36-44. doi: 10.1093/molbev/mss217. Epub 2012 Sep 12.
2
The relationship between relative solvent accessibility and evolutionary rate in protein evolution.蛋白质进化中相对溶剂可及性与进化速率的关系。
Genetics. 2011 Jun;188(2):479-88. doi: 10.1534/genetics.111.128025. Epub 2011 Apr 5.
3
Translationally optimal codons associate with aggregation-prone sites in proteins.
BMC Biol. 2020 Oct 20;18(1):146. doi: 10.1186/s12915-020-00870-9.
4
Influence of mutation bias and hydrophobicity on the substitution rates and sequence entropies of protein evolution.突变偏好性和疏水性对蛋白质进化中替换率和序列熵的影响。
PeerJ. 2018 Oct 5;6:e5549. doi: 10.7717/peerj.5549. eCollection 2018.
5
Differential strengths of molecular determinants guide environment specific mutational fates.分子决定因素的差异强度指导环境特异性突变命运。
PLoS Genet. 2018 May 29;14(5):e1007419. doi: 10.1371/journal.pgen.1007419. eCollection 2018 May.
6
Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates.在氨基酸或密码子水平计算特定位点的进化速率会得出相似的速率估计值。
PeerJ. 2017 May 30;5:e3391. doi: 10.7717/peerj.3391. eCollection 2017.
7
Biophysical Models of Protein Evolution: Understanding the Patterns of Evolutionary Sequence Divergence.蛋白质进化的生物物理模型:理解进化序列分歧模式
Annu Rev Biophys. 2017 May 22;46:85-103. doi: 10.1146/annurev-biophys-070816-033819. Epub 2017 Mar 15.
8
Functional Sites Induce Long-Range Evolutionary Constraints in Enzymes.功能位点在酶中诱导长程进化限制。
PLoS Biol. 2016 May 3;14(5):e1002452. doi: 10.1371/journal.pbio.1002452. eCollection 2016 May.
9
Dissecting the roles of local packing density and longer-range effects in protein sequence evolution.剖析局部堆积密度和长程效应在蛋白质序列进化中的作用。
Proteins. 2016 Jun;84(6):841-54. doi: 10.1002/prot.25034. Epub 2016 Apr 9.
10
Intermediate divergence levels maximize the strength of structure-sequence correlations in enzymes and viral proteins.中等发散水平可使酶和病毒蛋白中结构-序列相关性的强度最大化。
Protein Sci. 2016 Jul;25(7):1341-53. doi: 10.1002/pro.2920. Epub 2016 Mar 24.
翻译最优密码子与蛋白质中的聚集倾向位点相关。
Proteomics. 2010 Dec;10(23):4163-71. doi: 10.1002/pmic.201000229.
4
CodonTest: modeling amino acid substitution preferences in coding sequences.CodonTest:建模编码序列中氨基酸替换偏好。
PLoS Comput Biol. 2010 Aug 19;6(8):e1000885. doi: 10.1371/journal.pcbi.1000885.
5
Benchmarking multi-rate codon models.基准多速率密码子模型。
PLoS One. 2010 Jul 21;5(7):e11587. doi: 10.1371/journal.pone.0011587.
6
Evolutionary fingerprinting of genes.基因的进化特征分析。
Mol Biol Evol. 2010 Mar;27(3):520-36. doi: 10.1093/molbev/msp260. Epub 2009 Oct 28.
7
Structural determinants of protein evolution are context-sensitive at the residue level.蛋白质进化的结构决定因素在残基水平上是上下文敏感的。
Mol Biol Evol. 2009 Oct;26(10):2387-95. doi: 10.1093/molbev/msp146. Epub 2009 Jul 13.
8
Computational methods for evaluating phylogenetic models of coding sequence evolution with dependence between codons.用于评估密码子间存在依赖性的编码序列进化系统发育模型的计算方法。
Mol Biol Evol. 2009 Jul;26(7):1663-76. doi: 10.1093/molbev/msp078. Epub 2009 Apr 21.
9
Translationally optimal codons associate with structurally sensitive sites in proteins.翻译最优密码子与蛋白质中结构敏感位点相关联。
Mol Biol Evol. 2009 Jul;26(7):1571-80. doi: 10.1093/molbev/msp070. Epub 2009 Apr 6.
10
Solvent exposure imparts similar selective pressures across a range of yeast proteins.溶剂暴露会对一系列酵母蛋白施加相似的选择压力。
Mol Biol Evol. 2009 May;26(5):1155-61. doi: 10.1093/molbev/msp031. Epub 2009 Feb 20.