整合序列变异和蛋白质结构以鉴定受选择的位点。

Integrating sequence variation and protein structure to identify sites under selection.

机构信息

Section of Integrative Biology, Institute for Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, University of Texas at Austin, Austin, TX, USA.

出版信息

Mol Biol Evol. 2013 Jan;30(1):36-44. doi: 10.1093/molbev/mss217. Epub 2012 Sep 12.

DOI:10.1093/molbev/mss217

PMID:22977116

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3525147/

Abstract

We present a novel method to identify sites under selection in protein-coding genes. Our method combines the traditional Goldman-Yang model of coding-sequence evolution with the information obtained from the 3D structure of the evolving protein, specifically the relative solvent accessibility (RSA) of individual residues. We develop a random-effects likelihood sites model in which rate classes are RSA dependent. The RSA dependence is modeled with linear functions. We demonstrate that our RSA-dependent model provides a significantly better fit to molecular sequence data than does a traditional, RSA-independent model. We further show that our model provides a natural, RSA-dependent neutral baseline for the evolutionary rate ratio ω = dN/dS Sites that deviate from this neutral baseline likely experience selection pressure for function. We apply our method to the influenza proteins hemagglutinin and neuraminidase. For hemagglutinin, our method recovers positively selected sites near the sialic acid-binding site and negatively selected sites that may be important for trimerization. For neuraminidase, our method recovers the oseltamivir resistance site and otherwise suggests that few sites deviate from the neutral baseline. Our method is broadly applicable to any protein sequences for which structural data are available or can be obtained via homology modeling or threading.

摘要

我们提出了一种鉴定蛋白质编码基因中受选择影响的位点的新方法。我们的方法将传统的编码序列进化的 Goldman-Yang 模型与进化蛋白的 3D 结构（特别是个别残基的相对溶剂可及性（RSA））所获得的信息相结合。我们开发了一个随机效应似然位点模型，其中速率类别依赖于 RSA。RSA 依赖性用线性函数来建模。我们证明，与传统的、不依赖 RSA 的模型相比，我们的 RSA 依赖模型能更好地拟合分子序列数据。我们进一步表明，我们的模型为进化率比ω=dN/dS 提供了一个自然的、依赖 RSA 的中性基线，而偏离这个中性基线的位点可能经历了功能选择压力。我们将我们的方法应用于流感蛋白血凝素和神经氨酸酶。对于血凝素，我们的方法在唾液酸结合位点附近恢复了阳性选择的位点，以及可能对三聚体化很重要的阴性选择的位点。对于神经氨酸酶，我们的方法恢复了奥司他韦耐药位点，否则表明很少有位点偏离中性基线。我们的方法广泛适用于任何有结构数据或可通过同源建模或穿线获得结构数据的蛋白质序列。

相似文献

Integrating sequence variation and protein structure to identify sites under selection.

Mol Biol Evol. 2013 Jan;30(1):36-44. doi: 10.1093/molbev/mss217. Epub 2012 Sep 12.

Modeling coding-sequence evolution within the context of residue solvent accessibility.

BMC Evol Biol. 2012 Sep 12;12:179. doi: 10.1186/1471-2148-12-179.

Sequence and structure alignment of paramyxovirus hemagglutinin-neuraminidase with influenza virus neuraminidase.

J Virol. 1993 Jun;67(6):2972-80. doi: 10.1128/JVI.67.6.2972-2980.1993.

Dextran sulfate-resistant A/Puerto Rico/8/34 influenza virus is associated with the emergence of specific mutations in the neuraminidase glycoprotein.

Antiviral Res. 2014 Nov;111:69-77. doi: 10.1016/j.antiviral.2014.09.002. Epub 2014 Sep 16.

The utility of protein structure as a predictor of site-wise dN/dS varies widely among HIV-1 proteins.

J R Soc Interface. 2015 Oct 6;12(111):20150579. doi: 10.1098/rsif.2015.0579.

Nucleotide and predicted amino acid sequence analysis of the fusion protein and hemagglutinin-neuraminidase protein genes among Newcastle disease virus isolates. Phylogenetic relationships among the Paramyxovirinae based on attachment glycoprotein sequences.

Funct Integr Genomics. 2004 Oct;4(4):246-57. doi: 10.1007/s10142-004-0113-2. Epub 2004 Apr 24.

Prevalence of epistasis in the evolution of influenza A surface proteins.

PLoS Genet. 2011 Feb;7(2):e1001301. doi: 10.1371/journal.pgen.1001301. Epub 2011 Feb 17.

[Variation of influenza viruses and their recognition of the receptor sialo-sugar chains].

Yakugaku Zasshi. 1993 Aug;113(8):556-78. doi: 10.1248/yakushi1947.113.8_556.

Topological N-glycosylation and site-specific N-glycan sulfation of influenza proteins in the highly expressed H1N1 candidate vaccines.

Sci Rep. 2017 Aug 31;7(1):10232. doi: 10.1038/s41598-017-10714-2.

Mutations in Influenza A Virus Neuraminidase and Hemagglutinin Confer Resistance against a Broadly Neutralizing Hemagglutinin Stem Antibody.

J Virol. 2019 Jan 4;93(2). doi: 10.1128/JVI.01639-18. Print 2019 Jan 15.

引用本文的文献

Transcription factor binding sites are frequently under accelerated evolution in primates.

Nat Commun. 2023 Feb 11;14(1):783. doi: 10.1038/s41467-023-36421-3.

Site-Specific Amino Acid Distributions Follow a Universal Shape.

J Mol Evol. 2020 Dec;88(10):731-741. doi: 10.1007/s00239-020-09976-8. Epub 2020 Nov 24.

Phylogenetic Modeling of Regulatory Element Turnover Based on Epigenomic Data.

Mol Biol Evol. 2020 Jul 1;37(7):2137-2152. doi: 10.1093/molbev/msaa073.

Structures and functions linked to genome-wide adaptation of human influenza A viruses.

Sci Rep. 2019 Apr 18;9(1):6267. doi: 10.1038/s41598-019-42614-y.

mtProtEvol: the resource presenting molecular evolution analysis of proteins involved in the function of Vertebrate mitochondria.

BMC Evol Biol. 2019 Feb 26;19(Suppl 1):47. doi: 10.1186/s12862-019-1371-x.

A new parameter-rich structure-aware mechanistic model for amino acid substitution during evolution.

Proteins. 2018 Feb;86(2):218-228. doi: 10.1002/prot.25429. Epub 2017 Dec 12.

Biophysical Models of Protein Evolution: Understanding the Patterns of Evolutionary Sequence Divergence.

Annu Rev Biophys. 2017 May 22;46:85-103. doi: 10.1146/annurev-biophys-070816-033819. Epub 2017 Mar 15.

Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models.

Biol Direct. 2017 Jan 17;12(1):1. doi: 10.1186/s13062-016-0172-z.

Determination of antigenicity-altering patches on the major surface protein of human influenza A/H3N2 viruses.

Virus Evol. 2016 Feb 14;2(1):vev025. doi: 10.1093/ve/vev025. eCollection 2016 Jan.

Sequence amplification via cell passaging creates spurious signals of positive adaptation in influenza virus H3N2 hemagglutinin.

Virus Evol. 2016 Jul;2(2). doi: 10.1093/ve/vew026. Epub 2016 Oct 3.

本文引用的文献

Modeling coding-sequence evolution within the context of residue solvent accessibility.

BMC Evol Biol. 2012 Sep 12;12:179. doi: 10.1186/1471-2148-12-179.

Influenza research database: an integrated bioinformatics resource for influenza research and surveillance.

Influenza Other Respir Viruses. 2012 Nov;6(6):404-16. doi: 10.1111/j.1750-2659.2011.00331.x. Epub 2012 Jan 20.

Broadly neutralizing human antibody that recognizes the receptor-binding pocket of influenza virus hemagglutinin.

Proc Natl Acad Sci U S A. 2011 Aug 23;108(34):14216-21. doi: 10.1073/pnas.1111497108. Epub 2011 Aug 8.

Slow protein evolutionary rates are dictated by surface-core association.

Proc Natl Acad Sci U S A. 2011 Jul 5;108(27):11151-6. doi: 10.1073/pnas.1015994108. Epub 2011 Jun 20.

The relationship between relative solvent accessibility and evolutionary rate in protein evolution.

Genetics. 2011 Jun;188(2):479-88. doi: 10.1534/genetics.111.128025. Epub 2011 Apr 5.

The genomic rate of molecular adaptation of the human influenza A virus.

Mol Biol Evol. 2011 Sep;28(9):2443-51. doi: 10.1093/molbev/msr044. Epub 2011 Mar 16.

Prevalence of epistasis in the evolution of influenza A surface proteins.

PLoS Genet. 2011 Feb;7(2):e1001301. doi: 10.1371/journal.pgen.1001301. Epub 2011 Feb 17.

Permissive secondary mutations enable the evolution of influenza oseltamivir resistance.

Science. 2010 Jun 4;328(5983):1272-5. doi: 10.1126/science.1187816.

Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

Proc Natl Acad Sci U S A. 2010 Mar 9;107(10):4629-34. doi: 10.1073/pnas.0910915107. Epub 2010 Feb 22.

Evolutionary fingerprinting of genes.

Mol Biol Evol. 2010 Mar;27(3):520-36. doi: 10.1093/molbev/msp260. Epub 2009 Oct 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

整合序列变异和蛋白质结构以鉴定受选择的位点。

Integrating sequence variation and protein structure to identify sites under selection.

机构信息

Section of Integrative Biology, Institute for Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, University of Texas at Austin, Austin, TX, USA.

出版信息

Mol Biol Evol. 2013 Jan;30(1):36-44. doi: 10.1093/molbev/mss217. Epub 2012 Sep 12.

DOI:10.1093/molbev/mss217

PMID:22977116

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3525147/

Abstract

摘要

整合序列变异和蛋白质结构以鉴定受选择的位点。

Integrating sequence variation and protein structure to identify sites under selection.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

整合序列变异和蛋白质结构以鉴定受选择的位点。

Integrating sequence variation and protein structure to identify sites under selection.

机构信息

出版信息