• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质结构与进化:它们是否受源于信息论的原理全局约束?

Protein structure and evolution: are they constrained globally by a principle derived from information theory?

作者信息

Hatton Leslie, Warr Gregory

机构信息

Faculty of Science, Engineering and Computing, Kingston University, London, UK.

Medical University of South Carolina, Charleston, South Carolina, USA.

出版信息

PLoS One. 2015 May 13;10(5):e0125663. doi: 10.1371/journal.pone.0125663. eCollection 2015.

DOI:10.1371/journal.pone.0125663
PMID:25970335
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4429977/
Abstract

That the physicochemical properties of amino acids constrain the structure, function and evolution of proteins is not in doubt. However, principles derived from information theory may also set bounds on the structure (and thus also the evolution) of proteins. Here we analyze the global properties of the full set of proteins in release 13-11 of the SwissProt database, showing by experimental test of predictions from information theory that their collective structure exhibits properties that are consistent with their being guided by a conservation principle. This principle (Conservation of Information) defines the global properties of systems composed of discrete components each of which is in turn assembled from discrete smaller pieces. In the system of proteins, each protein is a component, and each protein is assembled from amino acids. Central to this principle is the inter-relationship of the unique amino acid count and total length of a protein and its implications for both average protein length and occurrence of proteins with specific unique amino acid counts. The unique amino acid count is simply the number of distinct amino acids (including those that are post-translationally modified) that occur in a protein, and is independent of the number of times that the particular amino acid occurs in the sequence. Conservation of Information does not operate at the local level (it is independent of the physicochemical properties of the amino acids) where the influences of natural selection are manifest in the variety of protein structure and function that is well understood. Rather, this analysis implies that Conservation of Information would define the global bounds within which the whole system of proteins is constrained; thus it appears to be acting to constrain evolution at a level different from natural selection, a conclusion that appears counter-intuitive but is supported by the studies described herein.

摘要

氨基酸的物理化学性质限制蛋白质的结构、功能和进化,这一点毋庸置疑。然而,从信息论推导出来的原理也可能为蛋白质的结构(以及进化)设定界限。在此,我们分析了SwissProt数据库13 - 11版本中全套蛋白质的整体特性,通过对信息论预测的实验验证表明,它们的整体结构呈现出的特性与受守恒原理指导相一致。这一原理(信息守恒)定义了由离散成分组成的系统的整体特性,其中每个成分又依次由更小的离散片段组装而成。在蛋白质系统中,每个蛋白质是一个成分,且每个蛋白质由氨基酸组装而成。该原理的核心是蛋白质独特氨基酸数量、总长度之间的相互关系,以及这对平均蛋白质长度和具有特定独特氨基酸数量的蛋白质出现情况的影响。独特氨基酸数量简单来说就是蛋白质中出现的不同氨基酸的数量(包括那些翻译后修饰的氨基酸),且与特定氨基酸在序列中出现的次数无关。信息守恒并不在局部层面起作用(它独立于氨基酸的物理化学性质),在局部层面,自然选择的影响体现在人们熟知的各种蛋白质结构和功能中。相反,该分析表明信息守恒会定义蛋白质整个系统所受限制的全局界限;因此,它似乎在一个不同于自然选择的层面上对进化起到限制作用,这一结论看似违反直觉,但得到了本文所述研究的支持。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/fc733a0f9470/pone.0125663.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/a7977ccf26e8/pone.0125663.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/f0953fb89757/pone.0125663.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/fc86dc6f4824/pone.0125663.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/9e61287b030b/pone.0125663.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/420f4ec8fabf/pone.0125663.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/6824a8935962/pone.0125663.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/fc733a0f9470/pone.0125663.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/a7977ccf26e8/pone.0125663.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/f0953fb89757/pone.0125663.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/fc86dc6f4824/pone.0125663.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/9e61287b030b/pone.0125663.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/420f4ec8fabf/pone.0125663.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/6824a8935962/pone.0125663.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98fc/4429977/fc733a0f9470/pone.0125663.g007.jpg

相似文献

1
Protein structure and evolution: are they constrained globally by a principle derived from information theory?蛋白质结构与进化:它们是否受源于信息论的原理全局约束?
PLoS One. 2015 May 13;10(5):e0125663. doi: 10.1371/journal.pone.0125663. eCollection 2015.
2
A protein evolution model with independent sites that reproduces site-specific amino acid distributions from the Protein Data Bank.一种具有独立位点的蛋白质进化模型,可从蛋白质数据库中重现位点特异性氨基酸分布。
BMC Evol Biol. 2006 May 31;6:43. doi: 10.1186/1471-2148-6-43.
3
Reduced alphabet of prebiotic amino acids optimally encodes the conformational space of diverse extant protein folds.简化的前生物氨基酸字母表最优地编码了不同现存蛋白质折叠的构象空间。
BMC Evol Biol. 2019 Jul 30;19(1):158. doi: 10.1186/s12862-019-1464-6.
4
The genetic code is not an optimal code in a model taking into account both the biosynthetic relationships between amino acids and their physicochemical properties.遗传密码在考虑到氨基酸的生物合成关系及其物理化学性质的模型中并不是最优密码。
J Theor Biol. 2018 Dec 14;459:45-51. doi: 10.1016/j.jtbi.2018.09.021. Epub 2018 Sep 20.
5
Conservation of the secondary structure of protein during evolution and the role of the genetic code.
Orig Life Evol Biosph. 1985;16(2):157-64. doi: 10.1007/BF01809469.
6
Evolution of sparsity and modularity in a model of protein allostery.蛋白质变构模型中稀疏性和模块化的演变
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Apr;91(4):042704. doi: 10.1103/PhysRevE.91.042704. Epub 2015 Apr 9.
7
Distance-dependent classification of amino acids by information theory.基于信息论的氨基酸距离相关分类。
Proteins. 2010 Aug 1;78(10):2322-8. doi: 10.1002/prot.22744.
8
Strong evidence of an information-theoretical conservation principle linking all discrete systems.有强有力的证据表明存在一条将所有离散系统联系起来的信息理论守恒原理。
R Soc Open Sci. 2019 Oct 23;6(10):191101. doi: 10.1098/rsos.191101. eCollection 2019 Oct.
9
Inferring property selection pressure from positional residue conservation.从位置残基保守性推断属性选择压力。
Appl Bioinformatics. 2004;3(2-3):167-79. doi: 10.2165/00822942-200403020-00011.
10
Distinguishing structural and functional restraints in evolution in order to identify interaction sites.区分进化中的结构和功能限制以识别相互作用位点。
J Mol Biol. 2004 Oct 1;342(5):1487-504. doi: 10.1016/j.jmb.2004.08.022.

引用本文的文献

1
The Origin of Shared Emergent Properties in Discrete Systems.离散系统中共享涌现特性的起源。
Entropy (Basel). 2025 May 26;27(6):561. doi: 10.3390/e27060561.
2
Strong evidence of an information-theoretical conservation principle linking all discrete systems.有强有力的证据表明存在一条将所有离散系统联系起来的信息理论守恒原理。
R Soc Open Sci. 2019 Oct 23;6(10):191101. doi: 10.1098/rsos.191101. eCollection 2019 Oct.

本文引用的文献

1
Advances in LC-MS/MS-based glycoproteomics: getting closer to system-wide site-specific mapping of the N- and O-glycoproteome.基于液相色谱-串联质谱的糖蛋白质组学进展:更接近N-糖蛋白质组和O-糖蛋白质组的全系统位点特异性图谱分析。
Biochim Biophys Acta. 2014 Sep;1844(9):1437-52. doi: 10.1016/j.bbapap.2014.05.002. Epub 2014 May 12.
2
Validation of the reliability of computational O-GlcNAc prediction.计算性O-连接N-乙酰葡糖胺预测可靠性的验证。
Biochim Biophys Acta. 2014 Feb;1844(2):416-21. doi: 10.1016/j.bbapap.2013.12.002. Epub 2013 Dec 9.
3
UniCarbKB: building a knowledge platform for glycoproteomics.
UniCarbKB:构建糖蛋白质组学知识库平台。
Nucleic Acids Res. 2014 Jan;42(Database issue):D215-21. doi: 10.1093/nar/gkt1128. Epub 2013 Nov 13.
4
Pandoraviruses: amoeba viruses with genomes up to 2.5 Mb reaching that of parasitic eukaryotes.潘多拉病毒:具有高达 2.5Mb 基因组的变形虫病毒,达到了寄生真核生物的大小。
Science. 2013 Jul 19;341(6143):281-6. doi: 10.1126/science.1239181.
5
N- and O-glycosylation in the murine synaptosome.鼠突触体中的 N- 和 O-糖基化。
Mol Cell Proteomics. 2013 Dec;12(12):3474-88. doi: 10.1074/mcp.M113.030007. Epub 2013 Jul 1.
6
Proteome wide purification and identification of O-GlcNAc-modified proteins using click chemistry and mass spectrometry.利用点击化学和质谱法对 O-GlcNAc 修饰蛋白进行蛋白质组范围的纯化和鉴定。
J Proteome Res. 2013 Feb 1;12(2):927-36. doi: 10.1021/pr300967y. Epub 2013 Jan 18.
7
Word decoding of protein amino Acid sequences with availability analysis: a linguistic approach.蛋白质氨基酸序列的词法解码与可用性分析:一种语言学法。
PLoS One. 2012;7(11):e50039. doi: 10.1371/journal.pone.0050039. Epub 2012 Nov 21.
8
Post-translational modification: nature's escape from genetic imprisonment and the basis for dynamic information encoding.翻译后修饰:自然摆脱基因束缚的方式及动态信息编码的基础
Wiley Interdiscip Rev Syst Biol Med. 2012 Nov-Dec;4(6):565-83. doi: 10.1002/wsbm.1185. Epub 2012 Aug 15.
9
A universal scaling law determines time reversibility and steady state of substitutions under selection.一个通用的标度律决定了选择作用下替换的时间可逆性和稳态。
Theor Popul Biol. 2012 Aug;82(1):66-76. doi: 10.1016/j.tpb.2012.03.007.
10
Global identification and characterization of both O-GlcNAcylation and phosphorylation at the murine synapse.在鼠突触中对 O-连接糖基化和磷酸化进行全局鉴定和特征分析。
Mol Cell Proteomics. 2012 Aug;11(8):215-29. doi: 10.1074/mcp.O112.018366. Epub 2012 May 29.