• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质序列的层次结构。

Hierarchical Structure of Protein Sequence.

机构信息

Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia.

Steklov Mathematical Institute and of Russian Academy of Sciences, 8 Gubkina St., 119991 Moscow, Russia.

出版信息

Int J Mol Sci. 2021 Aug 3;22(15):8339. doi: 10.3390/ijms22158339.

DOI:10.3390/ijms22158339
PMID:34361104
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8348890/
Abstract

Most non-communicable diseases are associated with dysfunction of proteins or protein complexes. The relationship between sequence and structure has been analyzed for a long time, and the analysis of the sequences organization in domains and motifs remains an actual research area. Here, we propose a mathematical method for revealing the hierarchical organization of protein sequences. The method is based on the pentapeptide as a unit of protein sequences. Employing the frequency of occurrence of pentapeptides in sequences of natural proteins and a special mathematical approach, this method revealed a hierarchical structure in the protein sequence. The method was applied to 24,647 non-homologous protein sequences with sizes ranging from 50 to 400 residues from the NRDB90 database. Statistical analysis of the branching points of the graphs revealed 11 characteristic values of y (the width of the inscribed function), showing the relationship of these multiple fragments of the sequences. Several examples illustrate how fragments of the protein spatial structure correspond to the elements of the hierarchical structure of the protein sequence. This methodology provides a promising basis for a mathematically-based classification of the elements of the spatial organization of proteins. Elements of the hierarchical structure of different levels of the hierarchy can be used to solve biotechnological and medical problems.

摘要

大多数非传染性疾病都与蛋白质或蛋白质复合物的功能障碍有关。人们已经对序列和结构之间的关系进行了长期分析,而对结构域和基序中序列组织的分析仍然是一个实际的研究领域。在这里,我们提出了一种揭示蛋白质序列层次结构的数学方法。该方法基于五肽作为蛋白质序列的单元。该方法利用天然蛋白质序列中五肽的出现频率和一种特殊的数学方法,揭示了蛋白质序列中的层次结构。该方法应用于来自 NRDB90 数据库的 24647 个大小在 50 到 400 个残基之间的非同源蛋白质序列。对图形分支点的统计分析揭示了 11 个特征值 y(内接函数的宽度),显示了这些序列片段之间的关系。几个例子说明了蛋白质空间结构的片段如何与蛋白质序列层次结构的元素相对应。这种方法为基于数学的蛋白质空间组织元素分类提供了有前途的基础。不同层次结构的层次结构元素可用于解决生物技术和医学问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/bfe54ca6a0bc/ijms-22-08339-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/19a730a5ecbc/ijms-22-08339-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/950e0a5dc41a/ijms-22-08339-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/4f496f3474f9/ijms-22-08339-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/0dcf412b9c49/ijms-22-08339-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/af920c193f65/ijms-22-08339-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/bfe54ca6a0bc/ijms-22-08339-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/19a730a5ecbc/ijms-22-08339-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/950e0a5dc41a/ijms-22-08339-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/4f496f3474f9/ijms-22-08339-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/0dcf412b9c49/ijms-22-08339-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/af920c193f65/ijms-22-08339-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a957/8348890/bfe54ca6a0bc/ijms-22-08339-g006.jpg

相似文献

1
Hierarchical Structure of Protein Sequence.蛋白质序列的层次结构。
Int J Mol Sci. 2021 Aug 3;22(15):8339. doi: 10.3390/ijms22158339.
2
Entropy Analysis of Protein Sequences Reveals a Hierarchical Organization.蛋白质序列的熵分析揭示了一种层次结构。
Entropy (Basel). 2021 Dec 7;23(12):1647. doi: 10.3390/e23121647.
3
A study of a hierarchical structure of proteins and ligand binding sites of receptors using the triangular spatial relationship-based structure comparison method and development of a size-filtering feature designed for comparing different sizes of protein structures.一项使用基于三角形空间关系的结构比较方法对蛋白质的层次结构和受体的配体结合位点进行的研究,以及一种为比较不同大小蛋白质结构而设计的尺寸过滤特征的开发。
Proteins. 2022 Jan;90(1):239-257. doi: 10.1002/prot.26215. Epub 2021 Aug 23.
4
Efficiently Mining Recurrent Substructures from Protein Three-Dimensional Structure Graphs.从蛋白质三维结构图谱中高效挖掘循环子结构
J Comput Biol. 2019 Jun;26(6):561-571. doi: 10.1089/cmb.2018.0171. Epub 2018 Dec 5.
5
Five hierarchical levels of sequence-structure correlation in proteins.蛋白质中序列-结构相关性的五个层次级别。
Appl Bioinformatics. 2004;3(2-3):97-104. doi: 10.2165/00822942-200403020-00004.
6
Automatic classification of protein structures using low-dimensional structure space mappings.利用低维结构空间映射对蛋白质结构进行自动分类。
BMC Bioinformatics. 2014;15 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2105-15-S2-S1. Epub 2014 Jan 24.
7
Functional region prediction with a set of appropriate homologous sequences--an index for sequence selection by integrating structure and sequence information with spatial statistics.使用一组合适的同源序列进行功能区域预测——一种通过将结构和序列信息与空间统计相结合来选择序列的指标。
BMC Struct Biol. 2012 May 29;12:11. doi: 10.1186/1472-6807-12-11.
8
Protein topology recognition from secondary structure sequences: application of the hidden Markov models to the alpha class proteins.从二级结构序列识别蛋白质拓扑结构:隐马尔可夫模型在α类蛋白质中的应用。
J Mol Biol. 1997 Mar 28;267(2):446-63. doi: 10.1006/jmbi.1996.0874.
9
Local descriptors of protein structure: a systematic analysis of the sequence-structure relationship in proteins using short- and long-range interactions.蛋白质结构的局部描述符:利用短程和长程相互作用对蛋白质序列-结构关系进行系统分析。
Proteins. 2009 Jun;75(4):870-84. doi: 10.1002/prot.22296.
10
A topological algorithm for identification of structural domains of proteins.一种用于识别蛋白质结构域的拓扑算法。
BMC Bioinformatics. 2007 Jul 3;8:237. doi: 10.1186/1471-2105-8-237.

引用本文的文献

1
Structural and functional prediction, evaluation, and validation in the post-sequencing era.测序后时代的结构与功能预测、评估及验证
Comput Struct Biotechnol J. 2023 Dec 25;23:446-451. doi: 10.1016/j.csbj.2023.12.031. eCollection 2024 Dec.
2
Recent Trends in Cancer Genomics and Bioinformatics Tools Development.癌症基因组学和生物信息学工具开发的最新趋势。
Int J Mol Sci. 2021 Nov 10;22(22):12146. doi: 10.3390/ijms222212146.
3
Bioinformatics Applications to Reveal Molecular Mechanisms of Gene Expression Regulation in Model Organisms.

本文引用的文献

1
A protein sequence fitness function for identifying natural and nonnatural proteins.一种用于识别天然蛋白和非天然蛋白的蛋白质序列适合度函数。
Proteins. 2020 Oct;88(10):1271-1284. doi: 10.1002/prot.25900. Epub 2020 May 28.
2
A minimum set of stable blocks for rational design of polypeptide chains.多肽链合理设计的最小稳定基序集。
Biochimie. 2019 May;160:88-92. doi: 10.1016/j.biochi.2019.02.006. Epub 2019 Feb 19.
3
Filamentous Aggregates of Tau Proteins Fulfil Standard Amyloid Criteria Provided by the Fuzzy Oil Drop (FOD) Model.
生物信息学在揭示模式生物基因表达调控的分子机制中的应用。
Int J Mol Sci. 2021 Nov 5;22(21):11973. doi: 10.3390/ijms222111973.
丝状聚集的 Tau 蛋白满足模糊油滴(FOD)模型提供的标准淀粉样蛋白标准。
Int J Mol Sci. 2018 Sep 25;19(10):2910. doi: 10.3390/ijms19102910.
4
New insight into the mechanism of mitochondrial cytochrome c function.线粒体细胞色素c功能机制的新见解。
PLoS One. 2017 May 31;12(5):e0178280. doi: 10.1371/journal.pone.0178280. eCollection 2017.
5
A vocabulary of ancient peptides at the origin of folded proteins.折叠蛋白起源处的古代肽词汇表。
Elife. 2015 Dec 14;4:e09410. doi: 10.7554/eLife.09410.
6
Truncated Variants of Serratia proteamaculans Oligopeptidase B Having Different Activities.具有不同活性的粘质沙雷氏菌寡肽酶B的截短变体
Biochemistry (Mosc). 2015 Oct;80(10):1331-43. doi: 10.1134/S0006297915100156.
7
Entropy of never born protein sequences.未出生蛋白质序列的熵。
Springerplus. 2013 Apr 30;2(1):200. doi: 10.1186/2193-1801-2-200. Print 2013 Dec.
8
Geometric parameters defining the structure of proteins--relation to early-stage folding step.定义蛋白质结构的几何参数——与早期折叠步骤的关系。
J Biomol Struct Dyn. 2011 Aug;29(1):79-104. doi: 10.1080/07391102.2011.10507376.
9
[Production and properties of human tumor necrosis factor peptide fragments].
Bioorg Khim. 2010 May-Jun;36(3):327-36. doi: 10.1134/s1068162010030040.
10
Hidden Markov Models and their Applications in Biological Sequence Analysis.隐马尔可夫模型及其在生物序列分析中的应用。
Curr Genomics. 2009 Sep;10(6):402-15. doi: 10.2174/138920209789177575.