• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

真核生物和细菌蛋白质组中无序模式和同型重复序列的出现情况。

Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes.

作者信息

Lobanov M Yu, Galzitskaya O V

机构信息

Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia.

出版信息

Mol Biosyst. 2012 Jan;8(1):327-37. doi: 10.1039/c1mb05318c. Epub 2011 Oct 18.

DOI:10.1039/c1mb05318c
PMID:22009164
Abstract

Combining the motif discovery and disorder protein segment identification in PDB allows us to create the first and largest library of disordered patterns. At present the library includes 109 disordered patterns. Here we offer a comprehensive analysis of the occurrence of selected disordered patterns and 20 homorepeats of 6 residues long in 123 proteomes. 27 disordered patterns occur sparsely in all considered proteomes, but the patterns of low-complexity-homorepeats-appear more often in eukaryotic than in bacterial proteomes. A comparative analysis of the number of proteins containing homorepeats of 6 residues long and the disordered selected patterns in these proteomes has been performed. The matrices of correlation coefficients between numbers of proteins where at least once a homorepeat of six residues long for each of 20 types of amino acid residues and 109 disordered patterns from the library appears in 9 kingdoms of eukaryota and 5 phyla of bacteria have been calculated. As a rule, the correlation coefficients are higher inside the considered kingdom than between them. The largest fraction of homorepeats of 6 residues belongs to Amoebozoa proteomes (D. discoideum), 46%. Moreover, the longest uninterrupted repeats belong to S306 from D. discoideum (Amoebozoa). Homorepeats of some amino acids occur more frequently than others and the type of homorepeats varies across different proteomes, . For example, E6 appears most frequent for all considered proteomes for Chordata, Q6 for Arthropoda, S6 for Nematoda. The averaged occurrence of multiple long runs of 6 amino acids in a decreasing order for 97 eukaryotic proteomes is as follows: Q6, S6, A6, G6, N6, E6, P6, T6, D6, K6, L6, H6, R6, F6, V6, I6, Y6, C6, M6, W6, and for 26 bacterial proteomes it is A6, G6, P6, and the others occur seldom. This suggests that such short similar motifs are responsible for common functions for nonhomologous, unrelated proteins from different organisms.

摘要

结合蛋白质数据库(PDB)中的基序发现和无序蛋白质片段识别,使我们能够创建首个也是最大的无序模式库。目前该库包含109种无序模式。在此,我们对123个蛋白质组中选定的无序模式和20个6残基长的同聚物重复序列的出现情况进行了全面分析。27种无序模式在所有考虑的蛋白质组中出现频率较低,但低复杂性同聚物重复序列模式在真核生物蛋白质组中比在细菌蛋白质组中更常见。对这些蛋白质组中包含6残基长同聚物重复序列的蛋白质数量与选定的无序模式进行了比较分析。计算了20种氨基酸残基中每种至少出现一次6残基长同聚物重复序列的蛋白质数量与来自9个真核生物界和5个细菌门的库中109种无序模式之间的相关系数矩阵。通常,所考虑的界内的相关系数高于它们之间的相关系数。6残基长同聚物重复序列中最大比例属于变形虫门蛋白质组(盘基网柄菌),为46%。此外,最长的不间断重复序列属于盘基网柄菌(变形虫门)的S306。某些氨基酸的同聚物重复序列比其他的出现更频繁,并且同聚物重复序列的类型在不同蛋白质组中有所不同。例如,对于所有考虑的脊索动物蛋白质组,E6出现频率最高,对于节肢动物为Q6,对于线虫为S6。97个真核生物蛋白质组中多个6氨基酸长连续序列的平均出现频率从高到低如下:Q6、S6、A6、G6、N6、E6、P6、T6、D6、K6、L6、H6、R6、F6、V6、I6、Y6、C6、M6、W6,对于26个细菌蛋白质组则是A6、G6、P6,其他的很少出现。这表明这种短的相似基序负责不同生物体中非同源、不相关蛋白质的共同功能。

相似文献

1
Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes.真核生物和细菌蛋白质组中无序模式和同型重复序列的出现情况。
Mol Biosyst. 2012 Jan;8(1):327-37. doi: 10.1039/c1mb05318c. Epub 2011 Oct 18.
2
HRaP: database of occurrence of HomoRepeats and patterns in proteomes.HRaP:同源重复和蛋白质组模式出现数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D273-8. doi: 10.1093/nar/gkt927. Epub 2013 Oct 22.
3
[Occurrence of motifs with six amino acid residues in three eukaryotic proteomes].[三种真核生物蛋白质组中六氨基酸残基基序的出现情况]
Mol Biol (Mosk). 2012 Jan-Feb;46(1):184-90.
4
Disordered patterns in clustered Protein Data Bank and in eukaryotic and bacterial proteomes.蛋白质数据库中聚集的蛋白质和真核生物及细菌蛋白质组中的紊乱模式。
PLoS One. 2011;6(11):e27142. doi: 10.1371/journal.pone.0027142. Epub 2011 Nov 4.
5
[Comparative analysis of internal repeating segments in proteins of species from the three kingdoms of life].[生命三域物种蛋白质内部重复片段的比较分析]
Yi Chuan Xue Bao. 2005 Mar;32(3):315-21.
6
How Common Is Disorder? Occurrence of Disordered Residues in Four Domains of Life.该病症有多常见?生命四个领域中无序残基的出现情况。
Int J Mol Sci. 2015 Aug 18;16(8):19490-507. doi: 10.3390/ijms160819490.
7
PolyX2: Fast Detection of Homorepeats in Large Protein Datasets.PolyX2:在大型蛋白质数据集快速检测同源重复。
Genes (Basel). 2022 Apr 25;13(5):758. doi: 10.3390/genes13050758.
8
Intrinsic disorder in the Protein Data Bank.蛋白质数据库中的内在无序状态。
J Biomol Struct Dyn. 2007 Feb;24(4):325-42. doi: 10.1080/07391102.2007.10507123.
9
Context characterization of amino acid homorepeats using evolution, position, and order.利用进化、位置和顺序对氨基酸同型重复序列进行上下文特征分析。
Proteins. 2017 Apr;85(4):709-719. doi: 10.1002/prot.25250. Epub 2017 Feb 6.
10
Structure-function relationships in protein homorepeats.蛋白质同源重复结构与功能关系。
Curr Opin Struct Biol. 2023 Dec;83:102726. doi: 10.1016/j.sbi.2023.102726. Epub 2023 Nov 2.

引用本文的文献

1
Evolutionary perspective of the CAG/CAA interplay coding for pure polyglutamine stretches in proteins.蛋白质中编码纯聚谷氨酰胺片段的CAG/CAA相互作用的进化视角。
NAR Genom Bioinform. 2025 Jun 9;7(2):lqaf075. doi: 10.1093/nargab/lqaf075. eCollection 2025 Jun.
2
What Can Be Learned by Knowing Only the Amino Acid Composition of Proteins?仅通过了解蛋白质的氨基酸组成能学到什么?
Int J Mol Sci. 2024 Dec 21;25(24):13680. doi: 10.3390/ijms252413680.
3
Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context.
通过氨基酸上下文预测聚谷氨酰胺和聚腺苷酸在蛋白质-蛋白质相互作用中的参与情况。
Heliyon. 2024 Sep 14;10(18):e37861. doi: 10.1016/j.heliyon.2024.e37861. eCollection 2024 Sep 30.
4
Protein Fold Usages in Ribosomes: Another Glance to the Past.核糖体中蛋白质折叠的用途:另一种回顾过去的方式。
Int J Mol Sci. 2024 Aug 13;25(16):8806. doi: 10.3390/ijms25168806.
5
Reviewing the Structure-Function Paradigm in Polyglutamine Disorders: A Synergistic Perspective on Theoretical and Experimental Approaches.综述多聚谷氨酰胺疾病的结构-功能范式:理论和实验方法的协同视角。
Int J Mol Sci. 2024 Jun 20;25(12):6789. doi: 10.3390/ijms25126789.
6
Diversity and features of proteins with structural repeats.具有结构重复序列的蛋白质的多样性和特征
Biophys Rev. 2023 Sep 7;15(5):1159-1169. doi: 10.1007/s12551-023-01130-0. eCollection 2023 Oct.
7
Regions with two amino acids in protein sequences: A step forward from homorepeats into the low complexity landscape.蛋白质序列中含有两个氨基酸的区域:从同聚物重复迈向低复杂性格局的一步。
Comput Struct Biotechnol J. 2022 Sep 18;20:5516-5523. doi: 10.1016/j.csbj.2022.09.011. eCollection 2022.
8
Between Interactions and Aggregates: The PolyQ Balance.在相互作用和聚集体之间:聚谷氨酰胺的平衡。
Genome Biol Evol. 2021 Nov 5;13(11). doi: 10.1093/gbe/evab246.
9
Assessing the low complexity of protein sequences via the low complexity triangle.通过低复杂度三角形评估蛋白质序列的低复杂度。
PLoS One. 2020 Dec 30;15(12):e0239154. doi: 10.1371/journal.pone.0239154. eCollection 2020.
10
The features of polyglutamine regions depend on their evolutionary stability.多聚谷氨酰胺区的特征取决于其进化稳定性。
BMC Evol Biol. 2020 May 24;20(1):59. doi: 10.1186/s12862-020-01626-3.