• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

仅通过了解蛋白质的氨基酸组成能学到什么?

What Can Be Learned by Knowing Only the Amino Acid Composition of Proteins?

作者信息

Lobanov Michail Yu, Surin Alexey A, Galzitskaya Oxana V

机构信息

Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Russia.

Faculty of Informatics and Computer Engineering, MIREA-Russian Technological University, 119454 Moscow, Russia.

出版信息

Int J Mol Sci. 2024 Dec 21;25(24):13680. doi: 10.3390/ijms252413680.

DOI:10.3390/ijms252413680
PMID:39769440
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11676433/
Abstract

The amino acid composition of proteins depends on many factors. It varies in organisms that are distant in taxonomic position. The amino acid composition of proteins depends on the localization of proteins in cells and tissues and the structure of proteins. The question arises: is it possible to separate different proteomes using only the amino acid composition of proteins? Is it possible to determine, considering only its amino acid composition, to what structural class the protein under study will belong? We have developed a method and a measure that maximally separate two sets of proteins. As a result, we assign each protein an R-value, positive values of which are more characteristic of the first set, and negative ones-of the second. By studying the distribution of R in two sets, we can determine how much these sets differ in composition. Also, when examining a new protein, we can determine if it is more similar to the first set or the second. In this paper, we show that using only amino acid composition, it is possible to separate sets of proteins belonging to different organisms, as well as proteins that differ in function or structure. In all cases, we assign to proteins a measure R that maximally separates the studied sets. This approach can be further used to annotate proteins with unknown functions.

摘要

蛋白质的氨基酸组成取决于许多因素。在分类学位置上相距较远的生物体中,其氨基酸组成会有所不同。蛋白质的氨基酸组成取决于蛋白质在细胞和组织中的定位以及蛋白质的结构。问题来了:仅利用蛋白质的氨基酸组成能否分离不同的蛋白质组?仅考虑其氨基酸组成,能否确定所研究的蛋白质属于何种结构类别?我们开发了一种方法和一种度量,能最大程度地分离两组蛋白质。结果,我们为每种蛋白质赋予一个R值,其正值更具第一组的特征,负值则更具第二组的特征。通过研究两组中R值的分布,我们可以确定这两组在组成上的差异程度。此外,在检测一种新蛋白质时,我们可以确定它与第一组还是第二组更相似。在本文中,我们表明仅使用氨基酸组成,就有可能分离属于不同生物体的蛋白质组,以及功能或结构不同的蛋白质。在所有情况下,我们都为蛋白质赋予一个能最大程度分离所研究组别的度量R。这种方法可进一步用于注释功能未知的蛋白质。

相似文献

1
What Can Be Learned by Knowing Only the Amino Acid Composition of Proteins?仅通过了解蛋白质的氨基酸组成能学到什么?
Int J Mol Sci. 2024 Dec 21;25(24):13680. doi: 10.3390/ijms252413680.
2
Proteome-scale relationships between local amino acid composition and protein fates and functions.蛋白质组范围内的局部氨基酸组成与蛋白质命运和功能之间的关系。
PLoS Comput Biol. 2018 Sep 24;14(9):e1006256. doi: 10.1371/journal.pcbi.1006256. eCollection 2018 Sep.
3
Prediction of secondary structural content of proteins from their amino acid composition alone. II. The paradox with secondary structural class.仅根据氨基酸组成预测蛋白质的二级结构含量。II. 二级结构类别的悖论。
Proteins. 1996 Jun;25(2):169-79. doi: 10.1002/(SICI)1097-0134(199606)25:2<169::AID-PROT3>3.0.CO;2-D.
4
Prediction of secondary structural content of proteins from their amino acid composition alone. I. New analytic vector decomposition methods.仅根据氨基酸组成预测蛋白质的二级结构含量。I. 新的分析向量分解方法。
Proteins. 1996 Jun;25(2):157-68. doi: 10.1002/(SICI)1097-0134(199606)25:2<157::AID-PROT2>3.0.CO;2-F.
5
The modal distribution of protein isoelectric points reflects amino acid properties rather than sequence evolution.蛋白质等电点的模态分布反映的是氨基酸性质而非序列进化。
Proteomics. 2004 Apr;4(4):943-9. doi: 10.1002/pmic.200200648.
6
TMB-Hunt: an amino acid composition based method to screen proteomes for beta-barrel transmembrane proteins.TMB-Hunt:一种基于氨基酸组成的方法,用于在蛋白质组中筛选β-桶状跨膜蛋白。
BMC Bioinformatics. 2005 Mar 15;6:56. doi: 10.1186/1471-2105-6-56.
7
Atypical structural tendencies among low-complexity domains in the Protein Data Bank proteome.蛋白质数据库(PDB)蛋白质组中低复杂度结构域的非典型结构趋势。
PLoS Comput Biol. 2020 Jan 27;16(1):e1007487. doi: 10.1371/journal.pcbi.1007487. eCollection 2020 Jan.
8
Amino acid composition of proteins reduces deleterious impact of mutations.蛋白质的氨基酸组成可降低突变的有害影响。
Sci Rep. 2013 Oct 10;3:2919. doi: 10.1038/srep02919.
9
Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life.通过组成特征鉴定低复杂度结构域揭示了生命领域中特定类别出现的频率和功能。
PLoS Comput Biol. 2024 May 15;20(5):e1011372. doi: 10.1371/journal.pcbi.1011372. eCollection 2024 May.
10
Correlation and prediction of gene expression level from amino acid and dipeptide composition of its protein.基于蛋白质的氨基酸和二肽组成对基因表达水平进行相关性分析与预测。
BMC Bioinformatics. 2005 Mar 17;6:59. doi: 10.1186/1471-2105-6-59.

本文引用的文献

1
Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?在人类蛋白质中发现的同源重复序列的密码子频率是否存在偏倚?
Biosystems. 2024 Dec;246:105357. doi: 10.1016/j.biosystems.2024.105357. Epub 2024 Oct 21.
2
Comparison of deep learning models with simple method to assess the problem of antimicrobial peptides prediction.比较深度学习模型与简单方法来评估抗菌肽预测问题。
Mol Inform. 2024 May;43(5):e202200181. doi: 10.1002/minf.202200181. Epub 2023 Apr 7.
3
PANTHER: Making genome-scale phylogenetics accessible to all.
PANTHER:让所有人大开眼界的基因组系统发生学。
Protein Sci. 2022 Jan;31(1):8-22. doi: 10.1002/pro.4218. Epub 2021 Nov 25.
4
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
5
Benchmarking Inverse Statistical Approaches for Protein Structure and Design with Exactly Solvable Models.使用精确可解模型对蛋白质结构和设计的逆统计方法进行基准测试。
PLoS Comput Biol. 2016 May 13;12(5):e1004889. doi: 10.1371/journal.pcbi.1004889. eCollection 2016 May.
6
PASTA 2.0: an improved server for protein aggregation prediction.PASTA 2.0:改进的蛋白质聚集预测服务器。
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W301-7. doi: 10.1093/nar/gku399. Epub 2014 May 21.
7
Ten quick tips for using the gene ontology.使用基因本体论的十条快速提示。
PLoS Comput Biol. 2013;9(11):e1003343. doi: 10.1371/journal.pcbi.1003343. Epub 2013 Nov 14.
8
Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes.真核生物和细菌蛋白质组中无序模式和同型重复序列的出现情况。
Mol Biosyst. 2012 Jan;8(1):327-37. doi: 10.1039/c1mb05318c. Epub 2011 Oct 18.
9
The cell biology of smell.嗅觉的细胞生物学。
J Cell Biol. 2010 Nov 1;191(3):443-52. doi: 10.1083/jcb.201008163.
10
Genome wide exploration of the origin and evolution of amino acids.全基因组范围内氨基酸的起源和进化探索。
BMC Evol Biol. 2010 Mar 15;10:77. doi: 10.1186/1471-2148-10-77.