• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质中接触矩阵和疏水性图谱的主特征向量。

Principal eigenvector of contact matrices and hydrophobicity profiles in proteins.

作者信息

Bastolla Ugo, Porto Markus, Roman H Eduardo, Vendruscolo Michele

机构信息

Centro de Astrobiología (INTA-CSIC), 28850 Torrejón de Ardoz, Spain.

出版信息

Proteins. 2005 Jan 1;58(1):22-30. doi: 10.1002/prot.20240.

DOI:10.1002/prot.20240
PMID:15523667
Abstract

With the aim of studying the relationship between protein sequences and their native structures, we adopted vectorial representations for both sequence and structure. The structural representation was based on the principal eigenvector of the fold's contact matrix (PE). As has been recently shown, the latter encodes sufficient information for reconstructing the whole contact matrix. The sequence was represented through a hydrophobicity profile (HP), using a generalized hydrophobicity scale that we obtained from the principal eigenvector of a residue-residue interaction matrix, and denoted as interactivity scale. Using this novel scale, we defined the optimal HP of a protein fold, and, by means of stability arguments, predicted to be strongly correlated with the PE of the fold's contact matrix. This prediction was confirmed through an evolutionary analysis, which showed that the PE correlates with the HP of each individual sequence adopting the same fold and, even more strongly, with the average HP of this set of sequences. Thus, protein sequences evolve in such a way that their average HP is close to the optimal one, implying that neutral evolution can be viewed as a kind of motion in sequence space around the optimal HP. Our results indicate that the correlation coefficient between N-dimensional vectors constitutes a natural metric in the vectorial space in which we represent both protein sequences and protein structures, which we call vectorial protein space. In this way, we define a unified framework for sequence-to-sequence, sequence-to-structure and structure-to-structure alignments. We show that the interactivity scale is nearly optimal both for the comparison of sequences to sequences and sequences to structures.

摘要

为了研究蛋白质序列与其天然结构之间的关系,我们对序列和结构都采用了向量表示法。结构表示基于折叠接触矩阵的主特征向量(PE)。最近的研究表明,后者编码了重建整个接触矩阵所需的足够信息。序列通过疏水轮廓(HP)表示,使用我们从残基-残基相互作用矩阵的主特征向量获得的广义疏水标度,并将其表示为相互作用标度。使用这个新的标度,我们定义了蛋白质折叠的最佳HP,并通过稳定性论证预测其与折叠接触矩阵的PE密切相关。通过进化分析证实了这一预测,该分析表明PE与采用相同折叠的每个单独序列的HP相关,甚至与这组序列的平均HP更密切相关。因此,蛋白质序列的进化方式使得它们的平均HP接近最佳值,这意味着中性进化可以被视为围绕最佳HP在序列空间中的一种运动。我们的结果表明,N维向量之间的相关系数构成了我们表示蛋白质序列和蛋白质结构的向量空间中的一种自然度量,我们将其称为向量蛋白质空间。通过这种方式,我们定义了一个用于序列到序列、序列到结构和结构到结构比对的统一框架。我们表明,相互作用标度对于序列与序列以及序列与结构的比较几乎都是最佳的。

相似文献

1
Principal eigenvector of contact matrices and hydrophobicity profiles in proteins.蛋白质中接触矩阵和疏水性图谱的主特征向量。
Proteins. 2005 Jan 1;58(1):22-30. doi: 10.1002/prot.20240.
2
Effective connectivity profile: a structural representation that evidences the relationship between protein structures and sequences.有效连接性概况:一种证明蛋白质结构与序列之间关系的结构表示。
Proteins. 2008 Dec;73(4):872-88. doi: 10.1002/prot.22113.
3
Looking at structure, stability, and evolution of proteins through the principal eigenvector of contact matrices and hydrophobicity profiles.通过接触矩阵的主特征向量和疏水性图谱来研究蛋白质的结构、稳定性及进化。
Gene. 2005 Mar 14;347(2):219-30. doi: 10.1016/j.gene.2004.12.015.
4
Prediction of distant residue contacts with the use of evolutionary information.利用进化信息预测远距离残基接触。
Proteins. 2005 Mar 1;58(4):935-49. doi: 10.1002/prot.20370.
5
Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.用于比对远缘相关蛋白质序列的基于统计势的氨基酸相似性矩阵。
Proteins. 2006 Aug 15;64(3):587-600. doi: 10.1002/prot.21020.
6
Designability of protein structures: a lattice-model study using the Miyazawa-Jernigan matrix.蛋白质结构的可设计性:使用宫泽-杰尔尼根矩阵的晶格模型研究
Proteins. 2002 Nov 15;49(3):403-12. doi: 10.1002/prot.10239.
7
Comparative analysis of structural properties of the C-type-lectin-like domain (CTLD).C型凝集素样结构域(CTLD)结构特性的比较分析。
Proteins. 2003 Aug 15;52(3):466-77. doi: 10.1002/prot.10626.
8
Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space.原子水平蛋白质设计的准确预测及其在扩展近最优序列空间中的应用。
Proteins. 2009 May 15;75(3):682-705. doi: 10.1002/prot.22280.
9
Evolution and similarity evaluation of protein structures in contact map space.接触图空间中蛋白质结构的进化与相似性评估
Proteins. 2005 May 1;59(2):196-204. doi: 10.1002/prot.20415.
10
Prediction of site-specific amino acid distributions and limits of divergent evolutionary changes in protein sequences.蛋白质序列中位点特异性氨基酸分布的预测及分歧进化变化的限度
Mol Biol Evol. 2005 Mar;22(3):630-8. doi: 10.1093/molbev/msi048. Epub 2004 Nov 10.

引用本文的文献

1
Polymers in Physics, Chemistry and Biology: Behavior of Linear Polymers in Fractal Structures.物理、化学和生物学中的聚合物:分形结构中线性聚合物的行为
Polymers (Basel). 2024 Dec 2;16(23):3400. doi: 10.3390/polym16233400.
2
The genetic architecture of protein interaction affinity and specificity.蛋白质相互作用亲和力和特异性的遗传结构。
Nat Commun. 2024 Oct 14;15(1):8868. doi: 10.1038/s41467-024-53195-4.
3
Selection among site-dependent structurally constrained substitution models of protein evolution by approximate Bayesian computation.
基于近似贝叶斯计算的蛋白质进化中依赖于位置的结构约束替代模型的选择。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae096.
4
Predicting and Interpreting Protein Developability Via Transfer of Convolutional Sequence Representation.通过卷积序列表示的转移来预测和解释蛋白质可开发性。
ACS Synth Biol. 2023 Sep 15;12(9):2600-2615. doi: 10.1021/acssynbio.3c00196. Epub 2023 Aug 29.
5
ParSe 2.0: A web tool to identify drivers of protein phase separation at the proteome level.ParSe 2.0:一种在蛋白质组水平上识别蛋白质相分离驱动因素的网络工具。
Protein Sci. 2023 Sep;32(9):e4756. doi: 10.1002/pro.4756.
6
Improved LC-MS identification of short homologous peptides using sequence-specific retention time predictors.采用序列特异性保留时间预测器提高短同源肽的 LC-MS 鉴定。
Anal Bioanal Chem. 2023 Jun;415(14):2715-2726. doi: 10.1007/s00216-023-04670-2. Epub 2023 Mar 31.
7
Intrinsically disordered regions that drive phase separation form a robustly distinct protein class.内在无序区域驱动相分离形成了一个截然不同的蛋白质类别。
J Biol Chem. 2023 Jan;299(1):102801. doi: 10.1016/j.jbc.2022.102801. Epub 2022 Dec 14.
8
Calcium inhibits penetration of Alzheimer's Aβ - monomers into the membrane.钙抑制阿尔茨海默病 Aβ-单体穿透细胞膜。
Proteins. 2022 Dec;90(12):2124-2143. doi: 10.1002/prot.26403. Epub 2022 Aug 10.
9
SWeeP: representing large biological sequences datasets in compact vectors.SWeeP:用紧凑向量表示大型生物序列数据集。
Sci Rep. 2020 Jan 9;10(1):91. doi: 10.1038/s41598-019-55627-4.
10
Amyloid Fibril Design: Limiting Structural Polymorphism in Alzheimer's Aβ Protofilaments.淀粉样纤维设计:限制阿尔茨海默病 Aβ原纤维的结构多态性。
J Phys Chem B. 2018 Dec 13;122(49):11535-11545. doi: 10.1021/acs.jpcb.8b07423. Epub 2018 Nov 29.