• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

有效连接性概况:一种证明蛋白质结构与序列之间关系的结构表示。

Effective connectivity profile: a structural representation that evidences the relationship between protein structures and sequences.

作者信息

Bastolla Ugo, Ortíz Angel R, Porto Markus, Teichert Florian

机构信息

Centro de Biología Molecular Severo Ochoa, (CSIC-UAM), Cantoblanco, 28049 Madrid, Spain.

出版信息

Proteins. 2008 Dec;73(4):872-88. doi: 10.1002/prot.22113.

DOI:10.1002/prot.22113
PMID:18536008
Abstract

The complexity of protein structures calls for simplified representations of their topology. The simplest possible mathematical description of a protein structure is a one-dimensional profile representing, for instance, buriedness or secondary structure. This kind of representation has been introduced for studying the sequence to structure relationship, with applications to fold recognition. Here we define the effective connectivity profile (EC), a network theoretical profile that self-consistently represents the network structure of the protein contact matrix. The EC profile makes mathematically explicit the relationship between protein structure and protein sequence, because it allows predicting the average hydrophobicity profile (HP) and the distributions of amino acids at each site for families of homologous proteins sharing the same structure. In this sense, the EC provides an analytic solution to the statistical inverse folding problem, which consists in finding the statistical properties of the set of sequences compatible with a given structure. We tested these predictions with simulations of the structurally constrained neutral (SCN) model of protein evolution with structure conservation, for single- and multi-domain proteins, and for a wide range of mutation processes, the latter producing sequences with very different hydrophobicity profiles, finding that the EC-based predictions are accurate even when only one sequence of the family is known. The EC profile is very significantly correlated with the HP for sequence-structure pairs in the PDB as well. The EC profile generalizes the properties of previously introduced structural profiles to modular proteins such as multidomain chains, and its correlation with the sequence profile is substantially improved with respect to the previously defined profiles, particularly for long proteins. Furthermore, the EC profile has a dynamic interpretation, since the EC components are strongly inversely related with the temperature factors measured in X-ray experiments, meaning that positions with large EC component are more strongly constrained in their equilibrium dynamics. Last, the EC profile allows to define a natural measure of modularity that correlates with the number of domains composing the protein, suggesting its application for domain decomposition. Finally, we show that structurally similar proteins have similar EC profiles, so that the similarity between aligned EC profiles can be used as a structure similarity measure, a property that we have recently applied for protein structure alignment. The code for computing the EC profile is available upon request writing to ubastolla@cbm.uam.es, and the structural profiles discussed in this article can be downloaded from the SLOTH webserver http://www.fkp.tu-darmstadt.de/SLOTH/.

摘要

蛋白质结构的复杂性需要对其拓扑结构进行简化表示。对蛋白质结构最简单的数学描述是一维轮廓,例如表示埋藏度或二级结构。这种表示方式已被引入用于研究序列与结构的关系,并应用于折叠识别。在这里,我们定义了有效连通性轮廓(EC),这是一种网络理论轮廓,它自洽地表示蛋白质接触矩阵的网络结构。EC轮廓从数学上明确了蛋白质结构与蛋白质序列之间的关系,因为它可以预测具有相同结构的同源蛋白质家族的平均疏水性轮廓(HP)以及每个位点氨基酸的分布。从这个意义上说,EC为统计逆折叠问题提供了一种解析解决方案,该问题在于找到与给定结构兼容的序列集的统计特性。我们使用具有结构保守性的蛋白质进化的结构受限中性(SCN)模型模拟,对单域和多域蛋白质以及广泛的突变过程进行了测试,后者产生具有非常不同疏水性轮廓的序列,发现即使仅知道家族中的一个序列,基于EC的预测也是准确的。EC轮廓与PDB中序列 - 结构对的HP也有非常显著的相关性。EC轮廓将先前引入的结构轮廓的属性推广到模块化蛋白质,如多域链,并且相对于先前定义的轮廓,其与序列轮廓的相关性有了实质性的提高,特别是对于长蛋白质。此外,EC轮廓具有动态解释,因为EC成分与X射线实验中测量的温度因子强烈负相关,这意味着具有大EC成分的位置在其平衡动力学中受到更强的限制。最后,EC轮廓允许定义一种与组成蛋白质的结构域数量相关的自然模块化度量,表明其可用于结构域分解。最后,我们表明结构相似的蛋白质具有相似的EC轮廓,因此对齐的EC轮廓之间的相似性可以用作结构相似性度量,我们最近已将此属性应用于蛋白质结构对齐。计算EC轮廓的代码可通过写信至ubastolla@cbm.uam.es索取,本文讨论的结构轮廓可从SLOTH网络服务器http://www.fkp.tu-darmstadt.de/SLOTH/下载。

相似文献

1
Effective connectivity profile: a structural representation that evidences the relationship between protein structures and sequences.有效连接性概况:一种证明蛋白质结构与序列之间关系的结构表示。
Proteins. 2008 Dec;73(4):872-88. doi: 10.1002/prot.22113.
2
Principal eigenvector of contact matrices and hydrophobicity profiles in proteins.蛋白质中接触矩阵和疏水性图谱的主特征向量。
Proteins. 2005 Jan 1;58(1):22-30. doi: 10.1002/prot.20240.
3
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
4
Looking at structure, stability, and evolution of proteins through the principal eigenvector of contact matrices and hydrophobicity profiles.通过接触矩阵的主特征向量和疏水性图谱来研究蛋白质的结构、稳定性及进化。
Gene. 2005 Mar 14;347(2):219-30. doi: 10.1016/j.gene.2004.12.015.
5
Connectivity of neutral networks, overdispersion, and structural conservation in protein evolution.蛋白质进化中神经网络的连通性、过度分散和结构保守性。
J Mol Evol. 2003 Mar;56(3):243-54. doi: 10.1007/s00239-002-2350-0.
6
Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.通过评估序列-结构适应性预测蛋白质结构。将序列与从三维结构推导的接触谱进行比对。
J Mol Biol. 1993 Aug 5;232(3):805-25. doi: 10.1006/jmbi.1993.1433.
7
Evolution of function in protein superfamilies, from a structural perspective.从结构角度看蛋白质超家族中功能的演变。
J Mol Biol. 2001 Apr 6;307(4):1113-43. doi: 10.1006/jmbi.2001.4513.
8
Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.用于比对远缘相关蛋白质序列的基于统计势的氨基酸相似性矩阵。
Proteins. 2006 Aug 15;64(3):587-600. doi: 10.1002/prot.21020.
9
A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.一种用于蛋白质折叠识别的3D-1D替换矩阵,其包含序列的预测二级结构。
J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924.
10
Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation.相似和同源蛋白质折叠的识别:序列和结构保守性分析
J Mol Biol. 1997 Jun 13;269(3):423-39. doi: 10.1006/jmbi.1997.1019.

引用本文的文献

1
Influence of mutation bias and hydrophobicity on the substitution rates and sequence entropies of protein evolution.突变偏好性和疏水性对蛋白质进化中替换率和序列熵的影响。
PeerJ. 2018 Oct 5;6:e5549. doi: 10.7717/peerj.5549. eCollection 2018.
2
Selection on protein structure, interaction, and sequence.对蛋白质结构、相互作用和序列的选择。
Protein Sci. 2016 Jul;25(7):1168-78. doi: 10.1002/pro.2886. Epub 2016 Feb 11.
3
Maximum-Likelihood Phylogenetic Inference with Selection on Protein Folding Stability.基于蛋白质折叠稳定性选择的最大似然系统发育推断
Mol Biol Evol. 2015 Aug;32(8):2195-207. doi: 10.1093/molbev/msv085. Epub 2015 Apr 2.
4
Detecting selection on protein stability through statistical mechanical models of folding and evolution.通过折叠和进化的统计力学模型检测对蛋白质稳定性的选择。
Biomolecules. 2014 Mar 7;4(1):291-314. doi: 10.3390/biom4010291.
5
The relationship between relative solvent accessibility and evolutionary rate in protein evolution.蛋白质进化中相对溶剂可及性与进化速率的关系。
Genetics. 2011 Jun;188(2):479-88. doi: 10.1534/genetics.111.128025. Epub 2011 Apr 5.
6
Stochastic reconstruction of protein structures from effective connectivity profiles.基于有效连接图谱的蛋白质结构随机重建
PMC Biophys. 2008 Nov 26;1(1):5. doi: 10.1186/1757-5036-1-5.