• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过评估序列-结构适应性预测蛋白质结构。将序列与从三维结构推导的接触谱进行比对。

Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.

作者信息

Ouzounis C, Sander C, Scharf M, Schneider R

机构信息

Protein Design Group, EMBL, Heidelberg, Germany.

出版信息

J Mol Biol. 1993 Aug 5;232(3):805-25. doi: 10.1006/jmbi.1993.1433.

DOI:10.1006/jmbi.1993.1433
PMID:8355272
Abstract

The problem of protein structure prediction is formulated here as that of evaluating how well an amino acid sequence fits a hypothetical structure. The simplest and most complicated approaches, secondary structure prediction and all-atom free energy calculations, can be viewed as sequence-structure fitness problems. Here, an approach of intermediate complexity is described, which involves; (1) description of a protein structure in terms of contact interface vectors, with both intra-protein and protein-solvent contacts counted, (2) derivation of sequence preferences for 2 up to 29 contact interface types, (3) generation of numerous hypothetical model structures by placing the input sequence into a large set of known three-dimensional structures in all possible alignments, (4) evaluation of these models by summing the sequence preferences over all structural positions and (5) choice of predicted three-dimensional structure as that with the best sequence-structure fitness. Evolutionary information is incorporated by using position-dependent core weights derived from multiple sequence alignments. A number of tests of the method are performed: (1) evaluation of cyclic shifts of a sequence in its native structure; (2) alignment of a sequence in its native structure, allowing gaps; (3) alignment search with a sequence or sequence fragment in a database of structures; and (4) alignment search with a structure in a database of sequences. The main results are: (1) a native sequence can very well find its native structure among a large number of alternatives, in correct alignment; (2) substructures, such as (beta alpha)n units, can be detected in spite of very low sequence similarity; (3) remote homologous can be detected, with some dependence on the set of parameters used; (4) contact interface parameters are clearly superior to classical secondary structure parameters; (5) a simple interface description in terms of just two states, protein-protein and protein-water contacts, performs surprisingly well; (6) the use of core weights considerably improves accuracy in detection of remote homologues; (7) based on a sequence database search with a myoglobin contact profile, the C-terminal domain of a viral origin of replication binding protein is predicted to have an all-helical fold. The sequence-structure fitness concept is sufficiently general to accommodate a large variety of protein structure prediction methods, including new models of intermediate complexity currently being developed.

摘要

蛋白质结构预测问题在此被阐述为评估氨基酸序列与假设结构的匹配程度。最简单和最复杂的方法,即二级结构预测和全原子自由能计算,都可被视为序列 - 结构匹配问题。这里描述了一种中等复杂度的方法,该方法包括:(1)根据接触界面向量描述蛋白质结构,同时计算蛋白质内部和蛋白质 - 溶剂接触;(2)推导2至29种接触界面类型的序列偏好;(3)通过将输入序列以所有可能的比对方式放入大量已知三维结构中,生成众多假设模型结构;(4)通过对所有结构位置的序列偏好求和来评估这些模型;(5)选择序列 - 结构匹配度最佳的预测三维结构。通过使用从多序列比对中得出的位置依赖核心权重纳入进化信息。对该方法进行了多项测试:(1)评估序列在其天然结构中的循环移位;(2)在其天然结构中对序列进行比对,允许有缺口;(3)在结构数据库中用序列或序列片段进行比对搜索;(4)在序列数据库中用结构进行比对搜索。主要结果如下:(1)天然序列能够在大量备选结构中很好地找到其天然结构,并具有正确的比对;(2)尽管序列相似性很低,如(β - α)n单元等亚结构仍可被检测到;(3)可以检测到远源同源物,这在一定程度上依赖于所使用的参数集;(4)接触界面参数明显优于经典二级结构参数;(5)仅根据蛋白质 - 蛋白质和蛋白质 - 水接触这两种状态进行的简单界面描述表现出奇地好;(6)使用核心权重显著提高了检测远源同源物的准确性;(7)基于用肌红蛋白接触谱在序列数据库中进行的搜索,预测病毒复制起始结合蛋白的C端结构域具有全螺旋折叠。序列 - 结构匹配概念足够通用,能够容纳多种蛋白质结构预测方法,包括目前正在开发的中等复杂度新模型。

相似文献

1
Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from three-dimensional structures.通过评估序列-结构适应性预测蛋白质结构。将序列与从三维结构推导的接触谱进行比对。
J Mol Biol. 1993 Aug 5;232(3):805-25. doi: 10.1006/jmbi.1993.1433.
2
Protein fold recognition by prediction-based threading.基于预测穿线法的蛋白质折叠识别
J Mol Biol. 1997 Jul 18;270(3):471-80. doi: 10.1006/jmbi.1997.1101.
3
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.一种蛋白质序列与结构分析及建模的综合方法。III. 使用多重结构比对对蛋白质结构家族中的序列保守性进行比较研究。
J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975.
4
Database of homology-derived protein structures and the structural meaning of sequence alignment.同源性衍生蛋白质结构数据库及序列比对的结构意义
Proteins. 1991;9(1):56-68. doi: 10.1002/prot.340090107.
5
Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation.相似和同源蛋白质折叠的识别:序列和结构保守性分析
J Mol Biol. 1997 Jun 13;269(3):423-39. doi: 10.1006/jmbi.1997.1019.
6
Protein secondary structure prediction using local alignments.利用局部比对进行蛋白质二级结构预测。
J Mol Biol. 1997 Apr 25;268(1):31-6. doi: 10.1006/jmbi.1997.0958.
7
Analysis and prediction of functional sub-types from protein sequence alignments.基于蛋白质序列比对的功能亚类型分析与预测。
J Mol Biol. 2000 Oct 13;303(1):61-76. doi: 10.1006/jmbi.2000.4036.
8
Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles.利用修饰序列谱比对进行蛋白质-蛋白质复合物三维结构的基于同源性的建模。
Int J Biol Macromol. 2008 Aug 15;43(2):198-208. doi: 10.1016/j.ijbiomac.2008.05.004. Epub 2008 May 21.
9
Protein structure prediction by threading methods: evaluation of current techniques.基于穿线法的蛋白质结构预测:当前技术评估
Proteins. 1995 Nov;23(3):337-55. doi: 10.1002/prot.340230308.
10
Recoverable one-dimensional encoding of three-dimensional protein structures.三维蛋白质结构的可恢复一维编码
Bioinformatics. 2005 May 15;21(10):2167-70. doi: 10.1093/bioinformatics/bti330. Epub 2005 Feb 18.

引用本文的文献

1
DPFunc: accurately predicting protein function via deep learning with domain-guided structure information.DPFunc:利用域引导的结构信息通过深度学习准确预测蛋白质功能。
Nat Commun. 2025 Jan 2;16(1):70. doi: 10.1038/s41467-024-54816-8.
2
Poorly Conserved P15 Proteins of Cileviruses Retain Elements of Common Ancestry and Putative Functionality: A Theoretical Assessment on the Evolution of Cilevirus Genomes.西病毒属中保守性较差的P15蛋白保留了共同祖先的元素和假定的功能:关于西病毒属基因组进化的理论评估
Front Plant Sci. 2021 Nov 5;12:771983. doi: 10.3389/fpls.2021.771983. eCollection 2021.
3
Analysis of the peroxiredoxin family: using active-site structure and sequence information for global classification and residue analysis.
过氧化物酶家族分析:利用活性位点结构和序列信息进行全局分类和残基分析。
Proteins. 2011 Mar;79(3):947-64. doi: 10.1002/prot.22936. Epub 2010 Dec 22.
4
Residue contact-count potentials are as effective as residue-residue contact-type potentials for ranking protein decoys.残基接触计数势在对蛋白质诱饵进行排序方面与残基-残基接触类型势一样有效。
BMC Struct Biol. 2008 Dec 8;8:53. doi: 10.1186/1472-6807-8-53.
5
Statistical potential for assessment and prediction of protein structures.用于蛋白质结构评估和预测的统计势
Protein Sci. 2006 Nov;15(11):2507-24. doi: 10.1110/ps.062416606.
6
The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.定向原子溶剂化能:一种基于原子的将蛋白质序列分配到已知折叠结构的势能。
Proc Natl Acad Sci U S A. 2002 Dec 10;99(25):16041-6. doi: 10.1073/pnas.252626399. Epub 2002 Dec 2.
7
Design of an optimal Chebyshev-expanded discrimination function for globular proteins.球状蛋白质最优切比雪夫展开判别函数的设计
Protein Sci. 2002 Aug;11(8):2010-21. doi: 10.1110/ps.0200702.
8
Statistical potentials for fold assessment.用于折叠评估的统计势
Protein Sci. 2002 Feb;11(2):430-48. doi: 10.1002/pro.110430.
9
Feasibility in the inverse protein folding protocol.反向蛋白质折叠方案的可行性。
Protein Sci. 1999 May;8(5):1001-9. doi: 10.1110/ps.8.5.1001.
10
Factors limiting the performance of prediction-based fold recognition methods.限制基于预测的折叠识别方法性能的因素。
Protein Sci. 1999 Apr;8(4):750-9. doi: 10.1110/ps.8.4.750.