Suppr超能文献

球状蛋白质中的序列-结构匹配:在超二级和三级结构测定中的应用

Sequence-structure matching in globular proteins: application to supersecondary and tertiary structure determination.

作者信息

Godzik A, Skolnick J

机构信息

Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037.

出版信息

Proc Natl Acad Sci U S A. 1992 Dec 15;89(24):12098-102. doi: 10.1073/pnas.89.24.12098.

Abstract

A methodology designed to address the inverse globular protein-folding problem (the identification of which sequences are compatible with a given three-dimensional structure) is described. By using a library of protein finger-prints, defined by the side chain interaction pattern, it is possible to match each structure to its own sequence in an exhaustive data base search. It is shown that this is a permissive requirement for the validation of the methodology. To pass the more rigorous test of identifying proteins that are not close sequence homologs, but that have similar structure, the method has been extended to include insertions and deletions in the sequence, which is compared to the fingerprint. This allows for the identification of sequences having little or no sequence homology to the fingerprint. Examples include plastocyanin/azurin/pseudoazurin, the globin family, different families of proteases and cytochromes, including cytochromes c' and b-562, actinidin/papain, and lysozyme/alpha-lactalbumin. Turning to supersecondary structure prediction, we find that alpha/beta/alpha fragments possess sufficient specificity to identify their own and related sequences. By threading a beta-hairpin through a sequence, it is possible to predict the location of such hairpins and turns with remarkable fidelity. Thus, the method greatly extends existing techniques for the prediction of both global structural homology and local supersecondary structure.

摘要

本文描述了一种用于解决反向球状蛋白折叠问题(即确定哪些序列与给定的三维结构兼容)的方法。通过使用由侧链相互作用模式定义的蛋白指纹库,可以在详尽的数据库搜索中将每个结构与其自身的序列进行匹配。结果表明,这是该方法验证的一个宽松要求。为了通过更严格的测试,即识别序列上不相近但结构相似的蛋白质,该方法已扩展到包括序列中的插入和缺失,并将其与指纹进行比较。这使得能够识别与指纹几乎没有或没有序列同源性的序列。例子包括质体蓝素/天青蛋白/假天青蛋白、珠蛋白家族、不同家族的蛋白酶和细胞色素,包括细胞色素c'和b - 562、猕猴桃蛋白酶/木瓜蛋白酶,以及溶菌酶/α-乳白蛋白。转向超二级结构预测,我们发现α/β/α片段具有足够的特异性来识别其自身及相关序列。通过将β-发夹穿过一个序列,可以以极高的保真度预测此类发夹和转角的位置。因此,该方法极大地扩展了现有的用于预测全局结构同源性和局部超二级结构的技术。

相似文献

5
Homology Searches Using Supersecondary Structure Code.使用超二级结构编码的同源性搜索。
Methods Mol Biol. 2019;1958:329-340. doi: 10.1007/978-1-4939-9161-7_17.

引用本文的文献

2
From local structure to a global framework: recognition of protein folds.从局部结构到全局框架:蛋白质折叠的识别
J R Soc Interface. 2014 Apr 16;11(95):20131147. doi: 10.1098/rsif.2013.1147. Print 2014 Jun 6.

本文引用的文献

1
Molecular technology. Designing proteins and peptides.
Nature. 1983 Jan 20;301(5897):200. doi: 10.1038/301200a0.
10
Selection of representative protein data sets.代表性蛋白质数据集的选择。
Protein Sci. 1992 Mar;1(3):409-17. doi: 10.1002/pro.5560010313.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验