Godzik A, Skolnick J
Department of Molecular Biology, Scripps Research Institute, La Jolla, CA 92037.
Proc Natl Acad Sci U S A. 1992 Dec 15;89(24):12098-102. doi: 10.1073/pnas.89.24.12098.
A methodology designed to address the inverse globular protein-folding problem (the identification of which sequences are compatible with a given three-dimensional structure) is described. By using a library of protein finger-prints, defined by the side chain interaction pattern, it is possible to match each structure to its own sequence in an exhaustive data base search. It is shown that this is a permissive requirement for the validation of the methodology. To pass the more rigorous test of identifying proteins that are not close sequence homologs, but that have similar structure, the method has been extended to include insertions and deletions in the sequence, which is compared to the fingerprint. This allows for the identification of sequences having little or no sequence homology to the fingerprint. Examples include plastocyanin/azurin/pseudoazurin, the globin family, different families of proteases and cytochromes, including cytochromes c' and b-562, actinidin/papain, and lysozyme/alpha-lactalbumin. Turning to supersecondary structure prediction, we find that alpha/beta/alpha fragments possess sufficient specificity to identify their own and related sequences. By threading a beta-hairpin through a sequence, it is possible to predict the location of such hairpins and turns with remarkable fidelity. Thus, the method greatly extends existing techniques for the prediction of both global structural homology and local supersecondary structure.
本文描述了一种用于解决反向球状蛋白折叠问题(即确定哪些序列与给定的三维结构兼容)的方法。通过使用由侧链相互作用模式定义的蛋白指纹库,可以在详尽的数据库搜索中将每个结构与其自身的序列进行匹配。结果表明,这是该方法验证的一个宽松要求。为了通过更严格的测试,即识别序列上不相近但结构相似的蛋白质,该方法已扩展到包括序列中的插入和缺失,并将其与指纹进行比较。这使得能够识别与指纹几乎没有或没有序列同源性的序列。例子包括质体蓝素/天青蛋白/假天青蛋白、珠蛋白家族、不同家族的蛋白酶和细胞色素,包括细胞色素c'和b - 562、猕猴桃蛋白酶/木瓜蛋白酶,以及溶菌酶/α-乳白蛋白。转向超二级结构预测,我们发现α/β/α片段具有足够的特异性来识别其自身及相关序列。通过将β-发夹穿过一个序列,可以以极高的保真度预测此类发夹和转角的位置。因此,该方法极大地扩展了现有的用于预测全局结构同源性和局部超二级结构的技术。