Honig B
Department of Biochemistry and Molecular Biophysics, Columbia University, 630 West 168 St., New York, NY 10032, USA.
J Mol Biol. 1999 Oct 22;293(2):283-93. doi: 10.1006/jmbi.1999.3006.
This article is a personal perspective on the developments in the field of protein folding over approximately the last 40 years. In addition to its historical aspects, the article presents a view of the principles of protein folding with particular emphasis on the relationship of these principles to the problem of protein structure prediction. It is argued that despite much that is new, the essential elements of our current understanding of protein folding were anticipated by researchers many years ago. These elements include the recognition of the central importance of the polypeptide backbone as a determinant of protein conformation, hierarchical protein folding, and multiple folding pathways. Important areas of progress include a detailed characterization of the folding pathways of a number of proteins and a fundamental understanding of the physical chemical forces that determine protein stability. Despite these developments, fold prediction algorithms still encounter difficulties in identifying the correct fold for a given sequence. This may be due to the possibility that the free energy differences between at least a few alternate conformations of many proteins are not large. Significant progress in protein structure prediction has been due primarily to the explosive growth of sequence and structural databases. However, further progress is likely to depend in part on the ability to combine information available from databases with principles and algorithms derived from physical chemical studies of protein folding. An approach to the integration of the two areas is outlined with specific reference to the PrISM program that is a fully integrated sequence/structural-analysis/fold-recognition/homology model building software system.
本文是对近40年来蛋白质折叠领域发展的个人观点。除了其历史方面,本文还呈现了蛋白质折叠原理的观点,特别强调了这些原理与蛋白质结构预测问题的关系。有人认为,尽管有很多新的内容,但我们目前对蛋白质折叠的理解的基本要素在多年前就被研究人员预见到了。这些要素包括认识到多肽主链作为蛋白质构象决定因素的核心重要性、蛋白质的层次折叠以及多种折叠途径。重要的进展领域包括对多种蛋白质折叠途径的详细表征以及对决定蛋白质稳定性的物理化学力的基本理解。尽管有这些进展,但折叠预测算法在为给定序列识别正确折叠时仍遇到困难。这可能是由于许多蛋白质的至少一些替代构象之间的自由能差异不大的可能性。蛋白质结构预测的重大进展主要归功于序列和结构数据库的爆炸式增长。然而,进一步的进展可能部分取决于将数据库中可用信息与从蛋白质折叠的物理化学研究中得出的原理和算法相结合的能力。本文概述了将这两个领域整合的一种方法,并特别提及了PrISM程序,它是一个完全集成的序列/结构分析/折叠识别/同源性模型构建软件系统。