Hellinga H W, Richards F M
Department of Biochemistry, Duke University Medical Center, Durham, NC 27710.
Proc Natl Acad Sci U S A. 1994 Jun 21;91(13):5803-7. doi: 10.1073/pnas.91.13.5803.
Rational design of protein structure requires the identification of optimal sequences to carry out a particular function within a given backbone structure. A general solution to this problem requires that a potential function describing the energy of the system as a function of its atomic coordinates be minimized simultaneously over all available sequences and their three-dimensional atomic configurations. Here we present a method that explicitly minimizes a semiempirical potential function simultaneously in these two spaces, using a simulated annealing approach. The method takes the fixed three-dimensional coordinates of a protein backbone and stochastically generates possible sequences through the introduction of random mutations. The corresponding three-dimensional coordinates are constructed for each sequence by "redecorating" the backbone coordinates of the original structure with the corresponding side chains. These are then allowed to vary in their structure by random rotations around free torsional angles to generate a stochastic walk in configurational space. We have named this method protein simulated evolution, because, in loose analogy with natural selection, it randomly selects for allowed solutions in the sequence of a protein subject to the "selective pressure" of a potential function. Energies predicted by this method for sequences of a small group of residues in the hydrophobic core of the phage lambda cI repressor correlate well with experimentally determined biological activities. This "genetic selection by computer" approach has potential applications in protein engineering, rational protein design, and structure-based drug discovery.
蛋白质结构的合理设计需要确定在给定主链结构内执行特定功能的最佳序列。解决这个问题的一般方法要求一个描述系统能量作为其原子坐标函数的势函数,在所有可用序列及其三维原子构型上同时最小化。在这里,我们提出一种方法,使用模拟退火方法在这两个空间中明确地同时最小化一个半经验势函数。该方法采用蛋白质主链的固定三维坐标,并通过引入随机突变随机生成可能的序列。通过用相应的侧链“重新装饰”原始结构的主链坐标,为每个序列构建相应的三维坐标。然后通过围绕自由扭转角的随机旋转使它们的结构发生变化,以在构型空间中产生随机游走。我们将这种方法命名为蛋白质模拟进化,因为,与自然选择大致类似地,它在势函数的“选择压力”下,在蛋白质序列中随机选择允许的解决方案。通过这种方法预测的噬菌体λ cI 阻遏物疏水核心中一小群残基序列的能量与实验确定的生物活性密切相关。这种“计算机遗传选择”方法在蛋白质工程、合理的蛋白质设计和基于结构的药物发现中具有潜在应用。