Hanf Karl J M
Department of Physical Biochemistry, Biogen Idec, Cambridge, MA, USA.
Methods Mol Biol. 2012;899:127-44. doi: 10.1007/978-1-61779-921-1_8.
Proteins, especially antibodies, are widely used as therapeutic and diagnostic agents. Computational protein design is a powerful tool for improving the affinity and stability of these molecules. We describe a protein design method which employs the dead-end elimination (DEE) and A* discrete search algorithms with a few improvements aimed at making the procedure more useful for actual projects to design proteins for better affinity or stability. DEE/A* and related algorithms allow vast search spaces of protein sequences and their alternative side chain conformations ("rotamers") to be systematically explored, to find those with the best free energy of folding or binding. To maximize a protein design project's chance of success, it needs to find a diverse set of sequences to experimentally synthesize. It should also find structures that score well, not only on the pairwise-additive energy function which DEE/A* and related search algorithms must use, but also on a post-search energy function with accurate treatment of solvation effects. Straight DEE/A*, however, typically finds vast numbers of very similar low-energy conformations, making it infeasible to find a diverse set of sequences or conformations. Herein, we describe a three-level DEE/A* procedure that uses DEE/A* at the level of sequences, at the level of rotamers, and at an intermediate "fleximer" level, to ensure a wide variety of sequences as well as a diverse set of conformations for each sequence.A physics-based method is also described herein for calculating the free energy of folding based on a thermodynamic cycle with a model of the unfolded state. The free energies of both folding and binding may be used for the final evaluation of the designed structures. For example, when designing for improved affinity (binding), we can also ensure that stability is not degraded by screening on the free energy of folding.
蛋白质,尤其是抗体,被广泛用作治疗和诊断试剂。计算蛋白质设计是提高这些分子亲和力和稳定性的强大工具。我们描述了一种蛋白质设计方法,该方法采用死端消除(DEE)和A离散搜索算法,并进行了一些改进,旨在使该过程对实际项目更有用,以设计出具有更好亲和力或稳定性的蛋白质。DEE/A及相关算法允许系统地探索蛋白质序列及其替代侧链构象(“旋转异构体”)的巨大搜索空间,以找到具有最佳折叠或结合自由能的序列和构象。为了最大限度地提高蛋白质设计项目成功的机会,需要找到一组多样化的序列进行实验合成。它还应该找到不仅在DEE/A及相关搜索算法必须使用的成对加和能量函数上得分良好,而且在能准确处理溶剂化效应的搜索后能量函数上得分也良好的结构。然而,直接的DEE/A通常会找到大量非常相似的低能量构象,使得找到一组多样化的序列或构象变得不可行。在此,我们描述了一种三级DEE/A程序,该程序在序列水平、旋转异构体水平和中间的“柔性异构体”水平上使用DEE/A,以确保有各种各样的序列以及每个序列的多样化构象集。本文还描述了一种基于物理的方法,用于基于具有未折叠状态模型的热力学循环计算折叠自由能。折叠和结合的自由能均可用于对设计结构的最终评估。例如,在设计提高亲和力(结合)时,我们还可以通过筛选折叠自由能来确保稳定性不会降低。