Havranek James J, Baker David
Department of Biochemistry, Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
Protein Sci. 2009 Jun;18(6):1293-305. doi: 10.1002/pro.142.
Computational protein design relies on a number of approximations to efficiently search the huge sequence space available to proteins. The fixed backbone and rotamer approximations in particular are important for formulating protein design as a discrete combinatorial optimization problem. However, the resulting coarse-grained sampling of possible side-chain terminal positions is problematic for the design of protein function, which depends on precise positioning of side-chain atoms. Although backbone flexibility can greatly increase the conformation freedom of side-chain functional groups, it is not obvious which backbone movements will generate the critical constellation of atoms responsible for protein function. Here, we report an automated method for identifying protein backbone movements that can give rise to any specified set of desired side-chain atomic placements and interactions, using protein-DNA interfaces as a model system. We use a library of previously observed protein-DNA interactions (motifs) and a rotamer-based description of side-chain conformation freedom to identify placements for the protein backbone that can give rise to a favorable side-chain interaction with DNA. We describe a tree-search algorithm for identifying those combinations of interactions from the library that can be realized with minimal perturbation of the protein backbone. We compare the efficiency of this method with the alternative approach of building and screening alternate backbone conformations.
计算蛋白质设计依赖于多种近似方法,以便有效地搜索蛋白质可利用的巨大序列空间。特别是固定主链和旋转异构体近似方法,对于将蛋白质设计表述为离散组合优化问题非常重要。然而,由此产生的对可能的侧链末端位置的粗粒度采样,对于蛋白质功能设计来说是个问题,因为蛋白质功能取决于侧链原子的精确定位。尽管主链灵活性可极大增加侧链官能团的构象自由度,但并不清楚哪些主链运动将产生负责蛋白质功能的关键原子组合。在此,我们报告一种自动化方法,以蛋白质 - DNA 界面作为模型系统,识别能够产生任何指定的所需侧链原子排列和相互作用的蛋白质主链运动。我们使用先前观察到的蛋白质 - DNA 相互作用(基序)库以及基于旋转异构体的侧链构象自由度描述,来识别能够与 DNA 产生有利侧链相互作用的蛋白质主链排列。我们描述一种树搜索算法,用于从库中识别那些能够以对蛋白质主链最小扰动来实现的相互作用组合。我们将此方法的效率与构建和筛选替代主链构象的替代方法进行比较。