Dolgikh Benedikt, Woldring Daniel
Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA.
Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA.
Methods Mol Biol. 2022;2491:63-73. doi: 10.1007/978-1-0716-2285-8_3.
Many auspicious clinical and industrial accomplishments have improved the human condition by means of protein engineering. Despite these achievements, our incomplete understanding of the sequence-structure-function relationship prevents rapid innovation. To tackle this problem, we must develop and integrate new and existing technologies. To date, directed evolution and rational design have dominated as protein engineering principles. Even so, prior to screening for novel or improved functions, a large collection of variants, within a protein library, exist along an ambiguous mutational terrain. Complicating things further, the choice of where to initialize investigation along a vast sequence space becomes even more difficult given that the majority of any sequence lacks function entirely. Unfortunately, even when considering functionally relevant positions, random substitutions can prove to be destabilizing, causing a hindrance to an otherwise function-inducing, stability-reliant folding process. To enhance productivity in the field, we seek to address this issue of destabilization, and subsequent disfunction, at protein-protein and protein-ligand interacting regions. Herein, the process of choosing amenable positions - and amino acids at those positions - allows for a refined, knowledge-based approach to combinatorial library design. Using structural data, we perform computational stability prediction with FoldX's PositionScan and Rosetta's ddG_monomer in tandem, allowing for the refinement of our thermodynamic stability data through the comparison of results. In turn, we provide a process for selecting in silico predicted mutually stabilizing positions and avoiding overly destabilizing ones that guides the site-wise diversification of combinatorial libraries.
许多卓越的临床和工业成果借助蛋白质工程改善了人类状况。尽管取得了这些成就,但我们对序列-结构-功能关系的不完全理解阻碍了快速创新。为解决这一问题,我们必须开发并整合新的和现有的技术。迄今为止,定向进化和理性设计一直是蛋白质工程的主导原则。即便如此,在筛选新的或改进的功能之前,蛋白质文库中的大量变体存在于一个模糊的突变区域中。更复杂的是,鉴于大多数序列完全缺乏功能,在广阔的序列空间中选择从何处开始研究变得更加困难。不幸的是,即使考虑功能相关的位置,随机替换也可能被证明是不稳定的,从而阻碍原本诱导功能、依赖稳定性的折叠过程。为提高该领域的生产力,我们试图解决蛋白质-蛋白质和蛋白质-配体相互作用区域的不稳定以及随后的功能障碍问题。在此,选择合适的位置以及这些位置上的氨基酸的过程允许采用一种基于知识的精细方法来设计组合文库。利用结构数据,我们串联使用FoldX的PositionScan和Rosetta的ddG_monomer进行计算稳定性预测,通过结果比较来完善我们的热力学稳定性数据。相应地,我们提供了一个在计算机上预测相互稳定位置并避免过度不稳定位置的过程,该过程指导组合文库的位点特异性多样化。