Janssen Pharmaceuticals, Inc. , San Diego , California 92121 , United States.
J Chem Theory Comput. 2019 Oct 8;15(10):5703-5715. doi: 10.1021/acs.jctc.9b00483. Epub 2019 Sep 10.
Custom-designed ligand-binding proteins represent a promising class of macromolecules with exciting applications toward the design of new enzymes or the engineering of antibodies and small-molecule recruited proteins for therapeutic interventions. However, several challenges remain in designing a protein sequence such that the binding site organization results in high affinity interaction with a bound ligand. Here, we study the dynamics of explicitly solvated designed proteins through all-atom molecular dynamics (MD) simulations to gain insight into the causes that lead to the low affinity or instability of most of these designs, despite the prediction of their success by the computational design methodology. Simulations ranging from 500 to 1000 ns per replicate were conducted on 37 designed protein variants encompassing two distinct folds and a range of ligand affinities, resulting in more than 180 μs of combined sampling. The simulations provide retrospective insights into the properties affecting ligand affinity that can prove useful in guiding further steps of design optimization. Features indicate that entropic components are particularly important for affinity, which are not easily incorporated in the empirical models often used in design protocols. Additionally, we demonstrate that the application of machine learning approaches built upon the output from the simulations can help discriminate between successful and failed binders, such that MD could act as a screening step in protein design, resulting in a more efficient process.
定制设计的配体结合蛋白是一类很有前途的大分子,它们在设计新酶或抗体,以及为治疗干预工程设计招募小分子的蛋白质方面具有令人兴奋的应用。然而,在设计一个蛋白质序列时,仍然存在一些挑战,即结合位点的组织导致与结合配体的高亲和力相互作用。在这里,我们通过全原子分子动力学(MD)模拟研究了明确溶剂化的设计蛋白的动力学,以深入了解导致大多数这些设计亲和力低或不稳定的原因,尽管计算设计方法学预测了它们的成功。针对两种不同的折叠和一系列配体亲和力的 37 种设计蛋白变体,每个变体进行了 500 到 1000 纳秒的模拟,总共进行了超过 180 微秒的组合采样。这些模拟提供了对影响配体亲和力的性质的回溯性见解,这些见解有助于指导进一步的设计优化步骤。特征表明,熵分量对亲和力特别重要,但在设计协议中常用的经验模型中不容易包含这些分量。此外,我们证明了基于模拟输出构建的机器学习方法的应用可以帮助区分成功和失败的结合物,因此 MD 可以作为蛋白质设计中的筛选步骤,从而提高效率。