Center for Structural Biology, Vanderbilt University, Nashville, TN, United States of America.
Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, United States of America.
PLoS Comput Biol. 2018 Feb 16;14(2):e1005999. doi: 10.1371/journal.pcbi.1005999. eCollection 2018 Feb.
Computational protein design has been successful in modeling fixed backbone proteins in a single conformation. However, when modeling large ensembles of flexible proteins, current methods in protein design have been insufficient. Large barriers in the energy landscape are difficult to traverse while redesigning a protein sequence, and as a result current design methods only sample a fraction of available sequence space. We propose a new computational approach that combines traditional structure-based modeling using the Rosetta software suite with machine learning and integer linear programming to overcome limitations in the Rosetta sampling methods. We demonstrate the effectiveness of this method, which we call BROAD, by benchmarking the performance on increasing predicted breadth of anti-HIV antibodies. We use this novel method to increase predicted breadth of naturally-occurring antibody VRC23 against a panel of 180 divergent HIV viral strains and achieve 100% predicted binding against the panel. In addition, we compare the performance of this method to state-of-the-art multistate design in Rosetta and show that we can outperform the existing method significantly. We further demonstrate that sequences recovered by this method recover known binding motifs of broadly neutralizing anti-HIV antibodies. Finally, our approach is general and can be extended easily to other protein systems. Although our modeled antibodies were not tested in vitro, we predict that these variants would have greatly increased breadth compared to the wild-type antibody.
计算蛋白质设计在单一构象下对固定骨架蛋白质进行建模方面已经取得了成功。然而,在对大型柔性蛋白质集合进行建模时,目前的蛋白质设计方法还不够完善。在重新设计蛋白质序列时,能量景观中的大障碍难以跨越,因此当前的设计方法仅能对可用序列空间的一小部分进行采样。我们提出了一种新的计算方法,该方法将基于结构的传统建模与机器学习和整数线性规划相结合,以克服 Rosetta 采样方法的局限性。我们通过在不断增加的抗 HIV 抗体预测广度方面对基准测试来证明这种称为 BROAD 的方法的有效性。我们使用这种新方法来提高天然存在的抗 HIV 抗体 VRC23 对 180 种不同 HIV 病毒株的预测广度,并实现了对该面板的 100%预测结合。此外,我们将这种方法的性能与 Rosetta 中的最新多态设计进行了比较,表明我们可以显著优于现有方法。我们进一步证明,通过这种方法恢复的序列恢复了广泛中和抗 HIV 抗体的已知结合基序。最后,我们的方法是通用的,可以很容易地扩展到其他蛋白质系统。尽管我们模拟的抗体尚未在体外进行测试,但我们预测这些变体与野生型抗体相比,广度会大大增加。