Traoré Seydou, Roberts Kyle E, Allouche David, Donald Bruce R, André Isabelle, Schiex Thomas, Barbe Sophie
Université De Toulouse, INSA, UPS, INP, LISBP, 135 Avenue de Rangueil, Toulouse, F-31077, France.
INRA, UMR792, Ingénierie des Systèmes Biologiques et des Procédés, Toulouse, F-31400, France.
J Comput Chem. 2016 May 5;37(12):1048-58. doi: 10.1002/jcc.24290. Epub 2016 Feb 2.
One of the main challenges in computational protein design (CPD) is the huge size of the protein sequence and conformational space that has to be computationally explored. Recently, we showed that state-of-the-art combinatorial optimization technologies based on Cost Function Network (CFN) processing allow speeding up provable rigid backbone protein design methods by several orders of magnitudes. Building up on this, we improved and injected CFN technology into the well-established CPD package Osprey to allow all Osprey CPD algorithms to benefit from associated speedups. Because Osprey fundamentally relies on the ability of A* to produce conformations in increasing order of energy, we defined new A* strategies combining CFN lower bounds, with new side-chain positioning-based branching scheme. Beyond the speedups obtained in the new A*-CFN combination, this novel branching scheme enables a much faster enumeration of suboptimal sequences, far beyond what is reachable without it. Together with the immediate and important speedups provided by CFN technology, these developments directly benefit to all the algorithms that previously relied on the DEE/ A* combination inside Osprey* and make it possible to solve larger CPD problems with provable algorithms.
计算蛋白质设计(CPD)的主要挑战之一是必须通过计算探索的蛋白质序列和构象空间的巨大规模。最近,我们表明,基于成本函数网络(CFN)处理的先进组合优化技术可将可证明的刚性主链蛋白质设计方法加速几个数量级。在此基础上,我们对CFN技术进行了改进,并将其引入到成熟的CPD软件包Osprey中,以使所有Osprey CPD算法都能受益于相关的加速效果。由于Osprey从根本上依赖于A以能量递增顺序生成构象的能力,我们定义了新的A策略,将CFN下界与基于新的侧链定位的分支方案相结合。除了在新的A*-CFN组合中获得的加速效果外,这种新颖的分支方案还能更快地枚举次优序列,这是没有它时远远无法实现的。连同CFN技术带来的直接且重要的加速效果,这些进展直接惠及了所有以前依赖于Osprey内部DEE/A*组合的算法,并使得使用可证明的算法解决更大的CPD问题成为可能。