Department of Biochemistry, University of Oxford, Oxford OX1 3QU, U.K.
J Chem Theory Comput. 2023 Jun 27;19(12):3705-3720. doi: 10.1021/acs.jctc.3c00140. Epub 2023 Jun 7.
The structure of proteins has long been recognized to hold the key to understanding and engineering their function, and rapid advances in structural biology and protein structure prediction are now supplying researchers with an ever-increasing wealth of structural information. Most of the time, however, structures can only be determined in free energy minima, one at a time. While conformational flexibility may thus be inferred from static end-state structures, their interconversion mechanisms─a central ambition of structural biology─are often beyond the scope of direct experimentation. Given the dynamical nature of the processes in question, many studies have attempted to explore conformational transitions using molecular dynamics (MD). However, ensuring proper convergence and reversibility in the predicted transitions is extremely challenging. In particular, a commonly used technique to map out a path from a starting to a target conformation called steered MD (SMD) can suffer from starting-state dependence (hysteresis) when combined with techniques such as umbrella sampling (US) to compute the free energy profile of a transition. Here, we study this problem in detail on conformational changes of increasing complexity. We also present a new, history-independent approach that we term "MEMENTO" (Morphing End states by Modelling Ensembles with iNdependent TOpologies) to generate paths that alleviate hysteresis in the construction of conformational free energy profiles. MEMENTO utilizes template-based structure modelling to restore physically reasonable protein conformations based on coordinate interpolation (morphing) as an ensemble of plausible intermediates, from which a smooth path is picked. We compare SMD and MEMENTO on well-characterized test cases (the toy peptide deca-alanine and the enzyme adenylate kinase) before discussing its use in more complicated systems (the kinase P38α and the bacterial leucine transporter LeuT). Our work shows that for all but the simplest systems SMD paths should not in general be used to seed umbrella sampling or related techniques, unless the paths are validated by consistent results from biased runs in opposite directions. MEMENTO, on the other hand, performs well as a flexible tool to generate intermediate structures for umbrella sampling. We also demonstrate that extended end-state sampling combined with MEMENTO can aid the discovery of collective variables on a case-by-case basis.
蛋白质的结构一直被认为是理解和设计其功能的关键,结构生物学和蛋白质结构预测的快速发展现在为研究人员提供了越来越多的结构信息。然而,大多数情况下,结构只能在自由能极小值中确定,一次一个。虽然构象灵活性可以从静态终态结构中推断出来,但它们的转换机制——结构生物学的核心目标——往往超出了直接实验的范围。考虑到所讨论过程的动态性质,许多研究试图使用分子动力学 (MD) 探索构象转变。然而,确保预测的转变中适当的收敛性和可逆性极具挑战性。特别是,一种常用的从起始构象到目标构象绘制路径的技术称为导向 MD (SMD),当与伞状采样 (US) 等技术结合以计算转变的自由能曲线时,可能会受到起始态依赖性(滞后)的影响。在这里,我们详细研究了这个问题,包括越来越复杂的构象变化。我们还提出了一种新的、与历史无关的方法,称为“MEMENTO”(通过对独立拓扑的建模来重塑末端状态),以生成缓解构建构象自由能曲线时滞后的路径。MEMENTO 利用基于模板的结构建模,根据坐标插值(变形)作为合理的中间物系综,恢复基于物理的蛋白质构象,从中选择平滑的路径。我们在经过充分表征的测试案例(玩具肽 deca-alanine 和酶腺嘌呤激酶)上比较了 SMD 和 MEMENTO,然后讨论了它在更复杂系统(激酶 P38α 和细菌亮氨酸转运蛋白 LeuT)中的使用。我们的工作表明,除了最简单的系统之外,一般情况下,除非通过从相反方向进行有偏差的运行获得一致的结果来验证路径,否则不应将 SMD 路径用于播种伞状采样或相关技术。另一方面,MEMENTO 作为生成伞状采样中间结构的灵活工具表现良好。我们还证明,与 MEMENTO 结合的扩展末端采样可以帮助逐个案例发现集体变量。