Center for Computational Sciences, University of Tsukuba , Tennodai 1-1-1, Tsukuba, Ibaraki 305-8577, Japan.
J Chem Theory Comput. 2017 Mar 14;13(3):1411-1423. doi: 10.1021/acs.jctc.6b01112. Epub 2017 Feb 22.
Structural Dissimilarity Sampling (SDS) is proposed as an efficient conformational search method to promote structural transitions essential for the biological functions of proteins. In SDS, initial structures are selected based on structural dissimilarity, and conformational resampling is repeated. Conformational resampling is performed as follows: (I) arrangement of initial structures for a diverse distribution at the edge of a conformational subspace and (II) promotion of the structural transitions with multiple short-time molecular dynamics (MD) simulations restarting from the diversely distributed initial structures. Cycles of (I) and (II) are repeated to intensively promote structural transitions because conformational resampling from the initial structures would quickly expand conformational distributions toward unvisited conformational subspaces. As a demonstration, SDS was first applied to maltodextrin binding protein (MBP) in explicit water to reproduce structural transitions between the open and closed states of MBP. Structural transitions of MBP were successfully reproduced with SDS in nanosecond-order simulation times. Starting from both the open and closed forms, SDS successfully reproduced the structural transitions within 25 cycles (a total of 250 ns of simulation time). For reference, a conventional long-time (500 ns) MD simulation under NPT (300 K and 1 bar) starting from the open form failed to reproduce the structural transition. In addition to the open-closed motions of MBP, SDS was applied to folding processes of the fast-folding proteins (chignolin, Trp-cage, and villin) and successfully sampled their native states. To confirm how the selections of initial structures affected conformational sampling efficiency, numbers of base sets for characterizing structural dissimilarity of initial structures were addressed in distinct trials of SDS. The parameter searches showed that the conformational sampling efficiency was relatively insensitive with respect to the numbers of base sets, indicating the robustness of SDS for actual applications.
结构非相似性抽样 (SDS) 被提出作为一种有效的构象搜索方法,以促进蛋白质生物功能所必需的结构转变。在 SDS 中,根据结构非相似性选择初始结构,并重复构象重采样。构象重采样如下进行:(I) 初始结构的排列,以在构象子空间的边缘实现多样化分布,以及 (II) 通过从多样化分布的初始结构重新启动的多个短时间分子动力学 (MD) 模拟来促进结构转变。(I) 和 (II) 的循环重复进行,以密集地促进结构转变,因为从初始结构进行构象重采样会迅速将构象分布扩展到未访问的构象子空间。作为演示,SDS 首先应用于麦芽糖结合蛋白 (MBP) 在显式水中,以重现 MBP 的开放和关闭状态之间的结构转变。在纳秒级模拟时间内,SDS 成功地重现了 MBP 的结构转变。从开放和关闭形式开始,SDS 成功地在 25 个循环内 (总共 250 ns 的模拟时间) 重现了结构转变。作为参考,从开放形式开始的常规长时间 (500 ns) MD 模拟在 NPT (300 K 和 1 bar) 下未能重现结构转变。除了 MBP 的开-闭运动之外,SDS 还应用于快速折叠蛋白 (chignolin、Trp-cage 和 villin) 的折叠过程,并成功地采样了它们的天然状态。为了确认初始结构的选择如何影响构象采样效率,在 SDS 的不同试验中解决了用于描述初始结构结构非相似性的基集数量的问题。参数搜索表明,构象采样效率对基集数量相对不敏感,表明 SDS 对于实际应用具有稳健性。