Center for Computational Sciences , University of Tsukuba 1-1-1 Tennodai , Tsukuba , Ibaraki 305-8577 , Japan.
Institute of Chemistry - Centre for Glycomics , Dubravska cesta 9 , 84538 Bratislava , Slovakia.
J Chem Inf Model. 2019 Dec 23;59(12):5198-5206. doi: 10.1021/acs.jcim.9b00753. Epub 2019 Nov 21.
Nontargeted parallel cascade selection molecular dynamics (nt-PaCS-MD) is a method for enhanced conformational sampling of proteins. To search a broad conformational subspace, nt-PaCS-MD repeats cycles of conformational resampling from relevant initial structures. Generally, the conformational sampling efficiency of nt-PaCS-MD depends on a selection rule for the initial structures. In the original nt-PaCS-MD, the initial structures were selected by referring to structural distributions of protein configurations generated by conformational resampling (multiple short-time MD simulations). However, their structural redundancy among the initial structures was neglected for the cycles of conformational resampling, indicating that similar protein configurations might be frequently specified and resampled in every cycle in the original nt-PaCS-MD. To reduce the possibility of resampling from redundant initial structures, we propose an alternative selection rule that accounts for structural similarity among the initial structures. Specifically, a pairwise root-mean-square deviation (RMSD) is defined for all of the initial structures selected for all of the past cycles. Then a set of protein configurations with a larger pairwise RMSD is sequentially specified and resampled in the next cycle, which is regarded to as a history-dependent selection of initial structures by considering a profile of the past specified initial structures. The present scheme, termed extended nt-PaCS-MD, prevents us from resampling a set of redundant protein configurations. To check the conformational sampling efficiency of the extended nt-PaCS-MD, we used a middle-sized protein, T4 lysozyme, in explicit water. Through the assessment, this extended nt-PaCS-MD identified the open-closed transitions of T4 lysozyme more efficiently than the original nt-PaCS-MD.
非靶向平行级联选择分子动力学(nt-PaCS-MD)是一种增强蛋白质构象采样的方法。为了搜索广泛的构象子空间,nt-PaCS-MD 重复从相关初始结构中进行构象重采样的循环。通常,nt-PaCS-MD 的构象采样效率取决于初始结构的选择规则。在原始的 nt-PaCS-MD 中,初始结构是通过参考构象重采样(多次短时间 MD 模拟)生成的蛋白质构象的结构分布来选择的。然而,在构象重采样的循环中,它们在初始结构之间的结构冗余被忽略了,这表明在原始的 nt-PaCS-MD 中,相似的蛋白质构象可能在每个循环中经常被指定和重采样。为了减少从冗余初始结构中重采样的可能性,我们提出了一种替代的选择规则,该规则考虑了初始结构之间的结构相似性。具体来说,为所有过去循环中选择的所有初始结构定义了两两均方根偏差(RMSD)。然后,一组具有较大两两 RMSD 的蛋白质构象将在接下来的循环中依次指定和重采样,这被视为通过考虑过去指定的初始结构的轮廓来进行初始结构的历史依赖性选择。该方案称为扩展的 nt-PaCS-MD,可以防止我们重采样一组冗余的蛋白质构象。为了检查扩展的 nt-PaCS-MD 的构象采样效率,我们在明水中使用了一个中等大小的蛋白质 T4 溶菌酶。通过评估,与原始的 nt-PaCS-MD 相比,该扩展的 nt-PaCS-MD 更有效地识别了 T4 溶菌酶的开-闭转变。