Haxton Thomas K
Molecular Foundry, Lawrence Berkeley National Laboratory , Berkeley, California 94720, United States.
J Chem Theory Comput. 2015 Mar 10;11(3):1244-54. doi: 10.1021/ct500881x.
We introduce a method to bring nearly atomistic resolution to coarse-grained models, and we apply the method to proteins. Using a small number of coarse-grained sites (about one per eight atoms) but assigning an independent three-dimensional orientation to each site, we preferentially integrate out stiff degrees of freedom (bond lengths and angles, as well as dihedral angles in rings) that are accurately approximated by their average values, while retaining soft degrees of freedom (unconstrained dihedral angles) mostly responsible for conformational variability. We demonstrate that our scheme retains nearly atomistic resolution by mapping all experimental protein configurations in the Protein Data Bank onto coarse-grained configurations and then analytically backmapping those configurations back to all-atom configurations. This roundtrip mapping throws away all information associated with the eliminated (stiff) degrees of freedom except for their average values, which we use to construct optimal backmapping functions. Despite the 4:1 reduction in the number of degrees of freedom, we find that heavy atoms move only 0.051 Å on average during the roundtrip mapping, while hydrogens move 0.179 Å on average, an unprecedented combination of efficiency and accuracy among coarse-grained protein models. We discuss the advantages of such a high-resolution model for parametrizing effective interactions and accurately calculating observables through direct or multiscale simulations.
我们介绍了一种为粗粒度模型带来近乎原子分辨率的方法,并将该方法应用于蛋白质。使用少量的粗粒度位点(大约每八个原子一个),但为每个位点赋予独立的三维取向,我们优先整合出可以通过其平均值精确近似的刚性自由度(键长和键角,以及环中的二面角),同时保留主要负责构象变异性的柔性自由度(无约束二面角)。我们通过将蛋白质数据库中的所有实验蛋白质构象映射到粗粒度构象,然后将这些构象解析地反向映射回全原子构象,证明了我们的方案保留了近乎原子的分辨率。这种往返映射丢弃了与消除的(刚性)自由度相关的所有信息,除了它们的平均值,我们用这些平均值来构建最佳反向映射函数。尽管自由度数量减少了4:1,但我们发现在往返映射过程中,重原子平均仅移动0.051 Å,而氢原子平均移动0.179 Å,这在粗粒度蛋白质模型中是前所未有的效率与准确性的结合。我们讨论了这种高分辨率模型在参数化有效相互作用以及通过直接或多尺度模拟精确计算可观测量方面的优势。