Liebl Korbinian, Voth Gregory A
Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, United States.
J Chem Theory Comput. 2025 May 13;21(9):4846-4854. doi: 10.1021/acs.jctc.5c00063. Epub 2025 Apr 16.
Bottom-up coarse-graining refers to the development of low-resolution simulation models that are thermodynamically consistent with certain distributions from fully atomistic simulations. Force-matching and relative entropy minimization represent two major, frequently applied methods that allow to develop such bottom-up models. Nevertheless, atomistic simulations can often provide only limited sampling of the phase space. For bottom-up coarse-graining, these limitations may result in overfitting of the atomistic reference data, especially for large molecular complexes, where the learning may be agnostic of the actual affinities between binding partners. As a solution to this problem, we devise a data-driven machine learning hybrid coarse-graining concept that represents a regularized version of the relative entropy minimization approach. We demonstrate that this new approach allows one to develop coarse-grained models for molecular complexes that reproduce the targeted binding affinity but also describe the underlying complex structure accurately. The trained models therefore show diverse behavior as they can undergo frequent unbinding and binding events and are also transferable for simulating entire protein lattices, e.g., for a virus capsid.
自底向上的粗粒化是指开发与全原子模拟中的某些分布在热力学上一致的低分辨率模拟模型。力匹配和相对熵最小化是两种主要的、经常应用的方法,可用于开发此类自底向上的模型。然而,原子模拟通常只能对相空间进行有限的采样。对于自底向上的粗粒化,这些限制可能导致原子参考数据的过度拟合,特别是对于大分子复合物,其中学习可能无法识别结合伙伴之间的实际亲和力。作为这个问题的解决方案,我们设计了一种数据驱动的机器学习混合粗粒化概念,它代表了相对熵最小化方法的正则化版本。我们证明,这种新方法允许为分子复合物开发粗粒化模型,该模型不仅能重现目标结合亲和力,还能准确描述潜在的复合物结构。因此,经过训练的模型表现出多样的行为,因为它们可以频繁地经历解离和结合事件,并且还可转移用于模拟整个蛋白质晶格,例如病毒衣壳。