Walter Luis J, Bereau Tristan
Institute for Theoretical Physics, Heidelberg University Philosophenweg 19 69120 Heidelberg Germany
Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University Im Neuenheimer Feld 205 69120 Heidelberg Germany.
Chem Sci. 2025 Jul 30. doi: 10.1039/d5sc03855c.
Molecular discovery within the vast chemical space remains a significant challenge due to the immense number of possible molecules and limited scalability of conventional screening methods. To approach chemical space exploration more effectively, we have developed an active learning-based method that uses transferable coarse-grained models to compress chemical space into varying levels of resolution. By using multiple representations of chemical space with different coarse-graining resolutions, we balance combinatorial complexity and chemical detail. To identify target compounds, we first transform the discrete molecular spaces into smooth latent representations. We then perform Bayesian optimization within these latent spaces, using molecular dynamics simulations to calculate target free energies of the coarse-grained compounds. This multi-level approach effectively balances exploration and exploitation at lower and higher resolutions, respectively. We demonstrate the effectiveness of our method by optimizing molecules to enhance phase separation in phospholipid bilayers. Our funnel-like strategy not only suggests optimal compounds but also provides insight into relevant neighborhoods in chemical space. We show how this neighborhood information from lower resolutions can guide the optimization at higher resolutions, thereby providing an efficient way to navigate large chemical spaces for free energy-based molecular optimization.
由于可能的分子数量巨大以及传统筛选方法的可扩展性有限,在广阔的化学空间中进行分子发现仍然是一项重大挑战。为了更有效地进行化学空间探索,我们开发了一种基于主动学习的方法,该方法使用可转移的粗粒度模型将化学空间压缩到不同的分辨率水平。通过使用具有不同粗粒度分辨率的化学空间的多种表示,我们平衡了组合复杂性和化学细节。为了识别目标化合物,我们首先将离散的分子空间转换为平滑的潜在表示。然后,我们在这些潜在空间中进行贝叶斯优化,使用分子动力学模拟来计算粗粒度化合物的目标自由能。这种多层次方法分别在较低和较高分辨率下有效地平衡了探索和利用。我们通过优化分子以增强磷脂双层中的相分离来证明我们方法的有效性。我们的漏斗状策略不仅能提出最优化合物,还能深入了解化学空间中的相关邻域。我们展示了来自较低分辨率的这种邻域信息如何指导较高分辨率下的优化,从而提供了一种在基于自由能的分子优化中导航大型化学空间的有效方法。