Kim Hyunseung, Choi Haeyeon, Kang Dongju, Lee Won Bo, Na Jonggeol
School of Chemical and Biological Engineering, Seoul National University Republic of Korea
Department of Chemical Engineering and Materials Science, Ewha Womans University Republic of Korea
Chem Sci. 2024 Apr 24;15(21):7908-7925. doi: 10.1039/d3sc05281h. eCollection 2024 May 29.
The goal of most materials discovery is to discover materials that are superior to those currently known. Fundamentally, this is close to extrapolation, which is a weak point for most machine learning models that learn the probability distribution of data. Herein, we develop reinforcement learning-guided combinatorial chemistry, which is a rule-based molecular designer driven by trained policy for selecting subsequent molecular fragments to get a target molecule. Since our model has the potential to generate all possible molecular structures that can be obtained from combinations of molecular fragments, unknown molecules with superior properties can be discovered. We theoretically and empirically demonstrate that our model is more suitable for discovering better compounds than probability distribution-learning models. In an experiment aimed at discovering molecules that hit seven extreme target properties, our model discovered 1315 of all target-hitting molecules and 7629 of five target-hitting molecules out of 100 000 trials, whereas the probability distribution-learning models failed. Moreover, it has been confirmed that every molecule generated under the binding rules of molecular fragments is 100% chemically valid. To illustrate the performance in actual problems, we also demonstrate that our models work well on two practical applications: discovering protein docking molecules and HIV inhibitors.
大多数材料发现的目标是发现比目前已知材料更优越的材料。从根本上讲,这接近于外推法,而外推法是大多数学习数据概率分布的机器学习模型的一个弱点。在此,我们开发了强化学习引导的组合化学,这是一种基于规则的分子设计方法,由经过训练的策略驱动,用于选择后续分子片段以获得目标分子。由于我们的模型有潜力生成从分子片段组合中可以获得的所有可能的分子结构,因此可以发现具有优越性质的未知分子。我们从理论和实证上证明,我们的模型比概率分布学习模型更适合发现更好的化合物。在一个旨在发现具有七种极端目标性质的分子的实验中,我们的模型在100000次试验中发现了所有命中目标的分子中的1315个以及五个命中目标的分子中的7629个,而概率分布学习模型则失败了。此外,已经证实,在分子片段的结合规则下生成的每个分子在化学上都是100%有效的。为了说明在实际问题中的性能,我们还证明了我们的模型在两个实际应用中表现良好:发现蛋白质对接分子和HIV抑制剂。