Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
School of Computer Science, Wuhan University, Wuhan 430072, Hubei, P. R. China.
J Med Chem. 2022 Sep 22;65(18):12482-12496. doi: 10.1021/acs.jmedchem.2c01179. Epub 2022 Sep 6.
Many deep learning (DL)-based molecular generative models have been proposed to design novel molecules. These models may perform well on benchmarks, but they usually do not take real-world constraints into account, such as available training data set, synthetic accessibility, and scaffold diversity in drug discovery. In this study, a new algorithm, ChemistGA, was proposed by combining the traditional heuristic algorithm with DL, in which the crossover of the traditional genetic algorithm (GA) was redefined by DL in conjunction with GA, and an innovative backcrossing operation was implemented to generate desired molecules. Our results clearly show that ChemistGA not only retains the strength of the traditional GA but also greatly enhances the synthetic accessibility and success rate of the generated molecules with desired properties. Calculations on the two benchmarks illustrate that ChemistGA achieves impressive performance among the state-of-the-art baselines, and it opens a new avenue for the application of generative models to real-world drug discovery scenarios.
许多基于深度学习(DL)的分子生成模型被提出用于设计新分子。这些模型在基准测试中表现良好,但它们通常没有考虑到实际的约束条件,例如可用的训练数据集、合成可及性以及药物发现中的支架多样性。在这项研究中,通过将传统启发式算法与 DL 相结合,提出了一种新的算法 ChemistGA,其中传统遗传算法(GA)的交叉操作由 DL 重新定义,并结合 GA 实现了创新的回交操作,以生成所需的分子。我们的结果清楚地表明,ChemistGA 不仅保留了传统 GA 的优势,而且极大地提高了具有所需性质的生成分子的合成可及性和成功率。对两个基准的计算表明,ChemistGA 在最先进的基线中表现出色,为生成模型在实际药物发现场景中的应用开辟了新途径。