Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China.
Zhejiang Lab, Hangzhou, China.
Commun Biol. 2024 Sep 2;7(1):1074. doi: 10.1038/s42003-024-06746-w.
Target-aware drug discovery has greatly accelerated the drug discovery process to design small-molecule ligands with high binding affinity to disease-related protein targets. Conditioned on targeted proteins, previous works utilize various kinds of deep generative models and have shown great potential in generating molecules with strong protein-ligand binding interactions. However, beyond binding affinity, effective drug molecules must manifest other essential properties such as high drug-likeness, which are not explicitly addressed by current target-aware generative methods. In this article, aiming to bridge the gap of multi-objective target-aware molecule generation in the field of deep learning-based drug discovery, we propose ParetoDrug, a Pareto Monte Carlo Tree Search (MCTS) generation algorithm. ParetoDrug searches molecules on the Pareto Front in chemical space using MCTS to enable synchronous optimization of multiple properties. Specifically, ParetoDrug utilizes pretrained atom-by-atom autoregressive generative models for the exploration guidance to desired molecules during MCTS searching. Besides, when selecting the next atom symbol, a scheme named ParetoPUCT is proposed to balance exploration and exploitation. Benchmark experiments and case studies demonstrate that ParetoDrug is highly effective in traversing the large and complex chemical space to discover novel compounds with satisfactory binding affinities and drug-like properties for various multi-objective target-aware drug discovery tasks.
基于靶标的药物发现极大地加速了药物发现的过程,设计出与疾病相关蛋白靶标具有高结合亲和力的小分子配体。在靶向蛋白的条件下,以前的工作利用各种深度生成模型,并在生成具有强蛋白-配体结合相互作用的分子方面显示出巨大的潜力。然而,除了结合亲和力之外,有效的药物分子还必须表现出其他必要的特性,如高类药性,而当前基于靶标的生成方法并没有明确解决这些问题。在本文中,为了弥合基于深度学习的药物发现领域中多目标基于靶标的分子生成的差距,我们提出了 ParetoDrug,一种 Pareto 蒙特卡罗树搜索(MCTS)生成算法。ParetoDrug 使用 MCTS 在化学空间中的 Pareto 前沿上搜索分子,以实现多个特性的同步优化。具体来说,ParetoDrug 在 MCTS 搜索过程中利用预训练的原子到原子自回归生成模型来指导对期望分子的探索。此外,在选择下一个原子符号时,提出了一种名为 ParetoPUCT 的方案来平衡探索和利用。基准实验和案例研究表明,ParetoDrug 非常有效地遍历了庞大而复杂的化学空间,为各种多目标基于靶标的药物发现任务发现了具有满意结合亲和力和类药性的新型化合物。