Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
School of Biomedical Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
Nat Commun. 2023 May 25;14(1):3009. doi: 10.1038/s41467-023-38851-5.
Retrosynthesis planning, the process of identifying a set of available reactions to synthesize the target molecules, remains a major challenge in organic synthesis. Recently, computer-aided synthesis planning has gained renewed interest and various retrosynthesis prediction algorithms based on deep learning have been proposed. However, most existing methods are limited to the applicability and interpretability of model predictions, and further improvement of predictive accuracy to a more practical level is still required. In this work, inspired by the arrow-pushing formalism in chemical reaction mechanisms, we present an end-to-end architecture for retrosynthesis prediction called Graph2Edits. Specifically, Graph2Edits is based on graph neural network to predict the edits of the product graph in an auto-regressive manner, and sequentially generates transformation intermediates and final reactants according to the predicted edits sequence. This strategy combines the two-stage processes of semi-template-based methods into one-pot learning, improving the applicability in some complicated reactions, and also making its predictions more interpretable. Evaluated on the standard benchmark dataset USPTO-50k, our model achieves the state-of-the-art performance for semi-template-based retrosynthesis with a promising 55.1% top-1 accuracy.
逆合成规划,即确定一组可用反应来合成目标分子的过程,仍然是有机合成中的一个主要挑战。最近,计算机辅助合成规划重新引起了人们的兴趣,并且已经提出了各种基于深度学习的逆合成预测算法。然而,大多数现有的方法都受到模型预测的适用性和可解释性的限制,并且仍然需要进一步提高预测准确性,使其达到更实用的水平。在这项工作中,受化学反应机制中箭推形式主义的启发,我们提出了一种名为 Graph2Edits 的端到端逆合成预测架构。具体来说,Graph2Edits 基于图神经网络,以自回归的方式预测产物图的编辑,然后根据预测的编辑序列依次生成转化中间体和最终反应物。这种策略将基于半模板方法的两阶段过程结合到一锅学习中,提高了在某些复杂反应中的适用性,同时也使其预测更加可解释。在标准基准数据集 USPTO-50k 上进行评估,我们的模型在基于半模板的逆合成方面实现了最先进的性能,具有有前途的 55.1%的顶级准确率。