BNLMS, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China.
Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China.
J Chem Inf Model. 2024 Jul 22;64(14):5470-5479. doi: 10.1021/acs.jcim.4c00432. Epub 2024 Jun 28.
Computer-assisted synthesis planning has become increasingly important in drug discovery. While deep-learning models have shown remarkable progress in achieving high accuracies for single-step retrosynthetic predictions, their performances in retrosynthetic route planning need to be checked. This study compares the intricate single-step models with a straightforward template enumeration approach for retrosynthetic route planning on a real-world drug molecule data set. Despite the superior single-step accuracy of advanced models, the template enumeration method with a heuristic-based retrosynthesis knowledge score was found to surpass them in efficiency in searching the reaction space, achieving a higher or comparable solve rate within the same time frame. This counterintuitive result underscores the importance of efficiency and retrosynthesis knowledge in retrosynthesis route planning and suggests that future research should incorporate a simple template enumeration as a benchmark. It also suggests that this simple yet effective strategy should be considered alongside more complex models to better cater to the practical needs of computer-assisted synthesis planning in drug discovery.
计算机辅助合成规划在药物发现中变得越来越重要。虽然深度学习模型在实现单步反合成预测的高精度方面取得了显著进展,但它们在反合成路线规划方面的性能仍需要进行检查。本研究比较了复杂的单步模型和简单的模板枚举方法,用于对真实药物分子数据集进行反合成路线规划。尽管先进模型的单步精度更高,但基于启发式反合成知识评分的模板枚举方法在搜索反应空间的效率方面超过了它们,在相同的时间框架内实现了更高或可比的求解率。这一反直觉的结果强调了效率和反合成知识在反合成路线规划中的重要性,并表明未来的研究应将简单的模板枚举作为基准纳入其中。它还表明,这种简单而有效的策略应与更复杂的模型一起考虑,以更好地满足药物发现中计算机辅助合成规划的实际需求。