Gong Yukang, Xue Dongyu, Chuai Guohui, Yu Jing, Liu Qi
Department of Ophthalmology, Shanghai Tenth People's Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University Shanghai 200072 China
Chem Sci. 2021 Oct 9;12(43):14459-14472. doi: 10.1039/d1sc02087k. eCollection 2021 Nov 10.
Various computational methods have been developed for quantitative modeling of organic chemical reactions; however, the lack of universality as well as the requirement of large amounts of experimental data limit their broad applications. Here, we present DeepReac+, an efficient and universal computational framework for prediction of chemical reaction outcomes and identification of optimal reaction conditions based on deep active learning. Under this framework, DeepReac is designed as a graph-neural-network-based model, which directly takes 2D molecular structures as inputs and automatically adapts to different prediction tasks. In addition, carefully-designed active learning strategies are incorporated to substantially reduce the number of necessary experiments for model training. We demonstrate the universality and high efficiency of DeepReac+ by achieving the state-of-the-art results with a minimum of labeled data on three diverse chemical reaction datasets in several scenarios. Collectively, DeepReac+ has great potential and utility in the development of AI-aided chemical synthesis. DeepReac+ is freely accessible at https://github.com/bm2-lab/DeepReac.
已经开发了各种计算方法用于有机化学反应的定量建模;然而,缺乏通用性以及对大量实验数据的需求限制了它们的广泛应用。在此,我们展示了DeepReac+,这是一个基于深度主动学习的用于预测化学反应结果和识别最佳反应条件的高效通用计算框架。在此框架下,DeepReac被设计为基于图神经网络的模型,它直接将二维分子结构作为输入,并自动适应不同的预测任务。此外,还纳入了精心设计的主动学习策略,以大幅减少模型训练所需的实验数量。我们通过在几种场景下的三个不同化学反应数据集上使用最少的标记数据取得了最先进的结果,证明了DeepReac+的通用性和高效性。总体而言,DeepReac+在人工智能辅助化学合成的开发中具有巨大的潜力和实用性。可通过https://github.com/bm2-lab/DeepReac免费访问DeepReac+。