Ebi Tomoya, Sen Abhijit, Dhital Raghu N, Yamada Yoichi M A, Kaneko Hiromasa
Department of Applied Chemistry, School of Science and Technology, Meiji University, 1-1-1 Higashi-Mita, Tama-ku, Kawasaki, Kanagawa 214-8571, Japan.
RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
ACS Omega. 2021 Oct 5;6(41):27578-27586. doi: 10.1021/acsomega.1c04826. eCollection 2021 Oct 19.
To improve product yields in synthetic reactions, it is important to use appropriate catalysts. In this study, we used machine learning to design catalysts for a reaction system in which both Buchwald-Hartwig-type and Suzuki-Miyaura-type cross-coupling reactions proceed simultaneously. First, using an existing dataset, yield prediction models were constructed with machine learning between experimental conditions, including the substrate and catalyst and the yields of the two products. Seven methods for calculating both the substrate and catalyst descriptors were proposed, and the predictive ability of the yield prediction models was discussed in terms of the descriptors and machine learning methods. Then, the constructed models were used to predict the compound yields for new combinations of substrates and catalysts, and the predictions were experimentally validated with high reproducibility, confirming that machine learning can predict yields from experimental conditions with high accuracy. In addition, to design catalysts that will improve the yields in our dataset, we added datasets collected from scientific papers and designed catalyst ligands. The proposed catalyst candidates were tested in actual synthetic experiments, and the experimental results exceeded the existing yields.
为了提高合成反应中的产物收率,使用合适的催化剂很重要。在本研究中,我们使用机器学习为一个反应体系设计催化剂,在该反应体系中布赫瓦尔德-哈特维希型和铃木-宫浦型交叉偶联反应同时进行。首先,利用现有的数据集,通过机器学习在包括底物和催化剂以及两种产物收率的实验条件之间构建收率预测模型。提出了七种计算底物和催化剂描述符的方法,并从描述符和机器学习方法的角度讨论了收率预测模型的预测能力。然后,使用构建的模型预测底物和催化剂新组合的化合物收率,并通过实验以高重现性验证了这些预测,证实机器学习可以从实验条件高精度地预测收率。此外,为了设计能提高我们数据集中收率的催化剂,我们添加了从科学论文中收集的数据集并设计了催化剂配体。所提出的催化剂候选物在实际合成实验中进行了测试,实验结果超过了现有收率。