Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia.
Laboratory of Chemoinformatics (UMR 7140 CNRS/UniStra), Université de Strasbourg, 4, Rue Blaise Pascal, 67000 Strasbourg, France.
Int J Mol Sci. 2021 Dec 27;23(1):248. doi: 10.3390/ijms23010248.
The selection of experimental conditions leading to a reasonable yield is an important and essential element for the automated development of a synthesis plan and the subsequent synthesis of the target compound. The classical QSPR approach, requiring one-to-one correspondence between chemical structure and a target property, can be used for optimal reaction conditions prediction only on a limited scale when only one condition component (e.g., catalyst or solvent) is considered. However, a particular reaction can proceed under several different conditions. In this paper, we describe the Likelihood Ranking Model representing an artificial neural network that outputs a list of different conditions ranked according to their suitability to a given chemical transformation. Benchmarking calculations demonstrated that our model outperformed some popular approaches to the theoretical assessment of reaction conditions, such as k Nearest Neighbors, and a recurrent artificial neural network performance prediction of condition components (reagents, solvents, catalysts, and temperature). The ability of the Likelihood Ranking model trained on a hydrogenation reactions dataset, (~42,000 reactions) from Reaxys database, to propose conditions that led to the desired product was validated experimentally on a set of three reactions with rich selectivity issues.
选择导致合理产率的实验条件是自动化合成计划开发和目标化合物后续合成的重要和必要元素。经典的定量构效关系(QSAR)方法要求化学结构与目标性质一一对应,因此仅当考虑一个条件成分(例如催化剂或溶剂)时,才能在有限的范围内预测最佳反应条件。然而,一个特定的反应可以在几种不同的条件下进行。在本文中,我们描述了可能性排序模型,它代表了一种人工神经网络,根据它们对给定化学转化的适用性,对不同的条件进行排序并输出一个列表。基准计算表明,我们的模型优于一些流行的反应条件理论评估方法,例如 k 最近邻(kNN)和条件成分(试剂、溶剂、催化剂和温度)的递归人工神经网络性能预测。在 Reaxys 数据库中对氢化反应数据集(约 42,000 个反应)进行训练的可能性排序模型,能够提出导致所需产物的条件,在一组具有丰富选择性问题的三个反应中进行了实验验证。