Singh Sukriti, Hernández-Lobato José Miguel
Department of Engineering, University of Cambridge, Cambridge, UK.
Commun Chem. 2024 Jun 14;7(1):136. doi: 10.1038/s42004-024-01219-x.
Recent years have seen a rapid growth in the application of various machine learning methods for reaction outcome prediction. Deep learning models have gained popularity due to their ability to learn representations directly from the molecular structure. Gaussian processes (GPs), on the other hand, provide reliable uncertainty estimates but are unable to learn representations from the data. We combine the feature learning ability of neural networks (NNs) with uncertainty quantification of GPs in a deep kernel learning (DKL) framework to predict the reaction outcome. The DKL model is observed to obtain very good predictive performance across different input representations. It significantly outperforms standard GPs and provides comparable performance to graph neural networks, but with uncertainty estimation. Additionally, the uncertainty estimates on predictions provided by the DKL model facilitated its incorporation as a surrogate model for Bayesian optimization (BO). The proposed method, therefore, has a great potential towards accelerating reaction discovery by integrating accurate predictive models that provide reliable uncertainty estimates with BO.
近年来,各种机器学习方法在反应结果预测中的应用迅速增长。深度学习模型因其能够直接从分子结构中学习表示而受到欢迎。另一方面,高斯过程(GPs)提供可靠的不确定性估计,但无法从数据中学习表示。我们在深度核学习(DKL)框架中将神经网络(NNs)的特征学习能力与GPs的不确定性量化相结合,以预测反应结果。观察到DKL模型在不同的输入表示上都能获得非常好的预测性能。它显著优于标准GPs,并且与图神经网络具有可比的性能,但具有不确定性估计。此外,DKL模型提供的预测不确定性估计有助于将其作为贝叶斯优化(BO)的替代模型。因此,所提出的方法通过将提供可靠不确定性估计的准确预测模型与BO相结合,在加速反应发现方面具有巨大潜力。