Optibrium Limited, Cambridge, UK.
Research and Development, International Flavors & Fragrances, Union Beach, NJ, USA.
J Comput Aided Mol Des. 2021 Nov;35(11):1125-1140. doi: 10.1007/s10822-021-00424-3. Epub 2021 Oct 30.
Predicting the sensory properties of compounds is challenging due to the subjective nature of the experimental measurements. This testing relies on a panel of human participants and is therefore also expensive and time-consuming. We describe the application of a state-of-the-art deep learning method, Alchemite™, to the imputation of sparse physicochemical and sensory data and compare the results with conventional quantitative structure-activity relationship methods and a multi-target graph convolutional neural network. The imputation model achieved a substantially higher accuracy of prediction, with improvements in R between 0.26 and 0.45 over the next best method for each sensory property. We also demonstrate that robust uncertainty estimates generated by the imputation model enable the most accurate predictions to be identified and that imputation also more accurately predicts activity cliffs, where small changes in compound structure result in large changes in sensory properties. In combination, these results demonstrate that the use of imputation, based on data from less expensive, early experiments, enables better selection of compounds for more costly studies, saving experimental time and resources.
由于实验测量的主观性,预测化合物的感官性质具有挑战性。这种测试依赖于一组人类参与者,因此也昂贵且耗时。我们描述了一种最先进的深度学习方法 Alchemite™ 的应用,用于稀疏物理化学和感官数据的插补,并将结果与传统的定量构效关系方法和多目标图卷积神经网络进行比较。插补模型实现了更高的预测准确性,对于每种感官性质,与下一个最佳方法相比,R 值提高了 0.26 到 0.45。我们还证明,插补模型生成的稳健不确定性估计可以确定最准确的预测,并且插补还可以更准确地预测活性悬崖,其中化合物结构的微小变化会导致感官性质的巨大变化。总之,这些结果表明,基于更便宜、早期实验的数据进行插补,可以更好地选择化合物进行更昂贵的研究,从而节省实验时间和资源。