Ramani Vansh, Karmakar Tarak
Department of Chemical Engineering, Indian Institute of Technology, Delhi, Hauz Khas, New Delhi 110016, India.
Department of Chemistry, Indian Institute of Technology, Delhi, Hauz Khas, New Delhi 110016, India.
J Chem Theory Comput. 2024 Aug 13;20(15):6549-6558. doi: 10.1021/acs.jctc.4c00382. Epub 2024 Jul 23.
The prediction of solubility is a complex and challenging physicochemical problem that has tremendous implications for the chemical and pharmaceutical industry. Recent advancements in machine learning methods have provided a great scope for predicting the reliable solubility of a large number of molecular systems. However, most of these methods rely on using physical properties obtained from experiments and expensive quantum chemical calculations. Here, we developed a method that utilizes a graphical representation of solute-solvent interactions using "MolMerger," which captures the strongest polar interactions between molecules using Gasteiger charges and creates a graph incorporating the true nature of the system. Using these graphs as input, a neural network learns the correlation between the structural properties of a molecule in the form of node embedding and its physicochemical properties as the output. This approach has been used to calculate molecular solubility by predicting the Log solubility values of various organic molecules and pharmaceuticals in diverse sets of solvents.
溶解度预测是一个复杂且具有挑战性的物理化学问题,对化学和制药行业有着重大影响。机器学习方法的最新进展为预测大量分子系统的可靠溶解度提供了广阔空间。然而,这些方法大多依赖于使用从实验和昂贵的量子化学计算中获得的物理性质。在此,我们开发了一种方法,该方法利用“MolMerger”对溶质 - 溶剂相互作用进行图形表示,它使用Gasteiger电荷捕获分子间最强的极性相互作用,并创建一个包含系统真实性质的图形。以这些图形为输入,神经网络学习以节点嵌入形式表示的分子结构性质与其作为输出的物理化学性质之间的相关性。这种方法已被用于通过预测各种有机分子和药物在不同溶剂组中的对数溶解度值来计算分子溶解度。