Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
Phys Chem Chem Phys. 2022 Oct 5;24(38):23082-23088. doi: 10.1039/d2cp02648a.
Efficient prediction of the partition coefficient (log ) between polar and non-polar phases could shorten the cycle of drug and materials design. In this work, a descriptor, named 〈 - ACSFs〉, is proposed to take the explicit polarization effects in the polar phase and the conformation ensemble of energetic and entropic significance in the non-polar phase into consideration. The polarization effects are involved by embedding the partial charge directly derived from force fields or quantum chemistry calculations into the atom-centered symmetry functions (ACSFs), together with the entropy effects, which are averaged according to the Boltzmann distribution of different conformations taken from the similarity matrix. The model was trained with high-dimensional neural networks (HDNNs) on a public dataset PhysProp (with 41 039 samples). Satisfactory log prediction performance was achieved on three other datasets, namely, Martel (707 molecules), Star & Non-Star (266) and Huuskonen (1870). The present 〈 - ACSFs〉 model was also applicable to -carboxylic acids with the number of carbons ranging from 2 to 14 and 54 kinds of organic solvent. It is easy to apply the present method to arbitrary sized systems and give a transferable atom-based partition coefficient.
高效预测极性相与非极性相之间的分配系数(log)可以缩短药物和材料设计的周期。在这项工作中,提出了一个描述符,名为〈-ACSFs〉,用于考虑极性相中的显式极化效应以及非极性相中的能量和熵意义上的构象集合。通过将直接从力场或量子化学计算中得出的部分电荷嵌入到基于原子的对称函数(ACSFs)中,并结合根据相似性矩阵中来自不同构象的玻尔兹曼分布进行平均的熵效应,来考虑极化效应。该模型在 PhysProp 公共数据集(包含 41,039 个样本)上使用高维神经网络(HDNN)进行了训练。在另外三个数据集 Martel(707 个分子)、Star & Non-Star(266 个)和 Huuskonen(1870 个)上,模型实现了令人满意的 log 预测性能。本研究中的〈-ACSFs〉模型还适用于碳原子数为 2 到 14 的-羧酸和 54 种有机溶剂。本方法易于应用于任意大小的系统,并能提供可转移的基于原子的分配系数。