Suppr超能文献

基于极化和熵嵌入原子中心对称函数的机器学习的分子分配系数。

Molecular partition coefficient from machine learning with polarization and entropy embedded atom-centered symmetry functions.

机构信息

Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.

出版信息

Phys Chem Chem Phys. 2022 Oct 5;24(38):23082-23088. doi: 10.1039/d2cp02648a.

Abstract

Efficient prediction of the partition coefficient (log ) between polar and non-polar phases could shorten the cycle of drug and materials design. In this work, a descriptor, named 〈 - ACSFs〉, is proposed to take the explicit polarization effects in the polar phase and the conformation ensemble of energetic and entropic significance in the non-polar phase into consideration. The polarization effects are involved by embedding the partial charge directly derived from force fields or quantum chemistry calculations into the atom-centered symmetry functions (ACSFs), together with the entropy effects, which are averaged according to the Boltzmann distribution of different conformations taken from the similarity matrix. The model was trained with high-dimensional neural networks (HDNNs) on a public dataset PhysProp (with 41 039 samples). Satisfactory log  prediction performance was achieved on three other datasets, namely, Martel (707 molecules), Star & Non-Star (266) and Huuskonen (1870). The present 〈 - ACSFs〉 model was also applicable to -carboxylic acids with the number of carbons ranging from 2 to 14 and 54 kinds of organic solvent. It is easy to apply the present method to arbitrary sized systems and give a transferable atom-based partition coefficient.

摘要

高效预测极性相与非极性相之间的分配系数(log)可以缩短药物和材料设计的周期。在这项工作中,提出了一个描述符,名为〈-ACSFs〉,用于考虑极性相中的显式极化效应以及非极性相中的能量和熵意义上的构象集合。通过将直接从力场或量子化学计算中得出的部分电荷嵌入到基于原子的对称函数(ACSFs)中,并结合根据相似性矩阵中来自不同构象的玻尔兹曼分布进行平均的熵效应,来考虑极化效应。该模型在 PhysProp 公共数据集(包含 41,039 个样本)上使用高维神经网络(HDNN)进行了训练。在另外三个数据集 Martel(707 个分子)、Star & Non-Star(266 个)和 Huuskonen(1870 个)上,模型实现了令人满意的 log 预测性能。本研究中的〈-ACSFs〉模型还适用于碳原子数为 2 到 14 的-羧酸和 54 种有机溶剂。本方法易于应用于任意大小的系统,并能提供可转移的基于原子的分配系数。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验