Buonaiuto Michael A, Lang Andrew S I D
Department of Computing and Mathematics, Oral Roberts University, 7777 S. Lewis Avenue, Tulsa, OK 74171 USA.
Chem Cent J. 2015 Sep 24;9(1):50. doi: 10.1186/s13065-015-0131-2. eCollection 2015 Dec.
1-Octanol solubility is important in a variety of applications involving pharmacology and environmental chemistry. Current models are linear in nature and often require foreknowledge of either melting point or aqueous solubility. Here we extend the range of applicability of 1-octanol solubility models by creating a random forest model that can predict 1-octanol solubilities directly from structure.
We created a random forest model using CDK descriptors that has an out-of-bag (OOB) R value of 0.66 and an OOB mean squared error of 0.34. The model has been deployed for general use as a Shiny application.
The 1-octanol solubility model provides reasonably accurate predictions of the 1-octanol solubility of organic solutes directly from structure. The model was developed under Open Notebook Science conditions which makes it open, reproducible, and as useful as possible.Graphical abstract.
1-辛醇溶解度在涉及药理学和环境化学的各种应用中很重要。当前的模型本质上是线性的,并且通常需要熔点或水溶性的先验知识。在这里,我们通过创建一个可以直接从结构预测1-辛醇溶解度的随机森林模型,扩展了1-辛醇溶解度模型的适用范围。
我们使用CDK描述符创建了一个随机森林模型,其袋外(OOB)R值为0.66,OOB均方误差为0.34。该模型已作为一个Shiny应用程序部署以供通用。
1-辛醇溶解度模型可以直接从结构合理准确地预测有机溶质的1-辛醇溶解度。该模型是在开放笔记本科学条件下开发的,这使其具有开放性、可重复性且尽可能有用。图形摘要。